The false positive tax

If you manage an expense audit team, you already know the frustration. Your rules engine generates hundreds of exceptions per week. Your reviewers spend their mornings opening flagged transactions, checking receipts, reading policy language, and then clearing the item because nothing is actually wrong. Then they do it again. And again.

Let's put numbers to it.

500 Exceptions / week \times 40% False positive rate = 200 Wasted reviews / week

At 12 minutes per review, that's 40 hours per week. A full-time employee doing nothing but clearing legitimate transactions.

A 40% false positive rate means you're paying one person's full salary to verify that things are fine. Multiply that across a team of four reviewers and you have an entire FTE-equivalent dedicated to processing noise. That's not audit coverage. That's overhead.

The hidden costs

The time cost is obvious. The downstream effects are worse.

Alert fatigue

When 4 out of every 10 exceptions are false positives, reviewers stop treating exceptions seriously. They develop scanning habits - quick glances instead of thorough reviews. The real issues that do come through get the same half-attention as the noise. This is how genuine fraud passes through a team that's technically looking at it.

Reviewer attrition

Nobody went into audit to clear 200 false positives per week. The repetitive, low-value nature of false positive review is one of the top reasons experienced auditors leave for other roles. You're not just burning time. You're burning out your best people.

Missed escalations

When the queue is always full, prioritization becomes impossible. Reviewers triage by volume, not by risk. A genuinely suspicious $50,000 vendor pattern sits in the same queue as a $47 coffee receipt that got flagged for a category mismatch. The high-value exceptions don't get the attention they deserve because the queue is clogged with noise.

The paradox: Teams respond to missed fraud by writing more rules. More rules generate more false positives. More false positives create more alert fatigue. More alert fatigue leads to more missed fraud. The cycle accelerates.

Where do false positives come from?

Most false positives in expense audit come from three sources:

Over-broad category rules

"Flag all meals over $150" sounds reasonable until you realize that a legitimate client dinner in Manhattan routinely exceeds that threshold. The rule doesn't understand context - it only sees a number. So every sales rep in a high-cost-of-living market generates weekly false positives.

Merchant Category Code (MCC) mismatches

MCC codes are assigned by payment networks, not by the merchant. A hotel restaurant might carry a "bar" MCC. An office supply purchase at a department store might carry a "general merchandise" code. The rule sees "bar" and flags it. The reviewer sees a hotel dinner receipt and clears it. Repeat daily.

One-size-fits-all thresholds

A $200 taxi ride is suspicious for a local office worker and completely normal for someone traveling to JFK during rush hour. Static thresholds can't account for the variance in legitimate spend patterns across roles, geographies, and business contexts. So they flag everything above the threshold and let humans sort it out. This is exactly why sampling-based approaches compound the problem.

What a low false positive rate actually looks like

The goal isn't zero false positives. That would mean you're only catching the most obvious violations. The goal is a false positive rate low enough that your reviewers trust the system and investigate every exception thoroughly.

Metric	40% FP Rate (Typical)	Under 10% FP Rate (Behavioral AI)
Exceptions per week	500	180 (fewer, higher-quality)
False positives per week	200	Under 18
Hours wasted clearing noise	40 hrs	Under 4 hrs
Reviewer trust in exceptions	Low (scan and dismiss)	High (investigate fully)
Real issues caught	Only the obvious ones	Behavioral + obvious
Team morale	Grinding through noise	Focused investigation

The difference isn't just efficiency. It's the quality of the audit itself. When reviewers trust that a flagged exception is likely a real issue, they investigate it properly. They pull related transactions. They check vendor histories. They escalate when warranted. The audit team becomes an investigation unit instead of a processing queue.

The shift: Behavioral AI achieves lower false positive rates not by setting higher thresholds (which would miss real issues) but by adding context. It asks: is this amount unusual for this person, in this role, in this market, at this time of year? A $300 dinner flagged in isolation is noise. A $300 dinner from someone who normally submits $60 lunches, at a restaurant with no other company transactions, on a weekend - that's a signal worth investigating.

The bottom line

False positives aren't a minor inconvenience. They're a systemic tax on your audit operation that degrades coverage quality, burns out experienced reviewers, and creates the exact conditions under which real fraud goes undetected.

The fix isn't more rules. It's smarter context. When every exception your team reviews has a high probability of being a genuine issue, the math changes entirely. And your auditors go from processing noise to doing actual audit work.

Your Auditors Are Drowning in False Positives. Here's the Math.

The false positive tax

The hidden costs

Alert fatigue

Reviewer attrition

Missed escalations

Where do false positives come from?

Over-broad category rules

Merchant Category Code (MCC) mismatches

One-size-fits-all thresholds

What a low false positive rate actually looks like

The bottom line

Give your auditors their time back

Continue Reading

What 5% Sampling Actually Misses

5 Expense Patterns Your Rules Engine Will Never Catch