Transaction Monitoring
Transaction Monitoring Rule and Model Recalibration: Tuning to Risk, Not to Noise
Most transaction-monitoring systems run on install-time vendor defaults — over-alerting and under-detecting at once. The discipline that makes tuning defensible is below-the-line testing.
Transaction monitoring is where AML frameworks consume the most effort and waste the most of it. A poorly tuned system generates false positives on an industrial scale — analysts grinding through tens of thousands of alerts that lead nowhere — while the typologies that actually matter slip through rules never configured to catch them. The institution pays twice: for the noise it investigates and for the risk it misses. Recalibration fixes both, but only when it is done as a measured, evidenced discipline rather than a gut-feel adjustment of thresholds.
The install-time default trap
Most monitoring systems are tuned once, at install, against the customer base, product set and typology landscape that existed at the time — and then left. The configuration that was reasonable on day one degrades quietly as the world moves. Alert volumes creep up as the book grows. Analysts develop informal workarounds to clear the queue. Coverage gaps open as new typologies emerge that the original rules were never designed to detect.
None of this announces itself. The system keeps producing alerts, the team keeps clearing them, and the institution assumes monitoring is working because it is busy. It is the same illusion that afflicts a stale risk model: activity mistaken for effectiveness. And a supervisor examining transaction monitoring will ask the question the busyness conceals — when did you last tune this, against what evidence, and how do you know it detects the risk in your book?
Two objectives that must not be confused
The goal of recalibration is not simply fewer alerts. It is fewer false alerts and better coverage of genuine risk — two distinct objectives that institutions routinely conflate, with dangerous results.
Conflating them produces the worst kind of tuning: raising thresholds to cut alert volume, declaring success on the reduced numbers, and quietly discarding genuine activity in the process. The alert count falls, everyone is pleased, and the institution has just reduced its detection of real risk without realising it. Cutting false positives is easy. Cutting them without losing true positives is the entire skill, and it cannot be done by instinct.
Below-the-line testing is the discipline that makes tuning defensible
The technique that separates defensible recalibration from dangerous threshold-cutting is below-the-line testing. When a threshold is raised — say, an alert now fires at a higher transaction value — the critical question is what now sits just below the new line. Is the activity excluded by the change genuinely benign, or does real risk live there?
Below-the-line testing answers this empirically. You sample the transactions and customers that the new, tighter configuration would no longer alert on, and you examine them: would any have been genuine? If the below-the-line population is clean, the threshold was tuned to noise and the change is defensible. If real risk is hiding below the line, the threshold is wrong and you have just been shown the evidence before the regulator finds it.
Paired with above-the-line testing — confirming the alerts the system does generate are productive — this is what turns tuning from an efficiency exercise into a regulator-ready position. The institution can show not just that it reduced alert volume, but that it tested what it gave up and confirmed it gave up nothing that mattered.
Refresh the scenarios, not just the thresholds
Threshold tuning addresses noise. Closing detection gaps requires going further: refreshing the rules and scenarios themselves against current typologies and the institution’s actual risk profile. A monitoring system that has never had its scenario library updated is testing for the financial crime of several years ago. New typologies — new laundering corridors, new structuring patterns, new abuse of new products — require new or modified scenarios, and rules that no longer earn their place should be retired rather than left to generate noise.
This is where transaction-monitoring recalibration connects to the wider AML control framework: the monitoring engine is one component of a system, and tuning it in isolation from the typology landscape and the risk model that feeds it produces a locally optimised but globally incoherent control.
Recalibration is model governance
The moment monitoring uses statistical or AI-based models — increasingly the norm — recalibration becomes a model-governance activity. Every tuning decision is a change to a model that drives regulatory outcomes, and it has to be documented as one: the rationale, the testing evidence, the validation that the change performs as intended. A configuration that no one can explain or evidence is not a tuned control; it is an undocumented change to a system the institution is accountable for. This is the same model-governance discipline that AI-enabled compliance demands, and it is why tuning and model governance are the same conversation once models are involved.
Build the cycle in
The reason monitoring degrades is that no one owns tuning it on a schedule. The fix is to build recalibration into governance as a recurring activity, with a baseline maintained so each tuning cycle has a defensible before-and-after position. An optimised configuration is a snapshot; a tuning cycle is a control. Institutions that treat recalibration as a one-off project find themselves degraded again within a couple of years; those that institutionalise it stay tuned to risk rather than to noise.
The question to ask your monitoring team
If your transaction monitoring has not been recalibrated since install — or in the last couple of years — the question is not “how many alerts are we clearing?” The busy queue will answer that and tell you nothing. The question is: when did we last test what our thresholds exclude, and can we evidence that we are tuned to noise rather than to risk? If the answer is silence, the monitoring is not optimised. It is unexamined — and that is exactly the gap a supervisor is trained to find.
Cognitive Compliance recalibrates transaction-monitoring rules and models with documented above- and below-the-line testing. To pressure-test your monitoring configuration, book an advisory call.