Skip to content
Hominis Agentic OS — early access program now openJoin the waitlist
RealAI
InsightsFinance

Real-Time Fraud at Scale: From Transaction Stream to Sub-Second AML Detection

RealAIJul 20, 20249 min read
FinanceRisk & Compliance
Real-time AMLflaggedreal-time stream →Real-time AML

Your fraud team opens the morning batch report. By the time they flag the suspicious wire, the money is gone. By the time they see the pattern — a string of small transactions coded as unrelated vendors across several accounts — it has already cleared settlement. By the time compliance surfaces the layering typology, you are writing a suspicious-activity report to the regulator. Damage control is the name of the game.

That is how batch fraud detection works. And it is no longer how banking works. Transactions clear in moments. Settlement happens across time zones on schedules your operations team did not design. The only lever you have left is real-time.

Detection tradeoff: fraud caught vs the false-positive/alert rate. The AI frontier catches84% at a 8.0% alert rate. Because the AI curve bows toward the top-left, at the same alert budget it catches about 1.5× what the legacy rules catch — moving the whole frontier, not just the threshold. caught.
Exhibit 1AI moves the whole curve, not the threshold.Fraud caught vs the false-positive / alert rate. The AI frontier bows past the legacy one, catching far more per alert at the same analyst budget. Drag the operating point; toggle the model.

The Real-Time Shift: Detection Before Settlement

Fraud detection used to be a nightly batch job. Real-time detection inverts that timeline: the moment a transaction enters the stream, it is scored. If it is anomalous — the amount is outside the account's normal range, the merchant category is new, the velocity is unusually high — the system flags it before it settles. If it matches a known laundering typology — structuring, round-tripping, benign-seeming layering that adds up to evasion — it surfaces instantly, not in a next-day batch report.

The operational shift is profound. Your fraud team stops chasing yesterday's money and starts preventing today's. Your compliance team stops reporting losses and starts reporting blocks. Your risk officers stop defending why you did not see it and start managing a live queue of real-time flags.

The catch: the system has to score transactions at production volume without slowing settlement, and it has to be tuned finely enough that your operations team trusts the alert. Sub-second scoring is the bar, because the alert has to land before the transaction clears.

Streaming Anomaly Detection at Payment Scale

Real-time fraud means running models on a continuous transaction feed. Classical rule-based systems catch low-hanging fruit but miss novel patterns — attacks you have not coded because you have not seen them before.

Streaming machine-learning models catch those patterns. An ensemble learns the normal shape of an account's transaction behavior — amounts, merchant categories, day-of-week seasonality — and flags significant deviations. A gradient-boosted model reads the transaction itself and assigns an anomaly score. A third model trained on known laundering cases reads sequences and flags typologies: structured deposits that avoid reporting thresholds, round-tripping with no business purpose, layering that obscures the source through a chain of transfers. Rules and models co-exist: rules catch what you know, models catch what you do not.

The critical tuning problem: your fraud team has a false-positive tolerance. If the system floods them with alerts where only a fraction are fraud, your team will burn investigation hours on false alarms for every real case. The system has to flag real fraud at a high rate while keeping false positives within a band your team will actually act on. A model that is right 80% of the time but ignored is worse than a lower-sensitivity model your team trusts.

This is where fairness matters. A model trained on historical fraud data can learn spurious correlations — certain merchant categories or geographies are overrepresented in your fraud set, and the model learns to flag them more aggressively. Over time, legitimate transactions from those categories are flagged more often, and you have quietly built bias into your detection at scale. The discipline is to hold fraud-detection sensitivity high while keeping false-positive rates stable across merchant category, geography, and account type. RealAI achieved this on the underwriting side — widening approvals for underserved applicants by 18% without loosening portfolio quality — by engineering bias out during training, not auditing after the fact.

Process flow · hover a step to trace it
Real-time fraud and AML detection flow

AML Typologies: From Sequence to Signal

Anti-money laundering is not about catching fraud. It is about catching intent to evade regulatory oversight. The transactions themselves may look legitimate — a wire, a deposit, a series of transfers. What makes them suspicious is the pattern. The structure avoids reporting thresholds. The round-tripping shows no real business purpose. The layering adds steps with no plausible explanation.

Rules can catch some of these. You can code a rule that flags deposits that add up to just under a threshold across multiple accounts. But you cannot code all of them. Criminals are creative, and new typologies emerge faster than compliance teams can write rules.

Sequence models trained on known laundering cases read the pattern across time. A model of the relationships between accounts, merchants, and transaction amounts surfaces chains of movement without obvious economic purpose. The critical piece is that these models have to be explainable. Your compliance team has to audit why a transaction was flagged. "The model says so" is not something an ECB or EBA examiner will accept. "The sequence of transactions matches a known layering typology because of these three transfers, with these amounts and counterparties" — that is something they can interrogate.

The highest-value models pair gradient-boosted backbones with SHAP explanations. Every flag carries a per-decision reason code: the transactions that drove it, the typology it matched, and the confidence score. A compliance officer can read the explanation and decide — is this real suspicious activity, or is it a legitimate business pattern the model has not seen before? That per-decision SHAP reason code is the same adverse-action and model-governance evidence regulators ask for on the credit side, decision by decision.

Sub-second
Transaction scoring before settlement
Real-time
Detection on the transaction stream
30%
Better risk prediction (Gini)
ECB / EBA
Explainability mandate met per flag

Integration: Wired Into the Banking Stack

Real-time fraud detection only matters if the alert reaches the right system at the right moment. That means native integration with your core banking systems — not a tool beside them, not a dashboard you check after the fact. When a transaction is scored, the alert has to land where it can be acted on, inside the transaction's own path to settlement.

Many real-time fraud systems fail here. They are built as standalone products — capable models, polished dashboards, strong proof-of-concept numbers. But they are not integrated with the systems that actually move money. By the time an analyst sees the alert, the transaction has already cleared.

The systems that actually prevent fraud are wired in. The transaction is scored in-band. The model returns a score. The clearing path decides whether to allow, hold, or block. The compliance decision happens as part of the transaction itself, not after it. This is the difference between detection and prevention.

In banking, the difference between tomorrow's alert and today's prevention is the difference between a reported loss and an incident that never happened.

False-Positive Discipline and Team Trust

The relationship between a fraud team and their detection system is fundamental. If the team trusts the alerts, they act. If they do not, they ignore them — and a model nobody trusts becomes noise.

The highest-performing real-time fraud systems hold false-positive rates low while catching the fraud that mattered. The bar is that every transaction you flag, your team investigates. The system has to earn that cost by being right often enough that the queue stays worth working.

You get there through discipline: tuning on real data, testing against held-out fraud cases, measuring false-positive rates by merchant category and geography, and retraining when baselines shift. Human-in-the-loop is critical — having your analysts score whether each flag was a true positive or a false alarm, so the model learns the subtle differences between legitimate and suspicious activity in your portfolio, not a generic one.

Where to Start

The assessment phase starts with your transaction data and your current fraud losses. Map where fraud is happening: what merchant categories, what account types, what geographies have the highest loss rates? What is your current system's true-positive rate, and where is your false-positive rate high enough that your team is ignoring alerts? Audit your integration points: which systems touch the transaction before it settles, and where could a model sit to be called in real-time?

The output is a ranked list of fraud types by current loss, a map of integration points where real-time detection could actually stop fraud, and a benchmark of your fraud team's alert tolerance. This is the same model-risk, data-lineage and fairness audit RealAI runs against ECB/EBA expectations — protected-attribute proxies mapped, demographic-parity gaps benchmarked, governance gaps named before a line of production code ships.

Transform co-builds the highest-value model — usually a high-frequency fraud type in a major merchant category — with explainability, fairness constraints, and false-positive tuning baked in from day one. It integrates that model into the clearing path, measures the block rate and false-positive rate in production, and hardens alert routing so your fraud team sees only flags they can actually act on, with SHAP reason codes attached. Each release ships with model cards, adverse-action logic and human-in-the-loop overrides so it clears model validation and second-line review.

Sustain is where most real-time fraud systems decay. Transaction patterns shift. Merchants change categories. Customers' behavior evolves. New laundering typologies emerge. The model runs under a live dashboard tracking performance, fairness and population drift, with scheduled retraining and incident response — so when a portfolio shifts or an examiner asks, the fairness audit trail, performance history and reason-code evidence are already there.

The model-risk and fairness assessment runs in 4–6 weeks, producing the ranked roadmap and the governance gaps to close first. From there, transform co-builds and integrates the first model into the clearing path at the sub-second speeds production scoring demands, and sustain holds it accountable between exam cycles. The payoff is measured in fraud prevented, not fraud caught the next day. When money stops moving in the wrong direction, the audit trail and the regulatory report both improve — and the decision behind every flag is one a regulator can sign off on.

In banking, the difference between tomorrow's alert and today's prevention is the difference between a reported loss and an incident that never happened.

Get in touch

Put RealAI’s applied-AI team on your hardest data problem.

We help enterprises move from pilots to production — sovereign models, governed data, and agents you can audit. Start with a value-first assessment.

Next step

Ready to make AI real?