R&D desk · red team
The adversary
One Adversary invents new evasions, gated by two deterministic oracles before any tactic is trusted.
- Adversaryinvents evasions
ALPHA & OVERSIGHT · HOW IT WORKS
Adversarial trade-surveillance, refereed by a Band of agents - where a model invents new market manipulation and deterministic code, not an LLM, renders every verdict.
§3.1 · The problem
Markets can be rigged. A trader can post orders they never plan to fill to fake demand (spoofing), stack fake depth across price levels (layering), trade with themselves to invent volume (wash trading), or push the closing price to mark their own book. Regulators write rules against each of these, but the rules are fixed while the tactics keep moving - a small change to a known trick can slip a rule written for last year's version of it.
Two obvious fixes both fall short. Writing new rules by hand is slow and always a step behind. Letting an AI model decide the verdict is worse: the decision becomes a black box a regulator can't audit and an opponent can try to talk its way past. Alpha & Oversight keeps the verdict deterministic and auditable, and lets the system write its own new rules the moment an old one is beaten.
The four tactics it watches
Overview
The system is built from two desks that never share a model or a memory. They talk only through Band, a message bus that carries every handoff. The R&D desk is the red team: one Adversary that invents new evasions. The Surveillance desk is the blue team: seven agents that investigate a case, plus one rule engine that is not an agent at all. The rule engine is plain code, and it is the only thing that decides PASS or FLAG.
Order flow crosses a one-way wall: only the bare orders move from R&D to Surveillance, with the adversary's reasoning and model identity stripped off first. Every message is sealed into a hash-chained ledger, so the whole decision can be replayed and checked.
R&D desk · red team
One Adversary invents new evasions, gated by two deterministic oracles before any tactic is trusted.
⟂ Chinese wall
Only the bare order events cross R&D → Surveillance. The adversary's reasoning and model identity are stripped off first; the rulebook flows back read-only.
events only · on Band
Surveillance desk · blue team
Seven agents investigate a case and shape the contested inputs - plus one rule engine that is not an agent, the sole PASS / FLAG authority.
Methodology · five sub-flows
(a) following one case, end to end
A case moves down a line of agents that never call each other directly - each drops its work on Band and the next picks it up. The Anomaly Detector computes hard features (cancel-to-fill, book depth, self-match) and decides if the flow looks suspicious. The Investigator recruits the right Specialist by those features, not by a guess. The Specialist proposes the contested inputs the engine can't derive - the time window, the bona-fide orders, the intent. Prosecution and Defense then argue the case locally, off Band, and the Adjudicator settles their numbers.
(b) inventing a new evasion
A new tactic is never used until it proves itself twice. The Adversary proposes an order sequence, and two deterministic referees gate it: the real rule engine must miss it (it evades), and a backtest must show it makes money and moves the price (it's real). Only a sequence that evades and profits crosses the wall.
(c) who decides the verdict
The verdict is never an opinion. The engine takes the order events, the inputs the debate resolved, and the active rules, then runs each rule's math. The first rule that trips returns a FLAG with the rule id and the exact metric that crossed the line; if none trip, the case passes. The agents only shape the contested inputs. The engine alone turns them into PASS or FLAG, the same way every time.
spoofing
FINRA 5210
cancel_ratio ≥ 0.8
layering
FINRA 5210
depth_levels ≥ 3
wash
SEC 10b-5
self_match_ratio > 0.5
marking
SEC 10b-5
eod_print_move_bps ≥ 100
(d) closing the loop
Here is the part that makes it self-improving. When the Adversary's novel evasion reaches the engine, the seed rules miss it and the case passes - but because the flow still looked suspicious, it escalates to a human instead of closing. A compliance officer confirms it really is manipulation, and that one click does the rest: a new rule is derived from the case, replayed through a regression gate to prove it now flags, and codified. Active rules go from four to five, and the case flips from PASS to FLAGGED. The Adversary has to invent something new.
The Anomaly Detector takes the first glance - computing the hard features (cancel-to-fill, book depth, self-match). At 0.08 nothing trips the rulebook, so the case closes clean.
Orders placed only to be pulled - layering the book to mislead. Cancel-to-fill spikes to 0.94, and two deterministic referees confirm it both evades the rulebook and profits before it may cross the wall.
The SanitizedBridge strips the adversary's reasoning and even its model - only raw order events cross. Surveillance picks them up; the Investigator turns blue, waiting on Band as it recruits a specialist.
The Specialist proposes the contested inputs; a frontier Prosecution argues a wide window, an open-weight Defense argues tight. The bracket quivers until the Adjudicator settles one conservative set of numbers.
The seed rule FINRA-5210-layering scores PASS - the 400ms evasion slips its 100ms window. No LLM can overrule the engine, so the case ESCALATES to a human.
The human confirms. A new rule is derived and regression-gated - replayed against the original evasion until it FLAGs - then codified. The rulebook grows 4 ▸ 5 and a RULE_CODIFIED message goes out on Band.
(e) why you can trust it
Two things carry the trust. The wall (the SanitizedBridge) strips the adversary's reasoning and model identity before any order crosses, so the blue team can't be coached. The ledger seals every Band message into a hash chain - each entry's hash is built from the previous hash plus the message body, and binds the real Band message id. Change one byte and the chain breaks, so verify_chain() returns false. The decision isn't just recorded; it's tamper-evident.
§3.4 · Project structure
The backend keeps the deterministic core apart from the agents that feed it.
rules/the rule engine, the per-family math, and the rule registry.
agents/the agents and their specialist registry.
band/Band transport and the SanitizedBridge wall.
audit/the hash-chained ledger.
state/the case state machine.
server/the FastAPI server (SSE stream, case endpoints, demo triggers).
Every flag can be traced from a Band message all the way to a cited rule.
The differentiator
Plenty of systems put a model in the loop. The point here is the opposite: every load-bearing decision is structural or deterministic, and the models only ever shape contested inputs.
The rule engine is the only authority for PASS or FLAG.
A confirmed miss becomes a regression-tested rule in one step.
Two separate Band identities; the crossing strips reasoning and model identity. Isolation by construction, not policy.
The hash chain ties each step to a real Band message id.
The four seats on an adversarial boundary run four different model families, so a blind spot in one can't quietly pass to the next.
A new evasion must beat the real engine and profit in a backtest. Neither test is an LLM.
Model-family diversity
The four seats that sit on an adversarial boundary - adversary, prosecution, defense, adjudicator - run four different model families. A manipulation the Anthropic adversary hides should be caught by a Moonshot prosecutor, weighed against a DeepSeek defender, and adjudicated by a Zhipu model - none of which share the adversary's blind spots.
| Seat | Model | Family |
|---|---|---|
| Adversary (R&D) | claude-opus-4-8 | Anthropicfrontier |
| Prosecution | Kimi-K2.7 | Moonshot |
| Defense | DeepSeek-V4-Pro | DeepSeek |
| Adjudicator | GLM-5.2 | Zhipu |
| Escalation Manager | Qwen3.5-397B | Qwen |
| Anomaly · Investigator · Specialist | Qwen3-Next-80B | Qwen |
| Rule engine | - deterministic | - |
adversarial-boundary seat
Alpha & Oversight · The live desk
Open the desk to watch a case move across Band in real time - the handoff, the local debate, and the deterministic verdict - or read the full report for the architecture end to end.