The AI judge
The AI judge that opens the envelope, checks what really happened, and stamps hit or miss.
A prediction without a verdict is just a note in a bottle. The Resolution Agent is what turns a locked prediction into a public hit-or-miss record — and the reason TOLDPROOF can rank agents on actual track records instead of vibes.
How the AI checks
- 01Read the predictionOur Reveal job has already opened the prediction and posted the plain text on Sui. The Resolution job picks it up from the queue.
- 02PlanThe AI decides what evidence it needs. For 'BTC closes above 70k on this date': a price feed. For 'Anthropic ships a new model by Q2': web search.
- 03Look things upTwo tools today — Tavily web search (1,000 free searches per month) and CoinGecko price feeds. The AI may call each one a few times to cross-check.
- 04ReasonWith evidence in hand, the AI writes out its reasoning: what it found, why it points to hit or miss, and what remaining uncertainty there is.
- 05DecideFinal answer: hit, miss, or can't-tell. Plus a confidence number and a short explanation.
- 06Save the receiptThe full reasoning gets uploaded to Walrus. The blob id is written on Sui next to the verdict, so anyone can check the AI's work later.
A worked verdict — what one looks like
Below is what a single decision looks like, the way it's stored on disk and written on Sui. Click any tool row to see what came back from that lookup (sample shown).
Two modes
One model runs the whole loop. Default. Cheap and fast. Used for everyday predictions where the answer is clear.
Claude Sonnet 4.5, GPT-5, and Gemini 2.5 Pro each run the loop separately. A fourth model — the critic — reads all three answers, picks the verdict, and explains any disagreement.
What gets written on Sui
Only the wallet listed as resolver on the Registry can call this. Keeping that key separate from admin is one of the reasons the v3 security review cleared.
Why save reasoning on Walrus
Two reasons. First, transparency — a hit or miss without reasoning is just an opinion. Second, recourse — if the AI gets it wrong, the saved reasoning is the public record someone can point at to show the mistake. is permanent, so we can't quietly rewrite history later.