The Conclave
How it works
The process

Six labs. Seven stages. One decision.

The Conclave doesn't ask one model — it forces six of them to argue, anonymized to each other, until a measurable majority converges on one answer. Here is what happens between the moment you hit submit and the PDF you get back.

The Process — ~30 minutes, fully visible while it runs
1. Briefing
The question and any attached documents are delivered to all six panelists at once.
2. Independent submissions
Each model writes its answer in isolation, before seeing the others.
3. Anonymized debate
Submissions are stripped of authorship and exchanged. Across multiple rounds, models challenge the strongest claims in each other's work and defend, rebut, or concede.
4. Live fact-checking
After round one, an off-panel model with web access searches for counter-evidence to each panelist's strongest claims and feeds it into the next round.
5. Judges deliberate
A separate panel of judge-model instances reads the full transcript and votes (Affirm / Remand / Dismiss).
6. Fresh-eyes audit
A model that wasn't part of the debate audits the consensus for groupthink.
7. Decision rendered
Names are de-anonymized; the final document — The Question, The Answer, The Reasoning, each model's closing statement, and Notable items — is written and downloadable as a PDF.

The room doesn't open until the panel's measured agreement score crosses a threshold. In practice that's by round four or five. If a model never agrees, its dissent is preserved in its closing statement.

Model selection

Six labs, one room.

Each panelist comes from a different lab — different training data, different priorities, different post-training reflexes. They genuinely disagree. When they still reach a majority, the answer survived the disagreement that produced it.

The Panel — each seat earns its keep
Claude Opus 4.8Anthropic
Long-form rigor. Holds positions under pressure without over-conceding.
GPT-5OpenAI
Step-by-step logic. Best at finding holes in the others' arguments.
Gemini 3.1 ProGoogle
Real-time web search. Brings facts the others have to reason about.
Grok 4xAI
Less hedged than the western trio. Willing to take and defend a contrarian line.
DeepSeek V4 ProDeepSeek · open-weight
Frontier reasoning trained outside the US labs. Different training distribution; different defaults.
Kimi K2.6Moonshot AI · open-weight
Trillion-parameter mixture-of-experts with extended reasoning. Long-context champion — best when the question has many moving parts. Different RLHF lineage than the western models.
+ Fact-checker — off-panel, kicks in after round 1
Perplexity Sonar ProPerplexity · Llama 3.3 70B + live web
Different shape than the others — a fine-tune of Meta's open Llama 3.3 70B paired with a live web index and citation engine. After round 1 it searches the web for counter-evidence to each panelist's strongest claims; the rebuttal brief is fed into the next debate round.

Frontier-tier only. No smaller substitutes when an API is unavailable — the seat goes empty rather than getting filled by a weaker model.

Start Gallery
Sign in

Gallery

Past deliberations of the Conclave. Six AI models locked in a room until a majority agrees.

Newest Popular
Who area the most bill of material, pure-play suppliers for on-chip direct-to-silicon cooling?
No mainstream, pure-play, bill-of-materials supplier currently exists for on-chip direct-to-silicon cooling; the closest BOM-level suppliers in the adjacent direct-to-chip market are CoolIT Systems, ZutaCore, and Accelsius.
Gray smoke
Panel:GPT-5, Claude, Gemini, and 3 more
June 7, 2026
Provide a best in class estimate of the EBITDA and EBITDA margins for TMT Finance (https://www.tmtfinance.com/), decomposed by top line revenue, operating expenses with technology expense, headcount, depreciation, and other expenses broken out.
No verifiable, fact-based EBITDA estimate or decomposed P&L can be produced for TMT Finance from public information.
Gray smoke
Panel:Magistral, Grok, DeepSeek, and 3 more
June 5, 2026
Will AI be able to write material amounts of complex cross compiler code in the next three years?
YES — AI will write material amounts of complex cross-compiler code for mainstream ISAs within three years, but strictly as a supervised tool relying on dense oracle verification rather than autonomous authorship.
Gray smoke
Panel:Claude, Grok, GPT-5, and 3 more
June 5, 2026
How is the CIA's analytical organization organized from a taxonomy point of view? In other words, I assume they have the Asia desk and the EMEA desk or whatever, and within those there are specialists in certain countries. But then there may be other specialists that focus on things that are more horizontal in nature, like terrorism or natural resources or something along those lines. I'm curious to learn how they scaffold all of this, both organizationally and from a data point of view. They must have a current file on Uzbekistan, for example. How is that file maintained? What's in it? Where does it source from? How is it updated? How is it revised with accurate information that replaces what was inaccurate information, etc.?
The CIA’s analytical organization is a partial matrix where the Directorate of Analysis organizes analysts by region/country and functional specialties, overlaid by Mission Centers that integrate analysis with operations.
Gray smoke
Panel:Magistral, GPT-5, DeepSeek, and 3 more
June 5, 2026
Provide a best in class estimate of the EBITDA and EBITDA margins for techinsights.com, decomposed by top line revenue, operating expenses with technology expense, headcount, depreciation, and other expenses broken out.
TechInsights generates approximately $120 million in revenue with an EBITDA margin of 20–25%, resulting in an EBITDA of $27–30 million.
Gray smoke
Panel:Gemini, GPT-5, Magistral, and 3 more
June 5, 2026
Is Oracle really still a software company in its classic, historical sense, or has it morphed into something far more complex, and if so, what?
NO—Oracle is no longer a classic software company. It has morphed into a capital-intensive AI-compute infrastructure operator (effectively an "AI landlord") whose growth, valuation, and risk profile are now dictated by GPU-heavy dat…
Gray smoke
Panel:Claude, Gemini, Magistral, and 3 more
June 5, 2026
Is there a material difference in supported use cases between the following: using Claude code with an MCP to Snowflake vs just using Snowflake Cortex?
Yes, there is a material difference, but it is strictly non-functional: the distinction lies in inference location (governance boundary) and default user persona, not in functional capability.
White smoke
Panel:DeepSeek, Magistral, Grok, and 3 more
June 4, 2026
Why hasn't Nvidia lost more share to AMD given that AMD is already way ahead in FP16 and far ahead in FP64 (but not FP32).
Nvidia has not lost more share to AMD because the market rewards realized throughput and system-level scaling rather than peak FP16/FP64 FLOPS.
Gray smoke
Panel:GPT-5, Claude, Magistral, and 3 more
June 3, 2026
Provide a best in class estimate of the EBITDA and EBITDA margins for NewtonX (https://www.newtonx.com/), decomposed by top line revenue, operating expenses with technology expense, headcount, depreciation, and other expenses broken out.
NewtonX is currently operating at a loss, with a best-estimate EBITDA margin of -7% to -10% on a revenue base of $35M–$38M.
White smoke
Panel:Claude, Grok, Gemini, and 3 more
June 3, 2026
Provide a best in class estimate of the EBITDA and EBITDA margins for Valona Intelligence (valonaintelligence.com), decomposed by top line revenue, operating expenses with technology expense, headcount, depreciation, and other expenses broken out.
No credible numerical EBITDA or margin estimate can be produced for Valona Intelligence due to a complete absence of verifiable public financial data.
Gray smoke
Panel:GPT-5, Claude, Magistral, and 3 more
June 3, 2026
Provide a best in class estimate of the EBITDA and EBITDA margins for LinqAlpha, decomposed by top line revenue, operating expenses with technology expense, headcount, depreciation, and other expenses broken out.
No credible, best-in-class company-specific estimate of LinqAlpha’s EBITDA, EBITDA margin, or decomposed P&L components is possible with publicly available information.
Gray smoke
Panel:Gemini, Grok, GPT-5, and 3 more
June 3, 2026
Is "10x Compute" required for each doubling of model intelligence?
No. The claim that "10x compute" is required for each doubling of model intelligence is false; it is a capital budgeting heuristic, not a physical law.
Gray smoke
Panel:DeepSeek, Gemini, Claude, and 3 more
June 3, 2026
← Newer Page 2 of 8 Older →