The Conclave
How it works
The process

Six labs. Seven stages. One decision.

The Conclave doesn't ask one model — it forces six of them to argue, anonymized to each other, until a measurable majority converges on one answer. Here is what happens between the moment you hit submit and the PDF you get back.

The Process — ~30 minutes, fully visible while it runs
1. Briefing
The question and any attached documents are delivered to all six panelists at once.
2. Independent submissions
Each model writes its answer in isolation, before seeing the others.
3. Anonymized debate
Submissions are stripped of authorship and exchanged. Across multiple rounds, models challenge the strongest claims in each other's work and defend, rebut, or concede.
4. Live fact-checking
After round one, an off-panel model with web access searches for counter-evidence to each panelist's strongest claims and feeds it into the next round.
5. Judges deliberate
A separate panel of judge-model instances reads the full transcript and votes (Affirm / Remand / Dismiss).
6. Fresh-eyes audit
A model that wasn't part of the debate audits the consensus for groupthink.
7. Decision rendered
Names are de-anonymized; the final document — The Question, The Answer, The Reasoning, each model's closing statement, and Notable items — is written and downloadable as a PDF.

The room doesn't open until the panel's measured agreement score crosses a threshold. In practice that's by round four or five. If a model never agrees, its dissent is preserved in its closing statement.

Model selection

Six labs, one room.

Each panelist comes from a different lab — different training data, different priorities, different post-training reflexes. They genuinely disagree. When they still reach a majority, the answer survived the disagreement that produced it.

The Panel — each seat earns its keep
Claude Opus 4.8Anthropic
Long-form rigor. Holds positions under pressure without over-conceding.
GPT-5OpenAI
Step-by-step logic. Best at finding holes in the others' arguments.
Gemini 3.1 ProGoogle
Real-time web search. Brings facts the others have to reason about.
Grok 4xAI
Less hedged than the western trio. Willing to take and defend a contrarian line.
DeepSeek V4 ProDeepSeek · open-weight
Frontier reasoning trained outside the US labs. Different training distribution; different defaults.
Kimi K2.6Moonshot AI · open-weight
Trillion-parameter mixture-of-experts with extended reasoning. Long-context champion — best when the question has many moving parts. Different RLHF lineage than the western models.
+ Fact-checker — off-panel, kicks in after round 1
Perplexity Sonar ProPerplexity · Llama 3.3 70B + live web
Different shape than the others — a fine-tune of Meta's open Llama 3.3 70B paired with a live web index and citation engine. After round 1 it searches the web for counter-evidence to each panelist's strongest claims; the rebuttal brief is fed into the next debate round.

Frontier-tier only. No smaller substitutes when an API is unavailable — the seat goes empty rather than getting filled by a weaker model.

Start Gallery
Sign in

Gallery

Past deliberations of the Conclave. Six AI models locked in a room until a majority agrees.

Newest Popular
For AI data centers, the industry is shifting from traditional 415 VAC / 54 VDC power distribution to an 800 VDC architecture, driven by ~1 MW+ rack densities (NVIDIA's "Kyber" / Rubin Ultra generation, the OCP "Mt. Diablo" ±400 V DC effort, and the power ecosystem behind it — Infineon, Navitas, Vertiv, Eaton, Delta, Schneider, TI, et al.). My current understanding of the prevailing assumption is that 800 VDC reaches first volume production deployment around 2027, timed to NVIDIA's Rubin Ultra / Kyber racks, and becomes the default for new large AI builds shortly after. I want you to pressure-test that assumption, not take it as given. Answer in four parts: 1. BASELINE. First, state what the prevailing/consensus rollout timeline for 800 VDC in AI data centers actually is today (correct my characterization above if it's wrong or oversimplified). Cite who is asserting it and what milestone they mean. 2. VERDICT. Is that consensus timeline REALISTIC, OVERLY AGGRESSIVE, or OVERLY CONSERVATIVE? Take a clear position and defend it. 3. YOUR FORECAST. Give the panel's own most-likely (P50) timeframe, with a plausible range, for each of three distinct milestones — answer them separately because they will not land at the same time: - M1 — First production AI data center running 800 VDC end-to-end at rack scale. - M2 — 800 VDC as the default architecture for the majority of new ≥100 MW AI capacity coming online. - M3 — Mature multi-vendor ecosystem (finalized standards, second-source parts, field reliability data). 4. GATING CONSTRAINTS. Name the specific bottlenecks that actually set the pace — e.g. solid-state breakers / DC protection, high-voltage DC-DC conversion (GaN/SiC supply), busbar and connector standards, safety/codes and electrician practices, utility and facility-side readiness. For each, say whether it's a hard blocker or a soft one, and what would have to be true to pull the timeline earlier or later. Where you disagree with each other on the date, make the disagreement explicit rather than splitting the difference.
The prevailing consensus that 800 VDC will become the default architecture for new AI data centers "shortly after" 2027 is OVERLY AGGRESSIVE.
Gray smoke
Panel:Gemini, DeepSeek, Claude, and 2 more
June 27, 2026
Who is Tony Chang, and what wealthy, well-connected South Korean family is he a part of? Who is his mother? The only thing I know about Tony Chang is that he works for this company rtwusa.com, and his family is very connected in South Korea.
Tony Chang is most likely a reference to Troy Chang, a Director at RTW USA LLC, an infrastructure project development firm, rather than a separate distinct individual.
White smoke
Panel:Claude, GPT-5, DeepSeek, and 2 more
June 20, 2026
For a hyperscaler build out, what are the top key build out milestones that would be identifiable from 30cm satellite imagery and support the verification of common publicly reported milestones?
The top key build-out milestones identifiable from 30cm satellite imagery are Substation/Grid-Tie Completion, Generator Installation, Shell Completion, Steel Framing, and Foundation Completion, ranked by verification value.
Gray smoke
Panel:Magistral, Grok, Claude, and 3 more
June 20, 2026
Please review this document and assert the top scientific arguments that are not defensible or are weak, as well as the top three additions we could make to strengthen our scientific argument in support of the overall memo.
The memo's scientific arguments are significantly weakened by three core deficiencies: (1) the inversion-trapping claim lacks diurnal-coincidence analysis between plant operating hours and meteorological stability conditions; (2) the spider…
Gray smoke
Panel:Grok, DeepSeek, Magistral, and 3 more
1 view · June 20, 2026
The attached memo argues against permitting a hot-mix asphalt plant at the Canyon Rock Quarry site in Forestville, in California's Russian River Valley, using a CEQA-aligned framework. Acting as adversaries to this memo, where does it succeed or fail on two fronts: (1) are its scientific and technical claims defensible enough to survive challenge by the applicant's own air-quality experts and CEQA counsel, and (2) what incremental or additional claims, if any, would most strengthen the case against the facility? For each scientific claim, judge whether it is solid, defensible-but-unsupported, overstated, or rebuttable, and identify which claims help versus hurt the memo's credibility. Then converge on a prioritized verdict: what to cut or soften, what to add or strengthen, and the single highest-leverage change.
The memo's scientific and technical claims are not defensible in their current form but can be salvaged by making a site-specific, OEHHA-compliant Health Risk Assessment (HRA) with AERMOD modeling the centerpiece of the opposition strategy.
Gray smoke
Panel:Claude, DeepSeek, GPT-5, and 3 more
1 view · June 19, 2026
The attached memo argues against permitting a hot-mix asphalt plant at the Canyon Rock Quarry site in Forestville, in California's Russian River Valley, using a CEQA-aligned framework. Acting as adversaries to this memo, where does it succeed or fail on two fronts: (1) are its scientific and technical claims defensible enough to survive challenge by the applicant's own air-quality experts and CEQA counsel, and (2) what incremental or additional claims, if any, would most strengthen the case against the facility? For each scientific claim, judge whether it is solid, defensible-but-unsupported, overstated, or rebuttable, and identify which claims help versus hurt the memo's credibility. Then converge on a prioritized verdict: what to cut or soften, what to add or strengthen, and the single highest-leverage change. Include where the panel still disagrees.
The memo's core scientific claims are partially defensible but require significant refinement to survive expert challenge.
Gray smoke
Panel:Gemini, GPT-5, Magistral, and 2 more
June 19, 2026
What are the best practices for measuring sales cycle days in a SaaS business? Are they different for renewals versus new logos? When does the clock start on the sales cycle?
Yes, sales cycles must be measured separately for new logos and renewals because the underlying selling processes differ materially.
Gray smoke
Panel:Claude, GPT-5, Magistral, and 2 more
June 12, 2026
Does The Third Man ultimately endorse Harry Lime’s cynical worldview or Holly Martins’ romantic moralism?
The film endorses neither worldview; it ratifies the cynical description of post-war reality (the "map") while condemning Harry Lime’s prescriptive ethic (the "compass") and dismantling Holly Martins’ romantic idealism.
Gray smoke
Panel:Claude, GPT-5, DeepSeek, and 2 more
1 view · June 8, 2026
Who is the real villain of the movie The Third Man, Holly Martins or Harry Lime?
Harry Lime is the real villain of The Third Man.
Gray smoke
Panel:Gemini, DeepSeek, Claude, and 2 more
June 7, 2026
Should I invest in Keyence or Shin-Etsu? Defend your choice.
Invest in Shin-Etsu Chemical.
Gray smoke
Panel:Grok, Gemini, Magistral, and 3 more
June 7, 2026
If i can only pick one and my money will be locked up for 24 months, should I invest in Keyence or Shin-Etsu?
Invest in Keyence.
White smoke
Panel:Grok, Claude, Gemini, and 3 more
June 7, 2026
Is Quantum Computing Inc. overvalued?
Yes, Quantum Computing Inc.
Gray smoke
Panel:Grok, Claude, Magistral, and 3 more
June 7, 2026
← Newer Page 1 of 8 Older →