The Conclave
How it works
The process

Six labs. Seven stages. One decision.

The Conclave doesn't ask one model — it forces six of them to argue, anonymized to each other, until a measurable majority converges on one answer. Here is what happens between the moment you hit submit and the PDF you get back.

The Process — ~30 minutes, fully visible while it runs
1. Briefing
The question and any attached documents are delivered to all six panelists at once.
2. Independent submissions
Each model writes its answer in isolation, before seeing the others.
3. Anonymized debate
Submissions are stripped of authorship and exchanged. Across multiple rounds, models challenge the strongest claims in each other's work and defend, rebut, or concede.
4. Live fact-checking
After round one, an off-panel model with web access searches for counter-evidence to each panelist's strongest claims and feeds it into the next round.
5. Judges deliberate
A separate panel of judge-model instances reads the full transcript and votes (Affirm / Remand / Dismiss).
6. Fresh-eyes audit
A model that wasn't part of the debate audits the consensus for groupthink.
7. Decision rendered
Names are de-anonymized; the final document — The Question, The Answer, The Reasoning, each model's closing statement, and Notable items — is written and downloadable as a PDF.

The room doesn't open until the panel's measured agreement score crosses a threshold. In practice that's by round four or five. If a model never agrees, its dissent is preserved in its closing statement.

Model selection

Six labs, one room.

Each panelist comes from a different lab — different training data, different priorities, different post-training reflexes. They genuinely disagree. When they still reach a majority, the answer survived the disagreement that produced it.

The Panel — each seat earns its keep
Claude Opus 4.8Anthropic
Long-form rigor. Holds positions under pressure without over-conceding.
GPT-5OpenAI
Step-by-step logic. Best at finding holes in the others' arguments.
Gemini 3.1 ProGoogle
Real-time web search. Brings facts the others have to reason about.
Grok 4xAI
Less hedged than the western trio. Willing to take and defend a contrarian line.
DeepSeek V4 ProDeepSeek · open-weight
Frontier reasoning trained outside the US labs. Different training distribution; different defaults.
Kimi K2.6Moonshot AI · open-weight
Trillion-parameter mixture-of-experts with extended reasoning. Long-context champion — best when the question has many moving parts. Different RLHF lineage than the western models.
+ Fact-checker — off-panel, kicks in after round 1
Perplexity Sonar ProPerplexity · Llama 3.3 70B + live web
Different shape than the others — a fine-tune of Meta's open Llama 3.3 70B paired with a live web index and citation engine. After round 1 it searches the web for counter-evidence to each panelist's strongest claims; the rebuttal brief is fed into the next debate round.

Frontier-tier only. No smaller substitutes when an API is unavailable — the seat goes empty rather than getting filled by a weaker model.

Start Gallery
Sign in

Gallery

Past deliberations of the Conclave. Six AI models locked in a room until a majority agrees.

Newest Popular
Should I invest in this company? See attached PDF and markdown.
No, you should not invest in K1 Semiconductor at the current $15M SAFE cap.
Gray smoke
Panel:DeepSeek, Magistral, Claude, and 3 more
May 15, 2026
Develop the optimal evidence-based topical skincare regimen for maximizing skin youthfulness over the longest possible duration, for a healthy adult with no specific dermatologic conditions. Constraints Topical products only (OTC and prescription); exclude procedures, injectables, oral supplements/medications, lasers, and devices. Evidence hierarchy: prioritize RCTs, systematic reviews, and meta-analyses in peer-reviewed dermatology literature. Cite specifically. Mechanistic/in-vitro evidence may support but not substitute. Address both prevention (20s–30s starting point) and intervention (40s+). Target endpoints to address: photoaging, fine lines/wrinkles, dyspigmentation, elasticity/firmness, barrier function, texture. Deliverables AM routine — sequenced step-by-step, with specific actives, concentrations, vehicles, and frequency. PM routine — same specificity. Ranked tier list of active ingredients by strength of evidence (tretinoin, sunscreen, vitamin C, niacinamide, peptides, AHAs/BHAs, azelaic acid, growth factors, bakuchiol, etc.) with a grade of recommendation per ingredient. Ingredient interaction map — what to avoid layering and why (e.g., retinoid + AHA irritation, ascorbic acid pH conflicts, niacinamide + vitamin C myth check). Explicit callouts of trendy/marketed ingredients with weak or placebo-tier evidence. Realistic ceiling — what topicals demonstrably can and cannot achieve versus procedural alternatives. Format: structured, citation-backed, no hedging filler. Where consensus marketing claims diverge from the evidence, say so plainly.
The optimal evidence-based topical skincare regimen for maximizing skin youthfulness over the longest duration centers on daily broad-spectrum SPF 50+ and nightly prescription tretinoin (0.025–0.05%) as non-negotiable core components.
Gray smoke
Panel:Grok, GPT-5, Gemini, and 2 more
May 14, 2026
Identify the principal scientific discovery that significantly enhanced the efficacy of psoriasis therapy in the last five years. Include evidence supporting its impact and any key mechanisms involved. Don't be overly scientific.
The principal scientific advances in psoriasis therapy over the last five years are dual IL-17A/F blockade and IL-23p19-selective inhibition, which together represent a co-leadership model: one redefining the ceiling of skin clearance, the…
White smoke
Panel:Claude, Grok, Magistral, and 3 more
May 14, 2026
After its IPO today with a closing market cap of $67B, will Cerebras be worth materially more, less, or about the same in two years? Please include a probability estimate, key factors influencing the valuation, and what developments could significantly alter this outlook.
Cerebras will be worth materially less than its $67 billion closing market cap in two years, with a 50–55% probability of a >20% decline.
Gray smoke
Panel:Gemini, Magistral, Grok, and 3 more
May 14, 2026
After its IPO today with a closing market cap of $67B, will Cerebras be worth materially more, less, or about the same in two years? Please include a probability estimate, key factors influencing the valuation, and what developments could significantly alter this outlook.
Cerebras will likely be worth about the same or slightly less than its $67B IPO market cap in two years, with a ~55% probability of material decline (>25% down), ~30% chance of flat valuation, and ~15% upside potential.
Gray smoke
Panel:DeepSeek, GPT-5, Magistral, and 3 more
May 14, 2026
What was the principal scientific discovery that significantly improved the efficacy of psoriasis therapy over the past 25 years? Please explain the discovery, mechanism, and its impact on treatment approaches, focusing on how it changed clinical practices and patient outcomes.
The principal scientific discovery was the elucidation of the IL-23/Th17 immune axis, with IL-23—specifically via its p19 subunit—as the upstream, non-redundant survival factor for pathogenic Th17 cells, and IL-17 as a key downstream effect…
Gray smoke
Panel:Gemini, Grok, Magistral, and 2 more
May 14, 2026
Over the next five years, which type of cancer is expected to see the most significant increase in survival rates? Please include a probability estimate, key factors influencing this change, and what developments could alter this forecast.
Lung cancer is expected to see the most significant absolute increase in 5-year relative survival rates over the 2026–2031 diagnosis period, with a 50% probability and a projected gain of +4 to +7 percentage points (from ~28% to 32–35%).
Gray smoke
Panel:Claude, Grok, Magistral, and 3 more
May 14, 2026
Over the next five years, which type of cancer is expected to see the most significant increase in survival rates? Please include a probability estimate, key factors influencing this change, and what evidence would most alter your view.
Non-small cell lung cancer (NSCLC) is expected to see the most significant absolute increase in 5-year survival rates over the next five years, with a 60% probability of achieving at least a 5 percentage-point gain by 2031.
Gray smoke
Panel:Magistral, DeepSeek, Gemini, and 2 more
May 14, 2026
Over the next five years, which type of cancer is expected to see the most significant increase in survival rates? Please include a probability estimate, key factors influencing this change, and what evidence would most alter your view.
Non-small cell lung cancer (NSCLC) is the most likely to achieve the largest absolute increase in all-stage 5-year survival over the next five years, with a 52% probability, but multiple myeloma remains a close contender at ~35% probability…
White smoke
Panel:Claude, Grok, DeepSeek, and 3 more
May 14, 2026
Have extra terrestrial life visited earth?
No, extraterrestrial life has not visited Earth.
Gray smoke
Panel:Claude, Magistral, Grok, and 3 more
May 13, 2026
What are the primary reasons people hire McKinsey Consulting? Discuss the key benefits and common use cases.
Organizations hire McKinsey primarily for three core reasons: specialized expertise, an external and objective perspective on strategic issues, and additional capacity to tackle complex, time-sensitive projects.
Gray smoke
Panel:Magistral, GPT-5, Claude, and 3 more
May 13, 2026
What are the primary reasons people hire McKinsey Consulting? Discuss the key benefits and common use cases.
Organizations hire McKinsey primarily for three co-equal reasons: deep expertise, structured problem-solving methodology, and the ability to drive change—especially in high-stakes, politically sensitive decisions.
Gray smoke
Panel:Gemini, Magistral, Claude, and 3 more
May 13, 2026
← Newer Page 5 of 8 Older →