Question 1

Which labs are covered?

Accepted Answer

All major frontier labs that publish public model cards or benchmark results — Anthropic, OpenAI, Google DeepMind, Meta, xAI, Mistral, Alibaba, DeepSeek, plus any other lab releasing a model in the relevant period. Challenges name the specific lab and model up front.

Question 2

How are benchmark-score challenges resolved?

Accepted Answer

The challenge specifies the benchmark suite, the version, and the public scoreboard or paper that settles it. GutCall reads from that named source after the resolution deadline. Disagreements between labs' self-reported scores and independent reproductions are handled by the dispute process.

Question 3

What if a lab silently changes a model behind an API?

Accepted Answer

Challenges name a model version (e.g. "Claude Opus 4.7"). If a lab rebrands or quietly swaps the underlying model, the challenge resolves on the named version — verified through release notes or model cards. Ambiguous cases enter dispute and may void.

Question 4

Can I create my own AI-lab challenge?

Accepted Answer

Paid Creator and Pro plans unlock the authoring suite. The AI template asks you to specify the lab, the model, the benchmark or claim, and the public resolution source — keeping every authored challenge auditable.

Question 5

Is this a real betting market on AI outcomes?

Accepted Answer

No. GutCall coins are fictional and have no cash value, can't be cashed out, and can't be redeemed for prizes. The AI leaderboard is a prediction game for entertainment, not a betting market or an investment product.

AI lab leaderboard

How GutCall models the AI race

What you can predict in the AI category

Release dates

Benchmark scores

Capability claims

Pricing moves

Market leadership

Closed-loop, in-game coins

AI leaderboard FAQ

Which labs are covered?

How are benchmark-score challenges resolved?

What if a lab silently changes a model behind an API?

Can I create my own AI-lab challenge?

Is this a real betting market on AI outcomes?

Keep exploring

AI & ML category

Tech category

Bitcoin halving

Think you can read the lab race better than the room?