Best AI Model This Month? Prediction Market Odds

LIVE
Updated 9 minutes ago Β· 2:41 PM PST

Anthropic is a near certain win for the best AI model prediction market in February, breaking a six-month streak of Google victories on both Kalshi and Polymarket. Claude has held the #1 spot on the Arena leaderboard since February 6, and with one day left before resolution, both platforms are pricing Anthropic 98.7%. Total volume combined from both platforms $585.0K. The March market is where things get interesting. Anthropic opened the month as the favorite but without the 90%+ dominance Google held in prior months.Our cross-platform odds for this month’s β€œBest AI Model by Company” market aggregates live probability blend from Kalshi and Polymarket plus arbitrage flags and venue-by-venue comparisons.

Largest Spread
0.50%
Google
Consensus Leader
98.7%
Anthropic +12.2%
24H Volume (Share)
$585.0K
K: 4.4% P: 95.6%
Momentum Leader
+12.2%
Anthropic 7D change
Period:
Platform:

Probability Over Time

Hover for details Β· Cursor-synced tooltips

Tap "Chart settings" to adjust the chart.

Chart settings
Outcome:
Aggregated (VWAP)
Volume
Kalshi
Polymarket
Cross-Venue Spread

Current Odds Snapshot

Current probabilities across platforms with liquidity indicators

Sort:
AN
Anthropic
Vol $164.1K Spread 0.4%
Agg 98.7%
↑ +12.2%
K 99.0%
P 98.6%
Kalshi 99.0%
Vol $25.4K 98–100Β’
Polymarket 98.6%
Vol $138.7K 98.4–98.8Β’
GO
Google
Vol $133.3K Spread 0.5%
Agg 1.0%
↓ -8.5%
P 1.0%
K 0.5%
Polymarket 1.0%
Vol $133.1K 0.8–1.2Β’
Kalshi 0.5%
Vol $178 0–1Β’
BY
ByteDance
Vol $0 Spread β€”
Agg 0.5%
β€” +0.0%
K 0.5%
Kalshi 0.5%
Vol $0 0–1Β’
ME
Meta
Vol $0 Spread β€”
Agg 0.5%
β€” +0.0%
K 0.5%
Kalshi 0.5%
Vol $0 0–1Β’
OP
OpenAI
Vol $122.6K Spread 0.3%
Agg 0.3%
↓ -1.0%
K 0.5%
P 0.3%
Kalshi 0.5%
Vol $235 0–1Β’
Polymarket 0.3%
Vol $122.4K 0.2–0.3Β’
XA
Xai
Vol $88.5K Spread 0.4%
Agg 0.2%
↓ -1.6%
K 0.5%
P 0.2%
Kalshi 0.5%
Vol $56 0–1Β’
Polymarket 0.2%
Vol $88.5K 0.1–0.2Β’
AL
Alibaba
Vol $12 Spread 0.5%
Agg 0.1%
β€” +0.0%
K 0.5%
P 0.1%
Kalshi 0.5%
Vol $0 0–1Β’
Polymarket 0.1%
Vol $12 0.1Β’
DE
DeepSeek
Vol $52.0K Spread β€”
Agg 0.1%
↓ -0.5%
P 0.1%
Polymarket 0.1%
Vol $52.0K 0.1Β’
MO
Moonshot
Vol $9.2K Spread β€”
Agg 0.1%
β€” +0.0%
P 0.1%
Polymarket 0.1%
Vol $9.2K 0.1Β’
BA
Baidu
Vol $365 Spread β€”
Agg 0.1%
β€” +0.0%
P 0.1%
Polymarket 0.1%
Vol $365 0.1Β’
ZA
Z.ai
Vol $5.0K Spread β€”
Agg 0.1%
β€” +0.0%
P 0.1%
Polymarket 0.1%
Vol $5.0K 0.1Β’
ME
Meituan
Vol $5.0K Spread β€”
Agg 0.1%
β€” +0.0%
P 0.1%
Polymarket 0.1%
Vol $5.0K 0.1Β’
MI
Mistral
Vol $5.0K Spread β€”
Agg 0.1%
β€” +0.0%
P 0.1%
Polymarket 0.1%
Vol $5.0K 0.1Β’
OutcomeAggregatedSpreadVolumeKalshiPolymarket
AN
Anthropic
98.7%
↑ +12.2%
0.4%
$164.1K
Kalshi 99.0%
98–100Β’ Vol $25.4K
Polymarket 98.6%
98.4–98.8Β’ Vol $138.7K
GO
Google
1.0%
↓ -8.5%
0.5%
$133.3K
Kalshi 0.5%
0–1Β’ Vol $178
Polymarket 1.0%
0.8–1.2Β’ Vol $133.1K
BY
ByteDance
0.5%
β€” +0.0%
β€”
$0
Kalshi 0.5%
0–1Β’ Vol $0
Polymarket β€”
β€” Vol $0
ME
Meta
0.5%
β€” +0.0%
β€”
$0
Kalshi 0.5%
0–1Β’ Vol $0
Polymarket β€”
β€” Vol $0
OP
OpenAI
0.3%
↓ -1.0%
0.3%
$122.6K
Kalshi 0.5%
0–1Β’ Vol $235
Polymarket 0.3%
0.2–0.3Β’ Vol $122.4K
XA
Xai
0.2%
↓ -1.6%
0.4%
$88.5K
Kalshi 0.5%
0–1Β’ Vol $56
Polymarket 0.2%
0.1–0.2Β’ Vol $88.5K
AL
Alibaba
0.1%
β€” +0.0%
0.5%
$12
Kalshi 0.5%
0–1Β’ Vol $0
Polymarket 0.1%
0.1Β’ Vol $12
DE
DeepSeek
0.1%
↓ -0.5%
β€”
$52.0K
Kalshi β€”
β€” Vol $0
Polymarket 0.1%
0.1Β’ Vol $52.0K
MO
Moonshot
0.1%
β€” +0.0%
β€”
$9.2K
Kalshi β€”
β€” Vol $0
Polymarket 0.1%
0.1Β’ Vol $9.2K
BA
Baidu
0.1%
β€” +0.0%
β€”
$365
Kalshi β€”
β€” Vol $0
Polymarket 0.1%
0.1Β’ Vol $365
ZA
Z.ai
0.1%
β€” +0.0%
β€”
$5.0K
Kalshi β€”
β€” Vol $0
Polymarket 0.1%
0.1Β’ Vol $5.0K
ME
Meituan
0.1%
β€” +0.0%
β€”
$5.0K
Kalshi β€”
β€” Vol $0
Polymarket 0.1%
0.1Β’ Vol $5.0K
MI
Mistral
0.1%
β€” +0.0%
β€”
$5.0K
Kalshi β€”
β€” Vol $0
Polymarket 0.1%
0.1Β’ Vol $5.0K

Anthropic breaks Google’s winning streak in the #1 AI model market

Kalshi and Polymarket both run monthly markets on which company holds the top-ranked AI model on the Arena leaderboard. For February, the two sites agreed: Anthropic’s Claude Opus 4.6 is #1 and the market is pricing it as settled with Kalshi at 99% and Polymarket at 98%. Unless something drastic happens in the next 24 hours, this market will resolve as is. It will also end Google’s six-month streak as the leading AI model.

One difference worth noting: Kalshi resolves to the specific model (gemini-3-pro, claude-opus-4-6), while Polymarket resolves to the company (Google, Anthropic). Kalshi also runs both variants β€” a model-level market and a company-level market. Our charts are aggregating data from the model market.

Monthly AI prediction results

MonthKalshiPolymarketNotes
Feb 2026claude-opus-4-6 (99%, $2.48M)Anthropic (98%, $21.66M)First non-Google winner. Claude launched Feb 5, took Arena #1 by Feb 6. Kalshi split: opus-4-6 at 85%, opus-4-6-thinking at 17%
Jan 2026gemini-3-pro ($1.54M)Google ($28.97M)Uncontested
Dec 2025gemini-3-pro ($471K)Google ($36.33M)Uncontested
Nov 2025gemini-3-pro ($810K)Google (β€”)Gemini 3 Pro launched mid-month, dethroned Gemini 2.5 Pro. gemini-2.5-pro led ~50% early
Oct 2025gemini-2.5-pro ($279K)Google ($4.66M)Volatile early on Kalshi, settled by mid-month
Sep 2025gemini-2.5-pro ($92K)Google ($2.92M)Uncontested on both
Aug 2025gemini-2.5-pro ($291K)Google ($7.49M)GPT-5 High hit 32% on Kalshi, OpenAI hit 73% on Polymarket mid-month before Google retook lead on both

How to bet on AI prediction markets

Each company (or model, on Kalshi) gets its own contract. You pick Yes or No on whether that AI company will hold #1 at the end of the month. Only one can win, so in practice this is a winner-take-all contract. If you buy Anthropic Yes at 98Β’ and Anthropic holds #1 on February 28, the contract pays $1.00 and every other company’s contract resolves No.

The contracts trade between 1Β’ and 99Β’. With only one day left in February and Anthropic at 97-99%, there isn’t much edge left to gain this month. The opportunity is in forward months where the outcome is less certain, particularly if GPT-5.3 or Grok 5 launches and disrupts the leaderboard.

The two platforms handle timing differently. Kalshi opens each month’s market on the 1st, so you can only trade the current month. Polymarket lists future months alongside the active one β€” March, June, and beyond are already tradeable. That means Polymarket lets you take a position on where the leaderboard will be months from now, while Kalshi is limited to the race in progress.

For a full comparison of where to trade, see our list of prediction market apps.

Anthropic beats Google for the first time in six months

Claude Opus 4.6 launched on February 5 and took #1 on Arena by February 6, dethroning Gemini 3 Pro. The thinking variant hit 1506 Elo, the standard version 1502. It’s held the top spot for three weeks with no serious challenger, which is why pricing has converged toward certainty on both markets.

The Polymarket chart tells the story clearly. Google led early February around 75%, Claude spiked past it around Feb 5-6, dipped briefly mid-month, then locked in above 90% by Feb 20. On Kalshi, the interesting dynamic is the internal Claude race β€” opus-4-6 (standard) started behind opus-4-6-thinking but has pulled ahead to 85% vs 17% as the month closes.

The March market is already shaping up differently. On Polymarket, Anthropic opened the month at 54% with Google at 24%, OpenAI at 12%, and xAI at 5.5% β€” with $3.13 million in early volume. The market is pricing meaningful confidence in multiple contenders, a sharp contrast to the 90%+ dominance Google held in most prior months.

What to watch this month

Several releases could reshape the March market and beyond:

  • Google’s Gemini 3.1 Pro launched on February 19 and is already within 4 Elo points of Claude Opus 4.6 on Arena. As more votes accumulate through March, the gap could narrow further or flip entirely β€” making Google the most immediate threat to Anthropic’s lead.
  • OpenAI’s GPT-5.3 general model is still expected in Q1 2026. The coding variant (GPT-5.3-Codex) shipped in early February, but the full model hasn’t arrived on Arena yet. If it launches in March, OpenAI becomes a real contender.
  • xAI’s Grok 5 is confirmed for Q1 2026 with a public beta likely landing in March or April. Grok 4.1 briefly held Arena #1 at its November 2025 launch before Gemini 3 dethroned it a day later, so the gap is closable.

The volume trend is also worth watching. Polymarket’s monthly AI market peaked at $36.33 million in December and has been declining β€” $28.97 million in January, $21.66 million in February. A competitive March could reverse that trend. Track live volume across Kalshi and Polymarket on our prediction markets dashboard.

How do these markets resolve?

Kalshi and Polymarket both resolve against the Arena leaderboard, but they use different columns β€” which means they could theoretically resolve to different winners.

  • Kalshi checks the “Rank (UB)” column on the text leaderboard with Style Control removed on the last day of the month. Tiebreaker goes to highest Arena Score, then most votes, then earliest release.
  • Polymarket checks the “Arena Score” column on the Leaderboard tab at 12:00 PM ET on the last day of the month with style control off. Tiebreaker is alphabetical.

Rank (UB) reflects the upper bound of a model’s confidence interval, while Arena Score is the raw Elo. In a tight race where two models are separated by a few points, one platform could resolve to a different winner than the other β€” use our arbitrage calculator to compare pricing across platforms.

How Arena ranks AI models

Kalshi and Polymarket both resolve against the Arena leaderboard, formerly known as the LMSYS Chatbot Arena. Arena presents users with a prompt and two anonymous model responses side by side. The user picks which response is better without knowing which model produced it. Over 5.3 million of these blind comparisons have been collected, and each model receives an Elo rating derived from its win rate against other models in the pool.

Style Control is a leaderboard filter that adjusts for formatting bias β€” models that use more markdown, bullet points, or longer responses tend to score higher in raw comparisons even when the underlying reasoning is equivalent. With Style Control on, Arena isolates substance from presentation. Both Kalshi and Polymarket require Style Control off in their resolution criteria, meaning the raw Elo without formatting adjustment is what determines the winner.

  • “Which companies will have a top-ranked AI model this year?” on Kalshi. An annual market with 12 active contracts ($832K volume). Unlike the monthly markets, these resolve Yes if a company reaches #1 at any point during 2026. Google and Anthropic have already resolved Yes. OpenAI leads the remaining contracts at 62%, xAI at 55%.
  • “AI capability growth before July?” on Kalshi. A related bet on whether any model will hit an Arena Score of 1550 or higher before July ($77K volume). Currently pricing 46% for 1550 and 30% for 1575. Claude Opus 4.6 sits at 1506 today, so this is effectively a bet on the next generation of models.
  • “When will Claude 5 be released?” on Polymarket. Currently pricing 36% for April 30. If Claude 5 arrives and tops Arena, Anthropic could hold #1 for consecutive months.
  • “When will GPT-5.3 be released?” on Polymarket. Directly relevant to whether OpenAI can challenge for March. If GPT-5.3 launches and climbs Arena, expect the monthly market to move in tandem.
  • EU AI Act compliance deadline arrives August 2, 2026. If a lab delays a frontier model release to meet compliance requirements, that’s one fewer chance to top the leaderboard within a given month.

Liquidity varies significantly across these markets. Polymarket’s monthly AI market routinely draws $20-36 million per month, making it one of the deeper non-political betting markets on the platform. Kalshi’s monthly markets are thinner at $2-3 million but growing β€” February volume nearly doubled January. The annual and milestone markets on either site are much smaller, typically under $1 million, so larger positions may face slippage.

The fee structures are quite different between Kalshi and Polymarket and can affect net returns, particularly on lower-volume contracts. If you’re new to betting on Kalshi or Polymarket, you will also want to learn how order books work and market resolution before making your first trade.

FAQ

What does “best AI model” mean in these markets?

The company (or model) that holds the top rank on the Arena text leaderboard at the end of the month. Kalshi uses the Rank (UB) column, Polymarket uses Arena Score.

Can Kalshi and Polymarket resolve to different winners?

In theory, yes. They use different columns from the same leaderboard. In practice, the #1 model under Rank (UB) and Arena Score has been the same, but a tight race could produce a split.

Why is Polymarket’s volume so much higher than Kalshi’s?

Polymarket generally carries more liquidity on AI-related markets and draws a larger crypto-native trading audience. Kalshi is a regulated US exchange that has been expanding internationally, while Polymarket US continues to roll out to domestic traders.

Why is Anthropic favored for March but not dominant?

Anthropic opened March at 54% because Claude Opus 4.6 is the incumbent β€” it holds Arena #1 at 1506 Elo with a large gap over Gemini 3 Pro. Google sits at 24% partly because Gemini 3.1 Pro launched on February 19 and is already within 4 Elo points of Opus 4.6 on Arena. As more votes accumulate through March, that gap could close. OpenAI is at 12% because the general GPT-5.3 model is expected in Q1 2026 β€” the coding variant (GPT-5.3-Codex) shipped in early February, but the full model hasn’t launched yet. If it arrives in March and climbs Arena, OpenAI could become a real contender. xAI rounds out at 5.5% with Grok 5 confirmed for Q1 2026, though a public beta likely lands closer to March or April.

How often do these markets run?

Monthly on Kalshi and Polymarket. New contracts open as the prior month’s market approaches resolution. Polymarket has results going back to at least August 2025, Kalshi to at least August 2025 as well.