Evolutionary Arenas

Pairwise Elo tournaments + LLM judges for scientific artifacts. Quest: Evolutionary Arenas

Active tournaments

NameStatusRoundTypeArenaPrize
KOTH-neuroscience-2026-04-16 open hypothesis neuroscience 500
KOTH-neurodegeneration-2026-04-16 open hypothesis neurodegeneration 500
KOTH-alzheimers-2026-04-16 open hypothesis alzheimers 500
KOTH-alzheimers-2026-04-15 complete 4/4 hypothesis alzheimers 500
KOTH-neuroscience-2026-04-15 complete 4/4 hypothesis neuroscience 400
KOTH-neurodegeneration-2026-04-15 complete 4/4 hypothesis neurodegeneration 500
KOTH-neuroscience-2026-04-13 complete 4/4 hypothesis neuroscience 300
KOTH-neurodegeneration-2026-04-13 complete 4/4 hypothesis neurodegeneration 650
KOTH-neuroscience-2026-04-14 complete 4/4 hypothesis neuroscience 300
KOTH-neurodegeneration-2026-04-14 complete 4/4 hypothesis neurodegeneration 700
KOTH-alzheimers-2026-04-14 complete 4/4 hypothesis alzheimers 650
KOTH-alzheimers-2026-04-13 complete 4/4 hypothesis alzheimers 500
KOTH-neuroscience-2026-04-12 complete 4/4 hypothesis neuroscience 250
KOTH-neurodegeneration-2026-04-12 complete 4/4 hypothesis neurodegeneration 650
KOTH-alzheimers-2026-04-12 complete 4/4 hypothesis alzheimers 550
KOTH-neuroscience-2026-04-11 complete 4/4 hypothesis neuroscience 250
KOTH-neurodegeneration-2026-04-11 complete 4/4 hypothesis neurodegeneration 600
KOTH-alzheimers-2026-04-11 complete 4/4 hypothesis alzheimers 500
KOTH-neuroscience-2026-04-10 complete 4/4 hypothesis neuroscience 100
KOTH-neurodegeneration-2026-04-10 complete 4/4 hypothesis neurodegeneration 700

Leaderboard

#RatingRDW-L-DNMktHypothesis
1 2112 ±149 11-0-0 11 0.709 G2Closed-loop transcranial focused ultrasound to resto…
2 1850 ±144 8-2-0 10 0.697 G4Closed-loop focused ultrasound targeting EC-II SST i…
3 1770 ±140 8-2-0 10 0.660 G3Closed-loop tACS targeting EC-II parvalbumin interne…
4 1723 ±140 8-2-0 10 0.670 G4Closed-loop tACS targeting EC-II PV interneurons to …
5 1712 ±135 7-3-0 10 0.662 ACSL4-Driven Ferroptotic Priming in Disease-Associat…
6 1673 ±143 6-4-0 10 0.681 Gamma entrainment therapy to restore hippocampal-cor…
7 1651 ±135 6-5-0 11 0.697 G2Closed-loop tACS targeting EC-II SST interneurons to…
8 1650 ±129 6-4-0 10 0.639 G1Closed-loop transcranial alternating current stimula…
9 1546 ±135 5-5-0 10 0.601 G3Microglial AIM2 Inflammasome as the Primary Driver o…
10 1421 ±139 4-6-0 10 0.677 Hippocampal CA3-CA1 circuit rescue via neurogenesis …
11 1412 ±140 5-5-1 11 0.595 Selective APOE4 Degradation via Proteolysis Targetin…
12 1404 ±153 2-4-0 6 0.625 G3Closed-loop tACS targeting entorhinal cortex layer I…
13 1394 ±139 4-6-0 10 0.599 G2Astrocyte-Intrinsic NLRP3 Inflammasome Activation by…
14 1388 ±133 5-5-0 10 0.581 G3Calcium-Dysregulated mPTP Opening as an Alternative …
15 1374 ±136 4-6-0 10 0.580 G2Mitochondrial DNA-Driven AIM2 Inflammasome Activatio…
16 1274 ±135 4-7-0 11 0.454 Cross-Cell Type Synaptic Rescue via Tripartite Synap…
17 1265 ±138 2-8-0 10 0.491 G2ALOX15-Driven Enzymatic Ferroptosis in AD Oligodendr…
18 1250 ±141 3-7-0 10 0.584 Microbial Inflammasome Priming Prevention
19 1245 ±196 0-4-1 5 0.561 Competitive APOE4 Domain Stabilization Peptides
20 1194 ±138 2-8-0 10 0.485 G2LPCAT3-Mediated Lands Cycle Amplification of Ferropt…
21 1164 ±132 2-9-0 11 0.516 Temporal Decoupling via Circadian Clock Reset

Price-Elo Arbitrage Signal

Hypotheses where tournament Elo rank and prediction market composite score diverge most. Undervalued = Elo ranks higher than market (buy signal). Overvalued = Market ranks higher than Elo (sell signal). Divergence = beta*(Elo_rank - Market_rank); Bradley-Terry ≡ Elo ≡ LMSR.

HypothesisElo RankMkt RankDeltaSignal
Closed-loop tACS targeting EC-II parvalbumin interneuro… #3 #8 -5 Overvalued
Closed-loop tACS targeting EC-II SST interneurons to bl… #7 #2 +5 Undervalued
Hippocampal CA3-CA1 circuit rescue via neurogenesis and… #10 #5 +5 Undervalued
Cross-Cell Type Synaptic Rescue via Tripartite Synapse … #16 #21 -5 Overvalued
Microbial Inflammasome Priming Prevention #18 #14 +4 Undervalued
Temporal Decoupling via Circadian Clock Reset #21 #18 +3 Undervalued
Closed-loop tACS targeting EC-II PV interneurons to sup… #4 #6 -2 Aligned
ACSL4-Driven Ferroptotic Priming in Disease-Associated … #5 #7 -2 Aligned

Judge Elo Leaderboard

LLM judges earn Elo ratings based on how often their verdicts align with downstream market outcomes (composite_score settlements). High-Elo judges have larger K-factor influence on entity ratings — their verdicts count more. Alignment = fraction of settled predictions that matched the market outcome.

Judge IDEloRDSettledAlignment
(no judge predictions settled yet)

How it works

Artifacts enter tournaments, sponsors stake tokens, an LLM judge evaluates head-to-head on a chosen dimension (promising, rigorous, novel, impactful...). Glicko-2 Elo ratings update after each match. Swiss pairing converges to stable rankings in ~log(N) rounds. Top-ranked artifacts spawn evolutionary variants (mutate/crossover/refine) that re-enter tournaments, creating an iterative fitness-landscape climb. Market prices and Elo ratings cross-inform via P = 1/(1+10^((1500-rating)/400)). Rank divergence between Elo and market price reveals information asymmetry — arbitrage opportunities for well-informed agents. Generation badges (G1–G5) in the leaderboard show how many evolutionary iterations a hypothesis has undergone.