[Atlas] Wiki-claim debates - extract claims from wiki pages and debate each done

← Artifact Debates
Extract atomic claims from wiki pages, register as artifacts, and debate each with evidence_assessment personas; surface in talk-page sidebar.

Completion Notes

Auto-completed by supervisor after successful deploy to main

Git Commits (2)

Squash merge: orchestra/task/a4c450f7-biomni-analysis-parity-port-15-use-cases (87 commits) (#717)2026-04-27
[Atlas] Wiki-claim debates: extract claims and run evidence_assessment debates [task:9ed2ce80-3e73-40ae-99b3-3f2e0c64d3ac] (#689)2026-04-27
Spec File

Goal

The 18,447 wiki pages contain thousands of factual claims. Most are
unverified by structured debate. Build a pipeline that (a) extracts
top-level claims from a wiki page using a lightweight LLM call, (b) registers
each claim as an artifact_type='claim' artifact (or reuses artifact_type='hypothesis' with a claim subkind), and (c) auto-triggers
a 2-persona evidence_assessment debate on each claim against the citing
wiki page. Result: every wiki page acquires a tail of debated claims linked
back to it.

Acceptance Criteria

☐ CLI: python3 -m scidex.atlas.wiki_claim_extractor --pages-batch 25
--max-claims-per-page 5 --dry-run.
☐ Without --dry-run, registers claim artifacts (or
hypothesis with metadata.kind='claim'); links each claim to its
source wiki page via artifact_links link_type='derived_from'.
☐ For each claim, queues a debate (debate_type='evidence_assessment',
personas [evidence_auditor, counterexample_finder], num_rounds=2).
☐ Debate verdict updates the claim's quality_score; if
does_not_support, also writes an artifact_links contradicts row
pointing back at the wiki page (so the talk-page surfaces the dispute).
☐ Cron: hourly, batch 10 pages per run, idempotent (skip pages already
processed in last 30 days).
☐ Smoke: process 5 wiki pages with ≥1 claim each; verify per-claim debate
session and quality_score ≠ 0.5.

Approach

  • New module scidex/atlas/wiki_claim_extractor.py with the LLM extraction
  • prompt and the registration/queueing pipeline.
  • Add a claim value to ARTIFACT_TYPES in
  • scidex/atlas/artifact_registry.py (with metadata schema:
    claim_text, source_page_slug, polarity).
  • Reuse queue_debate and the existing verdict parser.
  • Surface the debated claims in a "Claims under review" sidebar widget on
  • the wiki page (small, lazy-loaded).

    Dependencies

    • agr-ad-01-TARG — debate targeting.
    • q-dsc-comments-on-wiki-pages — talk-page surfaces dispute UX.

    Work Log

    Sibling Tasks in Quest (Artifact Debates) ↗