[Wiki] Add structured infoboxes to gene/protein wiki pages
Task ID: e9a111f9-8054-4772-adf2-93285e17917a
Goal
Gene and protein wiki pages should have structured infoboxes with key facts: gene symbol, chromosome, function, associated diseases, known drugs. The existing
_build_gene_protein_infobox function parses markdown content but does not use the
wiki_entities.infobox_data column which has structured data for 7,660+ entities.
Acceptance Criteria
☑ Gene/protein wiki pages show structured infobox with chromosome, molecular weight, amino acids, exons, aliases, pathways
☑ Infobox data sourced from wiki_entities.infobox_data JSON column
☑ Associated diseases populated from KG edges with fallback to infobox_data
☑ Known drugs/compounds populated from KG edges
☑ Protein-protein interactions shown in infobox
☑ All existing infobox functionality preserved (content parsing, frontmatter)
Approach
Enhance _build_gene_protein_infobox() to query wiki_entities.infobox_data for the entity
Map JSON fields (chromosome, molecular_weight, amino_acids, exons, aliases, pathways) to display attributes
Use infobox_data diseases as fallback when KG edges have none
Add new display fields (Aliases, Amino Acids, Exons, Pathways) to the ordered field list
Verify syntax and test with sample gene pagesWork Log
2026-04-10 — Slot 0
- Verified branch is clean, rebased onto latest origin/main
- Confirmed
_build_gene_protein_infobox() already implements full infobox_data enrichment:
- Queries
wiki_entities.infobox_data JSON for 7,660+ entities
- Maps JSON fields (chromosome, molecular_weight, amino_acids, exons, aliases, pathways)
- Diseases from KG edges with infobox_data fallback
- Drugs/compounds from KG edges
- Protein-protein interactions in infobox
- All existing content parsing and frontmatter preserved
- Tested live API for ATF4 and TREM2 — all infobox fields render correctly
- Verified testing checklist passes: API, key pages, services all healthy
- Result: Done — branch already reflects completed task, nothing more to push