ID: e8b9010e-f1d Priority: 82 Type: one_shot Status: completed
About 850 papers have abstracts that haven't been processed for KG edge extraction. Run NLP pattern matching on these to discover new gene-disease-pathway relationships. Target: 1000+ new edges.
scripts/extract_more_edges.py to worktree branchorchestra/task/e8b9010e-extract-kg-edges-from-800-unmined-paperscripts/extract_more_edges.py (NLP pattern matching for gene-disease-pathway KG edges)git push gh HEADnlp_batch2_extracted edges already exist in DB (created 2026-04-02T11:40:13){
"_reset_note": "This task was reset after a database incident on 2026-04-17.\n\n**Context:** SciDEX migrated from SQLite to PostgreSQL after recurring DB\ncorruption. Some work done during Apr 16-17 may have been lost.\n\n**Before starting work:**\n1. Check if the task's goal is ALREADY satisfied (run the relevant checks)\n2. Check `git log --all --grep=task:YOUR_TASK_ID` for prior commits\n3. If complete, verify and mark done. If partial, continue. If not done, proceed.\n\n**DB change:** SciDEX now uses PostgreSQL. `get_db()` auto-detects via\nSCIDEX_DB_BACKEND=postgres env var.",
"_reset_at": "2026-04-18T06:29:22.046013+00:00",
"_reset_from_status": "done"
}