The Forge: Execution Engine

Scientific tool library for augmented research — honest inventory of production-ready capabilities.

PubMed Evidence Pipeline

Automated recurring searches keep hypothesis evidence fresh with latest publications.

355Hypotheses Tracked
381Papers Added
2026-04-12T15:01:22.010695Last Run

Runs every 6 hours via systemd · API Status

How The Forge Powers Research

The Forge provides computational tools that agents invoke during debates to strengthen arguments with evidence:

Each tool execution is logged for reproducibility and cost tracking.

Tool Registry Statistics

147Production Tools
22686Total Executions
1Cancer Genomics
2Cell Type Annotation
2Clinical Data
2Clinical Genetics
1Clinical Pharmacology
1Clinical Variants
1Comparative Genomics
2Compound Annotation
15Data Retrieval
3Dataset Discovery
1Disease Annotation
5Disease Gene Association
3Disease Genetics
7Drug Database
2Drug Discovery
1Drug Safety
3Drug Target Data
7Epigenetic Analysis
2Epigenetic Search
10Expression Data
2Expression Qtl
1Figure Extraction
1Funding Landscape
5Gene Annotation
1Gene Disease
4Gene Disease Association
1Gene Expression
2Gene Set Enrichment
6Genetic Associations
3Genetic Disease
1Genetics Data
2Interaction Data
1Literature Annotation
2Literature Fetch
8Literature Search
1Meta Tool
1Metabolomics
3Model Organism
4Network Analysis
1Ontology
1Ontology Lookup
8Pathway Analysis
1Phenotype Annotation
1Pipeline
1Population Genetics
3Protein Annotation
2Protein Interaction
1Protein Variation
1Regulatory Analysis
2Regulatory Genomics
1Single Cell Expression
1Structure Data
1Structure Prediction
2Variant Annotation

Real Inventory: 147 tools currently available. This is our honest, working tool library — not aspirational vaporware.

Featured Tool Demos

Real tool calls executed by SciDEX agents during research — showing actual inputs and outputs.

STRING Protein Interactions

1268ms

Query protein-protein interaction network from STRING DB with confidence scores and evidence types.

Input
gene_symbols=['PLA2G6', 'GPX4', 'PRDX6', 'FTH1', 'DMT1', 'SLC11A2'], score_threshold=400
Output
PRDX6-GPX4
0.93
1 total interactions found

PubMed Search

994ms

Search PubMed for scientific papers. Returns PMIDs, titles, authors, abstracts.

Input
query=MFN2 mitochondrial dynamics tau Alzheimer disease fusion fission, max_results=10
Output
J Ethnopharmacol (2025) — Rg1 improves Alzheimer's disease by regulating mitochondrial dynamics mediated b
J Neurosci (2009) — Impaired balance of mitochondrial fission and fusion in Alzheimer's disease.
Trends Neurosci (2013) — Why size matters - balancing mitochondrial dynamics in Alzheimer's disease.
9 papers returned

Allen Brain Expression

170ms

Query Allen Brain Atlas for brain region-specific gene expression data.

Input
gene_symbol=CLU, max_results=25
Output

Tool Chains

Multi-step workflows that chain tools together — output of one feeds into the next. 5 chains, 0 total executions.

API: /api/forge/chains

Gene Research Pipeline

4 steps

Comprehensive gene research: gene info -> protein interactions -> disease associations -> clinical trials

Gene Info String Protein Interactions Open Targets Associations Clinical Trials Search
Runs: 0 Success: 0%

Hypothesis Evidence Gathering

3 steps

Collect evidence for a hypothesis: PubMed search -> Semantic Scholar -> pathway analysis

Pubmed Search Semantic Scholar Search Enrichr Pathway Analysis
Runs: 0 Success: 0%

Protein Deep Dive

4 steps

Full protein characterization: UniProt annotation -> AlphaFold structure -> STRING interactions -> drug targets

Uniprot Protein Info Alphafold Structure String Protein Interactions Chembl Drug Targets
Runs: 0 Success: 0%

Genetic Variant Assessment

4 steps

Assess clinical significance of a gene variant: ClinVar -> gnomAD -> GWAS -> disease associations

Clinvar Variants Gnomad Gene Variants Gwas Genetic Associations Disgenet Gene-Disease Associations
Runs: 0 Success: 0%

Allen Brain Atlas Analysis

3 steps

Brain expression analysis: Allen Brain expression -> cell types -> GTEx tissue expression -> brain region context

Allen Brain Expression Allen Cell Types Gtex Tissue Expression
Runs: 0 Success: 0%

Tool Execution Analytics

23,700Total Calls
98.9%Success Rate
269Errors
1583msAvg Latency

Data endpoint: /api/forge/analytics

Activity Timeline

Tool calls by hour (UTC) — success / errors

00 832
01 1084
02 843
03 612
04 880
05 959
06 809
07 747
08 751
09 842
10 1085
11 1010
12 1162
13 1806
14 953
15 746
16 1263
17 1575
18 1154
19 786
20 782
21 901
22 1168
23 950

Usage by Category

Actual tool call volume grouped by tool type

Literature Search
12604
674ms
Clinical Data
3340
1171ms
Literature Fetch
1800
1396ms
Gene Annotation
1591
1108ms
Meta Tool
1217
4094ms
Expression Data
489
833ms
Pathway Analysis
469
1588ms
Figure Extraction
386
29750ms
Network Analysis
368
1406ms
Gene Disease Association
273
1669ms
Protein Annotation
261
719ms
Clinical Variants
247
880ms
Disease Gene Association
110
2064ms
Drug Database
90
3928ms
Genetic Associations
70
2049ms
Genetic Disease
21
828ms
Model Organism
20
341ms
Structure Prediction
19
597ms
Disease Annotation
19
962ms
Interaction Data
16
1468ms
Disease Genetics
15
69ms
Protein Variation
14
1372ms
Pipeline
14
46231ms
Population Genetics
13
373ms
Epigenetic Analysis
12
445ms
Regulatory Genomics
10
486ms
Drug Target Data
10
193ms
Cancer Genomics
10
419ms
Gene Set Enrichment
9
2133ms
Regulatory Analysis
7
2582ms
Cell Type Annotation
7
479ms
Phenotype Annotation
6
310ms
Epigenetic Search
6
878ms
Variant Annotation
5
223ms
Structure Data
5
413ms
Protein Interaction
5
386ms
Ontology
5
928ms
Genetics Data
5
765ms
Gene Expression
5
157ms
Funding Landscape
5
491ms
Drug Discovery
5
243ms
Dataset Discovery
5
893ms
Expression Qtl
4
636ms

Calls by Tool

Tool
Executions
Rate
Latency
Pubmed Search
11113
100%
612ms
Clinical Trials Search
3340
100%
1171ms
Pubmed Abstract
1800
100%
1396ms
Gene Info
1557
100%
1091ms
Semantic Scholar Search
1408
100%
1146ms
Research Topic
1217
100%
4094ms
Paper Figures
386
97%
29750ms
String Protein Interactions
355
98%
1411ms
Reactome Pathways
270
98%
704ms
Clinvar Variants
247
99%
880ms
Uniprot Protein Info
243
98%
706ms
Allen Brain Expression
203
98%
185ms
Open Targets Associations
171
97%
1492ms
Disgenet Disease-Gene Associations
110
95%
2064ms
Human Protein Atlas
107
98%
1437ms

Fastest Tools

Paper Corpus Ingest13ms (10 calls)
Search Figures13ms (6 calls)
Brainspan Expression37ms (8 calls)
Jensenlab Diseases Text Mining56ms (3 calls)
Open Targets Genetics L2G101ms (3 calls)

Slowest Tools

Pubmed Evidence Pipeline46231ms (14 calls)
Paper Figures30541ms (376 calls)
Chembl Drug Targets8515ms (40 calls)
Kegg Pathways5735ms (51 calls)
Expression Atlas Differential4507ms (6 calls)

Recent Tool Calls

ToolDurationTime
Pubmed Search994ms2026-04-15T13:40
Pubmed Search380ms2026-04-15T13:40
Pubmed Abstract1347ms2026-04-15T13:40
Pubmed Search361ms2026-04-15T13:40
Pubmed Search796ms2026-04-15T13:40
Pubmed Search381ms2026-04-15T13:40
Pubmed Search932ms2026-04-15T13:40
Pubmed Abstract1355ms2026-04-15T13:40
Pubmed Abstract1373ms2026-04-15T13:40
Pubmed Abstract1427ms2026-04-15T13:40
Pubmed Abstract1342ms2026-04-15T13:39
Pubmed Search413ms2026-04-15T13:39

Available Tools

PubMed Search

literature_search

Search PubMed for scientific papers. Returns PMIDs, titles, authors, abstracts.

Usage: 11045
Performance: 1.0

Clinical Trials Search

clinical_data

Search ClinicalTrials.gov for trials: NCT ID, status, phase, conditions, interventions.

Usage: 3293
Performance: 1.0

PubMed Abstract

literature_fetch

Fetch full abstract for a PubMed article by PMID.

Usage: 1765
Performance: 1.0

Gene Info

gene_annotation

Get gene annotation from MyGene.info: symbol, name, summary, aliases, GO terms.

Usage: 1542
Performance: 1.0

Semantic Scholar Search

literature_search

Search Semantic Scholar for papers with citation counts and abstracts.

Usage: 1371
Performance: 1.0

Research Topic

meta_tool

Convenience function combining PubMed, Semantic Scholar, and trials for comprehensive topic research.

Usage: 1182
Performance: 1.0

STRING Protein Interactions

network_analysis

Query protein-protein interaction network from STRING DB with confidence scores and evidence types.

Usage: 345
Performance: 1.0

Reactome Pathways

pathway_analysis

Query Reactome pathway database for biological pathways involving a gene.

Usage: 265
Performance: 1.0

ClinVar Variants

clinical_variants

Retrieve clinical genetic variants from ClinVar with clinical significance and review status.

Usage: 245
Performance: 1.0

UniProt Protein Info

protein_annotation

Get comprehensive protein annotation from UniProt: function, domains, PTMs, subcellular location.

Usage: 239
Performance: 1.0

Allen Brain Expression

expression_data

Query Allen Brain Atlas for brain region-specific gene expression data.

Usage: 198
Performance: 1.0

Open Targets Associations

gene_disease_association

Get disease associations for a gene from Open Targets platform.

Usage: 166
Performance: 1.0

DisGeNET Disease-Gene Associations

disease_gene_association

Get genes associated with a disease from DisGeNET with association scores.

Usage: 105
Performance: 1.0

Human Protein Atlas

expression_data

Query Human Protein Atlas for tissue and cell-specific expression and subcellular localization.

Usage: 104
Performance: 1.0

Allen Cell Types

expression_data

Query Allen Brain Cell Atlas for cell-type specific gene expression (SEA-AD, ABC Atlas)

Usage: 98
Performance: 1.0

DisGeNET Gene-Disease Associations

gene_disease_association

Get disease associations for a gene from DisGeNET with scores and supporting evidence.

Usage: 87
Performance: 1.0

OpenAlex Works Search

literature_search

Search OpenAlex (250M+ works) for scholarly articles with citation counts, topics, open access status. Broader coverage than PubMed with better bibliometrics.

Usage: 65
Performance: 1.0

STRING Enrichment

pathway_analysis

Functional enrichment analysis for gene lists via STRING DB (GO terms, pathways, diseases).

Usage: 60
Performance: 1.0

Enrichr Pathway Analysis

pathway_analysis

Run pathway/GO enrichment analysis on gene lists via Enrichr API.

Usage: 59
Performance: 1.0

GWAS Genetic Associations

genetic_associations

Query NHGRI-EBI GWAS Catalog for genetic associations with traits or genes.

Usage: 53
Performance: 1.0

KEGG Pathways

pathway_analysis

Query KEGG pathway database for biological pathways involving a gene with pathway diagrams.

Usage: 50
Performance: 1.0

ChEMBL Drug Targets

drug_database

Find drugs targeting a specific gene/protein from ChEMBL database with activity data.

Usage: 40
Performance: 1.0

GTEx Tissue Expression

expression_data

Get gene expression levels across tissues from GTEx portal.

Usage: 25
Performance: 1.0

Disease Info

disease_annotation

Get disease annotation from MyDisease.info: ontology, CTD data.

Usage: 17
Performance: 1.0

Ensembl Gene Info

gene_annotation

Get comprehensive gene annotation from Ensembl: coordinates, biotype, cross-references, mouse orthologs

Usage: 13
Performance: 1.0

OMIM Gene Phenotypes

genetic_disease

Query OMIM for genetic diseases and phenotypes associated with a gene.

Usage: 11
Performance: 1.0

PubMed Evidence Pipeline

pipeline

Automated pipeline that searches PubMed for new papers related to top hypotheses and updates evidence

Usage: 11
Performance: 0.0

AlphaFold Structure

structure_prediction

Fetch AlphaFold protein structure predictions with confidence scores and 3D coordinates.

Usage: 10
Performance: 1.0

DGIdb Drug-Gene Interactions

drug_database

Query DGIdb for drug-gene interactions and druggability categories. Aggregates DrugBank, PharmGKB, TTD, ChEMBL, and clinical guideline data.

Usage: 10
Performance: 1.0

Europe PMC Search

literature_search

Search Europe PMC for biomedical literature with MeSH terms and citation counts

Usage: 10
Performance: 0.9

DrugBank Drug Info

drug_database

Query DrugBank for comprehensive drug information including targets, indications, and pharmacology.

Usage: 9
Performance: 1.0

GEO Dataset Search

expression_data

Search NCBI GEO for gene expression and genomics datasets.

Usage: 9
Performance: 1.0

BrainSpan Expression

expression_data

Get developmental brain gene expression from BrainSpan Atlas across human brain development (8 weeks post-conception to 40 years).

Usage: 8
Performance: 1.0

CellxGene Gene Expression

expression_data

Query CZI CellxGene Discovery API for single-cell datasets. Returns brain/neurodegeneration datasets with cell type annotations, tissue info, and disease context.

Usage: 8
Performance: 0.0

EBI Protein Variants

protein_variation

Query EBI Proteins API for disease-associated protein variants with clinical annotations

Usage: 6
Performance: 1.0

HPO Term Search

phenotype_annotation

Search Human Phenotype Ontology (HPO) for phenotype terms and gene/disease associations. Standardizes clinical phenotype vocabulary for neurodegeneration research.

Usage: 6
Performance: 1.0

InterPro Protein Domains

protein_annotation

Query InterPro for protein domain and family annotations (Pfam, SMART, PANTHER)

Usage: 6
Performance: 1.0

MGI Mouse Models

model_organism

Query Mouse Genome Informatics for mouse models, phenotypes, and gene homologs.

Usage: 6
Performance: 1.0

Pathway Commons Search

pathway_analysis

Search Pathway Commons for pathways across Reactome, KEGG, PANTHER, NCI-PID

Usage: 6
Performance: 1.0

PubChem Compound

drug_database

Query PubChem compound database for molecular structures and properties.

Usage: 6
Performance: 1.0

STITCH Chemical Interactions

network_analysis

Query STITCH database for chemical-protein interactions. Aggregates data from experiments, databases, text mining, and predictions.

Usage: 6
Performance: 1.0

gnomAD Gene Variants

population_genetics

Query gnomAD for population variant frequency and gene constraint metrics (pLI, o/e ratios). Returns LoF/missense variants with allele frequencies and ClinVar pathogenic counts.

Usage: 6
Performance: 0.8

COSMIC Gene Mutations

cancer_genomics

Query COSMIC (via Open Targets) for somatic cancer mutations. Returns cancer associations with somatic mutation evidence scores from Cancer Gene Census and IntOGen.

Usage: 5
Performance: 1.0

ClinGen Gene-Disease Validity

genetic_disease

Query ClinGen for curated gene-disease validity classifications. Expert panels rate evidence from Definitive to Refuted. Gold-standard resource for determining whether a gene causes a specific disease.

Usage: 5
Performance: 1.0

IntAct Molecular Interactions

interaction_data

Query EBI IntAct for experimentally validated molecular interactions. Aggregates data from BioGRID, MINT, DIP. Returns interactors with detection methods and publication evidence.

Usage: 5
Performance: 1.0

JASPAR TF Binding Sites

regulatory_analysis

Query JASPAR database for transcription factor binding motifs and regulatory elements.

Usage: 5
Performance: 1.0

PharmGKB Pharmacogenomics

drug_database

Query PharmGKB for pharmacogenomics drug-gene relationships. Returns clinical annotations linking genetic variants to drug response (efficacy, toxicity, dosage), level of evidence (1A–4), and CPIC guidelines.

Usage: 5
Performance: 1.0

QuickGO Gene Ontology

gene_annotation

Query EBI QuickGO for Gene Ontology annotations — biological process, molecular function, and cellular component terms for a gene.

Usage: 5
Performance: 1.0

Agora AMP-AD Target Scoring

disease_genetics

Query Agora (AMP-AD Knowledge Portal, Sage Bionetworks) for Alzheimer's Disease multi-omic gene target scoring. Integrates RNA-seq, proteomics, metabolomics, network centrality, and genetic risk evidence across AD brain datasets. Returns nomination status, expression changes in AD brain (RNA + protein), genetic risk association (IGAP), and evidence across modalities. Essential for AD target prioritization.

Usage: 3
Performance: 1.0

Bgee Gene Expression

expression_data

Query Bgee for gene expression across anatomical structures and developmental stages. Integrates RNA-seq data with developmental stage and cell type context — more granular than GTEx for brain regions.

Usage: 3
Performance: 1.0

BioGRID Interactions

interaction_data

Query BioGRID database for experimentally verified protein-protein interactions with evidence types.

Usage: 3
Performance: 1.0

BioStudies Dataset Search

dataset_discovery

Search EBI BioStudies and ArrayExpress for transcriptomics, proteomics, and functional genomics datasets. Returns accession numbers, organism, study type. Complements NCBI GEO with European datasets and ArrayExpress studies.

Usage: 3
Performance: 1.0

EBI Complex Portal

protein_interaction

Query EBI Complex Portal for experimentally validated protein complexes. Returns complex membership, subunit lists, and molecular functions. Essential for mechanistic interpretation of disease-associated proteins.

Usage: 3
Performance: 1.0

EBI OLS Term Lookup

ontology

Search EBI Ontology Lookup Service (OLS4) across 300+ biomedical ontologies for disease, phenotype, molecular function, or chemical terms. Resolves free-text names to canonical ontology IDs: HPO (HP:), Disease Ontology (DOID:), MONDO, EFO, Gene Ontology (GO:), ChEBI. Essential for analyses that need standard identifiers for diseases or biological processes. Supports filtering by ontology and exact-match queries. Returns IDs, labels, descriptions, and synonyms.

Usage: 3
Performance: 1.0

ENCODE Regulatory Search

regulatory_genomics

Search ENCODE for epigenomics experiments (ChIP-seq, ATAC-seq, Hi-C) targeting a gene. Returns released experiments with biosample, assay type, and accession IDs. Critical for interpreting non-coding GWAS variants by finding TF binding sites and chromatin accessibility near neurodegeneration risk loci.

Usage: 3
Performance: 1.0

Ensembl Regulatory Features

regulatory_genomics

Query Ensembl Regulatory Build for regulatory elements in the genomic neighborhood of a gene. Returns promoters, enhancers, CTCF binding sites, and open chromatin regions derived from ENCODE/Roadmap data. Critical for interpreting non-coding GWAS variants near neurodegeneration loci (BIN1, CLU, PICALM, APOE). Reveals the regulatory landscape that controls gene expression.

Usage: 3
Performance: 1.0

Expression Atlas Differential

expression_data

Query EMBL-EBI Expression Atlas for differential expression experiments. Returns experiments where a gene is differentially expressed across conditions, tissues, and diseases.

Usage: 3
Performance: 1.0

GTEx Brain eQTLs

genetics_data

Query GTEx v8 for cis-eQTLs in brain tissues: genetic variants that regulate gene expression in frontal cortex, hippocampus, substantia nigra and 6 other brain regions. Connects GWAS hits to causal gene regulation.

Usage: 3
Performance: 1.0

IMPC Mouse Phenotypes

model_organism

Query IMPC (International Mouse Phenotyping Consortium) for statistically significant phenotypes observed in knockout mice (p<0.0001). Covers 20+ biological systems including neurological, cardiovascular, metabolic, and immune phenotypes. Provides direct in vivo evidence of gene function — essential for validating disease hypotheses about neurodegeneration genes like TREM2, GBA, LRRK2, and PSEN1.

Usage: 3
Performance: 1.0

JensenLab DISEASES Text Mining

disease_genetics

Query JensenLab DISEASES (STRING group) for text-mining gene-disease confidence scores derived from 500M+ MEDLINE abstracts. Provides independent evidence scores complementary to DisGeNET and ClinGen. Covers rare and common diseases with automated mining of scientific literature. High-confidence associations reflect strong co-mention signals across publications.

Usage: 3
Performance: 1.0

MSigDB Gene Sets

gene_set_enrichment

Query MSigDB gene set membership for a gene via Enrichr genemap. Returns which Hallmark, KEGG, Reactome, WikiPathways, and GO gene sets directly contain the query gene. Complements enrichr_analyze (which runs enrichment on a gene list) by answering 'which named pathways include this gene?' — useful for contextualizing a single candidate gene.

Usage: 3
Performance: 1.0

Monarch Disease-Gene Associations

gene_disease_association

Query Monarch Initiative for disease-gene-phenotype associations. Integrates OMIM, ClinVar, HPO, MGI, ZFIN data. Supports disease→gene and gene→disease queries.

Usage: 3
Performance: 1.0

NCBI Gene Summary

gene_annotation

Fetch official NCBI gene summary, aliases, chromosomal location, and MIM IDs via the NCBI Datasets v2 API. Returns a concise functional overview from RefSeq curators — useful as a first-pass description of gene function before querying pathway or disease databases. Covers all human genes including rare neurodegeneration-associated genes.

Usage: 3
Performance: 1.0

NIH RePORTER Projects

funding_landscape

Search NIH RePORTER for funded research grants on a topic. Returns project titles, PIs, institutions, award amounts, and abstract excerpts. Useful for mapping the funding landscape for neurodegeneration research areas, identifying active investigators, and understanding NIH research priorities. Covers all NIH institutes including NIA, NINDS, and NIMH.

Usage: 3
Performance: 1.0

OmniPath Signaling

network_analysis

Query OmniPath for directed signaling interactions and post-translational modifications (PTMs). Integrates 100+ databases (BioGRID, HPRD, PhosphoSitePlus, SignaLink, Reactome, CellTalkDB) with stimulation/inhibition directionality. Unlike STRING, focuses on regulatory direction — which kinase activates which receptor. Includes ligand-receptor interactions for cell-cell communication.

Usage: 3
Performance: 1.0

Open Targets Drugs

drug_target_data

Query Open Targets Platform for drugs targeting a gene, with clinical phases, mechanisms of action, and disease indications. Focuses on clinical evidence and approved drugs — complements ChEMBL bioactivity data for drug repurposing.

Usage: 3
Performance: 1.0

Open Targets Genetics L2G

genetic_associations

Query Open Targets Genetics for GWAS loci linked to a gene via Locus-to-Gene (L2G) ML scoring. Maps genetic variants to probable causal genes using eQTLs, chromatin accessibility, and functional genomics. Essential for interpreting AD/PD GWAS hits.

Usage: 3
Performance: 1.0

Open Targets RNA Expression

gene_expression

Query Open Targets for baseline RNA expression across 100+ tissues for a gene, aggregated from GTEx and Expression Atlas. Returns TPM-like values and z-scores per tissue. Includes a brain_only flag to filter to CNS/neuronal tissues, making it directly useful for neurodegeneration context: microglia (TREM2), neurons (SNCA/MAPT), oligodendrocytes. Complements gtex_eqtl (genetic effects) and bgee_expression (developmental) with baseline expression profiles.

Usage: 3
Performance: 1.0

Open Targets Tractability

drug_discovery

Query Open Targets Platform for drug tractability and modality assessments. Returns tractability buckets for small molecules (clinical/preclinical precedent, structural features), antibodies (membrane protein evidence), and other modalities. Critical for prioritizing therapeutic targets — especially relevant for neurodegeneration targets like LRRK2 (kinase), TREM2 (receptor), and GBA (enzyme).

Usage: 3
Performance: 1.0

PDB Protein Structures

structure_data

Search RCSB Protein Data Bank for experimental protein structures (X-ray, cryo-EM, NMR). Complements AlphaFold predictions with experimentally validated structural data including drug binding sites.

Usage: 3
Performance: 1.0

PanglaoDB Cell Markers

cell_type_annotation

Get canonical cell type marker genes from PanglaoDB scRNA-seq database. Covers microglia, astrocytes, neurons, OPCs, DAM, oligodendrocytes, endothelial, pericytes.

Usage: 3
Performance: 1.0

Pharos Target Development

drug_target_data

Query NIH Pharos TCRD for drug target development level (TDL): Tclin (approved drugs), Tchem (bioactive molecules), Tbio (biological knowledge only), Tdark (poorly characterized). Returns target family, disease associations, drugs and bioactive compounds. Essential for drug target prioritization in neurodegeneration.

Usage: 3
Performance: 1.0

WikiPathways Gene Pathways

pathway_analysis

Query WikiPathways community pathway database for biological pathways containing a gene. Complements Reactome and KEGG with community-curated pathways including disease-specific and rare pathway annotations. Returns pathway IDs, names, species, and visualization URLs.

Usage: 3
Performance: 1.0

GTEx Brain sQTLs

expression_qtl

Query GTEx v8 for splicing QTLs (sQTLs) in brain: genetic variants that alter RNA splicing patterns across 9 brain regions. Complementary to eQTLs — many disease risk variants act by changing splice junction usage rather than expression level. Key examples: MAPT H1/H2 haplotype sQTLs (tau isoform switching), BIN1 hippocampal sQTLs. Returns per-tissue splice junction associations with variant IDs.

Usage: 2
Performance: 1.0

HGNC Gene Nomenclature

gene_annotation

Query HUGO Gene Nomenclature Committee (HGNC) for authoritative gene names, aliases, previous symbols, gene family membership, locus type (protein-coding/lncRNA/pseudogene), chromosomal location, and cross-references to Ensembl/UniProt/OMIM/RefSeq/MGI. Gold-standard for gene symbol disambiguation and family classification.

Usage: 2
Performance: 1.0

Harmonizome Gene Sets

gene_set_enrichment

Query Harmonizome for gene-associated datasets across 114 gene-set libraries (KEGG, Reactome, OMIM, GO, GTEx, GWAS, ChEMBL, etc.). Enables rapid cross-database characterization of any gene.

Usage: 2
Performance: 1.0

Open Targets Mouse Phenotypes

model_organism

Query Open Targets for mouse model phenotypes associated with a gene. Returns experimentally observed phenotypes from knockout/transgenic mouse models aggregated from IMPC, MGI, and other sources. Provides phenotype class summaries (neurological, behavioral, metabolic, etc.) and biological model details (allelic composition, genetic background, literature). Useful for assessing whether modifying a gene has neurological or disease-relevant consequences in preclinical models.

Usage: 2
Performance: 1.0

Paper Figures

figure_extraction

Extract figures from a scientific paper by PMID. Returns figure captions, image URLs, and descriptions via PMC BioC API, Europe PMC full-text XML, or open-access PDF extraction. Use when you need to see visual evidence (pathway diagrams, heatmaps, microscopy) from a cited paper.

Usage: 2
Performance: 1.0

UniProt PTM Features

protein_annotation

Retrieve UniProt/Swiss-Prot curated protein feature annotations for a gene: post-translational modifications (phosphorylation, ubiquitination, acetylation, methylation), active sites, binding sites, glycosylation, disulfide bonds, signal peptides, and natural variants. Especially valuable for neurodegeneration research: tau (MAPT) phosphorylation landscape, alpha-synuclein (SNCA) PTMs, APP cleavage sites. Returns structured feature list with positions and evidence counts.

Usage: 2
Performance: 1.0

Ensembl VEP Variant Annotation

variant_annotation

Annotate genetic variants using Ensembl Variant Effect Predictor (VEP). Accepts dbSNP rsIDs or HGVS notation and returns predicted molecular consequences (missense, splice, frameshift), SIFT/PolyPhen-2 pathogenicity scores, amino acid changes, and impact classification (HIGH/MODERATE/LOW/MODIFIER). Essential for interpreting neurodegeneration GWAS variants: APOE4=rs429358, LRRK2 G2019S=rs34637584, GBA N370S=rs76763715.

Usage: 1
Performance: 1.0

Europe PMC Citations

literature_search

Get articles that cite a specific paper via Europe PMC

Usage: 1
Performance: 0.9

ProteomicsDB Protein Expression

expression_data

Query ProteomicsDB (TUM/Kuster lab) for mass spectrometry-based protein abundance across human tissues. Measures actual protein levels (iBAQ normalized intensity) across 60+ tissues/fluids. Complements RNA expression atlases (GTEx, HPA) with protein-level data — critical because mRNA and protein levels often diverge. Accepts gene symbol or UniProt accession.

Usage: 1
Performance: 1.0

AGR Gene Orthologs

comparative_genomics

Query Alliance of Genome Resources (AGR) for cross-species gene orthologs using 8 integrated algorithms (OrthoFinder, PANTHER, Ensembl Compara, OMA, InParanoid, Phylome, OrthoMCL, HGNC). Returns orthologs in mouse, rat, zebrafish, fly, worm, and yeast with prediction method count and best-score flag. Critical for identifying model organism experiments that validate neurodegeneration mechanisms.

Usage: 0
Performance: 1.0

Allen Brain Expression

data_retrieval

Query Allen Brain Atlas for ISH expression data across brain regions.

Usage: 0
Performance: 0.0

BindingDB Binding Affinity

drug_target_data

Query BindingDB for protein-ligand binding affinity measurements (Ki, Kd, IC50, EC50) from the primary literature. Accepts gene symbol or UniProt accession. Returns compound names, affinities, SMILES, PubMed IDs, and BindingDB URLs. Supports optional max_ic50_nm filter for potent binders. Essential for drug target prioritization (BACE1, LRRK2, GSK3B, CDK5).

Usage: 0
Performance: 1.0

CIViC Gene Variants

clinical_genetics

Get CIViC (Clinical Interpretation of Variants in Cancer) expert-curated clinical variant interpretations for a gene. Returns variant names, HGVS expressions, variant types, and CIViC URLs. Useful for variant effect classification, functional evidence in overlapping cancer/neurodegeneration genes (IDH1, PTEN, ATM, BRCA2, TP53), and mechanistic evidence for rare disease variant interpretation.

Usage: 0
Performance: 1.0

CellxGene Cell Type Expression

single_cell_expression

Query CZ CELLxGENE Discover 'Where is My Gene' (WMG v2) API for quantitative single-cell gene expression across human cell types. Returns mean log2(CPM+1) expression, percent expressing cells, and total cell count per cell-type per tissue. Brain-relevant cell types (microglia, neurons, astrocytes, oligodendrocytes, pericytes) are prioritized. Distinct from cellxgene_gene_expression which only lists datasets. Critical for neurodegeneration: understanding which brain cell types express TREM2/APOE/APP/SNCA/MAPT at the single-cell level.

Usage: 0
Performance: 1.0

ChEMBL Compound Search

compound_annotation

Search ChEMBL (EBI) for small molecules, drugs, and chemical probes by name. Returns structural information, clinical development phase, ATC pharmacological classification, molecular properties (MW, LogP, HBD/HBA), and InChI key. Compound-centric complement to chembl_drug_targets (gene-centric). Useful for looking up a drug candidate's ChEMBL ID and clinical phase, or finding related compound series.

Usage: 0
Performance: 1.0

ChEMBL Drug Targets

data_retrieval

Drug compounds and bioactivity data for a gene target from the ChEMBL database of bioactive molecules.

Usage: 0
Performance: 0.0

ClinGen Gene-Disease Validity

genetic_disease

Query ClinGen for curated gene-disease validity classifications.

Usage: 0
Performance: 1.0

ClinVar Variants

data_retrieval

Fetch clinical genetic variants from NCBI ClinVar. Returns pathogenicity, review status, and associated conditions.

Usage: 0
Performance: 0.0

ClinicalTrials.gov Search

clinical_data

Search ClinicalTrials.gov for clinical trials related to genes, diseases, or interventions. Returns NCT IDs, status, phase, conditions, interventions, enrollment, and sponsor info.

Usage: 0
Performance: 1.0

CrossRef Paper Metadata

literature_fetch

Retrieve publication metadata from CrossRef by DOI. Returns title, authors, journal, year, citation count, publisher, open access PDF link, subject areas, and funders. CrossRef indexes 150M+ scholarly works across all publishers — covers papers from any journal (not just biomedical), including clinical trials, meta-analyses, and specialized neuro-chemistry publications not in PubMed. Essential for DOI-based paper lookup when PMID is unavailable.

Usage: 0
Performance: 1.0

CrossRef Preprint Search

literature_search

Search for bioRxiv/medRxiv preprints via CrossRef API. Returns preprints indexed by CrossRef with title, authors, DOI, posted date, and abstract snippet. Useful for finding the latest findings before peer review, especially for fast-moving topics like tau pathology, neuroinflammation, TREM2 biology, Parkinson's alpha-synuclein research, and ALS/FTD mechanisms.

Usage: 0
Performance: 1.0

DGIdb Drug-Gene Interactions

drug_database

Query DGIdb for drug-gene interactions and druggability categories. Aggregates DrugBank, PharmGKB, TTD, ChEMBL, and clinical guideline data.

Usage: 0
Performance: 1.0

DisGeNET Disease Similarity

disease_genetics

Find diseases related to a query disease based on shared gene-disease associations in DisGeNET. Uses top-scoring genes for the query disease to identify comorbid or mechanistically related conditions. Helps map shared molecular substrates across neurodegeneration (e.g. diseases sharing APOE, LRRK2, or GBA pathways). Requires DISGENET_API_KEY for full results.

Usage: 0
Performance: 1.0

DisGeNET Disease-Gene

disease_gene_association

Get genes associated with a disease from DisGeNET with association scores.

Usage: 0
Performance: 1.0

DisGeNET Gene-Disease

gene_disease_association

Get disease associations for a gene from DisGeNET with scores and supporting PMIDs.

Usage: 0
Performance: 1.0

EBI eQTL Catalog

expression_qtl

Query EBI eQTL Catalog v2 for tissue/cell-type-specific expression QTLs. Covers 100+ studies beyond GTEx including microglia, iPSC-derived neurons, brain organoids, and disease cohorts. Returns rsid, variant, p-value, beta, SE, MAF, tissue_label, and qtl_group. Supports optional tissue_keyword filter (e.g. 'microglia', 'brain', 'iPSC').

Usage: 0
Performance: 1.0

Enrichr GO Enrichment

data_retrieval

Gene set enrichment against GO Biological Process. Enter a gene list to find enriched pathways.

Usage: 0
Performance: 0.0

Ensembl Gene Phenotype Associations

gene_disease

Retrieve multi-source phenotype and disease associations for a gene via Ensembl REST API. Aggregates disease-gene links from Orphanet, OMIM/MIM morbid, NHGRI-EBI GWAS Catalog, DECIPHER, ClinVar, and UniProtKB into a unified view with MONDO/HP/Orphanet ontology accessions and data source provenance. Distinct from omim_gene_phenotypes (OMIM only), clinvar_variants (variant-level), and monarch_disease_genes (Monarch KG). Useful for comprehensive disease context, rare disease associations via Orphanet, and multi-source phenotype validation.

Usage: 0
Performance: 1.0

FinnGen Disease Loci

genetic_associations

Query FinnGen R10 (N=520,000 Finns, 2,408 endpoints) for fine-mapped genetic loci. Returns SuSiE credible sets with lead SNP, gene, p-value, beta, and cross-trait annotations.

Usage: 0
Performance: 0.9

GWAS Catalog

data_retrieval

Genome-wide association study hits from the NHGRI-EBI GWAS Catalog. Query by gene or trait.

Usage: 0
Performance: 0.0

GWAS Catalog Variant Associations

genetic_associations

Query EBI GWAS Catalog for all phenotypic associations reported for a specific genetic variant (rsID). Returns distinct traits grouped by study count and best p-value. Complements gwas_genetic_associations (gene-centric): this tool lets you see the full phenotypic spectrum of a known variant. Essential for major neurodegeneration variants: APOE4=rs429358 (Alzheimer, lipids, cognition), LRRK2 G2019S=rs34637584 (Parkinson), GBA N370S=rs76763715.

Usage: 0
Performance: 1.0

Gene Info

data_retrieval

Look up any human gene — returns full name, summary, aliases, and gene type from MyGene.info.

Usage: 0
Performance: 0.0

Genomics England PanelApp

clinical_genetics

Query Genomics England PanelApp for disease gene panel memberships. Used by NHS Genomic Medicine Service for rare disease diagnosis. Returns panel name, disease group, confidence level (green/amber/red), mode of inheritance, penetrance, phenotypes, and gene evidence. Covers hereditary dementias, Parkinson, motor neuron disease, ataxias, and leukodystrophies.

Usage: 0
Performance: 1.0

Human Protein Atlas

data_retrieval

Protein expression across human tissues and cell types from the Human Protein Atlas. Includes subcellular localisation.

Usage: 0
Performance: 0.0

KEGG Disease Genes

disease_gene_association

Query the KEGG Disease database for curated disease entries and their causal/associated genes. Returns KEGG disease IDs, gene lists with subtypes (e.g. AD1/APP, AD17/TREM2), approved drugs, and linked pathways. Disease-centric view distinct from kegg_pathways (gene-to-pathway). Valuable for getting the official KEGG gene list for neurodegeneration diseases with clinical subtypes.

Usage: 0
Performance: 1.0

LIPID MAPS Lipid Search

metabolomics

Search LIPID MAPS for lipid structures, classifications, and biological roles via KEGG cross-reference. Returns LMID, formula, exact mass, main/sub class, InChIKey, HMDB/ChEBI IDs, and LIPID MAPS URL. Covers glycerophospholipids, sphingolipids, gangliosides, ceramides, oxysterols, and plasmalogens — all implicated in neurodegeneration (NPC, Alzheimer, Parkinson, ALS).

Usage: 0
Performance: 1.0

Metabolomics Workbench Search

dataset_discovery

Search the NIH Metabolomics Workbench for public metabolomics datasets. Returns study IDs, titles, species, analysis type (LC-MS, GC-MS, NMR), sample counts, and URLs. Useful for finding published CSF, brain region, or plasma metabolomics datasets for neurodegeneration diseases. Complements RNA/protein atlases with metabolite-level evidence.

Usage: 0
Performance: 1.0

MethBase Age Correlation

epigenetic_analysis

Get age-related methylation changes

Usage: 0
Performance: 0.0

MethBase Conservation

epigenetic_analysis

Query cross-species methylation conservation

Usage: 0
Performance: 0.0

MethBase CpG Islands

epigenetic_analysis

Get CpG island information for gene regions

Usage: 0
Performance: 0.0

MethBase Developmental Methylation

epigenetic_analysis

Query developmental stage methylation dynamics

Usage: 0
Performance: 0.0

MethBase Differential Analysis

epigenetic_analysis

Search differential methylation between conditions

Usage: 0
Performance: 0.0

MethBase Disease Methylation

epigenetic_search

Search disease-associated methylation changes

Usage: 0
Performance: 0.0

MethBase Gene Methylation

epigenetic_search

Query DNA methylation studies for specific genes

Usage: 0
Performance: 0.0

MethBase Tissue Comparison

epigenetic_analysis

Compare methylation patterns across tissues

Usage: 0
Performance: 0.0

MethBase Tissue Methylation

epigenetic_analysis

Get tissue-specific methylation patterns

Usage: 0
Performance: 0.0

Monarch Disease-Gene Associations

disease_gene_association

Query Monarch Initiative for disease-gene-phenotype associations from OMIM, ClinVar, HPO

Usage: 0
Performance: 0.8

NCBI GeneRIF Citations

literature_search

Retrieve NCBI Gene Reference Into Function (GeneRIF) linked publications for a gene. GeneRIF is a curated set of brief functional annotations each backed by a PubMed paper. Returns titles, journals, and URLs for the top-cited papers supporting gene function annotations — a fast path to high-quality mechanistic evidence for any neurodegeneration gene.

Usage: 0
Performance: 1.0

NCBI MeSH Term Lookup

ontology_lookup

Look up official NCBI Medical Subject Headings (MeSH) descriptors for diseases, pathways, and biological concepts. Returns the preferred descriptor name, hierarchical tree codes (e.g. C10.228.140.380 for Alzheimer Disease), scope note definition, and entry terms (all synonyms accepted by PubMed). Essential for standardizing terminology, discovering correct PubMed search terms, and navigating the MeSH vocabulary hierarchy from broad to specific.

Usage: 0
Performance: 1.0

NCBI SRA Search

dataset_discovery

Search the NCBI Sequence Read Archive (SRA) for public high-throughput sequencing datasets. Returns run accessions, titles, organisms, library strategy (RNA-Seq, ATAC-seq, ChIP-seq, scRNA-seq), and center names. Essential for finding publicly available neurodegeneration patient or model datasets for reanalysis.

Usage: 0
Performance: 1.0

NCBI dbSNP Variant Lookup

variant_annotation

Look up a variant in NCBI dbSNP for chromosomal position (GRCh38), reference/alternate alleles, functional class, gene context, and population allele frequencies from studies including 1000 Genomes, ALFA, and TOPMED. Complements Ensembl VEP (consequence prediction) and gnomAD (constraint) with official dbSNP registration metadata. Essential for characterizing a specific variant rsID.

Usage: 0
Performance: 1.0

OmniPath PTM Interactions

protein_interaction

Query OmniPath for post-translational modification (PTM) interactions integrating PhosphoSitePlus, PhosphoELM, SIGNOR, DEPOD, dbPTM, HPRD and 30+ other sources. Returns enzymes modifying the query gene AND substrates the gene modifies — phosphorylation, ubiquitination, acetylation, SUMOylation, methylation. Critical for tau kinase/phosphatase networks, alpha-synuclein ubiquitination (PINK1/Parkin), LRRK2 substrates, TDP-43/FUS in ALS.

Usage: 0
Performance: 1.0

Open Targets Disease Gene Scoring

disease_gene_association

Get top-scored target genes for a disease from Open Targets Platform, integrating 14+ evidence types (GWAS, rare genetics, expression, pathways, animal models, literature). Disease-centric: given a disease name, returns ranked gene list with overall association score and per-evidence-type breakdown. Essential for target prioritization debates. Distinct from open_targets_associations (gene→diseases).

Usage: 0
Performance: 1.0

Open Targets Evidence

data_retrieval

Disease associations and therapeutic evidence for a gene from Open Targets Platform, scored across multiple evidence sources.

Usage: 0
Performance: 0.0

Open Targets Safety Liability

drug_safety

Query Open Targets for target safety liability evidence curated from literature, clinical genetics, and animal toxicology. Returns safety events, affected tissues, inhibition effects, and supporting PMIDs. Critical for neurodegeneration drug discovery: CNS off-target effects, gene essentiality, tissue-specific toxicity, and high-risk indications for targets like LRRK2, BACE1, MAPT, CDK5, GSK3B.

Usage: 0
Performance: 1.0

OpenGWAS PheWAS Associations

genetic_associations

Query the MRC IEU OpenGWAS platform for phenome-wide association study (PheWAS) data. Returns all GWAS traits where a given variant reaches genome-wide significance across 10K+ studies including UK Biobank, FinnGen, and many GWAS consortia. Complements GWAS Catalog by covering unpublished and OpenGWAS-native summary statistics. Useful for characterizing pleiotropic variants like rs429358 (APOE4).

Usage: 0
Performance: 1.0

PGS Catalog Polygenic Risk Scores

genetic_associations

Search PGS Catalog (EMBL-EBI) for published polygenic risk score (PRS) models for a disease. Returns multi-SNP scoring models with variant counts, effect weight methods, publication DOI/year, and FTP download links. Covers 47+ Alzheimer disease PRS, 11+ Parkinson disease PRS, and hundreds of cognitive/brain trait models. Complements GWAS tools (single variants) with complete polygenic models ready for individual risk stratification. Essential for precision medicine analyses in neurodegeneration.

Usage: 0
Performance: 1.0

PanglaoDB Cell Markers

cell_type_annotation

Get canonical cell type marker genes from PanglaoDB scRNA-seq database. Covers microglia, astrocytes, neurons, OPCs, DAM, oligodendrocytes.

Usage: 0
Performance: 1.0

Paper Corpus Ingest

data_retrieval

Ingest a list of paper dicts into the local PaperCorpus cache for persistent storage. Each paper needs at least one ID (pmid, doi, or paper_id).

Usage: 0
Performance: 0.0

Paper Corpus Search

data_retrieval

Search across PubMed, Semantic Scholar, OpenAlex, and CrossRef with unified results and local caching. Use providers param to filter to specific sources.

Usage: 0
Performance: 0.0

Paper Corpus Session

data_retrieval

Start a stateful multi-page search session. Call again with incremented page param to fetch subsequent pages.

Usage: 0
Performance: 0.0

Paperclip Search

literature_search

Search Paperclip MCP for biomedical papers (8M+ across arXiv, bioRxiv, PMC, OpenAlex, OSF). Uses hybrid BM25+embedding search with TL;DR summaries.

Usage: 0
Performance: 1.0

PharmGKB Pharmacogenomics

drug_database

Query PharmGKB for pharmacogenomics drug-gene relationships. Returns clinical annotations linking genetic variants to drug response.

Usage: 0
Performance: 1.0

PubChem Target BioAssays

drug_discovery

Find bioassay-confirmed active compounds against a protein target in PubChem BioAssay. Returns assay IDs linked to the gene and CIDs of active compounds. Useful for identifying chemical probes, drug leads, validated inhibitors, and tool compounds for neurodegeneration target validation (BACE1, CDK5, GSK3B, LRRK2, GBA, HDAC6, PARP1).

Usage: 0
Performance: 1.0

PubMed Search

data_retrieval

Search PubMed for papers by keyword. Returns titles, authors, journals, PMIDs.

Usage: 0
Performance: 0.0

PubTator3 Gene Annotations

literature_annotation

Extract standardized gene, disease, chemical, and variant mentions from PubMed literature using NCBI PubTator3 AI annotation. Resolves synonyms to canonical IDs (NCBI Gene IDs, MeSH disease IDs). Useful for finding papers where specific genes and diseases co-occur, building evidence chains for hypotheses, and disambiguating biomedical entity names.

Usage: 0
Performance: 1.0

Reactome Pathway Search

pathway_analysis

Search Reactome for pathways by concept name and return constituent genes for each pathway. Complements the gene→pathway direction by going pathway-name→genes. Essential for building mechanistic gene sets for neurodegeneration analyses: mitophagy, tau clearance, NLRP3 inflammasome, amyloid processing, autophagy, endosomal sorting, complement activation, mTOR signaling.

Usage: 0
Performance: 1.0

Reactome Pathways

data_retrieval

Look up biological pathways a gene participates in, from Reactome.

Usage: 0
Performance: 0.0

STRING Functional Network

network_analysis

Query STRING DB for a multi-gene functional interaction network with per-channel evidence scores: coexpression (escore), experimental binding/co-IP (ascore), text-mining (tscore), curated database (dscore), neighborhood (nscore), and gene fusion (fscore). Unlike the basic STRING protein interactions tool, this returns the complete evidence breakdown for every pairwise link so you can distinguish mechanistic (experimental) from correlative (coexpression/text-mining) evidence. Essential for assessing gene module cohesion in AD (TREM2/TYROBP/SYK/PLCG2), PD (PINK1/PARK2/LRRK2/SNCA), and ALS/FTD (TDP-43/FUS/C9orf72) networks.

Usage: 0
Performance: 1.0

STRING Protein Interactions

data_retrieval

Find physical protein-protein interactions from the STRING database. Enter 2+ gene symbols.

Usage: 0
Performance: 0.0

UniChem Compound Xrefs

compound_annotation

Cross-reference a compound across 40+ chemistry and pharmacology databases via EBI UniChem. Given a compound name or InChI key, returns the compound's IDs in ChEMBL, DrugBank, PubChem, BindingDB, DrugCentral, KEGG, ChEBI, and more. Essential for integrating multi-database drug evidence and resolving compound identity across sources.

Usage: 0
Performance: 1.0

UniProt Protein Info

data_retrieval

Comprehensive protein annotation from UniProt/Swiss-Prot: function, domains, subcellular location, disease associations.

Usage: 0
Performance: 0.0

gProfiler Gene Enrichment

pathway_analysis

Functional enrichment analysis via g:Profiler (ELIXIR bioinformatics platform). Tests a gene list against GO Biological Process, GO Molecular Function, GO Cellular Component, KEGG, Reactome, WikiPathways, Human Phenotype Ontology, and TRANSFAC databases. Complements Enrichr with different statistical correction (g:SCS method), independent database versions, and broader ontology coverage including HP phenotypes and miRTarBase targets.

Usage: 0
Performance: 1.0

openFDA Adverse Events

clinical_pharmacology

Query FDA Adverse Event Reporting System (FAERS) via openFDA API for drug safety signals. Returns ranked adverse reactions with report counts and percentages from post-market surveillance. Useful for identifying neurological adverse events, CNS toxicity, ARIA risk, and safety liabilities for drugs being repurposed in neurodegeneration (donepezil, memantine, lecanemab, aducanumab, levodopa, riluzole).

Usage: 0
Performance: 1.0