ENCODE Toolkit

效率与工作流

by ammawla

20 MCP tools + 48 skills for ENCODE Project genomics — search, download, pipelines

什么是 ENCODE Toolkit

20 MCP tools + 48 skills for ENCODE Project genomics — search, download, pipelines

README

ENCODE Toolkit — Genomics Research Infrastructure for Claude

<!-- mcp-name: io.github.ammawla/encode-toolkit -->

License: AGPL-3.0 Python 3.10+ Version Status Skills Tools Pipelines Databases<br> Tests Coverage Security Claude Code Provenance smithery badge <br> PyPI version PyPI Downloads npm version NPM Downloads GitHub Clones<br> DOI

<a href="https://glama.ai/mcp/servers/ammawla/encode-toolkit"> <img width="380" height="200" src="https://glama.ai/mcp/servers/ammawla/encode-toolkit/badge" /> </a>

Search ENCODE, cross-reference 14 databases, run 7 analysis pipelines, and generate publication-ready methods — all from natural language in Claude Code.

Start from ENCODE but go everywhere: discover histone peaks, cross-reference with GWAS variants, check ClinVar pathogenicity, pull GTEx expression, analyze TF binding motifs from JASPAR, run pipelines, and generate publication-ready methods with full provenance — in one conversation.


Citation Notes

If you use ENCODE-Toolkit, please cite:

Alex M. Mawla. (2026). ENCODE-Toolkit: an MCP server, Claude plugin, and skills suite for ENCODE genomic data access and analysis. Zenodo. https://doi.org/10.5281/zenodo.18917511

BibTeX

bibtex
@software{mawla_2026_encode_toolkit,
  author  = {Mawla, Alex M.},
  title   = {ENCODE-Toolkit: an MCP server, Claude plugin, and skills suite for ENCODE genomic data access and analysis},
  year    = {2026},
  publisher = {Zenodo},
  doi     = {10.5281/zenodo.18917511},
  url     = {https://doi.org/10.5281/zenodo.18917511}
}

Quick Start

Claude Code Plugin (recommended)

Start a new Claude Code session and enter:

code
/plugin marketplace add ammawla/encode-toolkit

/plugin install encode-toolkit

That's it. All 20 tools, 47 skills, and the MCP connector are now available.

<details> <summary><strong>MCP-only install (tools only, no skills)</strong></summary>

If you only need the 20 MCP tools without the 47 workflow skills:

bash
claude mcp add encode -- uvx encode-toolkit
</details> <details> <summary><strong>Other editors and platforms</strong></summary>

npx (Node.js)

bash
npx encode-toolkit

Or in MCP client config: { "command": "npx", "args": ["encode-toolkit"] }

pip install

bash
pip install encode-toolkit

Then use encode-toolkit as the command in any MCP client configuration:

json
{
  "mcpServers": {
    "encode": {
      "command": "encode-toolkit"
    }
  }
}
</details> <details> <summary><strong>Claude Desktop (MCP only)</strong></summary>

Add to your claude_desktop_config.json:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json

json
{
  "mcpServers": {
    "encode": {
      "command": "uvx",
      "args": ["encode-toolkit"]
    }
  }
}

No installation needed when using uvx. Just add the config and restart Claude.

</details> <details> <summary><strong>VS Code / Copilot</strong></summary>

Add to .vscode/mcp.json in your workspace:

json
{
  "mcp": {
    "servers": {
      "encode": {
        "command": "uvx",
        "args": ["encode-toolkit"]
      }
    }
  }
}
</details> <details> <summary><strong>Cursor</strong></summary>

Add to .cursor/mcp.json:

json
{
  "mcpServers": {
    "encode": {
      "command": "uvx",
      "args": ["encode-toolkit"]
    }
  }
}
</details> <details> <summary><strong>Windsurf</strong></summary>

Add to .windsurf/mcp.json:

json
{
  "mcpServers": {
    "encode": {
      "command": "uvx",
      "args": ["encode-toolkit"]
    }
  }
}
</details>

Connected Databases

ENCODE Toolkit integrates 14 databases through live API tools and guided skills.

DatabaseAccess MethodUse Case
ENCODE20 MCP tools (live API)ChIP-seq, ATAC-seq, RNA-seq, Hi-C, WGBS, CUT&RUN data
GTExREST API (skill)Tissue-specific gene expression across 54 tissues
ClinVarE-utilities (skill)Variant clinical significance and pathogenicity
GWAS CatalogREST API (skill)Trait-variant associations from genome-wide studies
JASPARREST API (skill)Transcription factor binding motif profiles
CellxGeneCensus API (skill)Single-cell expression atlas across tissues
gnomADGraphQL (skill)Population allele frequencies and gene constraint
EnsemblREST API (skill)VEP annotation, Regulatory Build, coordinate liftover
UCSC Genome BrowserREST API (skill)cCRE tracks, TF clusters, sequence retrieval
GEOE-utilities (skill)Complementary expression/epigenomic datasets
PubMedMCP serverLiterature search and citation
bioRxivMCP serverPreprint discovery
ClinicalTrials.govMCP serverClinical trial cross-reference
Open TargetsMCP serverDrug target identification

What You Can Ask Claude

Search and explore

  • "Find all histone ChIP-seq experiments for human pancreas tissue"
  • "What ATAC-seq data is available for mouse brain?"
  • "Search for RNA-seq on GM12878 cell line"
  • "What histone marks have ChIP-seq data for pancreas?"

Download and track

  • "Download all BED files from ENCSR133RZO to ~/data/encode"
  • "Track experiment ENCSR133RZO with its publications"
  • "Export citations for my tracked experiments as BibTeX"

Cross-reference databases

  • "What GWAS variants overlap islet enhancers?"
  • "Check ClinVar pathogenicity for rs7903146"
  • "Pull GTEx expression for TCF7L2 across tissues"
  • "Find JASPAR motifs for HNF4A binding sites"

Run pipelines

  • "Set up a ChIP-seq pipeline for my H3K27ac experiments"
  • "Run ATAC-seq analysis with ENCODE-standard QC thresholds"

Generate methods and provenance

  • "Log that I created filtered_peaks.bed from ENCSR133RZO using bedtools"
  • "Generate a methods section for my analysis with citations"
<details> <summary><strong>More example prompts</strong></summary>

Experiment details

  • "Show me the full details for experiment ENCSR133RZO"
  • "What files are available for ENCSR133RZO?"
  • "List only the BED files from ENCSR133RZO"

Bulk downloads

  • "Download all FASTQs from human pancreas ChIP-seq to /data/fastqs"
  • "Get the IDR thresholded peaks from these experiments"
  • "Download the bigWig signal tracks for H3K27me3 in GRCh38"

Compatibility analysis

  • "Are experiments ENCSR133RZO and ENCSR000AKS compatible for combined analysis?"
  • "Compare these two ChIP-seq experiments"

Provenance chains

  • "Show me the provenance chain for my derived files"
  • "What files have I derived from ENCSR133RZO?"
</details>

The Problem

Using genomics databases today means:

  1. Navigate web portals, click through dozens of filters
  2. Manually find the right experiments and files across multiple databases
  3. Write custom scripts to batch download
  4. Lose track of which files came from where

With ENCODE Toolkit, just tell Claude what you need:

"Find all histone ChIP-seq data for human pancreas tissue"

Claude searches ENCODE, returns a structured table of 66 experiments with targets, replicates, and file counts. Downloads are organized by experiment with MD5 verification and full provenance tracking.


Available Tools (20)

Five core tools are shown below. The remaining 15 are collapsed for readability.

encode_search_experiments

Search ENCODE experiments with 20+ filters.

ParameterTypeDescription
assay_titlestringAssay type: "Histone ChIP-seq", "ATAC-seq", "RNA-seq", "Hi-C", etc.
organismstringSpecies (default: "Homo sapiens")
organstringOrgan: "pancreas", "brain", "liver", "heart", "kidney", etc.
biosample_typestring"tissue", "cell line", "primary cell", "organoid"
targetstringChIP target: "H3K27me3", "H3K4me3", "CTCF", etc.
biosample_term_namestringSpecific biosample: "GM12878", "HepG2", etc.
limitintMax results (default: 25)

encode_get_experiment

Get full details for a single experiment including all files, quality metrics, and audit info.

ParameterTypeDescription
accessionstringExperiment ID (e.g., "ENCSR133RZO")

encode_download_files

Download specific files by accession to a local directory.

ParameterTypeDescription
file_accessionslist[str]File IDs to download (e.g., ["ENCFF635JIA"])
download_dirstringLocal path to save files
organize_bystring"flat", "experiment", "format", "experiment_format"
verify_md5boolVerify file integrity (default: true)

encode_batch_download

Search + download in one step. Runs in preview mode by default.

ParameterTypeDescription
download_dirstringLocal path to save files
file_formatstringFile format to download
assay_titlestringAssay type filter
organstringOrgan filter
dry_runboolPreview only (default: true). Set false to download.

encode_track_experiment

Track an experiment locally with its publications, methods, and pipeline info.

ParameterTypeDescription
accessionstringExperiment ID to track
fetch_publicationsboolFetch associated publications (default: true)
fetch_pipelinesboolFetch pipeline/analysis info (default: true)
notesstringOptional notes to attach
<details> <summary><strong>Search and discovery tools (4)</strong></summary>

encode_list_files

List files for a specific experiment with format/type filters.

ParameterTypeDescription
experiment_accessionstringExperiment ID
file_formatstring"fastq", "bam", "bed", "bigWig", "bigBed", etc.
output_typestring"reads", "peaks", "signal", "alignments", etc.
assemblystring"GRCh38", "mm10", etc.
preferred_defaultboolOnly return recommended files

encode_search_files

Search files across all experiments with combined experiment + file filters.

ParameterTypeDescription
file_formatstringFile format filter
assay_titlestringAssay type of parent experiment
organstringOrgan of parent experiment
targetstringChIP/CUT&RUN target
output_typestringOutput type filter
assemblystringGenome assembly

encode_get_metadata

List valid filter values for any parameter.

ParameterTypeDescription
metadata_typestring"assays", "organisms", "organs", "biosample_types", "file_formats", "output_types", "assemblies"

encode_get_facets

Get live counts from ENCODE showing what data exists for given filters.

ParameterTypeDescription
assay_titlestringPre-filter by assay
organismstringPre-filter by organism
organstringPre-filter by organ
</details> <details> <summary><strong>File and credential tools (2)</strong></summary>

encode_get_file_info

Get detailed metadata for a single file.

ParameterTypeDescription
accessionstringFile ID (e.g., "ENCFF635JIA")

encode_manage_credentials

Store, check, or clear ENCODE credentials for restricted data access.

ParameterTypeDescription
actionstring"store", "check", or "clear"
access_keystringENCODE access key (for "store")
secret_keystringENCODE secret key (for "store")
</details> <details> <summary><strong>Tracking and provenance tools (4)</strong></summary>

encode_list_tracked

List all experiments in your local tracker with metadata, publication counts, and derived file counts.

ParameterTypeDescription
assay_titlestringFilter by assay type
organismstringFilter by organism
organstringFilter by organ

encode_get_citations

Get publications for tracked experiments. Export as BibTeX or RIS for reference managers.

ParameterTypeDescription
accessionstringSpecific experiment (or all if omitted)
export_formatstring"json" (default), "bibtex", or "ris"

encode_compare_experiments

Analyze whether two experiments are compatible for combined analysis.

ParameterTypeDescription
accession1stringFirst experiment ID
accession2stringSecond experiment ID

encode_summarize_collection

Get grouped statistics of your tracked experiment collection.

ParameterTypeDescription
assay_titlestringFilter by assay type
organismstringFilter by organism
organstringFilter by organ
</details> <details> <summary><strong>Provenance and export tools (4)</strong></summary>

encode_log_derived_file

Log a file you created from ENCODE data for provenance tracking.

ParameterTypeDescription
file_pathstringPath to your derived file
source_accessionslist[str]ENCODE accessions this was derived from
descriptionstringWhat the file contains
tool_usedstringTool/software used
parametersstringCommand or parameters used

encode_get_provenance

View provenance chains from derived files back to source ENCODE data.

ParameterTypeDescription
file_pathstringGet provenance for a specific file
source_accessionstringList all files derived from an accession

encode_export_data

Export tracked experiments as a table (CSV, TSV, or JSON) for Excel, R, pandas.

ParameterTypeDescription
formatstring"csv" (default), "tsv", or "json"
assay_titlestringFilter by assay type

encode_link_reference

Link external references (PubMed, bioRxiv, ClinicalTrials, GEO) to tracked experiments.

ParameterTypeDescription
experiment_accessionstringENCODE experiment accession
reference_typestring"pmid", "doi", "nct_id", "preprint_doi", "geo_accession", "other"
reference_idstringThe identifier value

encode_get_references

Get external references linked to tracked experiments for cross-server workflows.

ParameterTypeDescription
experiment_accessionstringFilter by experiment (optional)
reference_typestringFilter by type (optional)
</details>

Authentication

Most ENCODE data is public and requires no authentication. Just install and use.

For restricted/unreleased data, ask Claude: "Store my ENCODE credentials"

Credentials are encrypted using your OS keyring (macOS Keychain, Linux Secret Service, Windows Credential Locker) and never stored in plaintext. Get your access keys from your ENCODE profile.


Plugin Skills (47)

When installed as a Claude Code plugin, ENCODE Toolkit includes 47 literature-backed workflow skills that guide Claude through complex genomics tasks. Each analysis skill includes evidence-based quality thresholds, assay-specific metrics, and citations to primary literature.

Core Skills

SkillDescription
setupInstall and configure the ENCODE Toolkit server
search-encodeSearch and explore ENCODE experiments and files
download-encodeDownload files with organization and verification
track-experimentsTrack experiments, citations, and provenance locally
cross-referenceConnect ENCODE data to PubMed, bioRxiv, ClinicalTrials.gov
<details> <summary><strong>Analysis skills (9)</strong></summary>
SkillDescription
quality-assessmentEvaluate experiment quality using ENCODE metrics — assay-specific thresholds for ChIP-seq (FRiP, NSC, RSC, NRF, IDR), ATAC-seq (TSS enrichment, NFR ratio), RNA-seq (mapping rate, gene body coverage), WGBS (bisulfite conversion, CpG coverage), Hi-C (cis/trans ratio), and CUT&RUN/CUT&Tag. Backed by Landt 2012, Buenrostro 2013, ENCODE Phase 3 (2020), Li 2011
integrative-analysisCombine multiple experiments with batch effect awareness — integration strategies (peak overlap, signal correlation, DiffBind, DESeq2, ChromHMM, ABC model). Backed by Ernst & Kellis 2012, Ross-Innes 2012, Love 2014, Fulco 2019
regulatory-elementsDiscover enhancers, promoters, insulators from combinatorial histone marks — ENCODE cCRE classification (926,535 elements), ChromHMM state interpretation. Backed by ENCODE Phase 3 (2020), Roadmap Epigenomics (2015), Whyte 2013
epigenome-profilingBuild comprehensive chromatin state profiles — three-tiered histone panels, ChromHMM 15-state model, bivalent chromatin analysis. References the chromatin biology catalog
compare-biosamplesCompare experiments across tissues and cell types — biosample hierarchy, tissue-specific regulation, batch effect detection. Backed by Roadmap Epigenomics (2015), Leek 2010
visualization-workflowGenerate publication-quality visualizations: genome browser tracks, heatmaps, and signal profiles
motif-analysisDiscover and analyze TF binding motifs in regulatory regions using HOMER, MEME, and JASPAR
peak-annotationAnnotate genomic peaks with features (promoter/enhancer/intergenic), nearest genes, and functional categories
batch-analysisBatch processing and QC screening across multiple ENCODE experiments with systematic quality filtering
</details> <details> <summary><strong>Functional genomics skills (1)</strong></summary>
SkillDescription
functional-screen-analysisAnalyze CRISPR screens, MPRA, and STARR-seq data from ENCODE — MAGeCK, BAGEL2, MPRAflow integration
</details> <details> <summary><strong>Data aggregation skills (4)</strong></summary>
SkillDescription
histone-aggregationUnion merge of histone ChIP-seq peaks across studies — signalValue-based noise filtering, sample-of-origin tagging, ENCODE blacklist removal. Backed by ChIP-Atlas (Oki 2018), Amemiya 2019, Perna 2024
accessibility-aggregationUnion merge of ATAC-seq and DNase-seq peaks — cross-platform integration, peak summit preservation. Backed by Corces 2017, Amemiya 2019, Zhao 2020
hic-aggregationUnion catalog of Hi-C chromatin loops (BEDPE) — resolution-aware anchor matching, loop caller concordance tracking. Backed by Loop Catalog (Reyna 2025), Mustache (Roayaei Ardakany 2020)
methylation-aggregationAggregate WGBS methylation profiles — per-CpG weighted averaging, HMR/UMR/PMD identification. Backed by Schultz 2015, DMRcate (Peters 2021), Zhou 2020
</details> <details> <summary><strong>Multi-omics and meta-analysis skills (2)</strong></summary>
SkillDescription
scrna-meta-analysisCross-study meta-analysis of scRNA-seq data — reproducibility assessment, TIN-based quality filtering, ambient RNA quantification. Backed by Tran 2020, Luecken & Theis 2019, Stuart 2019, Korsunsky 2019
multi-omics-integrationIntegrate RNA-seq, ATAC-seq, Histone ChIP-seq, and TF ChIP-seq — ABC model regulatory predictions, signal correlation. Backed by Fulco 2019, Corces 2018, ENCODE Phase 3 (2020)
</details> <details> <summary><strong>Workflow skills (7)</strong></summary>
SkillDescription
data-provenanceFull reproducibility tracking — tool versions, reference files, scripts, exact commands, timestamps, source-to-derived provenance chains
cite-encodeGenerate proper citations, BibTeX/RIS export, data availability statements
variant-annotationAnnotate GWAS/disease variants with ENCODE functional data — variant-to-gene mapping via cCREs. Backed by Finucane 2015, Maurano 2012
pipeline-guideUnderstand ENCODE uniform analysis pipelines and output types — pipeline specifications, Nextflow integration
single-cell-encodeWork with scRNA-seq and scATAC-seq data — platform comparison, cross-study integration, WNN multimodal analysis. Backed by Hao 2021, Stuart 2019
disease-researchDisease-focused workflows — GWAS variant interpretation, disease-tissue mapping, heritability enrichment, drug target identification via Open Targets. Backed by Buniello 2019, Finucane 2015
publication-trustPublication integrity assessment — 5-level trust scoring, retraction/erratum detection, citation analysis. Integrates with PubMed, bioRxiv, and Consensus
bioinformatics-installerInstall all bioinformatics tools for ENCODE analyses — 7 conda environment YAMLs, 3 install scripts, 134+ tools across ChIP-seq, ATAC-seq, RNA-seq, WGBS, Hi-C, DNase-seq, CUT&RUN
scientific-writingGenerate publication-ready methods sections, figure legends, supplementary tables, and data availability statements with full tool citations
liftover-coordinatesConvert genomic coordinates between assembly versions (hg19/hg38, mm9/mm10) using UCSC liftOver, CrossMap, Ensembl REST API, and rtracklayer
</details> <details> <summary><strong>External database skills (9)</strong></summary>
SkillDescription
gtex-expressionQuery GTEx tissue expression data via REST API for gene expression context across 54 tissues
clinvar-annotationAnnotate variants with ClinVar clinical significance, pathogenicity, and review status
cellxgene-contextQuery CellxGene single-cell atlas for cell type expression context across tissues
gwas-catalogSearch NHGRI-EBI GWAS Catalog for trait associations, risk alleles, and study metadata
jaspar-motifsQuery JASPAR database for transcription factor binding motifs and matrix profiles
ensembl-annotationEnsembl VEP variant annotation, Regulatory Build, coordinate liftover, gene lookup via REST API
geo-connectorSearch NCBI GEO for complementary datasets, cross-reference with ENCODE, FTP downloads
gnomad-variantsgnomAD population allele frequencies, gene constraint (LOEUF/pLI), structural variants via GraphQL
ucsc-browserUCSC Genome Browser REST API for cCRE tracks, TF binding clusters, and sequence retrieval
</details> <details> <summary><strong>Pipeline execution skills (7)</strong></summary>
PipelineAssayAlignerCaller
pipeline-chipseqChIP-seqBWA-MEMMACS2 + IDR
pipeline-atacseqATAC-seqBowtie2MACS2 (Tn5-adjusted)
pipeline-rnaseqRNA-seqSTARRSEM + Kallisto
pipeline-wgbsWGBSBismarkMethylDackel
pipeline-hicHi-CBWAJuicer + HiCCUPS
pipeline-dnaseseqDNase-seqBWAHotspot2
pipeline-cutandrunCUT&RUNBowtie2SEACR

Each pipeline includes a SKILL.md overview, 5-stage reference files (preprocessing through QC), a complete Nextflow DSL2 pipeline, a Dockerfile, and deployment configurations for local, SLURM, GCP, and AWS.

</details> <details> <summary><strong>Reference files</strong></summary>
FileDescription
skills/histone-aggregation/references/histone-marks-reference.mdComprehensive chromatin biology catalog (1,442 lines) — 21 histone marks with writers/erasers/readers, 5 novel acylation marks, ChromHMM state models (5 to 51 states), TF co-binding patterns, chromatin remodeling complexes, DNA methylation-chromatin interplay, nucleosome dynamics, 3D genome organization, chromatin in disease. 74 primary references
skills/*/references/literature.md33 per-skill literature reference documents — ~250 papers cataloged with DOI, PMID, citation counts, and skill-relevant key findings
</details>

Why ENCODE Toolkit

Most genomics tools give you one thing. ENCODE Toolkit gives you the full research loop:

CapabilityENCODE ToolkitTypical MCP servers
Live database access20 tools across 14 databasesSingle database, read-only
Executable pipelines7 Nextflow DSL2 pipelines with Docker and cloud configsNone
Provenance trackingFull audit trail from source data to derived filesNone
Publication outputBibTeX/RIS citations, auto-generated methods sectionsNone
Literature backing100+ primary references with assay-specific QC thresholdsNone
Workflow skills47 guided skills covering search to publicationStatic documentation

Supported Assay Types

<details> <summary><strong>50+ assay types across 7 categories</strong></summary>
CategoryAssays
Histone/ChromatinHistone ChIP-seq, TF ChIP-seq, ATAC-seq, DNase-seq, CUT&RUN, CUT&Tag, MNase-seq
TranscriptionRNA-seq, total RNA-seq, small RNA-seq, long read RNA-seq, CAGE, RAMPAGE, PRO-seq, GRO-seq
3D GenomeHi-C, intact Hi-C, Micro-C, ChIA-PET, HiChIP, PLAC-seq, 5C
DNA MethylationWGBS, RRBS, MeDIP-seq, MRE-seq
FunctionalSTARR-seq, MPRA, CRISPR screen, eCLIP, iCLIP
Single CellscRNA-seq, snATAC-seq, 10x multiome, SHARE-seq, Parse SPLiT-seq
PerturbationCRISPRi + RNA-seq, shRNA + RNA-seq, siRNA + RNA-seq

Supported file formats: fastq bam bed bigWig bigBed tsv csv hic tagAlign bedpe pairs fasta vcf tar

</details>

Security and Privacy

  • 100% local execution — no telemetry, no analytics, no tracking
  • Credentials encrypted at rest via OS keyring with Fernet fallback
  • Certificate verification enforced — no verify=False
  • Rate limited to respect ENCODE's 10 req/sec policy
  • MD5 verification on all downloads by default
  • No data leaves your machine except queries to public APIs over HTTPS

Vignettes

Step-by-step walkthroughs showing real Claude sessions, including actual API output and scientific interpretation.

VignetteSkills Demonstrated
01 — Discovery & SearchFacets, search, metadata, quality-aware selection
02 — Download & TrackFile listing, download, tracking, citations, provenance
03 — Epigenomics WorkflowHistone marks, ATAC-seq, aggregation skills
04 — Variant & Disease ResearchGWAS catalog, ClinVar, GTEx, JASPAR, gnomAD
05 — Expression & Single-CellRNA-seq, scRNA-seq, GTEx, CellxGene, meta-analysis
06 — Motif & Regulatory AnalysisTF ChIP-seq, chromatin states, HOMER/MEME
07 — 3D Genome & MethylationHi-C loops, WGBS methylation, integrative analysis
08 — Pipeline ExecutionChIP-seq/ATAC-seq/RNA-seq pipelines, Nextflow
09 — Cross-Reference & IntegrationGEO, PubMed, Ensembl, UCSC, multi-omics
<details> <summary><strong>Individual skill vignettes</strong></summary>

Every skill has a dedicated vignette in docs/skill-vignettes/ with a complete example session. Highlights:

SkillVignette Scenario
data-provenanceDownload, blacklist-filter, liftover, auto-generate methods section
histone-aggregationUnion merge of H3K27ac across 5 pancreas experiments
variant-annotationrs7903146 in TCF7L2 with islet enhancer evidence scoring
pipeline-chipseqFull Nextflow pipeline execution with ENCODE QC thresholds
gwas-catalogT2D GWAS variants overlaid on islet H3K27ac enhancers
publication-trustTrust assessment of artemisinin transdifferentiation claim
scrna-meta-analysis3-study islet integration following Mawla et al. 2019 framework

See the full showcase for 15 detailed examples.

</details>

Development

bash
git clone https://github.com/ammawla/encode-toolkit.git
cd encode-toolkit
python -m venv .venv
source .venv/bin/activate
pip install -e ".[dev]"

Run the server locally:

bash
encode-toolkit

Run tests:

bash
pytest

Troubleshooting

<details> <summary><strong>"Server not found" in Claude Desktop</strong></summary>
  • Make sure you restarted Claude Desktop after adding the config
  • Verify uvx is installed: pip install uv or curl -LsSf https://astral.sh/uv/install.sh | sh
</details> <details> <summary><strong>"Connection refused" or timeout errors</strong></summary>
  • Check your internet connection
  • ENCODE API rate limit is 10 requests/sec — the server handles this automatically
</details> <details> <summary><strong>Downloads fail with 403</strong></summary>
  • The file may require authentication. Ask Claude: "Store my ENCODE credentials"
  • Or check if the file status is "released" on encodeproject.org
</details> <details> <summary><strong>No results for a search</strong></summary>
  • Try broader filters (remove biosample_type or organ)
  • Use encode_get_facets to see what data actually exists for your filters
  • Use encode_get_metadata to check valid filter values
</details>

Author

Dr. Alex M. Mawla, PhD

License

AGPL-3.0. See LICENSE for full terms.

For commercial licensing inquiries: ammawla@ucdavis.edu

常见问题

ENCODE Toolkit 是什么?

20 MCP tools + 48 skills for ENCODE Project genomics — search, download, pipelines

相关 Skills

技能工坊

by anthropics

Universal
热门

覆盖 Skill 从创建到迭代优化全流程:起草能力、补测试提示、跑评测与基准方差分析,并持续改写内容和描述,提升效果与触发准确率。

技能工坊把技能从创建、迭代到评测串成闭环,方差分析加描述优化,特别适合把触发准确率打磨得更稳。

效率与工作流
未扫描149.6k

PPT处理

by anthropics

Universal
热门

处理 .pptx 全流程:创建演示文稿、提取和解析幻灯片内容、批量修改现有文件,支持模板套用、合并拆分、备注评论与版式调整。

涉及PPTX的创建、解析、修改到合并拆分都能一站搞定,连备注、模板和评论也能处理,做演示文稿特别省心。

效率与工作流
未扫描149.6k

PDF处理

by anthropics

Universal
热门

遇到 PDF 读写、文本表格提取、合并拆分、旋转加水印、表单填写或加解密时直接用它,也能提取图片、生成新 PDF,并把扫描件通过 OCR 变成可搜索文档。

PDF杂活别再来回切工具了,文本表格提取、合并拆分到OCR识别一次搞定,连扫描件也能变可搜索。

效率与工作流
未扫描149.6k

相关 MCP Server

文件系统

编辑精选

by Anthropic

热门

Filesystem 是 MCP 官方参考服务器,让 LLM 安全读写本地文件系统。

这个服务器解决了让 Claude 直接操作本地文件的痛点,比如自动整理文档或生成代码文件。适合需要自动化文件处理的开发者,但注意它只是参考实现,生产环境需自行加固安全。

效率与工作流
87.1k

by wonderwhy-er

热门

Desktop Commander 是让 AI 直接执行终端命令、管理文件和进程的 MCP 服务器。

这工具解决了 AI 无法直接操作本地环境的痛点,适合需要自动化脚本调试或文件批量处理的开发者。它能让你用自然语言指挥终端,但权限控制需谨慎,毕竟让 AI 执行 rm -rf 可不是闹着玩的。

效率与工作流
6.2k

EdgarTools

编辑精选

by dgunning

热门

EdgarTools 是无需 API 密钥即可解析 SEC EDGAR 财报的开源 Python 库。

这个工具解决了金融数据获取的痛点——直接让 AI 读取结构化财报,比如让 Claude 分析苹果的 10-K 文件。适合量化分析师或金融开发者快速构建数据管道。但注意,它依赖 SEC 网站稳定性,高峰期可能延迟。

效率与工作流
2.3k

评论