ResearcharXivNEW
EpiBench: Verifiable Evaluation of AI Agents on Epigenomics Analysis
Muralidharan 2026-06-11
Harihara MuralidharanReema BaskarSoo Hee Lee
We introduce EpiBench, a verifiable benchmark for short-horizon epigenomics analysis. EpiBench evaluates whether agents can make well-defined analysis decisions from realistic workflow states and return deterministically gradable answers. The benchmark includes 106 evaluations across CUT\&Tag/CUT\&RUN, ATAC-seq, ChIP-seq, and DNA methylation workflows. Across 5,088 valid trajectories from 16 model
Read on arXivData aggregated and editorially reviewed by TrendMing.
Key Contributions
- We introduce EpiBench, a verifiable benchmark for short-horizon epigenomics analysis.
- EpiBench evaluates whether agents can make well-defined analysis decisions from realistic workflow states and return deterministically gradable answers.
- The benchmark includes 106 evaluations across CUT\&Tag/CUT\&RUN, ATAC-seq, ChIP-seq, and DNA methylation workflows.
Research Themes
AIResearch