ResearcharXivNEW

EpiBench: Verifiable Evaluation of AI Agents on Epigenomics Analysis

Muralidharan 2026-06-11

Harihara MuralidharanReema BaskarSoo Hee Lee

We introduce EpiBench, a verifiable benchmark for short-horizon epigenomics analysis. EpiBench evaluates whether agents can make well-defined analysis decisions from realistic workflow states and return deterministically gradable answers. The benchmark includes 106 evaluations across CUT\&Tag/CUT\&RUN, ATAC-seq, ChIP-seq, and DNA methylation workflows. Across 5,088 valid trajectories from 16 model

Read on arXiv

Data aggregated and editorially reviewed by TrendMing.

Key Contributions

We introduce EpiBench, a verifiable benchmark for short-horizon epigenomics analysis.
EpiBench evaluates whether agents can make well-defined analysis decisions from realistic workflow states and return deterministically gradable answers.
The benchmark includes 106 evaluations across CUT\&Tag/CUT\&RUN, ATAC-seq, ChIP-seq, and DNA methylation workflows.

Research Themes

AIResearch

Back to AI Research