ResearcharXivNEW

EpiBench: Verifiable Evaluation of AI Agents on Epigenomics Analysis

Muralidharan 2026-06-11
Harihara MuralidharanReema BaskarSoo Hee Lee

We introduce EpiBench, a verifiable benchmark for short-horizon epigenomics analysis. EpiBench evaluates whether agents can make well-defined analysis decisions from realistic workflow states and return deterministically gradable answers. The benchmark includes 106 evaluations across CUT\&Tag/CUT\&RUN, ATAC-seq, ChIP-seq, and DNA methylation workflows. Across 5,088 valid trajectories from 16 model

Read on arXiv
Data aggregated and editorially reviewed by TrendMing.

Key Contributions

  • We introduce EpiBench, a verifiable benchmark for short-horizon epigenomics analysis.
  • EpiBench evaluates whether agents can make well-defined analysis decisions from realistic workflow states and return deterministically gradable answers.
  • The benchmark includes 106 evaluations across CUT\&Tag/CUT\&RUN, ATAC-seq, ChIP-seq, and DNA methylation workflows.

Research Themes

AIResearch