Genomics

Computational genomics, gene expression analysis, and DNA sequence modeling.

q-bio.GN · 53 papers

A Combinatorial Optimisation Approach to Multi-factorial Gap-filling in Genome-scale Metabolic Models (GEMs)

This paper presents a metaheuristic combinatorial optimization method for multi-factorial gap-filling in GEMs, significantly improving accuracy over prior approaches.

2604.25233Apr 28, 2026Philip Kilby, Sevvandi Kandanaarachchi, Matthew J. Morgan +4

T-cell repertoire response in individuals with post-acute sequelae of COVID-19

This study identifies distinct T-cell signatures and over 1,000 candidate TCRs associated with Post-Acute Sequelae of COVID-19 (PASC).

2604.26975Apr 28, 2026Zachary Montague, Rhea M Grover, Andrew Baumgartner +3

CMGL: Confidence-guided Multi-omics Graph Learning for Cancer Subtype Classification

CMGL is a two-stage framework that uses evidential deep learning to estimate per-sample modality reliability, improving cancer subtype classification.

2604.24201Apr 27, 2026Boyang Fan, Hengchuang Yin, Siyu Yi +5

Imaging Exploration of Molecular Subtypes in Tongue Squamous Cell Carcinoma

This study shows radiomic features can non-invasively distinguish molecular subtypes in tongue squamous cell carcinoma (TSCC), offering a new diagnostic tool.

2604.23679Apr 26, 2026Hao Pan, Peipei Wang, Yajie Chang +9

The Cathaya argyrophylla Genome Reveals the Evolutionary Trade-offs of a Living Fossil

A new 22.73 Gb genome for the endangered Cathaya argyrophylla reveals its gigantism, ancient divergence, and genomic trade-offs between resource adaptation and weak immunity.

2604.22440Apr 24, 2026Yun Wang, Peng Xie, Shaogang Fan +12

Supregraph: Enabling Information-Optimal Assembly Graph Representation of a Read Set

Supregraphs offer an information-optimal assembly graph representation, overcoming limitations of de Bruijn and overlap graphs for genome assembly.

2604.21951Apr 23, 2026Anton Bankevich

TorchGWAS : GPU-accelerated GWAS for thousands of quantitative phenotypes

TorchGWAS leverages GPU acceleration to perform genome-wide association studies on thousands of quantitative phenotypes, drastically speeding up analysis.

2604.21095Apr 22, 2026Xingzhong Zhao, Ziqian Xie, Islam +5

Conditional Monte Carlo Tree Diffusion for Designing Cell-Type-Specific and Biologically Faithful Regulatory DNA

DNA-CRAFT uses conditional Monte Carlo tree diffusion to design highly cell-type-specific and biologically faithful regulatory DNA elements.

2604.20488Apr 22, 2026Animesh Awasthi, Raphael Bednarsky, Moritz Schaefer +1

Quantum AI for Cancer Diagnostic Biomarker Discovery

Quantum AI identifies lung cancer biomarkers and classifies subtypes, demonstrating quantum advantage in diagnostics and multiomic data processing.

2604.18621Apr 18, 2026Mandeep Kaur Saggi, Amandeep Singh Bhatia, Humaira Gowher +1

Geometric coherence of single-cell CRISPR perturbations reveals regulatory architecture and predicts cellular stress

Shesha quantifies geometric coherence in single-cell CRISPR screens, revealing regulatory architecture and predicting cellular stress responses.

2604.16642Apr 17, 2026Prashant C. Raju

Combining Bayesian and Frequentist Inference for Laboratory-Specific Performance Guarantees in Copy Number Variation Detection

A hybrid Bayesian-frequentist method provides accurate, lab-specific performance guarantees for copy number variation detection in oncology panels.

2604.14305Apr 15, 2026Austin Talbot, Alex V. Kotlar, Yue Ke

oxo-call: Documentation-grounded Skill Augmentation for Accurate Bioinformatics Command-line Generation with Large Language Models

oxo-call is a Rust-based LLM assistant that generates accurate bioinformatics command-line invocations using documentation grounding and expert skill augmentation.

2604.12387Apr 14, 2026Yun Peng, Yujun Sun, Jia Ding +6

Interpretable DNA Sequence Classification via Dynamic Feature Generation in Decision Trees

DEFT uses large language models to dynamically generate interpretable, high-level features for DNA sequence classification in decision trees.

2604.12060Apr 13, 2026Nicolas Huynh, Krzysztof Kacprzyk, Ryan Sheridan +2

EvoLen: Evolution-Guided Tokenization for DNA Language Model

EvoLen introduces an evolution-guided tokenization method for DNA language models, improving the preservation of functional sequence patterns and DNALM performance.

2604.08698Apr 9, 2026Nan Huang, Xiaoxiao Zhou, Junxia Cui +4

Probing 3D Chromatin Structure Awareness in Evo2 DNA Language Model

Evo2 DNA language model learns local CTCF grammar but fails to grasp higher-order 3D chromatin organization, suggesting new architectures are needed.

2604.07196Apr 8, 2026UkJin Lee

WebCVTree4: A Newly Designed Phylogenetic and Taxonomic Study Platform for Prokaryotes Using Composition Vectors and Whole Genomes

WebCVTree4 is an upgraded web platform for prokaryotic phylogenetic and taxonomic studies using whole-genome composition vectors, supporting large-scale analysis.

2604.06835Apr 8, 2026Guanghong Zuo

PreviousPage 2 of 3Next

📬 Weekly AI Paper Digest

Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.