BioRAG: A RAG-LLM Framework for Biological Question Reasoning

Chengjun Wu; Chengrui Wang; Meng Xiao; Qingqing Long; Xuezhi Wang; Xunxin Cai; Yuanchun Zhou; Zhen Meng

arxiv: 2408.01107 · v2 · pith:PIWQDIPFnew · submitted 2024-08-02 · 💻 cs.CL · cs.AI· cs.IR

BioRAG: A RAG-LLM Framework for Biological Question Reasoning

Chengrui Wang , Qingqing Long , Meng Xiao , Xunxin Cai , Chengjun Wu , Zhen Meng , Xuezhi Wang , Yuanchun Zhou This is my paper

classification 💻 cs.CL cs.AIcs.IR

keywords knowledgebioragretrievalframeworkinformationlifemodelprocess

0 comments

read the original abstract

The question-answering system for Life science research, which is characterized by the rapid pace of discovery, evolving insights, and complex interactions among knowledge entities, presents unique challenges in maintaining a comprehensive knowledge warehouse and accurate information retrieval. To address these issues, we introduce BioRAG, a novel Retrieval-Augmented Generation (RAG) with the Large Language Models (LLMs) framework. Our approach starts with parsing, indexing, and segmenting an extensive collection of 22 million scientific papers as the basic knowledge, followed by training a specialized embedding model tailored to this domain. Additionally, we enhance the vector retrieval process by incorporating a domain-specific knowledge hierarchy, which aids in modeling the intricate interrelationships among each query and context. For queries requiring the most current information, BioRAG deconstructs the question and employs an iterative retrieval process incorporated with the search engine for step-by-step reasoning. Rigorous experiments have demonstrated that our model outperforms fine-tuned LLM, LLM with search engines, and other scientific RAG frameworks across multiple life science question-answering tasks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 12 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

BioHarness: Substrate-Aware Evidence Assembly for Biomedical Question Answering across Literature, Knowledge Bases, and Biological Atlases
q-bio.QM 2026-06 unverdicted novelty 6.0

BioHarness improves pooled biomedical QA score from 65.9 to 71.0 on 19,302 items by using staged, substrate-aware evidence assembly that escalates only when needed.
CuraView: A Multi-Agent Framework for Medical Hallucination Detection with GraphRAG-Enhanced Knowledge Verification
cs.CL 2026-05 unverdicted novelty 6.0

CuraView detects sentence-level faithfulness hallucinations in medical discharge summaries via GraphRAG knowledge graphs and multi-agent evidence grading, achieving 0.831 F1 on critical contradictions with a fine-tune...
Leveraging LLM-GNN Integration for Open-World Question Answering over Knowledge Graphs
cs.CL 2026-04 unverdicted novelty 6.0

GLOW integrates a pre-trained GNN for candidate prediction with an LLM for joint symbolic-semantic reasoning over incomplete KGs, reporting up to 53.3% gains on standard benchmarks and a new GLOW-BENCH dataset.
Knowledge Is Not Static: Order-Aware Hypergraph RAG for Language Models
cs.CL 2026-04 unverdicted novelty 6.0

OKH-RAG represents knowledge as ordered hyperedges and retrieves coherent interaction sequences via a learned transition model, outperforming permutation-invariant RAG baselines on order-sensitive QA tasks.
SciHorizon-GENE: Benchmarking LLM for Life Sciences Inference from Gene Knowledge to Functional Understanding
q-bio.GN 2026-01 unverdicted novelty 6.0

SciHorizon-GENE is a large-scale benchmark evaluating LLMs on gene-to-function inference across four perspectives, revealing heterogeneity and challenges in faithful, complete, literature-grounded outputs.
Understanding Structured Financial Data with LLMs: A Case Study on Fraud Detection
cs.LG 2025-12 unverdicted novelty 6.0

FinFRE-RAG combines importance-guided feature reduction with label-aware retrieval-augmented generation to boost LLM performance on tabular fraud detection across four public datasets while providing human-readable ra...
ARIA: A Causal-Aware Framework for Rescuing LLM Reasoning in Trustworthy Materials Discovery
cs.AI 2026-06 unverdicted novelty 5.0

ARIA is a three-tier causal framework that conditions LLM knowledge use on mechanistic completeness for forward prediction and inverse design of 2D materials, producing auditable traces.
HPC-LLM: Practical Domain Adaptation and Retrieval-Augmented Generation for HPC Support
cs.LG 2026-05 unverdicted novelty 4.0

HPC-LLM fine-tunes Llama 3.1 8B via QLoRA on 9k-24k HPC examples and adds dense retrieval to deliver practical support for job scheduling, MPI, and GPU workflows, approaching the performance of larger general models a...
A Survey on LLM-as-a-Judge
cs.CL 2024-11 unverdicted novelty 4.0

A survey on LLM-as-a-Judge that reviews reliability strategies, proposes evaluation methods, and introduces a novel benchmark for assessing such systems.
Earth Science Foundation Models: From Perception to Reasoning and Discovery
astro-ph.IM 2026-05 unverdicted novelty 3.0

The paper delivers a unified review and roadmap of Earth science foundation models, structured by capability depth from perception to agentic reasoning and by application breadth across atmosphere, hydrosphere, lithos...
Large Language Model Agent: A Survey on Methodology, Applications and Challenges
cs.CL 2025-03 accept novelty 3.0

A survey that deconstructs LLM agent systems via a methodology-centered taxonomy linking design principles to emergent behaviors, applications, and challenges.
Earth Science Foundation Models: From Perception to Reasoning and Discovery
astro-ph.IM 2026-05 unverdicted novelty 2.0

A review of Earth science foundation models covering capability evolution from perception to discovery, applications across atmosphere/hydrosphere/lithosphere/biosphere/anthroposphere/cryosphere, over 200 datasets, an...