Medbiolm: Optimizing medical and biological qa with fine-tuned large language models and retrieval-augmented generation

Medcpt: Contrastive pre-trained transformers with large-scale pubmed search logs for zero-shot biomedical information retrieval · 2004 · arXiv 2502.03004

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

What Makes a Medical Checker Trainable? Diagnosing Signal Collapse and Reward Hacking in Checker-Guided RAG for Biomedical QA

cs.CL · 2026-05-25 · unverdicted · novelty 6.0

Empirical comparison of four NLI checkers as process rewards in GRPO-trained medical RAG shows log-prob scoring collapses to neutral labels while moderate local classifiers improve BERTScore without reward hacking.

CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation

cs.AI · 2025-10-26 · unverdicted · novelty 4.0

CLIN-LLM combines uncertainty-calibrated BioBERT classification with retrieval-augmented FLAN-T5 generation and safety post-processing to reach 98% accuracy on clinical cases while cutting unsafe antibiotic suggestions by 67%.

Perovskite-R1: a domain-specialized large language model for intelligent discovery of precursor additives and experimental design

cs.LG · 2025-07-22 · unverdicted · novelty 4.0

A fine-tuned LLM called Perovskite-R1, built from curated perovskite literature and material libraries, proposes precursor additives and designs with some experimental validation showing improved stability and performance.

citing papers explorer

Showing 3 of 3 citing papers.

What Makes a Medical Checker Trainable? Diagnosing Signal Collapse and Reward Hacking in Checker-Guided RAG for Biomedical QA cs.CL · 2026-05-25 · unverdicted · none · ref 1
Empirical comparison of four NLI checkers as process rewards in GRPO-trained medical RAG shows log-prob scoring collapses to neutral labels while moderate local classifiers improve BERTScore without reward hacking.
CLIN-LLM: A Safety-Constrained Hybrid Framework for Clinical Diagnosis and Treatment Generation cs.AI · 2025-10-26 · unverdicted · none · ref 23
CLIN-LLM combines uncertainty-calibrated BioBERT classification with retrieval-augmented FLAN-T5 generation and safety post-processing to reach 98% accuracy on clinical cases while cutting unsafe antibiotic suggestions by 67%.
Perovskite-R1: a domain-specialized large language model for intelligent discovery of precursor additives and experimental design cs.LG · 2025-07-22 · unverdicted · none · ref 23
A fine-tuned LLM called Perovskite-R1, built from curated perovskite literature and material libraries, proposes precursor additives and designs with some experimental validation showing improved stability and performance.

Medbiolm: Optimizing medical and biological qa with fine-tuned large language models and retrieval-augmented generation

fields

years

verdicts

representative citing papers

citing papers explorer