hub

Medalpaca–an open-source collection of medical conversational ai models and training data

Tianyu Han, Lisa C Adams, Jens-Michalis Papaioannou, Paul Grundmann, Tom Oberhauser, Alexander Löser, Daniel Truhn, Keno K Bressem · 2023 · arXiv 2304.08247

15 Pith papers cite this work. Polarity classification is still indexing.

15 Pith papers citing it

read on arXiv browse 15 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Is Biomedical Specialization Still Worth It? Insights from Domain-Adaptive Language Modelling with a New French Health Corpus

cs.CL · 2026-04-08 · unverdicted · novelty 7.0

Domain-adaptive pre-training on a new French health corpus yields limited gains and risks general capability loss unless followed by model merging, which can even boost specialized performance.

Do No Harm? Hallucination and Actor-Level Abuse in Web-Deployed Medical Large Language Models

cs.CL · 2026-05-20 · unverdicted · novelty 6.0

Evaluation of 6233 MedGPTs finds 25-30% with low factual accuracy, 33.6-54.3% violating operational thresholds, and 57% of action-enabled models lacking privacy disclosures.

Collaborative Parameter Learning: Mitigating Forgetting via Parameter-Level Gradient Analysis

cs.LG · 2026-01-29 · conditional · novelty 6.0

Collaborative Parameter Learning freezes 50-75% of parameters whose updates cause forgetting and updates only the 25-50% that mitigate it, allowing LLMs to learn 20-48% more new questions with negligible forgetting and lower compute cost.

CURE-Med: Curriculum-Informed Reinforcement Learning for Multilingual Medical Reasoning

cs.AI · 2026-01-19 · unverdicted · novelty 6.0

CURE-MED pairs a new 13-language medical reasoning benchmark with curriculum RL to raise logical correctness to 70% and language consistency to 95% at 32B scale while outperforming baselines.

Real-World Doctor Agent with Proactive Consultation through Multi-Agent Reinforcement Learning

cs.CL · 2025-05-26 · unverdicted · novelty 6.0

DoctorAgent-RL trains a Qwen2.5-7B doctor agent via multi-agent RL on the new MTMedDialog dataset to conduct dynamic, question-driven consultations, reaching 70% exact diagnostic match in real-patient trials.

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

cs.CL · 2024-12-25 · unverdicted · novelty 6.0

HuatuoGPT-o1 achieves superior medical complex reasoning by using a verifier to curate reasoning trajectories for fine-tuning and then applying RL with verifier-based rewards.

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

cs.CV · 2023-06-01 · unverdicted · novelty 6.0

LLaVA-Med is created via curriculum fine-tuning on PubMed figure-caption pairs and GPT-4 self-instructed data, achieving competitive or better results than prior supervised models on three biomedical VQA benchmarks.

FedSDR: Federated Self-Distillation with Rectification

cs.LG · 2026-05-18 · unverdicted · novelty 5.0

FedSDR augments federated self-distillation with dual LoRA streams (local smoothing and global rectification) to produce globally aligned, factually faithful models under statistical heterogeneity.

RadLite: Multi-Task LoRA Fine-Tuning of Small Language Models for CPU-Deployable Radiology AI

cs.CL · 2026-05-01 · unverdicted · novelty 5.0

LoRA fine-tuning of 3-4B SLMs on 162K multi-task radiology data yields strong performance deployable on consumer CPUs at 4-8 tokens/second.

Reinforcement Learning Improves LLM Accuracy and Reasoning in Disease Classification from Radiology Reports

cs.AI · 2026-04-21 · unverdicted · novelty 5.0

SFT followed by GRPO improves LLM accuracy and reasoning recall in disease classification from radiology reports on three radiologist-annotated datasets.

FedShield-LLM: A Secure and Scalable Federated Fine-Tuned Large Language Model

cs.CR · 2025-06-06 · unverdicted · novelty 5.0

FedShield-LLM integrates pruning and FHE on LoRA parameters to support secure, scalable federated fine-tuning of LLMs such as Llama-2.

MedRoute: RL-Based Dynamic Specialist Routing in Multi-Agent Medical Diagnosis

eess.IV · 2026-02-05 · unverdicted · novelty 4.0

MedRoute applies RL-based dynamic routing to select specialist LMM agents in a multi-agent medical diagnosis system, outperforming static baselines on text and image datasets.

Speech-Based Cognitive Screening: A Systematic Evaluation of LLM Adaptation Strategies

cs.CL · 2025-08-24 · unverdicted · novelty 4.0

Systematic comparison of nine text-only and three multimodal LLMs using in-context learning, reasoning prompts, fine-tuning, and multimodal fusion on DementiaBank speech data finds class-centroid demonstrations and token-level fine-tuning most effective, with adapted open models matching or beating

Data-Centric Foundation Models in Computational Healthcare: A Survey

cs.LG · 2024-01-04 · unverdicted · novelty 3.0

The paper surveys data-centric strategies for foundation models in computational healthcare and supplies a curated list of related models and datasets.

SciHorizon-GENE: Benchmarking LLM for Life Sciences Inference from Gene Knowledge to Functional Understanding

q-bio.GN · 2026-01-19

citing papers explorer

Showing 1 of 1 citing paper after filters.

SciHorizon-GENE: Benchmarking LLM for Life Sciences Inference from Gene Knowledge to Functional Understanding q-bio.GN · 2026-01-19 · unreviewed · ref 20

Medalpaca–an open-source collection of medical conversational ai models and training data

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer