Fine-tuned on-device LLMs achieve up to 87.9% diagnostic accuracy on clinical tasks, approaching GPT-5.1 at 89.4% while remaining smaller and local.
DeepSeek-R1 incentivizes reasoning in LLMs through reinforcement learning,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Benchmarking and Adapting On-Device LLMs for Clinical Decision Support
Fine-tuned on-device LLMs achieve up to 87.9% diagnostic accuracy on clinical tasks, approaching GPT-5.1 at 89.4% while remaining smaller and local.