GLUE: A multi-task benchmark and analysis platform for natural language understanding,

· 2018

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Benchmarking Local Language Models for Social Robots using Edge Devices

cs.RO · 2026-05-04 · unverdicted · novelty 5.0

Benchmarking 25 LLMs on Raspberry Pi hardware shows Granite4 Tiny Hybrid (7B) balances 2.5 tokens/s, 0.90 tokens/J, and 54.6% MMLU while teaching effectiveness does not require high general knowledge scores.

Communication-Efficient Federated Fine-Tuning

cs.LG · 2025-05-07 · unverdicted · novelty 5.0

FDA-Opt unifies and improves upon FedOpt and FDA for communication-efficient federated fine-tuning of language models on NLP tasks, outperforming optimized FedOpt baselines.

citing papers explorer

Showing 2 of 2 citing papers.

Benchmarking Local Language Models for Social Robots using Edge Devices cs.RO · 2026-05-04 · unverdicted · none · ref 10
Benchmarking 25 LLMs on Raspberry Pi hardware shows Granite4 Tiny Hybrid (7B) balances 2.5 tokens/s, 0.90 tokens/J, and 54.6% MMLU while teaching effectiveness does not require high general knowledge scores.
Communication-Efficient Federated Fine-Tuning cs.LG · 2025-05-07 · unverdicted · none · ref 22
FDA-Opt unifies and improves upon FedOpt and FDA for communication-efficient federated fine-tuning of language models on NLP tasks, outperforming optimized FedOpt baselines.

GLUE: A multi-task benchmark and analysis platform for natural language understanding,

fields

years

verdicts

representative citing papers

citing papers explorer