L oo GLE : Can Long-Context Language Models Understand Long Contexts?

Li, Jiaqi, Wang, Mengmeng, Zheng, Zilong, Zhang, Muhan · 2024 · DOI 10.18653/v1/2024.acl-long.859

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open at publisher browse 7 citing papers

representative citing papers

Lost in a Single Vector: Improving Long-Document Retrieval with Chunk Evidence Aggregation

cs.CL · 2026-06-17 · unverdicted · novelty 7.0

DICE aggregates independently encoded document chunks into a single vector to reduce evidence dilution in long-document dense retrieval, reporting gains on LongEmbed especially beyond 4k tokens.

Attention Flows: Tracing LLM Conceptual Engagement via Story Summaries

cs.CL · 2026-04-07 · unverdicted · novelty 7.0

LLM novel summaries emphasize endings more than human ones, measured by aligning summary sentences to referenced chapters.

SeKV: Resolution-Adaptive KV Cache with Hierarchical Semantic Memory for Long-Context LLM Inference

cs.CL · 2026-06-30 · unverdicted · novelty 6.0

SeKV introduces resolution-adaptive semantic KV caching with GPU-CPU hierarchy and selective zoom-in reconstruction, achieving 5.9% average improvement over semantic baselines and 53.3% GPU memory reduction at 128K context.

Natural Language Access Control (NLAC): From Help Desk Requests to Structured Policies

cs.NI · 2026-06-04 · unverdicted · novelty 6.0

NLAC architecture translates natural language requests to access policies via LLMs, with embedding-based subgraph selection enabling up to 98.7% accuracy on large networks per NLACBench evaluations.

Enhancing Software Engineering Through Closed-Loop Memory Optimization

cs.SE · 2026-06-04 · unverdicted · novelty 6.0

MemOp is a closed-loop memory augmentation framework for SE agents that defines memory utility via downstream task impact and reports gains of up to 5.25% success rate, 4.63% resolve efficiency, and 9.79% cost reduction.

How Many Different Outputs Can a Transformer Generate?

cs.LG · 2026-05-21 · unverdicted · novelty 6.0

Transformers are limited to a linearly growing number of accessible output sequences with prompt length, with exponential decay in accessible proportion beyond a critical point, even under unbounded context.

MixRea: Benchmarking Explicit-Implicit Reasoning in Large Language Models

cs.CL · 2026-05-19 · unverdicted · novelty 6.0

MixRea benchmark reveals LLMs achieve at most 42.8% consistency on explicit-implicit reasoning tasks, with PRCP prompting proposed to recover overlooked relations.

citing papers explorer

Showing 7 of 7 citing papers after filters.

Lost in a Single Vector: Improving Long-Document Retrieval with Chunk Evidence Aggregation cs.CL · 2026-06-17 · unverdicted · none · ref 15
DICE aggregates independently encoded document chunks into a single vector to reduce evidence dilution in long-document dense retrieval, reporting gains on LongEmbed especially beyond 4k tokens.
Attention Flows: Tracing LLM Conceptual Engagement via Story Summaries cs.CL · 2026-04-07 · unverdicted · none · ref 22
LLM novel summaries emphasize endings more than human ones, measured by aligning summary sentences to referenced chapters.
SeKV: Resolution-Adaptive KV Cache with Hierarchical Semantic Memory for Long-Context LLM Inference cs.CL · 2026-06-30 · unverdicted · none · ref 44
SeKV introduces resolution-adaptive semantic KV caching with GPU-CPU hierarchy and selective zoom-in reconstruction, achieving 5.9% average improvement over semantic baselines and 53.3% GPU memory reduction at 128K context.
Natural Language Access Control (NLAC): From Help Desk Requests to Structured Policies cs.NI · 2026-06-04 · unverdicted · none · ref 18
NLAC architecture translates natural language requests to access policies via LLMs, with embedding-based subgraph selection enabling up to 98.7% accuracy on large networks per NLACBench evaluations.
Enhancing Software Engineering Through Closed-Loop Memory Optimization cs.SE · 2026-06-04 · unverdicted · none · ref 17
MemOp is a closed-loop memory augmentation framework for SE agents that defines memory utility via downstream task impact and reports gains of up to 5.25% success rate, 4.63% resolve efficiency, and 9.79% cost reduction.
How Many Different Outputs Can a Transformer Generate? cs.LG · 2026-05-21 · unverdicted · none · ref 33
Transformers are limited to a linearly growing number of accessible output sequences with prompt length, with exponential decay in accessible proportion beyond a critical point, even under unbounded context.
MixRea: Benchmarking Explicit-Implicit Reasoning in Large Language Models cs.CL · 2026-05-19 · unverdicted · none · ref 27
MixRea benchmark reveals LLMs achieve at most 42.8% consistency on explicit-implicit reasoning tasks, with PRCP prompting proposed to recover overlooked relations.

L oo GLE : Can Long-Context Language Models Understand Long Contexts?

fields

years

verdicts

representative citing papers

citing papers explorer