hub

Bert: Pre-training of deep bidirectional transformers for language understanding

· 2019

16 Pith papers cite this work. Polarity classification is still indexing.

16 Pith papers citing it

browse 16 citing papers

hub tools

JSON dossier citing papers JSON

citation-role summary

background 1 method 1

citation-polarity summary

background 1 use method 1

representative citing papers

QLAM: A Quantum Long-Attention Memory Approach to Long-Sequence Token Modeling

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

QLAM extends state-space models with quantum superposition in the hidden state for linear-time long-sequence modeling and reports consistent gains over RNN and transformer baselines on sequential image tasks.

SI-Diff: A Framework for Learning Search and High-Precision Insertion with a Force-Domain Diffusion Policy

cs.RO · 2026-05-12 · unverdicted · novelty 7.0

SI-Diff uses a force-domain diffusion policy with mode conditioning and a search teacher to handle both misalignment search and precise insertion in one model, raising x-y tolerance from 2 mm to 5 mm.

HapticLDM: A Diffusion Model for Text-to-Vibrotactile Generation

cs.HC · 2026-05-11 · unverdicted · novelty 7.0

HapticLDM is the first latent diffusion model that generates vibrotactile signals directly from text, using dynamic text curation and global denoising to improve realism and semantic alignment over autoregressive baselines.

CBEN -- A Multimodal Machine Learning Dataset for Cloud Robust Remote Sensing Image Understanding

cs.CV · 2026-02-13 · accept · novelty 7.0

CBEN provides paired optical-radar images with cloud occlusion, revealing 23-33 point AP drops in clear-sky trained models and 17-29 point relative gains when models are trained on cloudy data.

UniT: Unified Geometry Learning with Group Autoregressive Transformer

cs.CV · 2026-05-20 · unverdicted · novelty 6.0

UniT unifies online and offline 3D geometry perception via a Group Autoregressive Transformer that processes observation groups with anchor-free point map prediction and a scale-adaptive loss.

Text-to-CAD Retrieval: a Strong Baseline

cs.CV · 2026-05-07 · unverdicted · novelty 6.0

Text-to-CAD retrieval is introduced as a cross-modal task with a baseline that learns joint embeddings from CAD construction sequences, point clouds, and text queries via a masked feature decoder.

Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection

cs.CV · 2026-05-06 · unverdicted · novelty 6.0

RGSE adapts text embeddings at test time via evolutionary search, using cosine similarity rewards from high-confidence visual proposals to improve open-vocabulary object detection under distribution shifts.

Generative Learning Enhanced Intelligent Resource Management for Cell-Free Delay Deterministic Communications

cs.IT · 2026-04-23 · unverdicted · novelty 6.0

The proposed pretraining framework for safe DRL in CF-MIMO resource management doubles initial energy efficiency, achieves 4.7% higher final EE, maintains 1% delay violation rate, and cuts exploration steps by 50% compared to non-pretrained baselines while matching diffusion model performance at 14x

WiseOWL: A Methodology for Evaluating Ontological Descriptiveness and Semantic Correctness for Ontology Reuse and Ontology Recommendations

cs.AI · 2026-04-13 · unverdicted · novelty 6.0

WiseOWL introduces a four-metric scoring system with a Streamlit app to evaluate and recommend ontologies for reuse based on descriptiveness and semantic correctness.

SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models

cs.LG · 2026-04-06 · unverdicted · novelty 6.0

SLaB compresses LLM weights via sparse-lowrank-binary decomposition guided by activation-aware scores, achieving up to 36% lower perplexity than prior methods at 50% compression on Llama models.

Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models

eess.AS · 2026-01-09 · unverdicted · novelty 6.0

A hybrid two-stage framework pairs a discriminative front-end for interference suppression with a generative decoder-only LM back-end to improve perceptual quality and speaker consistency in target speaker extraction and speech enhancement.

SCOUT: A Defense Against Data Poisoning Attacks in Fine-Tuned Language Models

cs.CR · 2025-12-10 · unverdicted · novelty 6.0

SCOUT uses token saliency analysis to detect both standard and contextually-plausible backdoor attacks in language models while maintaining clean accuracy.

FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection

cs.SE · 2026-05-21 · unverdicted · novelty 5.0

FAME achieves F1 of 98.16 on BGL and 99.95 on Thunderbird for message-level log anomaly detection using at most K=100 labels per template, reducing annotation effort by 76x while detecting anomalies from unseen EventIDs.

Evaluating Tabular Representation Learning for Network Intrusion Detection

cs.LG · 2026-05-04 · unverdicted · novelty 5.0

Tabular representation learning for network intrusion detection exhibits strong dataset-model dependency, with supervised methods outperforming unsupervised anomaly detection and limited but possible cross-dataset generalization.

A Geometric Algebra-informed NeRF Framework for Generalizable Wireless Channel Prediction

cs.NI · 2026-04-13 · unverdicted · novelty 5.0

GAI-NeRF combines geometric algebra attention and an adaptive ray tracing module inside a NeRF model to deliver more accurate and generalizable wireless channel predictions across varied indoor environments.

Object-Attribute-Relation Model Driven Adaptive Hierarchical Transmission for Multimodal Semantic Communication

eess.SP · 2026-04-09 · unverdicted · novelty 5.0

An O-A-R model driven adaptive hierarchical transmission system for multimodal semantic communication achieves over 90% bandwidth savings at 1-3 kbps and eliminates cliff effects in deep fading channels by sending decision-oriented semantic graphs rather than pixels.

citing papers explorer

Showing 16 of 16 citing papers.

QLAM: A Quantum Long-Attention Memory Approach to Long-Sequence Token Modeling cs.LG · 2026-05-13 · unverdicted · none · ref 17
QLAM extends state-space models with quantum superposition in the hidden state for linear-time long-sequence modeling and reports consistent gains over RNN and transformer baselines on sequential image tasks.
SI-Diff: A Framework for Learning Search and High-Precision Insertion with a Force-Domain Diffusion Policy cs.RO · 2026-05-12 · unverdicted · none · ref 31
SI-Diff uses a force-domain diffusion policy with mode conditioning and a search teacher to handle both misalignment search and precise insertion in one model, raising x-y tolerance from 2 mm to 5 mm.
HapticLDM: A Diffusion Model for Text-to-Vibrotactile Generation cs.HC · 2026-05-11 · unverdicted · none · ref 50
HapticLDM is the first latent diffusion model that generates vibrotactile signals directly from text, using dynamic text curation and global denoising to improve realism and semantic alignment over autoregressive baselines.
CBEN -- A Multimodal Machine Learning Dataset for Cloud Robust Remote Sensing Image Understanding cs.CV · 2026-02-13 · accept · none · ref 84
CBEN provides paired optical-radar images with cloud occlusion, revealing 23-33 point AP drops in clear-sky trained models and 17-29 point relative gains when models are trained on cloudy data.
UniT: Unified Geometry Learning with Group Autoregressive Transformer cs.CV · 2026-05-20 · unverdicted · none · ref 15
UniT unifies online and offline 3D geometry perception via a Group Autoregressive Transformer that processes observation groups with anchor-free point map prediction and a scale-adaptive loss.
Text-to-CAD Retrieval: a Strong Baseline cs.CV · 2026-05-07 · unverdicted · none · ref 16
Text-to-CAD retrieval is introduced as a cross-modal task with a baseline that learns joint embeddings from CAD construction sequences, point clouds, and text queries via a masked feature decoder.
Reward-Guided Semantic Evolution for Test-time Adaptive Object Detection cs.CV · 2026-05-06 · unverdicted · none · ref 49
RGSE adapts text embeddings at test time via evolutionary search, using cosine similarity rewards from high-confidence visual proposals to improve open-vocabulary object detection under distribution shifts.
Generative Learning Enhanced Intelligent Resource Management for Cell-Free Delay Deterministic Communications cs.IT · 2026-04-23 · unverdicted · none · ref 42
The proposed pretraining framework for safe DRL in CF-MIMO resource management doubles initial energy efficiency, achieves 4.7% higher final EE, maintains 1% delay violation rate, and cuts exploration steps by 50% compared to non-pretrained baselines while matching diffusion model performance at 14x
WiseOWL: A Methodology for Evaluating Ontological Descriptiveness and Semantic Correctness for Ontology Reuse and Ontology Recommendations cs.AI · 2026-04-13 · unverdicted · none · ref 28
WiseOWL introduces a four-metric scoring system with a Streamlit app to evaluate and recommend ontologies for reuse based on descriptiveness and semantic correctness.
SLaB: Sparse-Lowrank-Binary Decomposition for Efficient Large Language Models cs.LG · 2026-04-06 · unverdicted · none · ref 3
SLaB compresses LLM weights via sparse-lowrank-binary decomposition guided by activation-aware scores, achieving up to 36% lower perplexity than prior methods at 50% compression on Llama models.
Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models eess.AS · 2026-01-09 · unverdicted · none · ref 73
A hybrid two-stage framework pairs a discriminative front-end for interference suppression with a generative decoder-only LM back-end to improve perceptual quality and speaker consistency in target speaker extraction and speech enhancement.
SCOUT: A Defense Against Data Poisoning Attacks in Fine-Tuned Language Models cs.CR · 2025-12-10 · unverdicted · none · ref 8
SCOUT uses token saliency analysis to detect both standard and contextually-plausible backdoor attacks in language models while maintaining clean accuracy.
FAME: Failure-Aware Mixture-of-Experts for Message-Level Log Anomaly Detection cs.SE · 2026-05-21 · unverdicted · none · ref 17
FAME achieves F1 of 98.16 on BGL and 99.95 on Thunderbird for message-level log anomaly detection using at most K=100 labels per template, reducing annotation effort by 76x while detecting anomalies from unseen EventIDs.
Evaluating Tabular Representation Learning for Network Intrusion Detection cs.LG · 2026-05-04 · unverdicted · none · ref 24
Tabular representation learning for network intrusion detection exhibits strong dataset-model dependency, with supervised methods outperforming unsupervised anomaly detection and limited but possible cross-dataset generalization.
A Geometric Algebra-informed NeRF Framework for Generalizable Wireless Channel Prediction cs.NI · 2026-04-13 · unverdicted · none · ref 36
GAI-NeRF combines geometric algebra attention and an adaptive ray tracing module inside a NeRF model to deliver more accurate and generalizable wireless channel predictions across varied indoor environments.
Object-Attribute-Relation Model Driven Adaptive Hierarchical Transmission for Multimodal Semantic Communication eess.SP · 2026-04-09 · unverdicted · none · ref 49
An O-A-R model driven adaptive hierarchical transmission system for multimodal semantic communication achieves over 90% bandwidth savings at 1-3 kbps and eliminates cliff effects in deep fading channels by sending decision-oriented semantic graphs rather than pixels.

Bert: Pre-training of deep bidirectional transformers for language understanding

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer