Decoupled weight decay regularization

· 2019

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Roll Out and Roll Back: Diffusion LLMs are Their Own Efficiency Teachers

cs.CL · 2026-05-16 · unverdicted · novelty 6.0

Diffusion LLMs can act as their own efficiency teachers by using revokable parallel decoding to identify reliable token orders and then distilling those orders into the model parameters for faster inference.

Deep Image Segmentation via Discriminant Feature Learning

cs.CV · 2026-05-14 · unverdicted · novelty 6.0

Deep Discriminant Analysis (DDA) is a new loss that maximizes between-class variance and minimizes within-class variance to produce more compact and separable features for image segmentation.

EditTransfer++: Toward Faithful and Efficient Visual-Prompt-Guided Image Editing

cs.CV · 2026-05-08 · unverdicted · novelty 6.0

EditTransfer++ delivers state-of-the-art faithfulness to visual editing examples and faster inference by removing text conditioning during fine-tuning and applying best-worst contrastive refinement plus condition compression.

Rate-Distortion Optimization for Transformer Inference

cs.LG · 2026-01-29 · unverdicted · novelty 5.0

A rate-distortion framework for lossy compression of transformer representations yields substantial bitrate savings on language tasks while preserving accuracy, with observed rates aligning to derived information-theoretic bounds.

Audio Deepfake Detection at the First Greeting: "Hi!"

eess.AS · 2026-01-27 · unverdicted · novelty 5.0

S-MGAA adds pixel-channel enhancement and frequency compensation modules to improve audio deepfake detection on very short, degraded speech inputs.

SRC-Flow: Compact Semantic Representations Enable Normalizing Flows for Image Generation

cs.CV · 2026-05-18

citing papers explorer

Showing 6 of 6 citing papers.

Roll Out and Roll Back: Diffusion LLMs are Their Own Efficiency Teachers cs.CL · 2026-05-16 · unverdicted · none · ref 44
Diffusion LLMs can act as their own efficiency teachers by using revokable parallel decoding to identify reliable token orders and then distilling those orders into the model parameters for faster inference.
Deep Image Segmentation via Discriminant Feature Learning cs.CV · 2026-05-14 · unverdicted · none · ref 38
Deep Discriminant Analysis (DDA) is a new loss that maximizes between-class variance and minimizes within-class variance to produce more compact and separable features for image segmentation.
EditTransfer++: Toward Faithful and Efficient Visual-Prompt-Guided Image Editing cs.CV · 2026-05-08 · unverdicted · none · ref 56
EditTransfer++ delivers state-of-the-art faithfulness to visual editing examples and faster inference by removing text conditioning during fine-tuning and applying best-worst contrastive refinement plus condition compression.
Rate-Distortion Optimization for Transformer Inference cs.LG · 2026-01-29 · unverdicted · none · ref 70
A rate-distortion framework for lossy compression of transformer representations yields substantial bitrate savings on language tasks while preserving accuracy, with observed rates aligning to derived information-theoretic bounds.
Audio Deepfake Detection at the First Greeting: "Hi!" eess.AS · 2026-01-27 · unverdicted · none · ref 29
S-MGAA adds pixel-channel enhancement and frequency compensation modules to improve audio deepfake detection on very short, degraded speech inputs.
SRC-Flow: Compact Semantic Representations Enable Normalizing Flows for Image Generation cs.CV · 2026-05-18 · unreviewed · ref 38

Decoupled weight decay regularization

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer