hub Tool reference

FiLM : Visual reasoning with a general conditioning layer

author author E · 2018 · DOI 10.1609/aaai.v32i1.11671

Tool reference. 83% of classified Pith citations use this work as a method, library, or software dependency, not as a substantive claim.

12 Pith papers citing it

Method reference 83% of classified citations

open at publisher browse 12 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

method 5 background 1

citation-polarity summary

use method 5 background 1

representative citing papers

X-VC: Zero-shot Streaming Voice Conversion in Codec Space

eess.AS · 2026-04-14 · unverdicted · novelty 7.0

X-VC achieves zero-shot streaming voice conversion via one-step codec-space conversion with dual-conditioning acoustic converter and role-assignment training on generated paired data.

Mechanisms of Misgeneralization in Physical Sequence Modeling

cs.LG · 2026-05-19 · unverdicted · novelty 6.0

Generative sequence models for physical tasks exhibit physical misgeneralization where local prediction errors propagate through physical measurements to distort aggregate distributions over quantities like distance or energy; a data deviation kernel explains and predicts the shifts and supports a内核

Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery

cs.CV · 2026-05-12 · unverdicted · novelty 6.0 · 2 refs

SkyPart achieves state-of-the-art single-pass cross-view geo-localization on SUES-200, University-1652, and DenseUAV by using prototype-based part discovery, altitude-conditioned modulation, and Kendall-weighted loss, with widening gains under weather corruptions.

Quantum Injection Pathways for Implicit Graph Neural Networks

quant-ph · 2026-05-09 · unverdicted · novelty 6.0

Independent quantum signal injection into graph DEQs yields higher test accuracy and fewer solver iterations than state-dependent or backbone-dependent injection and classical equilibrium models on NCI1, PROTEINS, and MUTAG benchmarks.

Conditional Neural Field based Reduced Order Model for Dynamic Ditching Load Prediction

physics.flu-dyn · 2026-05-05 · unverdicted · novelty 6.0

Conditional neural fields combined with LSTM networks predict aircraft ditching loads accurately across heterogeneous spatial discretizations using fewer parameters than convolutional autoencoders.

Generative Modeling of Complex-Valued Brain MRI Data

eess.IV · 2026-04-16 · unverdicted · novelty 6.0

A cVAE plus flow-matching model generates realistic complex-valued brain MRI that preserves phase coherence above 0.997 and yields synthetic data that trains abnormality classifiers to 0.880 AUROC, beating the 0.842 real-data baseline on fastMRI.

NeuVolEx: Implicit Neural Features for Volume Exploration

cs.GR · 2026-04-13 · unverdicted · novelty 6.0

NeuVolEx extracts robust spatial features from INR training via a structural encoder and multi-task scheme to enable accurate ROI classification with limited supervision and unsupervised viewpoint clustering in volume exploration.

PREFAB: PREFerence-based Affective Modeling for Low-Budget Self-Annotation

cs.AI · 2026-01-20 · unverdicted · novelty 6.0

PREFAB applies preference learning grounded in the peak-end rule to let users annotate only key affective change segments while interpolating the rest, reducing workload and improving confidence in a 25-participant study.

CodecSep: Prompt-Driven Universal Sound Separation on Neural Audio Codec Latents

cs.SD · 2025-09-15 · unverdicted · novelty 6.0

CodecSep performs prompt-driven universal sound separation directly in neural audio codec latents by combining a frozen DAC backbone with a lightweight FiLM-conditioned Transformer masker driven by CLAP embeddings, yielding efficiency gains over AudioSep.

Memory-Efficient EDA Denoising via Knowledge Distillation for Wearable IoT Under Severe Motion Artifacts and Underwater Conditions

eess.SP · 2026-05-04 · conditional · novelty 5.0

Knowledge distillation from a hybrid CNN-Transformer teacher to a depth-wise separable CNN student, combined with realistic motion and environmental augmentation, produces a 15x smaller EDA denoiser that cuts underwater reconstruction error from 2.809 to 0.215 MAE and raises downstream CNS-OT AUROC.

On Optimal Hyperparameters for Differentially Private Deep Transfer Learning

cs.LG · 2025-10-23 · unverdicted · novelty 5.0

Empirical study of DP transfer learning reveals that larger clipping bounds outperform under tight privacy and cumulative DP noise explains batch-size effects better than existing heuristics.

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective

cs.RO · 2025-07-02 · unverdicted · novelty 5.0

The survey frames VLA models as pipelines that generate progressively grounded action tokens and classifies those tokens into eight types to guide future development.

citing papers explorer

Showing 12 of 12 citing papers.

X-VC: Zero-shot Streaming Voice Conversion in Codec Space eess.AS · 2026-04-14 · unverdicted · none · ref 30
X-VC achieves zero-shot streaming voice conversion via one-step codec-space conversion with dual-conditioning acoustic converter and role-assignment training on generated paired data.
Mechanisms of Misgeneralization in Physical Sequence Modeling cs.LG · 2026-05-19 · unverdicted · none · ref 41
Generative sequence models for physical tasks exhibit physical misgeneralization where local prediction errors propagate through physical measurements to distort aggregate distributions over quantities like distance or energy; a data deviation kernel explains and predicts the shifts and supports a内核
Weather-Robust Cross-View Geo-Localization via Prototype-Based Semantic Part Discovery cs.CV · 2026-05-12 · unverdicted · none · ref 41 · 2 links
SkyPart achieves state-of-the-art single-pass cross-view geo-localization on SUES-200, University-1652, and DenseUAV by using prototype-based part discovery, altitude-conditioned modulation, and Kendall-weighted loss, with widening gains under weather corruptions.
Quantum Injection Pathways for Implicit Graph Neural Networks quant-ph · 2026-05-09 · unverdicted · none · ref 39
Independent quantum signal injection into graph DEQs yields higher test accuracy and fewer solver iterations than state-dependent or backbone-dependent injection and classical equilibrium models on NCI1, PROTEINS, and MUTAG benchmarks.
Conditional Neural Field based Reduced Order Model for Dynamic Ditching Load Prediction physics.flu-dyn · 2026-05-05 · unverdicted · none · ref 36
Conditional neural fields combined with LSTM networks predict aircraft ditching loads accurately across heterogeneous spatial discretizations using fewer parameters than convolutional autoencoders.
Generative Modeling of Complex-Valued Brain MRI Data eess.IV · 2026-04-16 · unverdicted · none · ref 22
A cVAE plus flow-matching model generates realistic complex-valued brain MRI that preserves phase coherence above 0.997 and yields synthetic data that trains abnormality classifiers to 0.880 AUROC, beating the 0.842 real-data baseline on fastMRI.
NeuVolEx: Implicit Neural Features for Volume Exploration cs.GR · 2026-04-13 · unverdicted · none · ref 19
NeuVolEx extracts robust spatial features from INR training via a structural encoder and multi-task scheme to enable accurate ROI classification with limited supervision and unsupervised viewpoint clustering in volume exploration.
PREFAB: PREFerence-based Affective Modeling for Low-Budget Self-Annotation cs.AI · 2026-01-20 · unverdicted · none · ref 54
PREFAB applies preference learning grounded in the peak-end rule to let users annotate only key affective change segments while interpolating the rest, reducing workload and improving confidence in a 25-participant study.
CodecSep: Prompt-Driven Universal Sound Separation on Neural Audio Codec Latents cs.SD · 2025-09-15 · unverdicted · none · ref 28
CodecSep performs prompt-driven universal sound separation directly in neural audio codec latents by combining a frozen DAC backbone with a lightweight FiLM-conditioned Transformer masker driven by CLAP embeddings, yielding efficiency gains over AudioSep.
Memory-Efficient EDA Denoising via Knowledge Distillation for Wearable IoT Under Severe Motion Artifacts and Underwater Conditions eess.SP · 2026-05-04 · conditional · none · ref 23
Knowledge distillation from a hybrid CNN-Transformer teacher to a depth-wise separable CNN student, combined with realistic motion and environmental augmentation, produces a 15x smaller EDA denoiser that cuts underwater reconstruction error from 2.809 to 0.215 MAE and raises downstream CNS-OT AUROC.
On Optimal Hyperparameters for Differentially Private Deep Transfer Learning cs.LG · 2025-10-23 · unverdicted · none · ref 11
Empirical study of DP transfer learning reveals that larger clipping bounds outperform under tight privacy and cumulative DP noise explains batch-size effects better than existing heuristics.
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective cs.RO · 2025-07-02 · unverdicted · none · ref 268
The survey frames VLA models as pipelines that generate progressively grounded action tokens and classifies those tokens into eight types to guide future development.

FiLM : Visual reasoning with a general conditioning layer

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer