hub Mixed citations

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

Shaojie Bai, J. Zico Kolter, Vladlen Koltun · 2018 · cs.LG · arXiv 1803.01271

Mixed citation behavior. Most common role is background (67%).

86 Pith papers citing it

Background 67% of classified citations

open full Pith review browse 86 citing papers arXiv PDF

abstract

For most deep learning practitioners, sequence modeling is synonymous with recurrent networks. Yet recent results indicate that convolutional architectures can outperform recurrent networks on tasks such as audio synthesis and machine translation. Given a new sequence modeling task or dataset, which architecture should one use? We conduct a systematic evaluation of generic convolutional and recurrent architectures for sequence modeling. The models are evaluated across a broad range of standard tasks that are commonly used to benchmark recurrent networks. Our results indicate that a simple convolutional architecture outperforms canonical recurrent networks such as LSTMs across a diverse range of tasks and datasets, while demonstrating longer effective memory. We conclude that the common association between sequence modeling and recurrent networks should be reconsidered, and convolutional networks should be regarded as a natural starting point for sequence modeling tasks. To assist related work, we have made code available at http://github.com/locuslab/TCN .

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 8 baseline 3 method 1

citation-polarity summary

background 8 baseline 3 use method 1

claims ledger

abstract For most deep learning practitioners, sequence modeling is synonymous with recurrent networks. Yet recent results indicate that convolutional architectures can outperform recurrent networks on tasks such as audio synthesis and machine translation. Given a new sequence modeling task or dataset, which architecture should one use? We conduct a systematic evaluation of generic convolutional and recurrent architectures for sequence modeling. The models are evaluated across a broad range of standard tasks that are commonly used to benchmark recurrent networks. Our results indicate that a simple conv

co-cited works

representative citing papers

BAH Dataset for Ambivalence/Hesitancy Recognition in Videos for Digital Behavioural Change

cs.CV · 2025-05-25 · accept · novelty 8.0

Introduces the BAH dataset with 1,427 annotated videos for multimodal recognition of ambivalence/hesitancy in digital behavior change contexts.

Efficiently Modeling Long Sequences with Structured State Spaces

cs.LG · 2021-10-31 · unverdicted · novelty 8.0

S4 is an efficient state space sequence model that captures long-range dependencies via structured parameterization of the SSM, achieving state-of-the-art results on the Long Range Arena and other benchmarks while being faster than Transformers for generation.

Scale-Equivariant Generative Forecasting: Weight-Tied Dilated Convolutions, Wavelet Scattering Inputs, and Spectral-Consistency Training for Self-Similar Time Series

cs.LG · 2026-05-17 · unverdicted · novelty 7.0

Presents SE-WaveNet with weight-tied dilated convolutions plus wavelet and spectral components that reproduces empirical scaling collapse on financial returns while using L times fewer convolutional parameters.

U-STS-LLM A Unified Spatio-Temporal Steered Large Language Model for Traffic Prediction and Imputation

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

U-STS-LLM uses a spatio-temporally steered LLM with dynamic attention bias generation to achieve state-of-the-art results on long-horizon traffic forecasting and high-missing-rate imputation while remaining parameter-efficient.

TailedTS: Benchmark Dataset for Heavy-Tailed Time Series Prediction and Periodicity Quantification

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

TailedTS supplies 24.69 billion Wikipedia page-view records as a public benchmark for heavy-tailed time series forecasting and periodicity analysis, revealing weaker periodic structure in high-traffic pages.

SIGMA-ASL: Sensor-Integrated Multimodal Dataset for Sign Language Recognition

cs.HC · 2026-05-07 · unverdicted · novelty 7.0

SIGMA-ASL is a multimodal dataset with 93,545 word-level ASL clips from Kinect RGB-D, mmWave radar, and dual IMUs, plus benchmarking protocols for single- and multi-modal recognition.

AegisTS: A Hierarchical Agent System with Reinforcement Learning for Multivariate Time Series Data Cleaning

cs.DB · 2026-05-06 · unverdicted · novelty 7.0

AegisTS uses a two-level RL agent architecture with a dual-stage reward to jointly optimize cleaning order and method selection for multivariate time series, delivering up to 96% better cleaning quality and 27% better downstream performance without ground truth.

BadmintonGRF: A Multimodal Dataset and Benchmark for Markerless Ground Reaction Force Estimation in Badminton

cs.CV · 2026-05-03 · unverdicted · novelty 7.0

BadmintonGRF is a new public multimodal dataset and benchmark that pairs multi-view video with instrumented GRF for markerless load estimation in badminton.

GAFSV-Net: A Vision Framework for Online Signature Verification

cs.CV · 2026-04-30 · unverdicted · novelty 7.0

GAFSV-Net encodes online signatures as asymmetric Gramian Angular Field images and processes them with dual-branch ConvNeXt plus cross-attention to outperform sequence-based baselines on DeepSignDB and BiosecurID.

Autocorrelation Reintroduces Spectral Bias in KANs for Time Series Forecasting

cs.LG · 2026-04-26 · unverdicted · novelty 7.0

Temporal autocorrelation reintroduces spectral bias in KANs for time series forecasting, which DCT preprocessing can mitigate.

A Convolutional Neural Network-Derived Catalog of Solar Flares from Soft X-Ray Observations

astro-ph.SR · 2026-04-10 · unverdicted · novelty 7.0

The CNN-derived catalog detects over seven times more solar flares than the GOES catalog and extends the power-law distribution of flare peak fluxes to smaller sizes.

Adversarial Robustness of Deep State Space Models for Forecasting

cs.LG · 2026-04-03 · conditional · novelty 7.0

Spacetime SSM forecasters represent optimal Kalman predictors for autoregressive data but remain vulnerable to model-free attacks that exploit local linearity and increase error by over 33% compared to projected gradient descent.

Self-Supervised Foundation Model for Calcium-imaging Population Dynamics

q-bio.QM · 2026-04-03 · unverdicted · novelty 7.0

CalM uses a discrete tokenizer and dual-axis autoregressive transformer pretrained self-supervised on calcium traces to outperform specialized baselines on population dynamics forecasting and adapt to superior behavior decoding.

T1: One-to-One Channel-Head Binding for Multivariate Time-Series Imputation

cs.LG · 2026-02-24 · conditional · novelty 7.0

T1 uses one-to-one channel-head binding in a CNN-Transformer hybrid to achieve robust multivariate time-series imputation, cutting average MSE by 46% versus the next-best baseline across 11 datasets even at 70% missingness.

MELT: A Behavioral Trace Dataset for High-Risk Memecoin Launch Detection

cs.CR · 2026-02-13 · unverdicted · novelty 7.0

MELT is the first behavioral trace dataset for high-risk memecoin launch detection on Solana, providing 122 features, risk annotations, and ML benchmarks that reduce investment loss when used for selection.

Causal Time Series Generation via Diffusion Models

cs.LG · 2025-09-25 · unverdicted · novelty 7.0

CaTSG is a unified diffusion model for causal time series generation that handles observational, interventional, and counterfactual tasks via backdoor adjustment and abduction-action-prediction.

Sundial: A Family of Highly Capable Time Series Foundation Models

cs.LG · 2025-02-02 · conditional · novelty 7.0

Sundial uses TimeFlow Loss for native pre-training of Transformers on continuous time series from TimeBench, achieving SOTA point and probabilistic forecasting with millisecond inference.

What Causes Performance Degradation in Cross-Subject EEG Classification?

cs.CE · 2024-10-04 · unverdicted · novelty 7.0

Controlled experiments attribute cross-subject EEG classification degradation to inter-subject variability in multi-class tasks and shortcut learning in single-class tasks.

Deep Time Series Models: A Comprehensive Survey and Benchmark

cs.LG · 2024-07-18 · unverdicted · novelty 7.0

This survey and benchmark of deep time series models using the released TSLib library finds that models with specific structures perform well only on distinct analysis tasks.

Time-LLM: Time Series Forecasting by Reprogramming Large Language Models

cs.LG · 2023-10-03 · conditional · novelty 7.0

Time-LLM reprograms frozen LLMs for time series forecasting via text prototypes and Prompt-as-Prefix, outperforming specialized models in standard, few-shot, and zero-shot settings.

Convolutional and Deep Learning based techniques for Time Series Ordinal Classification

cs.LG · 2023-06-16 · unverdicted · novelty 7.0

First benchmarking of ordinal adaptations of CNN and DL methods for time series shows they outperform nominal TSC techniques on ordinal metrics across 29 selected problems.

A Time Series is Worth 64 Words: Long-term Forecasting with Transformers

cs.LG · 2022-11-27 · conditional · novelty 7.0

PatchTST uses subseries patching and channel-independent Transformers to deliver significantly better long-term multivariate time series forecasting and strong self-supervised transfer performance.

Tropical time series, iterated-sums signatures and quasisymmetric functions

math.RA · 2020-09-17 · unverdicted · novelty 7.0

Defines iterated-sums signatures over commutative semirings (tropical case emphasized) for time-series feature extraction and links them to quasisymmetric functions over semirings.

ReactiveGWM: Steering NPC in Reactive Game World Models

cs.CV · 2026-05-14 · unverdicted · novelty 6.0

ReactiveGWM introduces a decoupled diffusion architecture for player-NPC interactions that learns game-agnostic response logic for zero-shot strategy transfer across games.

citing papers explorer

Showing 50 of 86 citing papers.

BAH Dataset for Ambivalence/Hesitancy Recognition in Videos for Digital Behavioural Change cs.CV · 2025-05-25 · accept · none · ref 4 · internal anchor
Introduces the BAH dataset with 1,427 annotated videos for multimodal recognition of ambivalence/hesitancy in digital behavior change contexts.
Efficiently Modeling Long Sequences with Structured State Spaces cs.LG · 2021-10-31 · unverdicted · none · ref 3 · internal anchor
S4 is an efficient state space sequence model that captures long-range dependencies via structured parameterization of the SSM, achieving state-of-the-art results on the Long Range Arena and other benchmarks while being faster than Transformers for generation.
Scale-Equivariant Generative Forecasting: Weight-Tied Dilated Convolutions, Wavelet Scattering Inputs, and Spectral-Consistency Training for Self-Similar Time Series cs.LG · 2026-05-17 · unverdicted · none · ref 2 · internal anchor
Presents SE-WaveNet with weight-tied dilated convolutions plus wavelet and spectral components that reproduces empirical scaling collapse on financial returns while using L times fewer convolutional parameters.
U-STS-LLM A Unified Spatio-Temporal Steered Large Language Model for Traffic Prediction and Imputation cs.LG · 2026-05-12 · unverdicted · none · ref 33 · internal anchor
U-STS-LLM uses a spatio-temporally steered LLM with dynamic attention bias generation to achieve state-of-the-art results on long-horizon traffic forecasting and high-missing-rate imputation while remaining parameter-efficient.
TailedTS: Benchmark Dataset for Heavy-Tailed Time Series Prediction and Periodicity Quantification cs.LG · 2026-05-09 · unverdicted · none · ref 18 · internal anchor
TailedTS supplies 24.69 billion Wikipedia page-view records as a public benchmark for heavy-tailed time series forecasting and periodicity analysis, revealing weaker periodic structure in high-traffic pages.
SIGMA-ASL: Sensor-Integrated Multimodal Dataset for Sign Language Recognition cs.HC · 2026-05-07 · unverdicted · none · ref 7 · internal anchor
SIGMA-ASL is a multimodal dataset with 93,545 word-level ASL clips from Kinect RGB-D, mmWave radar, and dual IMUs, plus benchmarking protocols for single- and multi-modal recognition.
AegisTS: A Hierarchical Agent System with Reinforcement Learning for Multivariate Time Series Data Cleaning cs.DB · 2026-05-06 · unverdicted · none · ref 3 · internal anchor
AegisTS uses a two-level RL agent architecture with a dual-stage reward to jointly optimize cleaning order and method selection for multivariate time series, delivering up to 96% better cleaning quality and 27% better downstream performance without ground truth.
BadmintonGRF: A Multimodal Dataset and Benchmark for Markerless Ground Reaction Force Estimation in Badminton cs.CV · 2026-05-03 · unverdicted · none · ref 2 · internal anchor
BadmintonGRF is a new public multimodal dataset and benchmark that pairs multi-view video with instrumented GRF for markerless load estimation in badminton.
GAFSV-Net: A Vision Framework for Online Signature Verification cs.CV · 2026-04-30 · unverdicted · none · ref 1 · internal anchor
GAFSV-Net encodes online signatures as asymmetric Gramian Angular Field images and processes them with dual-branch ConvNeXt plus cross-attention to outperform sequence-based baselines on DeepSignDB and BiosecurID.
Autocorrelation Reintroduces Spectral Bias in KANs for Time Series Forecasting cs.LG · 2026-04-26 · unverdicted · none · ref 10 · internal anchor
Temporal autocorrelation reintroduces spectral bias in KANs for time series forecasting, which DCT preprocessing can mitigate.
A Convolutional Neural Network-Derived Catalog of Solar Flares from Soft X-Ray Observations astro-ph.SR · 2026-04-10 · unverdicted · none · ref 12 · internal anchor
The CNN-derived catalog detects over seven times more solar flares than the GOES catalog and extends the power-law distribution of flare peak fluxes to smaller sizes.
Adversarial Robustness of Deep State Space Models for Forecasting cs.LG · 2026-04-03 · conditional · none · ref 5 · internal anchor
Spacetime SSM forecasters represent optimal Kalman predictors for autoregressive data but remain vulnerable to model-free attacks that exploit local linearity and increase error by over 33% compared to projected gradient descent.
Self-Supervised Foundation Model for Calcium-imaging Population Dynamics q-bio.QM · 2026-04-03 · unverdicted · none · ref 2 · internal anchor
CalM uses a discrete tokenizer and dual-axis autoregressive transformer pretrained self-supervised on calcium traces to outperform specialized baselines on population dynamics forecasting and adapt to superior behavior decoding.
T1: One-to-One Channel-Head Binding for Multivariate Time-Series Imputation cs.LG · 2026-02-24 · conditional · none · ref 1 · internal anchor
T1 uses one-to-one channel-head binding in a CNN-Transformer hybrid to achieve robust multivariate time-series imputation, cutting average MSE by 46% versus the next-best baseline across 11 datasets even at 70% missingness.
MELT: A Behavioral Trace Dataset for High-Risk Memecoin Launch Detection cs.CR · 2026-02-13 · unverdicted · none · ref 5 · internal anchor
MELT is the first behavioral trace dataset for high-risk memecoin launch detection on Solana, providing 122 features, risk annotations, and ML benchmarks that reduce investment loss when used for selection.
Causal Time Series Generation via Diffusion Models cs.LG · 2025-09-25 · unverdicted · none · ref 3 · internal anchor
CaTSG is a unified diffusion model for causal time series generation that handles observational, interventional, and counterfactual tasks via backdoor adjustment and abduction-action-prediction.
Sundial: A Family of Highly Capable Time Series Foundation Models cs.LG · 2025-02-02 · conditional · none · ref 3 · internal anchor
Sundial uses TimeFlow Loss for native pre-training of Transformers on continuous time series from TimeBench, achieving SOTA point and probabilistic forecasting with millisecond inference.
What Causes Performance Degradation in Cross-Subject EEG Classification? cs.CE · 2024-10-04 · unverdicted · none · ref 6 · internal anchor
Controlled experiments attribute cross-subject EEG classification degradation to inter-subject variability in multi-class tasks and shortcut learning in single-class tasks.
Deep Time Series Models: A Comprehensive Survey and Benchmark cs.LG · 2024-07-18 · unverdicted · none · ref 131 · internal anchor
This survey and benchmark of deep time series models using the released TSLib library finds that models with specific structures perform well only on distinct analysis tasks.
Time-LLM: Time Series Forecasting by Reprogramming Large Language Models cs.LG · 2023-10-03 · conditional · none · ref 71 · internal anchor
Time-LLM reprograms frozen LLMs for time series forecasting via text prototypes and Prompt-as-Prefix, outperforming specialized models in standard, few-shot, and zero-shot settings.
Convolutional and Deep Learning based techniques for Time Series Ordinal Classification cs.LG · 2023-06-16 · unverdicted · none · ref 70 · internal anchor
First benchmarking of ordinal adaptations of CNN and DL methods for time series shows they outperform nominal TSC techniques on ordinal metrics across 29 selected problems.
A Time Series is Worth 64 Words: Long-term Forecasting with Transformers cs.LG · 2022-11-27 · conditional · none · ref 1 · internal anchor
PatchTST uses subseries patching and channel-independent Transformers to deliver significantly better long-term multivariate time series forecasting and strong self-supervised transfer performance.
Tropical time series, iterated-sums signatures and quasisymmetric functions math.RA · 2020-09-17 · unverdicted · none · ref 8 · internal anchor
Defines iterated-sums signatures over commutative semirings (tropical case emphasized) for time-series feature extraction and links them to quasisymmetric functions over semirings.
ReactiveGWM: Steering NPC in Reactive Game World Models cs.CV · 2026-05-14 · unverdicted · none · ref 2 · internal anchor
ReactiveGWM introduces a decoupled diffusion architecture for player-NPC interactions that learns game-agnostic response logic for zero-shot strategy transfer across games.
Clin-JEPA: A Multi-Phase Co-Training Framework for Joint-Embedding Predictive Pretraining on EHR Patient Trajectories cs.LG · 2026-05-11 · unverdicted · none · ref 31 · 2 links · internal anchor
A five-phase co-training framework enables stable JEPA pretraining on EHR trajectories, producing converging latent rollouts and higher multi-task AUROC than baselines on MIMIC-IV ICU data.
What If We Let Forecasting Forget? A Sparse Bottleneck for Cross-Variable Dependencies cs.LG · 2026-05-08 · unverdicted · none · ref 99 · internal anchor
MS-FLOW uses a capacity-limited sparse routing mechanism to model only critical inter-variable dependencies in time series data, achieving state-of-the-art accuracy on 12 benchmarks with fewer but more reliable connections.
Dynamics Aware Quadrupedal Locomotion via Intrinsic Dynamics Head cs.RO · 2026-05-02 · unverdicted · none · ref 28 · internal anchor
Concurrent training of an Intrinsic Dynamics Head with a dynamics reward yields more efficient and smoother quadrupedal locomotion policies that transfer to real robots with 12-18% gains in efficiency metrics.
WISE-FM:Operation-Aware, Engineering-Informed Foundation Model for Multi-Task Well Design cs.LG · 2026-04-26 · unverdicted · none · ref 36 · internal anchor
WISE-FM is a design-aware, physics-informed multi-task foundation model that reduces virtual flow metering error by up to 13x on simulated wells and transfers to real Equinor data with high R-squared values by conditioning on design parameters and enforcing mass conservation.
Data-Driven Open-Loop Simulation for Digital-Twin Operator Decision Support in Wastewater Treatment cs.LG · 2026-04-22 · unverdicted · none · ref 43 · internal anchor
CCSS-RS achieves RMSE 0.696 and CRPS 0.349 at 1000-step horizons on a large public WWTP benchmark with 43% missingness, outperforming Neural CDE baselines by 40-46% in RMSE.
Conditional Attribution for Root Cause Analysis in Time-Series Anomaly Detection cs.LG · 2026-04-19 · unverdicted · none · ref 2 · internal anchor
Conditional attribution retrieves contextually similar normal states from VAE latent spaces and UMAP embeddings to explain time-series anomalies while preserving dependencies, improving root-cause accuracy on SWaT and MSDS benchmarks.
VQ-Wave: A physics-driven spatio-temporal deep learning approach for non-contrast-enhanced lung ventilation and perfusion MRI physics.med-ph · 2026-04-17 · unverdicted · none · ref 23 · internal anchor
VQ-Wave is a physics-driven spatio-temporal neural network that learns to extract ventilation and perfusion maps from non-contrast lung MRI by training on simulated signals with amplitude modulations, frequency drifts, and noise, showing better stability than matrix pencil decomposition in phantoms,
Modern Structure-Aware Simplicial Spatiotemporal Neural Network cs.LG · 2026-04-17 · unverdicted · none · ref 2 · internal anchor
ModernSASST is the first simplicial complex-based spatiotemporal model that combines random walks on high-dimensional complexes with parallelizable temporal convolutional networks for efficient high-order topology capture.
AIBuildAI: An AI Agent for Automatically Building AI Models cs.AI · 2026-04-15 · unverdicted · none · ref 63 · internal anchor
AIBuildAI uses a manager agent and three LLM sub-agents to fully automate AI model development and achieves a 63.1% medal rate on MLE-Bench, matching experienced human engineers.
MISID: A Multimodal Multi-turn Dataset for Complex Intent Recognition in Strategic Deception Games cs.AI · 2026-04-14 · unverdicted · none · ref 3 · internal anchor
MISID is a multimodal multi-turn dataset for intent recognition in strategic deception games, paired with the FRACTAM framework that improves MLLM performance on hidden intent detection via decouple-anchor-reason steps.
BiTA: Bidirectional Gated Recurrent Unit-Transformer Aggregator in a Temporal Graph Network Framework for Alert Prediction in Computer Networks cs.LG · 2026-04-03 · unverdicted · none · ref 3 · internal anchor
BiTA redesigns temporal aggregation in TGNs by jointly using bidirectional GRU for sequential dependencies and Transformer for long-range context to improve alert prediction accuracy on real network data.
A General Framework for Generative Self-supervised Learning in Non-invasive Estimation of Physiological Parameters Using Photoplethysmography eess.SP · 2026-04-03 · unverdicted · none · ref 1 · internal anchor
TS2TC combines cross-temporal fusion generative anchor pretraining with dual-process transfer to achieve 2.49% lower RMSE than prior methods on PPG parameter estimation using only 10% labeled data.
ROMAN: A Multiscale Routing Operator for Convolutional Time Series Models cs.LG · 2026-04-02 · conditional · none · ref 15 · internal anchor
ROMAN converts time series into a shorter multiscale channel representation that lets standard CNN classifiers access scale and coarse-position information explicitly.
WiFlow: A Lightweight WiFi-based Continuous Human Pose Estimation Network with Spatio-Temporal Feature Decoupling cs.CV · 2026-02-09 · accept · none · ref 18 · internal anchor
WiFlow achieves 97.25% PCK@20 and 99.48% PCK@50 on continuous pose estimation from WiFi CSI using a 2.23M-parameter network trained on 360,000 synchronized samples from 5 subjects.
Concurrence: A dependence criterion for time series, applied to biological data eess.SP · 2025-12-17 · unverdicted · none · ref 1 · 2 links · internal anchor
Concurrence detects dependence between time series by training a classifier to separate aligned from misaligned segments.
X-IONet: Cross-Platform Inertial Odometry Network for Pedestrian and Legged Robot cs.RO · 2025-11-11 · unverdicted · none · ref 22 · internal anchor
X-IONet combines rule-based platform classification with a dual-stage attention network to predict displacement and uncertainty from IMU data, then fuses outputs via EKF, achieving reported error reductions on pedestrian and quadruped datasets.
Vision-LLMs for Spatiotemporal Traffic Forecasting cs.LG · 2025-10-13 · conditional · none · ref 3 · internal anchor
ST-Vision-LLM reframes spatiotemporal traffic forecasting as vision-language fusion, using visual encoders on traffic grids and efficient numerical tokenization to achieve 15.6% better long-term accuracy and 30% gains in few-shot cross-domain settings.
Chinese Cyberbullying Detection: Dataset, Method, and Validation cs.CL · 2025-05-27 · unverdicted · none · ref 3 · internal anchor
Introduces CHNCI, the first Chinese cyberbullying incident detection dataset with 220,676 comments across 91 incidents, created via ensemble pseudo-labeling from explanation-generating methods followed by human annotation.
Physically Interpretable World Models via Weakly Supervised Representation Learning cs.LG · 2024-12-17 · unverdicted · none · ref 3 · internal anchor
PIWM aligns latent states in image-based world models with physical variables and constrains their dynamics to known equations via weak distribution supervision, yielding accurate long-horizon predictions and parameter recovery on Cart Pole, Lunar Lander, and Donkey Car.
iTransformer: Inverted Transformers Are Effective for Time Series Forecasting cs.LG · 2023-10-10 · unverdicted · none · ref 2 · internal anchor
By applying attention and feed-forward networks to inverted variate tokens instead of temporal tokens, iTransformer achieves state-of-the-art performance on real-world time series forecasting datasets.
Simplified State Space Layers for Sequence Modeling cs.LG · 2022-08-09 · accept · none · ref 87 · internal anchor
S5 uses a single MIMO state space model with S4-derived initialization to match S4 efficiency and reach 87.4% average accuracy on the Long Range Arena benchmark.
Multi-task Self-Supervised Learning for Human Activity Detection cs.LG · 2019-07-27 · unverdicted · none · ref 5 · internal anchor
A multi-task self-supervised approach trains a temporal CNN to detect transformations on sensory data, yielding features that match or exceed fully supervised performance in semi-supervised and transfer settings for smartphone-based HAR.
R-Transformer: Recurrent Neural Network Enhanced Transformer cs.LG · 2019-07-12 · unverdicted · none · ref 3 · internal anchor
R-Transformer integrates RNNs with multi-head attention to model local and global sequence dependencies without position embeddings and reports large-margin gains over prior methods on diverse tasks.
Bayesian Optimization in Variational Latent Spaces with Dynamic Compression cs.RO · 2019-07-10 · unverdicted · none · ref 36 · internal anchor
Sequential VAE embeds simulated trajectories into latent paths for Bayesian optimization with dynamic compression to enable data-efficient high-dimensional controller tuning on robots.
UTOPYA: A Multimodal Deep Learning Framework for Physics-Informed Anomaly Detection and Time-Series Prediction cs.LG · 2026-05-18 · unverdicted · none · ref 3 · internal anchor
UTOPYA fuses eight modalities via FiLM-conditioned attention and physics-informed regularization to reach AUROC 0.874 for anomaly detection in batch distillation, outperforming baselines by 0.147.
Efficient Serving for Dynamic Agent Workflows with Prediction-based KV-Cache Management cs.LG · 2026-05-07 · unverdicted · none · ref 35 · internal anchor
PBKV predicts agent invocations in dynamic LLM workflows to manage KV-cache reuse, delivering up to 1.85x speedup over LRU and 1.26x over KVFlow.

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

hub tools

citation-role summary

citation-polarity summary

claims ledger

co-cited works

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer