hub Mixed citations

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

URL http://arxiv · 2016 · cs.DC · arXiv 1603.04467

Mixed citation behavior. Most common role is method (64%).

53 Pith papers citing it

Method 64% of classified citations

open full Pith review browse 53 citing papers arXiv PDF

abstract

TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms. A computation expressed using TensorFlow can be executed with little or no change on a wide variety of heterogeneous systems, ranging from mobile devices such as phones and tablets up to large-scale distributed systems of hundreds of machines and thousands of computational devices such as GPU cards. The system is flexible and can be used to express a wide variety of algorithms, including training and inference algorithms for deep neural network models, and it has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields, including speech recognition, computer vision, robotics, information retrieval, natural language processing, geographic information extraction, and computational drug discovery. This paper describes the TensorFlow interface and an implementation of that interface that we have built at Google. The TensorFlow API and a reference implementation were released as an open-source package under the Apache 2.0 license in November, 2015 and are available at www.tensorflow.org.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

method 7 background 3 other 1

citation-polarity summary

use method 7 background 3 unclear 1

representative citing papers

Deep Variational Information Bottleneck

cs.LG · 2016-12-01 · unverdicted · novelty 8.0

Deep VIB is a neural-network parameterization of the information bottleneck objective trained via variational inference and the reparameterization trick, yielding improved generalization and adversarial robustness.

Floating-Point Networks with Automatic Differentiation Can Represent Almost All Floating-Point Functions and Their Gradients

cs.LG · 2026-05-03 · unverdicted · novelty 8.0

Floating-point neural networks with automatic differentiation can represent arbitrary floating-point functions and their gradients under mild conditions.

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

cs.LG · 2017-01-23 · accept · novelty 8.0

A noisy top-k gated mixture-of-experts layer between LSTMs scales neural networks to 137B parameters with sub-linear compute, beating SOTA on language modeling and machine translation.

Density estimation using Real NVP

cs.LG · 2016-05-27 · accept · novelty 8.0

Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.

RLGT: A reinforcement learning framework for extremal graph theory

cs.LG · 2026-02-19 · unverdicted · novelty 7.0

RLGT is a modular reinforcement learning framework for extremal graph theory that handles undirected, directed, looped, and multi-colored graphs to facilitate future research.

IRNet: A General Purpose Deep Residual Regression Framework for Materials Discovery

physics.comp-ph · 2019-07-07 · unverdicted · novelty 7.0

IRNet uses per-layer residual shortcuts in fully connected networks to achieve better prediction accuracy and training convergence than prior ML methods on OQMD and Materials Project datasets for material properties.

Deep reinforcement learning from human preferences

stat.ML · 2017-06-12 · accept · novelty 7.0

Reinforcement learning agents solve complex tasks without access to the reward function by training a reward predictor from human comparisons of trajectory segments, requiring feedback on less than 1% of interactions.

SMART: A Spectral Transfer Approach to Multi-Task Learning

cs.LG · 2026-04-22 · unverdicted · novelty 7.0

SMART transfers knowledge in multi-task linear regression via spectral subspace similarity assumptions, achieving near-minimax Frobenius error rates while requiring only a fitted source model.

The Kinetics Human Action Video Dataset

cs.CV · 2017-05-19 · accept · novelty 7.0

Kinetics is a new video dataset of 400 human actions with over 160000 ten-second clips collected from YouTube, accompanied by baseline action-classification results from neural networks.

HyperNetworks

cs.LG · 2016-09-27 · unverdicted · novelty 7.0

Hypernetworks generate weights for a main network, allowing LSTMs to use non-shared weights and achieve near state-of-the-art results on sequence modeling tasks while using fewer parameters overall.

OAM-Induced Lattice Rotation Reveals a Fractional Optimum in Fault-Tolerant GKP Quantum Sensing

quant-ph · 2026-05-13 · unverdicted · novelty 6.0 · 2 refs

Fractional OAM charge ℓ=1.5 induces an optimal 67.5° GKP lattice rotation that reduces error rate 23.9× with <0.2% loss in Fisher information and yields 41% higher metrological capacity.

A Data-Driven Parametric Reduced-Order Chemical Kinetics Model Derived from Atomistic Simulations

physics.chem-ph · 2026-05-05 · unverdicted · novelty 6.0

A parametric autoencoder with non-negativity and softmax constraints learns interpretable latent chemical components and couples them to kinetics and heat release for improved reduced-order modeling of decomposition.

Image reconstruction with the JWST Interferometer

astro-ph.IM · 2025-10-13 · unverdicted · novelty 6.0

Dorito enables diffraction-limited image reconstruction from JWST AMI observations by deconvolving images or Fourier observables using maximum entropy and total variation regularization.

ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution

cs.CL · 2025-09-17 · unverdicted · novelty 6.0

ShinkaEvolve improves sample efficiency in LLM-driven program evolution via parent sampling, code novelty rejection-sampling, and bandit LLM ensemble selection, achieving new SOTA circle packing with 150 samples and gains on math reasoning and competitive programming tasks.

Generative AI for image reconstruction in Intensity Interferometry: a first attempt

astro-ph.IM · 2025-08-05 · unverdicted · novelty 6.0

A conditional GAN reconstructs shape, size, and brightness distributions of simulated fast-rotating stars from intensity interferometry power spectra obtained with 6- and 9-telescope arrays.

Electroweak diboson production in association with a high-mass dijet system in semileptonic final states from $pp$ collisions at $\sqrt{s} = 13$ TeV with the ATLAS detector

hep-ex · 2025-03-21 · accept · novelty 6.0

Electroweak diboson plus high-mass dijet production observed at 7.4 sigma with signal strength 1.28, plus first semileptonic-channel limits on S02, T0 and M0 Wilson coefficients.

Convolutional Sparse Support Estimator Network (CSEN) From energy efficient support estimation to learning-aided Compressive Sensing

eess.SP · 2020-03-02 · unverdicted · novelty 6.0

CSEN is a compact convolutional neural network trained to estimate sparse support sets directly from measurements, claiming state-of-the-art accuracy at lower computational cost than iterative methods.

Jointly Aligning and Predicting Continuous Emotion Annotations

cs.LG · 2019-07-05 · unverdicted · novelty 6.0

A multi-delay sinc network jointly aligns speech signals with delayed continuous emotion labels and predicts arousal/valence, claiming state-of-the-art speech-only results on RECOLA and SEWA.

Who said that?: Audio-visual speaker diarisation of real-world meetings

cs.SD · 2019-06-24 · unverdicted · novelty 6.0

An iterative audio-visual approach for speaker diarisation in real-world meetings that enrolls speaker models via correspondence and outperforms prior methods on the AMI corpus.

Real-time Surface-Code Error Correction Using an FPGA-based Neural-Network Decoder

quant-ph · 2026-05-06 · unverdicted · novelty 6.0

An FPGA-based neural-network decoder achieves 550 ns deterministic closed-loop latency for real-time distance-3 surface code error correction on a superconducting processor, matching offline decoding performance.

Amortized Variational Inference for Joint Posterior and Predictive Distributions in Bayesian Uncertainty Quantification

stat.ML · 2026-05-05 · unverdicted · novelty 6.0

An amortized variational framework jointly targets the posterior and posterior-predictive distributions via a KL upper bound and moment regularization, yielding more accurate predictions at lower online cost than two-stage variational inference.

On Model-Based Clustering With Entropic Optimal Transport

stat.ME · 2026-05-05 · unverdicted · novelty 6.0

Entropic optimal transport yields a clustering loss with the same global optimum as log-likelihood but a better-behaved optimization surface, outperforming standard EM in experiments.

Alikhanov-XfPINNs: Adaptive Physics-Informed Learning for Nonlinear Fractional PDEs on Nonuniform Meshes

math.NA · 2026-05-02 · unverdicted · novelty 6.0

Alikhanov-XfPINNs integrates accelerated Alikhanov discretization on nonuniform time grids with physics-informed neural networks to solve general nonlinear fractional PDEs for both forward and inverse problems with improved efficiency and handling of initial singularities.

TCL: Enabling Fast and Efficient Cross-Hardware Tensor Program Optimization via Continual Learning

cs.LG · 2026-04-14 · conditional · novelty 6.0

TCL delivers 16.8x faster tuning on CPU and 12.48x on GPU with modestly lower inference latency by combining RDU active sampling, a lightweight Mamba cost model, and cross-platform continual knowledge distillation.

citing papers explorer

Showing 50 of 53 citing papers.

Deep Variational Information Bottleneck cs.LG · 2016-12-01 · unverdicted · none · ref 1 · internal anchor
Deep VIB is a neural-network parameterization of the information bottleneck objective trained via variational inference and the reparameterization trick, yielding improved generalization and adversarial robustness.
Floating-Point Networks with Automatic Differentiation Can Represent Almost All Floating-Point Functions and Their Gradients cs.LG · 2026-05-03 · unverdicted · none · ref 52
Floating-point neural networks with automatic differentiation can represent arbitrary floating-point functions and their gradients under mild conditions.
Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer cs.LG · 2017-01-23 · accept · none · ref 2
A noisy top-k gated mixture-of-experts layer between LSTMs scales neural networks to 137B parameters with sub-linear compute, beating SOTA on language modeling and machine translation.
Density estimation using Real NVP cs.LG · 2016-05-27 · accept · none · ref 1
Real NVP uses affine coupling layers to create invertible transformations that support exact density estimation, sampling, and latent inference without approximations.
RLGT: A reinforcement learning framework for extremal graph theory cs.LG · 2026-02-19 · unverdicted · none · ref 1 · internal anchor
RLGT is a modular reinforcement learning framework for extremal graph theory that handles undirected, directed, looped, and multi-colored graphs to facilitate future research.
IRNet: A General Purpose Deep Residual Regression Framework for Materials Discovery physics.comp-ph · 2019-07-07 · unverdicted · none · ref 3 · internal anchor
IRNet uses per-layer residual shortcuts in fully connected networks to achieve better prediction accuracy and training convergence than prior ML methods on OQMD and Materials Project datasets for material properties.
Deep reinforcement learning from human preferences stat.ML · 2017-06-12 · accept · none · ref 1 · internal anchor
Reinforcement learning agents solve complex tasks without access to the reward function by training a reward predictor from human comparisons of trajectory segments, requiring feedback on less than 1% of interactions.
SMART: A Spectral Transfer Approach to Multi-Task Learning cs.LG · 2026-04-22 · unverdicted · none · ref 39
SMART transfers knowledge in multi-task linear regression via spectral subspace similarity assumptions, achieving near-minimax Frobenius error rates while requiring only a fitted source model.
The Kinetics Human Action Video Dataset cs.CV · 2017-05-19 · accept · none · ref 1
Kinetics is a new video dataset of 400 human actions with over 160000 ten-second clips collected from YouTube, accompanied by baseline action-classification results from neural networks.
HyperNetworks cs.LG · 2016-09-27 · unverdicted · none · ref 1
Hypernetworks generate weights for a main network, allowing LSTMs to use non-shared weights and achieve near state-of-the-art results on sequence modeling tasks while using fewer parameters overall.
OAM-Induced Lattice Rotation Reveals a Fractional Optimum in Fault-Tolerant GKP Quantum Sensing quant-ph · 2026-05-13 · unverdicted · none · ref 35 · 2 links · internal anchor
Fractional OAM charge ℓ=1.5 induces an optimal 67.5° GKP lattice rotation that reduces error rate 23.9× with <0.2% loss in Fisher information and yields 41% higher metrological capacity.
A Data-Driven Parametric Reduced-Order Chemical Kinetics Model Derived from Atomistic Simulations physics.chem-ph · 2026-05-05 · unverdicted · none · ref 65 · internal anchor
A parametric autoencoder with non-negativity and softmax constraints learns interpretable latent chemical components and couples them to kinetics and heat release for improved reduced-order modeling of decomposition.
Image reconstruction with the JWST Interferometer astro-ph.IM · 2025-10-13 · unverdicted · none · ref 1 · internal anchor
Dorito enables diffraction-limited image reconstruction from JWST AMI observations by deconvolving images or Fourier observables using maximum entropy and total variation regularization.
ShinkaEvolve: Towards Open-Ended And Sample-Efficient Program Evolution cs.CL · 2025-09-17 · unverdicted · none · ref 33 · internal anchor
ShinkaEvolve improves sample efficiency in LLM-driven program evolution via parent sampling, code novelty rejection-sampling, and bandit LLM ensemble selection, achieving new SOTA circle packing with 150 samples and gains on math reasoning and competitive programming tasks.
Generative AI for image reconstruction in Intensity Interferometry: a first attempt astro-ph.IM · 2025-08-05 · unverdicted · none · ref 1 · internal anchor
A conditional GAN reconstructs shape, size, and brightness distributions of simulated fast-rotating stars from intensity interferometry power spectra obtained with 6- and 9-telescope arrays.
Electroweak diboson production in association with a high-mass dijet system in semileptonic final states from $pp$ collisions at $\sqrt{s} = 13$ TeV with the ATLAS detector hep-ex · 2025-03-21 · accept · none · ref 101 · internal anchor
Electroweak diboson plus high-mass dijet production observed at 7.4 sigma with signal strength 1.28, plus first semileptonic-channel limits on S02, T0 and M0 Wilson coefficients.
Convolutional Sparse Support Estimator Network (CSEN) From energy efficient support estimation to learning-aided Compressive Sensing eess.SP · 2020-03-02 · unverdicted · none · ref 55 · internal anchor
CSEN is a compact convolutional neural network trained to estimate sparse support sets directly from measurements, claiming state-of-the-art accuracy at lower computational cost than iterative methods.
Jointly Aligning and Predicting Continuous Emotion Annotations cs.LG · 2019-07-05 · unverdicted · none · ref 73 · internal anchor
A multi-delay sinc network jointly aligns speech signals with delayed continuous emotion labels and predicts arousal/valence, claiming state-of-the-art speech-only results on RECOLA and SEWA.
Who said that?: Audio-visual speaker diarisation of real-world meetings cs.SD · 2019-06-24 · unverdicted · none · ref 7 · internal anchor
An iterative audio-visual approach for speaker diarisation in real-world meetings that enrolls speaker models via correspondence and outperforms prior methods on the AMI corpus.
Real-time Surface-Code Error Correction Using an FPGA-based Neural-Network Decoder quant-ph · 2026-05-06 · unverdicted · none · ref 72
An FPGA-based neural-network decoder achieves 550 ns deterministic closed-loop latency for real-time distance-3 surface code error correction on a superconducting processor, matching offline decoding performance.
Amortized Variational Inference for Joint Posterior and Predictive Distributions in Bayesian Uncertainty Quantification stat.ML · 2026-05-05 · unverdicted · none · ref 24
An amortized variational framework jointly targets the posterior and posterior-predictive distributions via a KL upper bound and moment regularization, yielding more accurate predictions at lower online cost than two-stage variational inference.
On Model-Based Clustering With Entropic Optimal Transport stat.ME · 2026-05-05 · unverdicted · none · ref 199
Entropic optimal transport yields a clustering loss with the same global optimum as log-likelihood but a better-behaved optimization surface, outperforming standard EM in experiments.
Alikhanov-XfPINNs: Adaptive Physics-Informed Learning for Nonlinear Fractional PDEs on Nonuniform Meshes math.NA · 2026-05-02 · unverdicted · none · ref 42
Alikhanov-XfPINNs integrates accelerated Alikhanov discretization on nonuniform time grids with physics-informed neural networks to solve general nonlinear fractional PDEs for both forward and inverse problems with improved efficiency and handling of initial singularities.
TCL: Enabling Fast and Efficient Cross-Hardware Tensor Program Optimization via Continual Learning cs.LG · 2026-04-14 · conditional · none · ref 1
TCL delivers 16.8x faster tuning on CPU and 12.48x on GPU with modestly lower inference latency by combining RDU active sampling, a lightweight Mamba cost model, and cross-platform continual knowledge distillation.
MONAI: An open-source framework for deep learning in healthcare cs.LG · 2022-11-04 · accept · none · ref 31
MONAI is a community-supported PyTorch framework that extends deep learning to medical data with domain-specific architectures, transforms, and deployment tools.
Rethinking Atrous Convolution for Semantic Image Segmentation cs.CV · 2017-06-17 · unverdicted · none · ref 1
DeepLabv3 improves semantic segmentation by capturing multi-scale context with cascaded or parallel atrous convolutions and adding global context to ASPP, achieving better results on PASCAL VOC 2012 without DenseCRF post-processing.
Free surfaces in turbulence -- A unified framework from water surfaces to elastic solids physics.flu-dyn · 2026-05-22 · unverdicted · none · ref 11 · internal anchor
Linear theory predicts regimes for deformable surfaces in turbulence where the interface is enslaved by flow or shows intrinsic dynamics; simulations of air-water and rubber match predictions without wave turbulence.
Rethinking Expert Trajectory Utilization in LLM Post-training for Mathematical Reasoning cs.LG · 2025-12-12 · unverdicted · none · ref 2 · internal anchor
Sequential SFT followed by RL, guided by the Plasticity-Ceiling Framework, achieves higher performance ceilings in LLM mathematical reasoning than synchronized methods by optimizing data scale and transition timing.
Neural network-based deconvolution for GeV-Scale Gamma-Ray Spectroscopy physics.ins-det · 2025-12-01 · unverdicted · none · ref 51 · internal anchor
A denoising autoencoder followed by a U-Net reconstructs incident gamma spectra from measured positron spectra in a Monte Carlo-optimized spectrometer for multi-MeV to GeV energies.
High-precision measurement of the W boson mass with the CMS experiment hep-ex · 2024-12-18 · unverdicted · none · ref 51 · internal anchor
CMS measures the W boson mass as 80360.2 ± 9.9 MeV from 2016 data, consistent with the Standard Model prediction.
Usenix'23 Extended Version: Smart Learning to Find Dumb Contracts cs.CR · 2023-04-21 · unverdicted · none · ref 1 · internal anchor
DLVA trains neural networks on bytecode to match Slither source labels at 92.7% accuracy and 0.2 seconds per contract while outperforming nine other tools at 99.7% average accuracy.
Adversarial optimization for joint registration and segmentation in prostate CT radiotherapy eess.IV · 2019-06-28 · unverdicted · none · ref 15 · internal anchor
An end-to-end 3D adversarial network estimates deformation vector fields to align CT images and propagate segmentations, showing improved performance and speed over elastix for prostate radiotherapy.
Lit2Vec: A Reproducible Workflow for Building a Legally Screened Chemistry Corpus from S2ORC for Downstream Retrieval and Text Mining cs.DB · 2026-04-14 · unverdicted · none · ref 23
Lit2Vec delivers a documented, reproducible pipeline that extracts and annotates a large licensed chemistry paper corpus from S2ORC with paragraph embeddings and subfield labels.
SecureAFL: Secure Asynchronous Federated Learning cs.CR · 2026-04-04 · conditional · none · ref 5
SecureAFL secures asynchronous federated learning against poisoning attacks by detecting anomalous updates, estimating missing client contributions, and using Byzantine-robust aggregation.
Harnessing AI for Inverse Partial Differential Equation Problems: Past, Present, and Prospects cs.AI · 2026-05-16 · unverdicted · none · ref 1 · internal anchor
A survey organizing AI methods for inverse PDE problems into inverse problems, inverse design, and control categories, covering applications and future challenges like physics-informed models and uncertainty quantification.
Automated Big Data Quality Assessment using Knowledge Graph Embeddings cs.LG · 2026-05-12 · unverdicted · none · ref 46 · internal anchor
Knowledge graph embeddings predict missing connections to generate context-specific data quality assessment plans, tested on a radiation sensor dataset.
Advance Warning Methodologies for COVID-19 using Chest X-Ray Images eess.IV · 2020-06-07 · unverdicted · none · ref 56 · internal anchor
Introduces the Early-QaTa-COV19 dataset and reports that CSEN reaches over 97% sensitivity and over 95.5% specificity for early COVID-19 detection from X-rays.
Convolutional Sparse Support Estimator Based Covid-19 Recognition from X-ray Images eess.IV · 2020-05-08 · unverdicted · none · ref 49 · internal anchor
Introduces CSEN, a non-iterative network bridging sparse representation and deep learning, for Covid-19 detection from X-ray images with limited training data.
Automatically Learning Construction Injury Precursors from Text cs.CL · 2019-07-26 · unverdicted · none · ref 58 · internal anchor
Standard NLP classifiers can surface valid injury precursors from raw construction safety reports.
Deformable Registration Using Average Geometric Transformations for Brain MR Images cs.CV · 2019-07-23 · unverdicted · none · ref 12 · internal anchor
The method augments VoxelMorph with Jacobian and curl channels plus an average-transformation atlas and reports higher Dice scores and more valid Jacobians than the original VoxelMorph on ADNI and MRBrainS18 data.
An Actor-Critic-Attention Mechanism for Deep Reinforcement Learning in Multi-view Environments cs.LG · 2019-07-19 · unverdicted · none · ref 1 · internal anchor
An attention-augmented actor-critic agent learns to dynamically weight multiple environment views by importance and outperforms baselines on TORCS and three other 3D simulators under noise and partial observability.
Improving Semantic Segmentation via Dilated Affinity cs.CV · 2019-07-16 · unverdicted · none · ref 1 · internal anchor
Dilated affinity is jointly predicted with segmentation labels to strengthen features and support efficient label propagation refinement on benchmark datasets.
Autoencoding sensory substitution q-bio.NC · 2019-07-14 · unverdicted · none · ref 206 · internal anchor
Deep recurrent autoencoders convert images to shortened audio signals that incorporate hearing models, enabling above-chance hand posture discrimination and object reaching after a few hours of training instead of months.
Knowledge-incorporating ESIM models for Response Selection in Retrieval-based Dialog Systems cs.CL · 2019-07-11 · unverdicted · none · ref 2 · internal anchor
K-ESIM and T-ESIM extend ESIM by incorporating domain knowledge and similar-dialog information, yielding preliminary accuracy gains on Ubuntu and Advising datasets for next-utterance selection.
ELKPPNet: An Edge-aware Neural Network with Large Kernel Pyramid Pooling for Learning Discriminative Features in Semantic Segmentation cs.CV · 2019-06-27 · unverdicted · none · ref 29 · internal anchor
ELKPPNet combines a balanced encoder-decoder, large kernel spatial pyramid pooling for multi-scale fusion, and an edge-aware loss to claim superior semantic segmentation performance on Cityscapes, CamVid, and NYUDv2 versus prior methods.
Brain MR Image Segmentation in Small Dataset with Adversarial Defense and Task Reorganization eess.IV · 2019-06-25 · unverdicted · none · ref 8 · internal anchor
The method reaches 84.46% Dice score on brain MR segmentation of gray matter, white matter and major regions using only seven training subjects via adversarial defense and hierarchical task reorganization.
A numerical study into neural network surrogate model performance for uncertainty propagation stat.ML · 2026-05-15 · unverdicted · none · ref 45 · internal anchor
Numerical study comparing feedforward NN and DeepONet with data-driven and physics-informed losses on stochastic heat equation, highlighting larger errors at distribution tails due to extrapolation.
A Pedagogical Framework for Physics-Informed Machine Learning: From Classical Pendulum to Quantum Anharmonic Oscillator Using PyTorch on Modern GPU Hardware quant-ph · 2025-02-08 · unverdicted · none · ref 1 · internal anchor
A pedagogical framework implements and benchmarks ANN, CNN, LSTM, and PINN models for classical pendulum and quantum oscillator systems with reported MAEs and GPU speedups.
Array Programming with NumPy cs.MS · 2020-06-18 · unverdicted · none · ref 40 · internal anchor
NumPy provides array programming tools that form the foundation of the scientific Python ecosystem and enable data analysis across many disciplines.
Short-term Electric Load Forecasting Using TensorFlow and Deep Auto-Encoders eess.SP · 2019-07-21 · unverdicted · none · ref 29 · internal anchor
A TensorFlow-based deep auto-encoder model is proposed for short-term electric load forecasting and claimed to outperform traditional neural networks in accuracy and stability.

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer