pith. sign in

super hub Canonical reference

Long short -term memory

Canonical reference. 74% of citing Pith papers cite this work as background.

128 Pith papers citing it
80.8k external citations · Crossref
Background 74% of classified citations

hub tools

citation-role summary

background 15 baseline 2 method 2

citation-polarity summary

authors

co-cited works

clear filters

representative citing papers

Temporal Graph Networks for Deep Learning on Dynamic Graphs

cs.LG · 2020-06-18 · unverdicted · novelty 7.0

Temporal Graph Networks combine memory modules and graph operators to learn on dynamic graphs as timed event sequences, outperforming prior methods on transductive and inductive tasks while unifying earlier models as special cases.

Language Models as Knowledge Bases?

cs.CL · 2019-09-03 · accept · novelty 7.0

BERT stores relational knowledge extractable via cloze queries without fine-tuning and matches supervised baselines on open-domain QA tasks.

Mixed Precision Training

cs.AI · 2017-10-10 · accept · novelty 7.0

Mixed precision training uses FP16 for most computations, FP32 master weights for accumulation, and loss scaling to enable accurate training of large DNNs with halved memory usage.

Estimation--Prediction Tradeoff in Causal Probabilistic Temporal Graphs

cs.LG · 2026-06-26 · unverdicted · novelty 6.0

Characterizes an estimation-prediction tradeoff in binary logistic models for causal probabilistic temporal graphs and proposes a framework to jointly evaluate temporal link prediction with causal parameter recovery via Cramér-Rao bounds.

citing papers explorer

Showing 3 of 3 citing papers after filters.

  • Efficient Training of Language Models to Fill in the Middle cs.CL · 2022-07-28 · unverdicted · none · ref 85

    Autoregressive language models trained on data with middle spans relocated to the end learn infilling without degrading left-to-right perplexity or sampling quality.

  • Language Models (Mostly) Know What They Know cs.CL · 2022-07-11 · unverdicted · none · ref 228

    Language models show good calibration when asked to estimate the probability that their own answers are correct, with performance improving as models get larger.

  • Sketch of a novel approach to a neural model q-bio.NC · 2022-09-14 · unverdicted · none · ref 45

    The paper sketches a neuron-centric model of neuroplasticity that separates neural transmission from internal signal selection and storage within each neuron rather than relying solely on synaptic weights.