Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning

David Sontag; Samuel R. Bowman; Yacine Jernite

arxiv: 1705.00557 · v1 · pith:2IGAMTBSnew · submitted 2017-04-23 · 💻 cs.CL · cs.LG· cs.NE· stat.ML

Discourse-Based Objectives for Fast Unsupervised Sentence Representation Learning

Yacine Jernite , Samuel R. Bowman , David Sontag This is my paper

classification 💻 cs.CL cs.LGcs.NEstat.ML

keywords modelsobjectivesentencetrainunsupervisedallowingcoherencediscourse

0 comments

read the original abstract

This work presents a novel objective function for the unsupervised training of neural network sentence encoders. It exploits signals from paragraph-level discourse coherence to train these models to understand text. Our objective is purely discriminative, allowing us to train models many times faster than was possible under prior methods, and it yields models which perform well in extrinsic evaluations.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
cs.CL 2019-09 accept novelty 7.0

ALBERT reduces BERT parameters via embedding factorization and layer sharing, adds inter-sentence coherence pretraining, and reaches SOTA on GLUE, RACE, and SQuAD with fewer parameters than BERT-large.
Atlas: Few-shot Learning with Retrieval Augmented Language Models
cs.CL 2022-08 unverdicted novelty 6.0

Atlas reaches over 42% accuracy on Natural Questions with only 64 examples, outperforming a 540B-parameter model by 3% with 50x fewer parameters.
Unsupervised Dense Information Retrieval with Contrastive Learning
cs.IR 2021-12 unverdicted novelty 6.0

Contrastive learning trains unsupervised dense retrievers that beat BM25 on most BEIR datasets and support cross-lingual retrieval across scripts.
Learning Compressed Sentence Representations for On-Device Text Processing
cs.CL 2019-06 unverdicted novelty 5.0

Four binarization strategies turn continuous sentence embeddings into binary form, cutting storage by over 98% with only about 2% performance drop on downstream tasks.