Probing Biomedical Embeddings from Language Models

Bhuwan Dhingra; Qiao Jin; William W. Cohen; Xinghua Lu

arxiv: 1904.02181 · v1 · pith:IX4LF25C · submitted 2019-04-03 · cs.CL

Probing Biomedical Embeddings from Language Models

Qiao Jin , Bhuwan Dhingra , William W. Cohen , Xinghua Lu This is my paper

Reviewed by Pith T0 review T1 audit T2 compute T3 formal T4 kernel pith:IX4LF25C record.json open to challenge →

classification cs.CL

keywords biomedicalbiobertbioelmoembeddingsmodelsprobingtasksadditional

0 comments

read the original abstract

Contextualized word embeddings derived from pre-trained language models (LMs) show significant improvements on downstream NLP tasks. Pre-training on domain-specific corpora, such as biomedical articles, further improves their performance. In this paper, we conduct probing experiments to determine what additional information is carried intrinsically by the in-domain trained contextualized embeddings. For this we use the pre-trained LMs as fixed feature extractors and restrict the downstream task models to not have additional sequence modeling layers. We compare BERT, ELMo, BioBERT and BioELMo, a biomedical version of ELMo trained on 10M PubMed abstracts. Surprisingly, while fine-tuned BioBERT is better than BioELMo in biomedical NER and NLI tasks, as a fixed feature extractor BioELMo outperforms BioBERT in our probing tasks. We use visualization and nearest neighbor analysis to show that better encoding of entity-type and relational information leads to this superiority.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

PubMedQA: A Dataset for Biomedical Research Question Answering
cs.CL 2019-09 unverdicted novelty 7.0

PubMedQA supplies 273k+ biomedical QA instances that require reasoning over research abstracts to produce yes/no/maybe answers.