TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Citation Intent Classification	ACL-ARC	BiLSTM-Attention + ELMo	F1	54.6	# 4
Named Entity Recognition (NER)	CoNLL++	BiLSTM-CRF+ELMo	F1	93.42	# 6
Named Entity Recognition (NER)	CoNLL 2003 (English)	BiLSTM-CRF+ELMo	F1	92.22	# 44
Only Connect Walls Dataset Task 1 (Grouping)	OCW	ELMo (LARGE)	# Correct Groups	55 ± 4	# 17
Only Connect Walls Dataset Task 1 (Grouping)	OCW	ELMo (LARGE)	Fowlkes Mallows Score (FMS)	29.5 ± .3	# 16
Only Connect Walls Dataset Task 1 (Grouping)	OCW	ELMo (LARGE)	Adjusted Rand Index (ARI)	11.8 ± .4	# 16
Only Connect Walls Dataset Task 1 (Grouping)	OCW	ELMo (LARGE)	Adjusted Mutual Information (AMI)	14.5 ± .4	# 16
Only Connect Walls Dataset Task 1 (Grouping)	OCW	ELMo (LARGE)	# Solved Walls	0 ± 0	# 10
Only Connect Walls Dataset Task 1 (Grouping)	OCW	ELMo (LARGE)	Wasserstein Distance (WD)	86.3 ± .6	# 3
Semantic Role Labeling	OntoNotes	He et al., 2017 + ELMo	F1	84.6	# 12
Coreference Resolution	OntoNotes	e2e-coref + ELMo	F1	70.4	# 20
Conversational Response Selection	PolyAI Reddit	ELMO	1-of-100 Accuracy	19.3%	# 5
Natural Language Inference	SNLI	ESIM + ELMo Ensemble	% Test Accuracy	89.3	# 17
Natural Language Inference	SNLI	ESIM + ELMo Ensemble	% Train Accuracy	92.1	# 32
Natural Language Inference	SNLI	ESIM + ELMo Ensemble	Parameters	40m	# 4
Natural Language Inference	SNLI	ESIM + ELMo	% Test Accuracy	88.7	# 29
Natural Language Inference	SNLI	ESIM + ELMo	% Train Accuracy	91.6	# 34
Natural Language Inference	SNLI	ESIM + ELMo	Parameters	8.0m	# 4
Question Answering	SQuAD1.1	BiDAF + Self Attention + ELMo (single model)	EM	78.58	# 85
Question Answering	SQuAD1.1	BiDAF + Self Attention + ELMo (single model)	F1	85.833	# 87
Question Answering	SQuAD1.1	BiDAF + Self Attention + ELMo (ensemble)	EM	81.003	# 55
Question Answering	SQuAD1.1	BiDAF + Self Attention + ELMo (ensemble)	F1	87.432	# 63
Question Answering	SQuAD1.1 dev	BiDAF + Self Attention + ELMo	F1	85.6	# 23
Question Answering	SQuAD2.0	BiDAF + Self Attention + ELMo (single model)	EM	63.372	# 266
Question Answering	SQuAD2.0	BiDAF + Self Attention + ELMo (single model)	F1	66.251	# 271
Sentiment Analysis	SST-5 Fine-grained classification	BCN+ELMo	Accuracy	54.7	# 7
Word Sense Disambiguation	Supervised:	ELMo	Senseval 2	71.6	# 23
Word Sense Disambiguation	Supervised:	ELMo	Senseval 3	69.6	# 20
Word Sense Disambiguation	Supervised:	ELMo	SemEval 2007	62.2	# 18
Word Sense Disambiguation	Supervised:	ELMo	SemEval 2013	66.2	# 22
Word Sense Disambiguation	Supervised:	ELMo	SemEval 2015	71.3	# 23

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/task-1-grouping-on-ocw)](https://paperswithcode.com/sota/task-1-grouping-on-ocw?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/citation-intent-classification-on-acl-arc)](https://paperswithcode.com/sota/citation-intent-classification-on-acl-arc?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/conversational-response-selection-on-polyai)](https://paperswithcode.com/sota/conversational-response-selection-on-polyai?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/named-entity-recognition-on-conll)](https://paperswithcode.com/sota/named-entity-recognition-on-conll?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/sentiment-analysis-on-sst-5-fine-grained)](https://paperswithcode.com/sota/sentiment-analysis-on-sst-5-fine-grained?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/semantic-role-labeling-on-ontonotes)](https://paperswithcode.com/sota/semantic-role-labeling-on-ontonotes?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/natural-language-inference-on-snli)](https://paperswithcode.com/sota/natural-language-inference-on-snli?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/coreference-resolution-on-ontonotes)](https://paperswithcode.com/sota/coreference-resolution-on-ontonotes?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/question-answering-on-squad11-dev)](https://paperswithcode.com/sota/question-answering-on-squad11-dev?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/word-sense-disambiguation-on-supervised)](https://paperswithcode.com/sota/word-sense-disambiguation-on-supervised?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/named-entity-recognition-ner-on-conll-2003)](https://paperswithcode.com/sota/named-entity-recognition-ner-on-conll-2003?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/question-answering-on-squad11)](https://paperswithcode.com/sota/question-answering-on-squad11?p=deep-contextualized-word-representations)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-contextualized-word-representations/question-answering-on-squad20)](https://paperswithcode.com/sota/question-answering-on-squad20?p=deep-contextualized-word-representations)`

Deep contextualized word representations

NAACL 2018 · Matthew E. Peters, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, Luke Zettlemoyer ·

We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e.g., syntax and semantics), and (2) how these uses vary across linguistic contexts (i.e., to model polysemy). Our word vectors are learned functions of the internal states of a deep bidirectional language model (biLM), which is pre-trained on a large text corpus. We show that these representations can be easily added to existing models and significantly improve the state of the art across six challenging NLP problems, including question answering, textual entailment and sentiment analysis. We also present an analysis showing that exposing the deep internals of the pre-trained network is crucial, allowing downstream models to mix different types of semi-supervision signals.

PDF Abstract NAACL 2018 PDF NAACL 2018 Abstract

Code

Add Remove Mark official

flairNLP/flair

13,565

dmlc/gluon-nlp

2,548

allenai/bilm-tf

1,622

Hironsan/anago

1,477

HIT-SCIR/ELMoForManyLangs

1,455

See all 46 implementations

Tasks

Add Remove

Citation Intent Classification

Conversational Response Selection

Coreference Resolution

Language Modelling

Named Entity Recognition (NER)

Natural Language Inference

Only Connect Walls Dataset Task 1 (Grouping)

Question Answering

Semantic Role Labeling

Sentiment Analysis

Datasets

SST

SQuAD

SNLI CoNLL 2003

Reddit SST-5 OntoNotes 5.0 CoNLL Word Sense Disambiguation: a Unified Evaluation Framework and Empirical Comparison CoNLL++ ACL ARC

OCW

Reddit Corpus

Results from the Paper

Edit

Ranked #3 on Only Connect Walls Dataset Task 1 (Grouping) on OCW (Wasserstein Distance (WD) metric, using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Citation Intent Classification	ACL-ARC	BiLSTM-Attention + ELMo	F1	54.6	# 4	Compare
Named Entity Recognition (NER)	CoNLL++	BiLSTM-CRF+ELMo	F1	93.42	# 6	Compare
Named Entity Recognition (NER)	CoNLL 2003 (English)	BiLSTM-CRF+ELMo	F1	92.22	# 44	Compare
Only Connect Walls Dataset Task 1 (Grouping)	OCW	ELMo (LARGE)	# Correct Groups	55 ± 4	# 17	Compare
			Fowlkes Mallows Score (FMS)	29.5 ± .3	# 16	Compare
			Adjusted Rand Index (ARI)	11.8 ± .4	# 16	Compare
			Adjusted Mutual Information (AMI)	14.5 ± .4	# 16	Compare
			# Solved Walls	0 ± 0	# 10	Compare
			Wasserstein Distance (WD)	86.3 ± .6	# 3	Compare
Semantic Role Labeling	OntoNotes	He et al., 2017 + ELMo	F1	84.6	# 12	Compare
Coreference Resolution	OntoNotes	e2e-coref + ELMo	F1	70.4	# 20	Compare
Conversational Response Selection	PolyAI Reddit	ELMO	1-of-100 Accuracy	19.3%	# 5	Compare
Natural Language Inference	SNLI	ESIM + ELMo Ensemble	% Test Accuracy	89.3	# 17	Compare
			% Train Accuracy	92.1	# 32	Compare
			Parameters	40m	# 4	Compare
Natural Language Inference	SNLI	ESIM + ELMo	% Test Accuracy	88.7	# 29	Compare
			% Train Accuracy	91.6	# 34	Compare
			Parameters	8.0m	# 4	Compare
Question Answering	SQuAD1.1	BiDAF + Self Attention + ELMo (single model)	EM	78.58	# 85	Compare
Question Answering	SQuAD1.1	BiDAF + Self Attention + ELMo (single model)	F1	85.833	# 87	Compare
Question Answering	SQuAD1.1	BiDAF + Self Attention + ELMo (ensemble)	EM	81.003	# 55	Compare
Question Answering	SQuAD1.1	BiDAF + Self Attention + ELMo (ensemble)	F1	87.432	# 63	Compare
Question Answering	SQuAD1.1 dev	BiDAF + Self Attention + ELMo	F1	85.6	# 23	Compare
Question Answering	SQuAD2.0	BiDAF + Self Attention + ELMo (single model)	EM	63.372	# 266	Compare
Question Answering	SQuAD2.0	BiDAF + Self Attention + ELMo (single model)	F1	66.251	# 271	Compare
Sentiment Analysis	SST-5 Fine-grained classification	BCN+ELMo	Accuracy	54.7	# 7	Compare
Word Sense Disambiguation	Supervised:	ELMo	Senseval 2	71.6	# 23	Compare
			Senseval 3	69.6	# 20	Compare
			SemEval 2007	62.2	# 18	Compare
			SemEval 2013	66.2	# 22	Compare
			SemEval 2015	71.3	# 23	Compare

Methods

Add Remove

BiLSTM • ELMo • LSTM • Sigmoid Activation • Softmax • Tanh Activation

Edit Social Preview

Deep contextualized word representations

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove