TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Text Classification	AG News	ULMFiT	Error	5.01	# 4
Text Classification	DBpedia	ULMFiT	Error	0.80	# 6
Sentiment Analysis	IMDb	ULMFiT	Accuracy	95.4	# 15
Text Classification	TREC-6	ULMFiT	Error	3.6	# 5
Sentiment Analysis	Yelp Binary classification	ULMFiT	Error	2.16	# 7
Sentiment Analysis	Yelp Fine-grained classification	ULMFiT	Error	29.98	# 5

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/universal-language-model-fine-tuning-for-text/text-classification-on-ag-news)](https://paperswithcode.com/sota/text-classification-on-ag-news?p=universal-language-model-fine-tuning-for-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/universal-language-model-fine-tuning-for-text/text-classification-on-trec-6)](https://paperswithcode.com/sota/text-classification-on-trec-6?p=universal-language-model-fine-tuning-for-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/universal-language-model-fine-tuning-for-text/sentiment-analysis-on-yelp-fine-grained)](https://paperswithcode.com/sota/sentiment-analysis-on-yelp-fine-grained?p=universal-language-model-fine-tuning-for-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/universal-language-model-fine-tuning-for-text/text-classification-on-dbpedia)](https://paperswithcode.com/sota/text-classification-on-dbpedia?p=universal-language-model-fine-tuning-for-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/universal-language-model-fine-tuning-for-text/sentiment-analysis-on-yelp-binary)](https://paperswithcode.com/sota/sentiment-analysis-on-yelp-binary?p=universal-language-model-fine-tuning-for-text)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/universal-language-model-fine-tuning-for-text/sentiment-analysis-on-imdb)](https://paperswithcode.com/sota/sentiment-analysis-on-imdb?p=universal-language-model-fine-tuning-for-text)`

Universal Language Model Fine-tuning for Text Classification

ACL 2018 · Jeremy Howard, Sebastian Ruder ·

Inductive transfer learning has greatly impacted computer vision, but existing approaches in NLP still require task-specific modifications and training from scratch. We propose Universal Language Model Fine-tuning (ULMFiT), an effective transfer learning method that can be applied to any task in NLP, and introduce techniques that are key for fine-tuning a language model. Our method significantly outperforms the state-of-the-art on six text classification tasks, reducing the error by 18-24% on the majority of datasets. Furthermore, with only 100 labeled examples, it matches the performance of training from scratch on 100x more data. We open-source our pretrained models and code.

PDF Abstract ACL 2018 PDF ACL 2018 Abstract

Code

Add Remove Mark official

fastai/fastai official

↳ Quickstart in

Colab

25,608

mrdbourke/tensorflow-deep-learning

4,860

Socialbird-AILab/BERT-Classificatio…

526

cstorm125/thai2fit

188

cahya-wirawan/indonesian-language-m…

149

See all 65 implementations

Tasks

Add Remove

General Classification

Language Modelling

Sentiment Analysis

Text Classification

Transfer Learning

Datasets

IMDb Movie Reviews

WikiText-2

AG News

DBpedia

WikiText-103 Yelp

Results from the Paper

Edit

Ranked #4 on Text Classification on AG News

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Text Classification	AG News	ULMFiT	Error	5.01	# 4	Compare
Text Classification	DBpedia	ULMFiT	Error	0.80	# 6	Compare
Sentiment Analysis	IMDb	ULMFiT	Accuracy	95.4	# 15	Compare
Text Classification	TREC-6	ULMFiT	Error	3.6	# 5	Compare
Sentiment Analysis	Yelp Binary classification	ULMFiT	Error	2.16	# 7	Compare
Sentiment Analysis	Yelp Fine-grained classification	ULMFiT	Error	29.98	# 5	Compare

Methods

Add Remove

Activation Regularization • Adam • AWD-LSTM • Discriminative Fine-Tuning • DropConnect • Dropout • Embedding Dropout • LSTM • Sigmoid Activation • Slanted Triangular Learning Rates • Tanh Activation • Temporal Activation Regularization • ULMFiT • Variational Dropout • Weight Tying

Edit Social Preview

Universal Language Model Fine-tuning for Text Classification

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove