Faster transformers

Everybody is tired of the cost of working with transformers. This talk surveys the research happened to make them faster.

T3: High Performance Natural Language Processing

Better testing of NLP models

This talk by the author opens our eyes to better testing our models.

Beyond Accuracy: Behavioral Testing of NLP Models with CheckList

Pre-trained models

Pre-trained Models for Natural Language Processing: A Survey

Language Models are Few-Shot Learners

It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

Transfer Learning In NLP - Part 2

A Primer in BERTology

Smaller models

Distilling models will be a natural thing in the coming days until we discover a way to make great small models directly.

Speeding Up Transformer Training and Inference By Increasing Model Size

Knowledge Distillation: A Survey

A Survey of Model Compression and Acceleration for Deep Neural Networks

Domain adaptation

Neural Unsupervised Domain Adaptation in NLP

Tricks For Domain Adaptation

Don't Stop Pretraining: Adapt Language Models to Domains and Tasks

Neural search

Pretrained Transformers for Text Ranking: BERT and Beyond

Generative models

Evaluation of Text Generation: A Survey

Dealing with scarcity of data

Revisiting Few-sample BERT Fine-tuning

Data Augmentation using Pre-trained Transformer Models

How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?

Knowledge graphs

A Survey on Knowledge Graphs

Language Models are Open Knowledge Graphs


All in all

