Interpreting Predictions of NLP Models

The tutorial was held on November 19th, 2020 on Zoom. The presenters were Eric Wallace, Matt Gardner, and Sameer Singh.

Slides

The PDF version of the slides are available here. The Google Drive version is here. Feel free to reuse any of our slides for your own purposes.

Video

The video is available here.

Abstract

Although neural NLP models are highly expressive and empirically successful, they also systematically fail in counterintuitive ways and are opaque in their decision-making process. This tutorial will provide a background on interpretation techniques, i.e., methods for explaining the predictions of NLP models. We will first situate example-specific interpretations in the context of other ways to understand models (e.g., probing, dataset analyses). Next, we will present a thorough study of example-specific interpretations, including saliency maps, input perturbations (e.g., LIME, input reduction), adversarial attacks, and influence functions. Alongside these descriptions, we will walk through source code that creates and visualizes interpretations for a diverse set of NLP tasks. Finally, we will discuss open problems in the field, e.g., evaluating, extending, and improving interpretation methods.

Paper

We you can find our tutorial overview paper in the conference proceedings.

Citation

If you'd like to cite our tutorial, you can use the following citation:

@inproceedings{wallace2020interpreting,
  title={Interpreting Predictions of {NLP} Models},
  author={Wallace, Eric and Gardner, Matt and Singh, Sameer},
  booktitle={Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts},
  pages={20--23},
  year={2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
README.md		README.md
tutorial_slides.pdf		tutorial_slides.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Interpreting Predictions of NLP Models

Slides

Video

Abstract

Paper

Citation

About

Releases

Packages

Eric-Wallace/interpretability-tutorial-emnlp2020

Folders and files

Latest commit

History

Repository files navigation

Interpreting Predictions of NLP Models

Slides

Video

Abstract

Paper

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages