Using NLTK Data with Heroku Python
Last updated June 03, 2020
This article is a work in progress, or documents a feature that is not yet released to all users. This article is unlisted. Only those with the link can access it.
Table of Contents
NLTK is a platform for building Python programs to work with human language data. In order to use it, you have to download corpora and make it available to your application. The Heroku Python buildpack has built-in support for this with the nltk.txt
file.
Example nltk.txt
file
wordnet
pros_cons
reuters
This will tell Heroku to automatically download these three corpora and install them into your application at build time. Make sure you have nltk
specified in your Pipfile
or requirements.txt
!