Tutorials

Interested in getting started in a new NLP area? Here are some tutorials to help get started.

Data Loading and Vocabularies

Data Loading APIsnotes/data_api.html

Basics on how to load and process the sentiment dataset to form batches that can be processed efficiently.

Vocabulary APIsnotes/vocab_emb.html

Learn how to write simple code to create index for tokens.

Representation Learning

Using Pre-trained Word Embeddingsword_embedding/word_embedding.html

Basics on how to use word embedding with vocab in GluonNLP and apply it on word similarity and analogy problems.

Word Embeddings Training and Evaluationword_embedding/word_embedding_training.html

Learn how to train fastText and word2vec embeddings on your own dataset, and determine embedding quality through intrinsic evaluation.

Extracting Sentence Features with Pre-trained ELMosentence_embedding/elmo_sentence_representation.html

See how to use GluonNLP’s API to automatically download the pre-trained ELMo model, and extract features from it.

Fine-tuning Pre-trained BERT Modelssentence_embedding/bert.html

See how to use GluonNLP to fine-tune a sentence pair classification model with pre-trained BERT parameters.

Language Modeling

LSTM-based Language Modelslanguage_model/use_pretrained_lm.html

Learn what a language model is, what it can do, and how to train a word-level language model with truncated back-propagation-through-time (BPTT).

Machine Translation

Training GNMT on IWSLT 2015 Datasetmachine_translation/gnmt.html

Learn how to train Google Neural Machine Translation, a seq2seq with attention model.

Using Pre-trained Transformermachine_translation/transformer.html

Learn how to use a pre-trained transformer translation model for English-German translation.

Sentiment Analysis

Fine-tuning LSTM-based Language Modelsentiment_analysis/sentiment_analysis.html

See how to fine-tune a pre-trained language model to perform sentiment analysis on movie reviews.

Training Structured Self-attentive Sentence Embeddingsentiment_analysis/self_attentive_sentence_embedding.html

See how to use GluonNLP to build more advanced model structure for extracting sentence embeddings to predict Yelp review rating.

Text Generation

Sequence Generation with Beam Search Sampler and Sequence Samplersequence_sampling/sequence_sampling.html

Learn how to generate sentence from pre-trained language model through sampling and beam search.