NLP-Text-Generation

NOTE: Final copy of this is to be put in https://github.com/gradient-ai/NLP-Text-Generation ; this one in nmb-paperspace is not the public one to be run as a tutorial.

Last updated: Sep 13th 2021

This is our GitHub repository for the Paperspace Gradient NLP Text Generation Tutorial example. It runs the GPT-2 model from HuggingFace: https://huggingface.co/gpt2 .

The example shows:

Text generation from a modern deep-learning-based natural language processing model, GPT-2
Gradient Projects linked to GitHub repositories
Gradient Workflows
Triggering a Workflow to rerun based upon a change in the repository, as needed in many production systems
Versioned Gradient-managed Datasets as output

The repo contains 2 files: nlp_text_generation.py in the main directory, and nlp_text_generation.yamlin the .gradient/workflows directory. The YAML file contains the Gradient Workflow which in turn calls the Python script.

The Workflow is triggered to run when the YAML file is present in the .gradient/workflows/ directory, and the repo is linked to the user's Gradient project. The Workflow clones this repo and then in turn calls the Python script. The script outputs the generated text to the file outputs.txt in the Gradient-managed Dataset demo-dataset, which the user can then view.

The Workflow runs on the Paperspace HuggingFace NLP container (paperspace/transformers-gpu:0.4.0).

Steps to run this tutorial

Clone the example repository

Assuming you are up and running with Gradient, this project runs as a sample repository, available when creating a repo-linked project under the Projects tab. In the illustrated list of projects in the right-hand panel, select the one for NLP Text Generation and follow the instructions to run the YAML.

Alternative method

You can also fork your own copy of this repo, then create a repo-linked project that points to the fork:

Navigate to https://github.com/gradient-ai/NLP-Text-Generation in your browser
In the resulting GitHub GUI page, click "Fork" in the top right
Follow the usual GitHub procedure by selecting your GitHub account to fork the repo to
Create a Project in Gradient that is linked to this repo, e.g., under the GUI Project tab, instead of the right-hand side illustrated boxes, use the left-hand side and select your repo from the dropdown list. This assumes that account has the Paperspace Gradient app installed, the same as in the fork-a-sample-repo method above.
Change any of the files in your repo to trigger the nlp_text_generation.yaml file under .gradient/workflows/ to run. For example, add a few characters to the readme.md. The result should be the same as above.

Note: When running the Workflow from a project linked to your own fork of the repo, it will still be cloning from the original location https://github.com/gradient-ai/NLP-Text-Generation, unless you choose to alter it, which is optional.

Altering the model settings and triggering a Workflow rerun

The ability to trigger Workflow reruns is useful in several situations, especially more MLOps and production-oriented ones where the state of the collection of code, data, deployments, models, and other components should be consistent.

Here, changing the model settings can be used to trigger a rerun of the model. The 4 values under "Settings" in the nlp_text_generation.py script (random seed, maximum text length, number of returned text sequences, and the initial text sentence) can be altered to generate different text.

If the resulting updated version of the nlp_text_generation.py file is uploaded to the repo main directory to replace the one present, and the project remains linked to the repo, the Workflow will be rerun. A new output.txt file is generated, and placed in a new version of the output Gradient-managed Dataset.

Next Steps

See the documentation page for this tutorial for some suggested next steps (e.g., you can run the newer+larger GPT-Neo instead of GPT-2).

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.gradient/workflows		.gradient/workflows
README.md		README.md
nlp_text_generation.py		nlp_text_generation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

NLP-Text-Generation

Steps to run this tutorial

Altering the model settings and triggering a Workflow rerun

Next Steps

About

Uh oh!

Releases

Packages

Languages

nmb-paperspace/NLP-Text-Generation

Folders and files

Latest commit

History

Repository files navigation

NLP-Text-Generation

Steps to run this tutorial

Altering the model settings and triggering a Workflow rerun

Next Steps

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages