ToxicCommentClassification

This project is an implementation of toxic comment classification challenge in Kaggle and it is hosted here as a consumable application.

This project has three parts

Preprocessing
Modeling
Dash Application

Preprocessing:

Removed stopwords, punctuations, blank lines and some urls, hyperlinks and IPs from the input texts. Used WordNetLemmatizer to lemmatize the words and used glove 100d word vectors as embeddings.

Models:

Built three different models using Keras. It includes a CNN, a RNN and a Naive Bayes SVM. These model outputs are stacked to get the final output.

Dash Application:

A consumable UI is created using dash and is hosted in heroku.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
Models		Models
__pycache__		__pycache__
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
dashmain.py		dashmain.py
main.py		main.py
models.py		models.py
preprocess.py		preprocess.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ToxicCommentClassification

Preprocessing:

Models:

Dash Application:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

GopalSeshadri/ToxicCommentClassification

Folders and files

Latest commit

History

Repository files navigation

ToxicCommentClassification

Preprocessing:

Models:

Dash Application:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages