Next Word Generator

This project implements a Next-Word Generator using both MLP and LSTM neural network architectures. All models are trained on the Cleaned Indian Recipes Dataset from Kaggle.

It also features an interactive Streamlit web app for text generation, which is also deployed at Recipe Next Word Generator.

Features

MLP and LSTM Models: Choose between Multi-Layer Perceptron and LSTM for next-word generation.
Interactive Streamlit App: User-friendly interface for generating text based on a seed prompt with customizable parameters like context size, embedding dimension, activation function, random seed and temperature.
Pre-trained Models: No need to retrain; select from pre-trained variants.
Word Embedding Visualization: Notebooks for t-SNE and other embedding visualizations.

Project Structure

├── assets/
├── models/
│   ├── mlp/
│   └── lstm/
├── streamlit_app.py
├── MLP.ipynb
├── LSTM.ipynb
├── embeddings.ipynb
├── README.md
└── requirements.txt

Getting Started

Installation

Clone the repository:

git clone https://github.com/ShardulJunagade/Next-Word-Generator.git
cd Next-Word-Prediction

Install dependencies:

pip install uv
uv pip install -r requirements.txt

Running the Streamlit App

streamlit run streamlit_app.py

The app will open in your browser. Enter a seed text or use the default, select model and parameters, and generate text.

Notebooks

MLP.ipynb: Data processing, training, and evaluation for the MLP model.
LSTM.ipynb: Data processing, training, and evaluation for the LSTM model.
embeddings.ipynb: Visualization of learned word embeddings.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Next Word Generator

Features

Project Structure

Getting Started

Installation

Running the Streamlit App

Notebooks

License

About

Uh oh!

Releases 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.devcontainer		.devcontainer
assets		assets
data		data
models		models
.gitignore		.gitignore
LICENSE		LICENSE
LSTM.ipynb		LSTM.ipynb
MLP.ipynb		MLP.ipynb
README.md		README.md
embeddings.ipynb		embeddings.ipynb
instructions.md		instructions.md
requirements.txt		requirements.txt
streamlit_app.py		streamlit_app.py
uv_requirements.txt		uv_requirements.txt

License

ShardulJunagade/Next-Word-Generator

Folders and files

Latest commit

History

Repository files navigation

Next Word Generator

Features

Project Structure

Getting Started

Installation

Running the Streamlit App

Notebooks

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 2

Languages