Sem6_NLP_Project

Multi-label multi-class classification(tag prediction) of Stack Overflow questions along with Topic Modelling. Tag prediction model is deployed locally and users can enter their questions in the Streamlit web app and get the predicted tags. Two approaches have been tried here, supervised - through the classification model and unsupervised through topic modelling.

Supervised approach - Different base classifiers were tried for the tag prediction model - Logistic Regression, Decision Trees, SGDClassifier and different wrappers over these simple classifiers(that do not natively support multi-target classification) - MultiOutputClassifier, BinaryRelevance, ClassifierChain, LabelPowerset, etc.

Unsupervised approach - In topic modelling, Latent Dirichlet Allocation (LDA) models have been used - LDAMultiCore & LDAMallet . Coherence Model and visualization through pyLDAvis have been used to evaluate how distinct and separable the generated topics are for each LDA model.

Both the approaches can be used and compared. Either one of them can be used finally or better, both can be used together. The predicted tags can be used to filter the topics and the user can get a more descriptive tag, so like instead of just python tag getting predicted something like python - TypeError would be more helpful.

Link to Kaggle dataset - https://www.kaggle.com/datasets/stackoverflow/stacksample

To run the streamlit app, just open up the terminal in the folder that contains the streamlit app python file and execute the command - streamlit run streamlit_tag_pred_app.py

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
NLP_Project_final.ipynb		NLP_Project_final.ipynb
README.md		README.md
Topic_Modelling_final.ipynb		Topic_Modelling_final.ipynb
streamlit_tag_pred_app.py		streamlit_tag_pred_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sem6_NLP_Project

About

Uh oh!

Releases

Packages

Languages

Anmol967/Stack-Overflow-tags-prediction

Folders and files

Latest commit

History

Repository files navigation

Sem6_NLP_Project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages