Fix: Implemented Label Encoding and Unified ML Models using Pipelines #139
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fix: Implemented Label Encoding and Unified ML Models using Pipelines
π οΈ Pull Request Template
π·οΈ PR Type
π Related Issue
π Rationale / Motivation
This PR fully resolves Issue #68, which reported errors and instability in the classical machine learning model training workflow.
LabelEncoder.Pipelineobject.The changes significantly improve stability, data consistency, and code clarity.
β¨ Description of Changes
Core Fixes Applied:
sklearn.preprocessing.LabelEncoderto convert the 6 unique string labels to integers across all affected files.sklearn.pipeline.Pipeline, combiningFiles Modified:
scripts/fake_news_logreg_rf.py: Implemented Pipelines formodule/liar-data-analysis.py: Implemented Label Encoding and Pipelines for analysis examples.module/fake-news-detection-using-nb.ipynb: Applied Label Encoding and theπ§ͺ Testing Instructions
.md, confusion matrix PNGs) should be generated in theresults/directory.π Impact Assessment
β‘ Checklist
None. This is an internal fix that preserves the input/output of the scripts.
π― Priority / Impact Level