Sentence Classification with Transformers

This project fine-tunes and evaluates a sentence classification model using Hugging Face's Transformers library. It supports multiple classification tasks with stratified k-fold cross-validation and fine-tuning of pre-trained models.

Features

Fine-tuning Hugging Face models for sentence classification.
Stratified k-fold cross-validation for robust evaluation.
Metrics computation: Precision, Recall, F1-Score, and Accuracy.
Support for multiple classification tasks.

Requirements

Install the required Python packages using:

pip install -r requirements.txt

Usage

Run the script with the following arguments:

python train_test_model.py <dataset_name> <output_dir_name> <model_name> <num_train_epochs>

dataset_name: Path to the dataset file (tab-separated).
output_dir_name: Directory to save the model and tokenizer.
model_name: Hugging Face model name (e.g., bert-base-uncased).
num_train_epochs: Number of training epochs.

Example

python train_test_model.py data/dataset.tsv output bert-base-uncased 3

Dataset Format

The dataset should be a tab-separated file with the following structure:

sentence<TAB>fully_curatable<TAB>partially_curatable<TAB>language_related

Output

Fine-tuned models and tokenizers are saved in the specified output directory.
Evaluation metrics are printed for each classification task.

Pre-trained Models

Pretrained models can be found on Hugging Face.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
requirements.txt		requirements.txt
train_test_model.py		train_test_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentence Classification with Transformers

Features

Requirements

Usage

Example

Dataset Format

Output

Pre-trained Models

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sentence Classification with Transformers

Features

Requirements

Usage

Example

Dataset Format

Output

Pre-trained Models

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages