Korean Language Consultation Chatbot

Consultation chatbot training by pytorch,huggingface transformers,klue-roberta-base, AIHub sentiment conversation dataset and Songys's chatbot data

Background

In 20 May 2021, KLUE(Korean Language Understanding Evaluation) paper is published. KLUE released the pretrained language models (PLM), KLUE-BERT and KLUE-RoBERTa.

I want to fine-tuned KLUE-RoBERTa model, masked language model(MLM) to auto regressive task like languge generation task. I know that bert embedding is more effective understanding context than any other models like GPT-2 ect. I want to make conversation chatbot model using bert embedding but, I had hardship in training that model. because bert doesn't have decoder architecture so, it is hard to directly using Generation task. however, I found breakthrough in this paper Leveraging Pre-trained Checkpoints for Sequence Generation Tasks moreover, this paper providing experiment API in huggingface transformers

KLUE-Roberta-base : using for pre-trained model, tokenizer
Encoder Decoder Models : used to initialize a sequence-to-sequence model with KLUE-Roberta-base model as the encoder and decoder.
Final model is Roberta to Roberta model

Task

KLUE-Roberta to Roberta Text Generation

How to train

git clone https://github.com/chaeyoon-jang/RobertaChatbot.git

pip install -r requirements.txt

python train.py --batch_size=32 --n_epoch=20 --lr=5e-5

How to chat

python print_result.py --chat=CHAT

example

How to test

python print_result.py --test=TEST

References

AIHub sentiment conversation dataset

Songys's chatbot data

Huggingface transformers

KLUE(Korean Language Understanding Evaluation)

Leveraging Pre-trained Checkpoints for Sequence Generation Tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Korean Language Consultation Chatbot

Background

Task

KLUE-Roberta to Roberta Text Generation

How to train

How to chat

example

How to test

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
data		data
model		model
result		result
README.md		README.md
makedataset.py		makedataset.py
print_result.py		print_result.py
requirements.txt		requirements.txt
roberta2roberta.py		roberta2roberta.py
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

Korean Language Consultation Chatbot

Background

Task

KLUE-Roberta to Roberta Text Generation

How to train

How to chat

example

How to test

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages