Hỏi cách huấn luyện trên tập dữ liệu tùy chỉnh

Cho mình hỏi là: Mình có 1 tập dữ liệu viết tay tầm 1300 hình. Mỗi hình là 1 chữ viết tay tiếng Nhật Kanji.

Mình muốn train model OCR cho tập dữ liệu này thì mình nên chỉnh sửa như thế nào ?

Hiện tại thì mình có làm như sau:
1. Chuẩn bị dataset để train, valid 
2. Sửa đổi config->vocab: thay bằng từ tiếng Nhật có tập dataset

Không rõ mình có cần phải sửa đổi model không nhỉ ? Hiện tại default mình thấy đang dùng VGG19 làm backbone.

Hiện tại mình đang theo hướng fine tune từ pre-tranined model. Đây là config của mình 
![CleanShot 2023-05-05 at 23 08 31](https://user-images.githubusercontent.com/16741872/236510477-5b5d5157-2e02-434d-b6f1-060ae8612072.png)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hỏi cách huấn luyện trên tập dữ liệu tùy chỉnh #99

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Hỏi cách huấn luyện trên tập dữ liệu tùy chỉnh #99

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions