Skip to content

Latest commit

 

History

History
15 lines (8 loc) · 624 Bytes

File metadata and controls

15 lines (8 loc) · 624 Bytes

font-diffusion

install requirements.txt

use generate_dataset.py to make dataset jsonl and image directories with font_dataset.csv

x_clip_playground.ipynb is for the work we did training xclip

TrORC_playground.ipynb is for the work we did captioning with various pretrained TrOCR models

clip_train.py is python script for the work we had towards training a clip model

vit_train.py and vit-playground.ipynb were efforts to train ViT from scratch

cog-stable-diffusion-custom files includes the files altered and added to the cog-stable-diffusion repo we used to integrate our custom datasets with their BLIP framework