A unImpl of Sparse LlamaGen

author: jupitercsd

A unImpl of Sparse LlamaGen

bitahub环境无法运行app，需要使用sample脚本推理；
nsa稀疏注意力需要torch2.9+cuda126，可能无法安装; 使用2.7.1解决
osa稀疏注意力(openai)需要自行编写Triton或cuda算子才能实现加速 TODO

this is the config file for starting experiments

git config --global user.email "shuaic@mail.ustc.edu.cn" git config --global user.name "Chen Shuai" git config --global --unset http.proxy git config --global --unset https.proxy pip config set global.index-url https://pypi.tuna.tsinghua.edu.cn/simple source ./venv/bin/activate

result

naive llamagen

gpt prefill time: 0.8494267463684082s
gpt decode time: 14.262623071670532s
Full sampling takes about 15.23 seconds.
vq decoder takes about 0.77 seconds.
naive inference: 4.3725s/prompt
in coco2014 eval
FID_256px: 15.13755733841549
CLIP score: 0.3203125

==================== sample t2i ==============

PYTHONPATH=. python3 autoregressive/sample/sample_t2i.py
--vq-ckpt ./pretrained_models/vq_ds16_t2i.pt --gpt-ckpt ./pretrained_models/t2i_XL_stage1_256.pt --gpt-model GPT-XL --image-size 256

=============== sample c2i ===============

PYTHONPATH=. python3 autoregressive/sample/sample_c2i.py
--gpt-model ddGPT-L --gpt-ckpt ./pretrained_models/ddllamagen-L.pt --vq-ckpt ./pretrained_models/vq_ds16_c2i.pt --image-size 256
--precision fp16

============= train t2i debug ==============

PYTHONPATH=. python3 autoregressive/train/train_t2i.py
--data-path /data/ChenShuai/coco2014/annotations
--t5-feat-path /data/ChenShuai/coco2014/t5
--vq-ckpt ./pretrained_models/vq_ds16_t2i.pt
--results-dir /output/t2i_XL_512
--global-batch-size 2
--dataset t2i
--image-size 256
--mixed-precision fp16
--gpt-model GPT-B
--debug
--no-compile
--gpt-model Flash-GPT-B

=============== train t2i DDP ================

bash ./scripts/autoregressive/train_t2i.sh

extract t5 features

PYTHONPATH=. python3 language/extract_t5_feature.py
--data-path /data/ChenShuai/coco2014/annotations
--t5-path /data/ChenShuai/coco2014/t5
--data-start 0 --data-end 5000
--t5-model-path ./pretrained_models/t5-ckpt

================ eval t2i ==================

PYTHONPATH=. bash ./scripts/autoregressive/sample_t2i_coco.sh

PYTHONPATH=. python3 ./evaluations/t2i/evaluation.py
--fake_dir /data/ChenShuai/coco2014/val/GPT-XL-t2i_XL_stage1_256-coco_captions-size-256-size-256-VQ-16-topk-1000-topp-1.0-temperature-1.0-cfg-7.5-seed-0
--ref_dir /data/ChenShuai/coco2014/val
--ref_data coco2014
--ref_type val

app vllm==0.7.2 !important you must uninstall flash_attn!

PYTHONPATH=. python3 app.py
--gpt-model GPT-B
--gpt-type c2i
--precision fp16

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
autoregressive		autoregressive
dataset		dataset
evaluations		evaluations
language		language
scripts		scripts
tokenizer		tokenizer
tools		tools
utils		utils
.gitignore		.gitignore
app.py		app.py
func.py		func.py
readme.md		readme.md
sample_t2i.png		sample_t2i.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

author: jupitercsd

A unImpl of Sparse LlamaGen

this is the config file for starting experiments

result

naive llamagen

==================== sample t2i ==============

=============== sample c2i ===============

============= train t2i debug ==============

=============== train t2i DDP ================

extract t5 features

================ eval t2i ==================

app vllm==0.7.2 !important you must uninstall flash_attn!

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

author: jupitercsd

A unImpl of Sparse LlamaGen

this is the config file for starting experiments

result

naive llamagen

==================== sample t2i ==============

=============== sample c2i ===============

============= train t2i debug ==============

=============== train t2i DDP ================

extract t5 features

================ eval t2i ==================

app vllm==0.7.2 !important you must uninstall flash_attn!

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages