Skip to content

【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 SigLIP2 在Cifar100上的训练#450

Open
x0212wwl wants to merge 4 commits intoTecorigin:mainfrom
x0212wwl:SigLIP2
Open

【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 SigLIP2 在Cifar100上的训练#450
x0212wwl wants to merge 4 commits intoTecorigin:mainfrom
x0212wwl:SigLIP2

Conversation

@x0212wwl
Copy link

● 当前软件栈版本:
image

● 源码参考链接:https://github.com/huggingface/pytorch-image-models
● commit id:x0212wwl@ https://github.com/x0212wwl
● 工作目录:PyTorch/build-in/classification/SigLIP2/
● 训练内容:使用1张TECO_AICARD_01芯片,在PyTorch框架上支持SigLIP2在Cifar100数据集上的训练。
● 运行脚本如下:
SDAA_VISIBLE_DEVICES=8,9,10,11 python weloTrain.py --arch siglip2 --print_freq 1 --steps 100 --dataset cifar100 --datapath ./data --batch_size 32 --epochs 100 | tee siglip2Cifar100Sdaa.log

● 100iters损失:
image

MeanRelativeError: -0.011135153214594594
MeanAbsoluteError: -0.07194500000000004
Rule,mean_absolute_error -0.07194500000000004
pass mean_relative_error=-0.011135153214594594 <= 0.05 or mean_absolute_error=-0.07194500000000004 <= 0.0002

@x0212wwl x0212wwl changed the title 元碁智汇·定义训练未来-北邮&苏大-模型-在PyTorch框架上支持 SigLIP2 在Cifar100上的训练 【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 SigLIP2 在Cifar100上的训练 Dec 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant