Skip to content

【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 Sequencer2D 在Cifar100上的训练#449

Open
x0212wwl wants to merge 3 commits intoTecorigin:mainfrom
x0212wwl:Sequencer2D
Open

【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 Sequencer2D 在Cifar100上的训练#449
x0212wwl wants to merge 3 commits intoTecorigin:mainfrom
x0212wwl:Sequencer2D

Conversation

@x0212wwl
Copy link

● 当前软件栈版本:
image

● 源码参考链接:https://github.com/huggingface/pytorch-image-models
● commit id:x0212wwl@ https://github.com/x0212wwl
● 工作目录:PyTorch/build-in/classification/Sequencer2D/
● 训练内容:使用1张TECO_AICARD_01芯片,在PyTorch框架上支持Sequencer2D在Cifar100数据集上的训练。
● 运行脚本如下:
SDAA_VISIBLE_DEVICES=8,9,10,11 python weloTrain.py --arch sequencer2D --print_freq 1 --steps 100 --dataset cifar100 --datapath ./data --batch_size 32 --epochs 100 | tee sequencer2DCifar100Sdaa.log

● 100iters损失:
image

MeanRelativeError: 0.0033275151217559127
MeanAbsoluteError: 0.014867000000000035
Rule,mean_relative_error 0.0033275151217559127
pass mean_relative_error=0.0033275151217559127 <= 0.05 or mean_absolute_error=0.014867000000000035 <= 0.0002

@x0212wwl x0212wwl changed the title 元碁智汇·定义训练未来-北邮&苏大-模型-在PyTorch框架上支持Sequencer2D在Cifar100上的训练 【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 Sequencer2D 在Cifar100上的训练 Dec 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant