【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 Sequencer2D 在Cifar100上的训练#449
Open
x0212wwl wants to merge 3 commits intoTecorigin:mainfrom
Open
【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 Sequencer2D 在Cifar100上的训练#449x0212wwl wants to merge 3 commits intoTecorigin:mainfrom
x0212wwl wants to merge 3 commits intoTecorigin:mainfrom
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
● 当前软件栈版本:

● 源码参考链接:https://github.com/huggingface/pytorch-image-models
● commit id:x0212wwl@ https://github.com/x0212wwl
● 工作目录:PyTorch/build-in/classification/Sequencer2D/
● 训练内容:使用1张TECO_AICARD_01芯片,在PyTorch框架上支持Sequencer2D在Cifar100数据集上的训练。
● 运行脚本如下:
SDAA_VISIBLE_DEVICES=8,9,10,11 python weloTrain.py --arch sequencer2D --print_freq 1 --steps 100 --dataset cifar100 --datapath ./data --batch_size 32 --epochs 100 | tee sequencer2DCifar100Sdaa.log
● 100iters损失:

MeanRelativeError: 0.0033275151217559127
MeanAbsoluteError: 0.014867000000000035
Rule,mean_relative_error 0.0033275151217559127
pass mean_relative_error=0.0033275151217559127 <= 0.05 or mean_absolute_error=0.014867000000000035 <= 0.0002