【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 Semi-weakly Supervised (SWSL) ResNet 在Cifar100上的训练#415
Open
x0212wwl wants to merge 3 commits intoTecorigin:mainfrom
Open
【25-Q4-生态建设】模型迁移-研发效能部-模型训练-在PyTorch框架上支持 Semi-weakly Supervised (SWSL) ResNet 在Cifar100上的训练#415x0212wwl wants to merge 3 commits intoTecorigin:mainfrom
x0212wwl wants to merge 3 commits intoTecorigin:mainfrom
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
● 当前软件栈版本:

● 源码参考链接:https://github.com/huggingface/pytorch-image-models
● commit id:x0212wwl@ https://github.com/x0212wwl
● 工作目录:PyTorch/build-in/classification/VovNet/
● 训练内容:使用1张TECO_AICARD_01芯片,在PyTorch框架上支持SWSL在Cifar100数据集上的训练。
● 运行脚本如下:
SDAA_VISIBLE_DEVICES=8,9,10,11 python weloTrain.py --arch swslnet --print_freq 1 --steps 100 --dataset cifar100 --datapath ./data --batch_size 32 --epochs 100 | tee swslnetCifar100Sdaa.log
● 100iters损失:

MeanRelativeError: 0.006542727579724056
MeanAbsoluteError: 0.012236000000000065
Rule,mean_relative_error 0.006542727579724056
pass mean_relative_error=0.006542727579724056 <= 0.05 or mean_absolute_error=0.012236000000000065 <= 0.0002