Skip to content

Only save necessary parameters#32

Open
Line290 wants to merge 1 commit intoTsinghuaAI:mainfrom
Line290:patch-1
Open

Only save necessary parameters#32
Line290 wants to merge 1 commit intoTsinghuaAI:mainfrom
Line290:patch-1

Conversation

@Line290
Copy link

@Line290 Line290 commented Dec 21, 2020

@Line290
Copy link
Author

Line290 commented Dec 21, 2020

感觉qkv那边应该分的不对,有些feedforward的bias也没有分,
我之前用的脚本:https://github.com/Line290/CPM-ckpts-conversion

@zzy14
Copy link
Contributor

zzy14 commented Dec 25, 2020

感谢!我才看到,昨天我自己修了🤣

@Line290
Copy link
Author

Line290 commented Dec 28, 2020

没事哈,对了,还有一点就是最上面说的,torch tensor slice不会创建新的tensor,所以会把slice之前的整个tensor存下,会在磁盘上多存一些冗余的参数,不过不影响程序运行。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants