请问在1.45k步数下,未出现注意力模型,但是loss已经低于0.3,是否说明样本太少? #864
Unanswered
fatinghenji
asked this question in
Q&A
Replies: 1 comment
-
放弃吧,我记得作者在之前的回答里说至少要5个G还是说要100个小时来着,所以除非音源是天天演讲的知名人物,但是这样的人是不可能会让你练模型的,拿那个75K的模型做一些增量训练效果会非常好的 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
RT,补充图像如下:


总样本库约1.5小时,从头开始训练。
Beta Was this translation helpful? Give feedback.
All reactions