Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于数据集制作 #8

Open
KeyaoZhao opened this issue May 23, 2023 · 3 comments
Open

关于数据集制作 #8

KeyaoZhao opened this issue May 23, 2023 · 3 comments

Comments

@KeyaoZhao
Copy link

您好!我想请问一下在新的小数据集上finetune时可能会遗忘学过的LAION 400M,导致finetune后的模型泛化性能下降。所以我计划在finetune时也加入部分LAION 400M数据,但是我使用LAION 400M聚类到1M时的类id可能和您训练时的不同,这是否会产生冲突呢?请问数据集的这些信息或者原始的制作方法您可以公布下吗?非常感谢~

@anxiangsir
Copy link
Collaborator

马上会把,做数据集的脚本和100w类中心的权重放出来。

@hbchen121
Copy link

请问能否先release一下400M特征时的聚类算法?想学习一下数据集如何制作,非常感谢!

@hbchen121
Copy link

马上会把,做数据集的脚本和100w类中心的权重放出来。

你好,请问有最近的计划了吗

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants