Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

生成特定领域的高质量数据集 #581

Open
lzl-hello opened this issue Apr 14, 2024 · 0 comments
Open

生成特定领域的高质量数据集 #581

lzl-hello opened this issue Apr 14, 2024 · 0 comments

Comments

@lzl-hello
Copy link

请问我想生成“安全隐私”这方面的数据集,是只需要运行1.5M文件夹下的指令即可吗?关于prompt的编写有什么好的建议吗?我看你们在hugging face上也开源了“数学题”这类的数据集,我想知道是如何提示大模型来生成的?还有我想问一下运行该文件后会生成1.5M这么大的数据吗?大约需要多大内存和显存,我后续可能要租服务器来运行该项目,本机配置太低,因此先来问一下;感谢答复!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant