Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About running the construction task. #2

Open
MurakamiNatsuki opened this issue Nov 1, 2024 · 1 comment
Open

About running the construction task. #2

MurakamiNatsuki opened this issue Nov 1, 2024 · 1 comment

Comments

@MurakamiNatsuki
Copy link

Hello!
Thank you for your interesting research!
I would like to try running the construction task, but could you please explain the method in a bit more detail?

@cnsdqd-dyb
Copy link
Owner

Thank you for your question. Recently, I've been attempting to fine-tune using reinforcement learning paradigms and planning to potentially open-source the fine-tuned small models, which caused me to overlook some issues.

Regarding the construction task, we provide 100 building blueprints that are described using blocks. Multiple agents analyze these blueprints and task instructions as input, then break down and execute the construction tasks. Their building process closely mirrors real construction processes, with strict constraints on block placement direction and position.

When running the construction task, we create multiple agents that join the server, along with a build_judger that enters in spectator mode to track and calculate task metrics. The task ends when either the metrics are completed or a timeout occurs.

This is a general overview of our task testing approach. If you have any specific questions about the details, please feel free to ask. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants