Question about the rule-based criteria for GUI-Critic-R1 dataset construction

Hello,

Thanks for your great work. I have a question regarding the construction of the critic dataset as described in the paper. I would like to understand the **specifics of the rule-based criteria** used during the Negative Operations Sampling stage.

- Is the evaluation based on a direct match with the ground truth operations?

- For actions formatted as "click + coordinate," how is the correctness judged? Since any coordinate that falls within the correct bounding box is technically a valid action, how is this handled in your evaluation criteria?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about the rule-based criteria for GUI-Critic-R1 dataset construction #149

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about the rule-based criteria for GUI-Critic-R1 dataset construction #149

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions