Add example for data augmentation in tuner #98

lingzhq · 2026-01-09T12:42:21Z

📝 PR Type

📚 Description

Add an example for data augmentation strategy of Agentscope-Tuner.

🧪 Testing Validation

Follow the README to run the examples.

✅ Checklist

Please complete the following checks before submitting the PR:

All sample code has been formatted with pre-commit run --all-files
All new/modified test cases have passed (run pytest tests/)
Test coverage has not decreased (if applicable)
Sample code follows agentscope best practices (e.g., config management, logging)
Related documentation in agentscope-samples has been updated (e.g., README.md)

cla-assistant · 2026-01-09T12:42:35Z

All committers have signed the CLA.

Copilot

Pull request overview

This PR adds a new example demonstrating data augmentation strategies for AgentScope-Tuner, specifically focusing on difficulty-based task selection for training a math problem-solving ReAct agent.

Changes:

Adds a complete data augmentation example with configuration for both random and difficulty-based task selectors
Includes data preparation script to transform the LLM360/guru-RL-92k dataset into GSM8K format
Provides comprehensive documentation explaining data-centric training approaches

Reviewed changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
tuner/data_augment/prepare_data.py	Script to download and transform math dataset from HuggingFace, extracting difficulty features
tuner/data_augment/main.py	Main training script implementing ReAct agent workflow and GSM8K judge function with tuner integration
tuner/data_augment/config_random.yaml	Configuration file for baseline experiment using random task selector
tuner/data_augment/config_difficulty.yaml	Configuration file for advanced experiment using difficulty-based task selector
tuner/data_augment/README.md	Comprehensive documentation explaining the data-centric approach, setup, and usage

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

tuner/data_augment/README.md

lingzhq

Fixed comments from co-pilot.

tuner/data_augment/README.md

add example for data augmentation in tuner

4105239

lingzhq requested a review from a team January 9, 2026 12:42

lingzhq and others added 4 commits January 13, 2026 21:31

fixed main.py

8cc1fa0

fixed readme

c972b61

Merge branch 'agentscope-ai:main' into example/data

71e82b7

fixed precommit

5c3177e

rayrayraykk requested a review from Copilot January 16, 2026 09:20

Copilot started reviewing on behalf of rayrayraykk January 16, 2026 09:20 View session

Copilot AI reviewed Jan 16, 2026

View reviewed changes

lingzhq commented Jan 16, 2026

View reviewed changes

tuner/data_augment/README.md Outdated Show resolved Hide resolved

tuner/data_augment/README.md Outdated Show resolved Hide resolved

tuner/data_augment/README.md Outdated Show resolved Hide resolved

lingzhq and others added 3 commits January 16, 2026 22:34

Fix all comments and fix Trinity repo link.

0655188

Merge branch 'agentscope-ai:main' into example/data

58d19e0

Fix links.

40743b5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add example for data augmentation in tuner #98

Add example for data augmentation in tuner #98

lingzhq commented Jan 9, 2026 •

edited

Loading

Uh oh!

cla-assistant bot commented Jan 9, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lingzhq left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Add example for data augmentation in tuner #98

Are you sure you want to change the base?

Add example for data augmentation in tuner #98

Conversation

lingzhq commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📝 PR Type

📚 Description

🧪 Testing Validation

✅ Checklist

Uh oh!

cla-assistant bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lingzhq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

lingzhq commented Jan 9, 2026 •

edited

Loading

cla-assistant bot commented Jan 9, 2026 •

edited

Loading