train strategy #18900

yeonhyochoi · 2025-01-26T14:18:39Z

yeonhyochoi
Jan 26, 2025

The train results continued to improve. But now the improvement has stopped. As shown in the document, the background image should be used less than 10%, but I don't think continuing to see the same background during 300 epoch and 500 epoch training will help improve learning. So, we try to replace the training data set every 5 or 10 epochs.

All 500,000 object images (with labels) are used for learning.
background image 1,500,000 saved in cache with object image(total = 2,000,000)
Randomly select 3~10% of object images from the train epoch and use them for the each epoch train.

i also tried like this, but the process very slow. i think it is wrong approach.

Is there anyone who can give some advice on my code?

UltralyticsAssistant · 2025-01-26T14:19:14Z

UltralyticsAssistant
Jan 26, 2025
Maintainer

👋 Hello @yeonhyochoi, thank you for your detailed post and for sharing your training strategy and insights! 🚀 It's great to see how much effort you've put into optimizing your training process and analyzing your results.

We recommend checking out the Docs for guidance on topics such as dataset balancing and augmentation strategies. For more in-depth information on improving training performance, take a look at our Tips for Best Training Results 📚. These might help refine your approach further.

If your post is related to a 🐛 Bug Report, we kindly ask for a minimum reproducible example so we can assist you quickly and efficiently.

If you’re looking for additional feedback on your custom training ❓, providing more details, such as logs, sample images (if possible), or snippets of your dataset metadata could help facilitate better advice from both the community and our team.

In terms of slower processing speed with your approach, you may also want to explore integrating dataset sampling techniques or caching optimizations. Check the Dataset Guide for tips on efficiently handling large datasets (such as your 2,000,000-image dataset).

For real-time help, join the discussion on our Discord server 🎧. If you prefer topic-specific threads or detailed posts, our Discourse Community or Subreddit would be excellent places to collaborate with other developers.

Upgrade

First, please ensure you're running the latest version of ultralytics for compatibility and performance enhancements. Update it in a Python>=3.8 environment with:

pip install -U ultralytics

Make sure all requirements are installed. This will ensure that you're leveraging the latest improvements for data handling, training, and optimization.

Environments

Consider running your experiments in one of the following verified environments, which include preinstalled dependencies like CUDA/CUDNN:

Cloud Notebooks: , , or
Google Cloud, AWS, or Docker setups

This is an automated response 🤖, but an Ultralytics engineer will review your discussion and provide further assistance shortly. Thank you for being a part of the Ultralytics community! 🌟

0 replies

glenn-jocher · 2025-01-26T20:41:55Z

glenn-jocher
Jan 26, 2025
Maintainer

@yeonhyochoi it seems you're experimenting with data strategies to improve training outcomes. While replacing or dynamically sampling datasets can be beneficial in some cases, it introduces complexity and can slow down training significantly, as you've observed. Instead, consider techniques like data augmentation, multi-scale training, or mosaic augmentation, which can increase data variety without the need for dataset replacement. For background images, maintaining a 3–10% ratio is a sound practice, provided they are representative of your deployment environment. If you need further optimization, you can explore hyperparameter tuning or early stopping when metrics plateau to save resources. For more details, refer to the training tips here.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ultralytics

train strategy #18900

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Ultralytics

train strategy #18900

yeonhyochoi Jan 26, 2025

Replies: 2 comments

UltralyticsAssistant Jan 26, 2025 Maintainer

Upgrade

Environments

glenn-jocher Jan 26, 2025 Maintainer

yeonhyochoi
Jan 26, 2025

UltralyticsAssistant
Jan 26, 2025
Maintainer

glenn-jocher
Jan 26, 2025
Maintainer