This repo use vision_base library extracts from FSNet to accelerate the development of segmentation related research.
The vision_base library contains generic codes for:
- Config-based model/pipeline construction.
- Common tools for pipeline organization, experiment logs.
- Common mathematical, pytorch, numerical tools.
- Write code for dataset fetching, dataset evaluator, and network model.
- Adapt config files based on examples to launch dataset/network/training with your own model.
- Use existing scripts and launchers to start experiments.
- Minimize the modification in vision_base. Just write new classes in another folders if needed.
This segmentation package implements a simple UNet with a Transformer block to conduct semantic segmentation on KITTI360 dataset.
Download the KITTI360 data2D image sequences and data2D 2D annotations. Organized in {...}/KITTI-360/{data_2d_raw|data_2d_semantics}
After modifying paths in the configuration file. Launch training with the standard script.