Vision Base - Segmentation

This repo use vision_base library extracts from FSNet to accelerate the development of segmentation related research.

The vision_base library contains generic codes for:

Recommended Practice

Write code for dataset fetching, dataset evaluator, and network model.
Adapt config files based on examples to launch dataset/network/training with your own model.
Use existing scripts and launchers to start experiments.
Minimize the modification in vision_base. Just write new classes in another folders if needed.

This segmentation package implements a simple UNet with a Transformer block to conduct semantic segmentation on KITTI360 dataset.

Download the KITTI360 data2D image sequences and data2D 2D annotations. Organized in {...}/KITTI-360/{data_2d_raw|data_2d_semantics}

After modifying paths in the configuration file. Launch training with the standard script.