Skip to content

Voxel Files Outnumber Point Cloud and Label Files in SemanticKITTI Sequences — Causing Mismatched Data Entries #171

@xyxiongcode

Description

@xyxiongcode

I'm currently working with the SemanticKITTI dataset for 3D occupancy prediction training , and I’ve encountered a data mismatch issue regarding the voxels subfolder.

Problem Description
In multiple sequences (e.g., sequences/00, sequences/01, etc.), the number of .npz voxel files is significantly greater than the number of corresponding .bin (velodyne) and .label files. Specifically:

Some voxel files (e.g., 002480.npz) exist in the voxels directory, but the corresponding 002480.bin and 002480.label do not exist in velodyne/ or labels/.

As a result, when training or preprocessing data that depends on synchronized inputs (voxels + lidar + labels), this inconsistency can lead to errors or ignored samples.

This also creates difficulty in aligning data for supervised learning or evaluation steps where perfect correspondence between modalities is required.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions