Adding bf16 Sortformer train and inference #14627

tango4j · 2025-09-03T01:39:06Z

What does this PR do ?

This PR adds bf16 precision training and inference of Sortformer diarizer models.
Hardware: Starting from Ampere (e.g. A100) architecture, native bf16 operation is supported.

Collection: [Note which collection this PR will affect]

ASR/speaker_task

Changelog

NeMo/nemo/collections/asr/losses/bce_loss.py
NeMo/examples/speaker_tasks/diarization/neural_diarizer/e2e_diarize_speech.py

Usage

Although model weights are FP32, e2e_diarize_speech.py script automatically converts the precision to bf16 then perform inference based on bf16.

python $BASEPATH/neural_diarizer/e2e_diarize_speech.py \
    precision="bf16" \ 
    model_path=/path/to/diar_sortformer_4spk_v1.nemo \
    batch_size=1 \
    dataset_manifest=/path/to/diarization_manifest.json

for training, specify the following configuration:

trainer.precision="bf16"

Before your PR is "Ready for review"

Pre checks:

Make sure you read and followed Contributor guidelines
Did you write any new necessary tests?
Did you add or update any necessary documentation?
Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
- Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

New Feature
Bugfix
Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

Related to # (issue)

Signed-off-by: taejinp <[email protected]>

Signed-off-by: tango4j <[email protected]>

KunalDhawan

LGTM, thanks Taejin!

KunalDhawan · 2025-09-03T23:51:50Z

examples/speaker_tasks/diarization/neural_diarizer/e2e_diarize_speech.py

@@ -82,6 +82,7 @@ class DiarizationConfig:
    no_der: bool = False
    out_rttm_dir: Optional[str] = None
    save_preds_tensors: bool = False
+    precision: str = "bf16"  # 32, bf16


Let's also add the bf16-mixed option and maybe add a small comment about possible gains expected with bf16 training/inference?

Signed-off-by: taejinp <[email protected]>

Signed-off-by: tango4j <[email protected]>

Signed-off-by: taejinp <[email protected]>

… into bf16_sortformer_train

ipmedenn

LGTM!

tango4j added 3 commits August 28, 2025 10:51

Adding disabled autocast on bce_loss

f8a0016

Signed-off-by: taejinp <[email protected]>

Adding Sortformer BF16 inference

80e1f7f

Signed-off-by: taejinp <[email protected]>

Adding BF16 inference and adding a config

c5cd66f

Signed-off-by: taejinp <[email protected]>

github-actions bot added ASR Speaker Tasks labels Sep 3, 2025

tango4j and others added 2 commits September 2, 2025 19:58

Merge remote-tracking branch 'origin/main' into bf16_sortformer_train

051bef0

Apply isort and black reformatting

90e251d

Signed-off-by: tango4j <[email protected]>

tango4j added Run CICD and removed Run CICD labels Sep 3, 2025

tango4j temporarily deployed to test September 3, 2025 03:15 — with GitHub Actions Inactive

tango4j requested review from ipmedenn and KunalDhawan September 3, 2025 03:27

Merge branch 'main' into bf16_sortformer_train

fc32917

chtruong814 added Run CICD and removed Run CICD labels Sep 3, 2025

chtruong814 had a problem deploying to test September 3, 2025 16:28 — with GitHub Actions Error

tango4j added Run CICD and removed Run CICD labels Sep 3, 2025

tango4j temporarily deployed to test September 3, 2025 16:35 — with GitHub Actions Inactive

KunalDhawan previously approved these changes Sep 3, 2025

View reviewed changes

tango4j added 2 commits September 3, 2025 17:57

Adding bf16-mixed option for both training and inference

132675c

Signed-off-by: taejinp <[email protected]>

Adding bf16-mixed option for both training and inference

b47cc14

Signed-off-by: taejinp <[email protected]>

tango4j dismissed KunalDhawan’s stale review via b47cc14 September 4, 2025 00:59

chtruong814 added Run CICD and removed Run CICD labels Sep 4, 2025

tango4j and others added 2 commits September 4, 2025 01:00

Apply isort and black reformatting

51dac61

Signed-off-by: tango4j <[email protected]>

Adding bf16-mixed option for e2e_diarize_speech.py

51e2c77

Signed-off-by: taejinp <[email protected]>

chtruong814 added Run CICD and removed Run CICD labels Sep 4, 2025

chtruong814 added the Run CICD label Sep 4, 2025

chtruong814 had a problem deploying to test September 4, 2025 18:10 — with GitHub Actions Error

Merge branch 'main' into bf16_sortformer_train

d1bccee

tango4j added Run CICD and removed Run CICD labels Sep 4, 2025

chtruong814 added Run CICD and removed Run CICD labels Sep 4, 2025

chtruong814 temporarily deployed to test September 4, 2025 18:14 — with GitHub Actions Inactive

tango4j added 2 commits September 4, 2025 17:57

Adding bf16 description for offline and streaming training

73f3510

Signed-off-by: taejinp <[email protected]>

Merge branch 'bf16_sortformer_train' of https://github.com/tango4j/NeMo…

8e9420c

… into bf16_sortformer_train

chtruong814 added Run CICD and removed Run CICD labels Sep 5, 2025

chtruong814 had a problem deploying to test September 5, 2025 00:58 — with GitHub Actions Error

Merge branch 'main' into bf16_sortformer_train

1e91c0c

chtruong814 added Run CICD and removed Run CICD labels Sep 5, 2025

chtruong814 temporarily deployed to test September 5, 2025 01:02 — with GitHub Actions Inactive

ipmedenn approved these changes Sep 5, 2025

View reviewed changes

Merge branch 'main' into bf16_sortformer_train

4b4b8b7

chtruong814 added Run CICD and removed Run CICD labels Sep 5, 2025

chtruong814 temporarily deployed to test September 5, 2025 15:05 — with GitHub Actions Inactive

Merge branch 'main' into bf16_sortformer_train

7585c1a

tango4j added Run CICD and removed Run CICD labels Sep 5, 2025

tango4j had a problem deploying to test September 5, 2025 20:48 — with GitHub Actions Error

Merge branch 'main' into bf16_sortformer_train

7e7fbe9

chtruong814 added Run CICD and removed Run CICD labels Sep 6, 2025

chtruong814 requested a deployment to test September 6, 2025 02:27 — with GitHub Actions Waiting

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding bf16 Sortformer train and inference #14627

Adding bf16 Sortformer train and inference #14627

tango4j commented Sep 3, 2025 •

edited

Loading

Uh oh!

KunalDhawan left a comment

Uh oh!

KunalDhawan Sep 3, 2025

Uh oh!

ipmedenn left a comment

Uh oh!

Uh oh!

Adding bf16 Sortformer train and inference #14627

Are you sure you want to change the base?

Adding bf16 Sortformer train and inference #14627

Conversation

tango4j commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do ?

Changelog

Usage

Before your PR is "Ready for review"

Who can review?

Additional Information

Uh oh!

KunalDhawan left a comment

Choose a reason for hiding this comment

Uh oh!

KunalDhawan Sep 3, 2025

Choose a reason for hiding this comment

Uh oh!

ipmedenn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tango4j commented Sep 3, 2025 •

edited

Loading