GitHub - HERIUN/fsdp_s2tt_seamlessM4T_v2_finetuning: fine tuning seamless M4T v2 s2tt with FSDP

seamlessM4Tv2 finetuning with custom data with FSDP

0.(optional). make sample s2tt data(n_samples=100) with chatgpt and google tts

python make_sample_data.py

prepare custom json data. output.json : audio path is source language_speech.

[
    {
        "kor": "서울은 다양한 역사적 유산과 현대적인 매력을 함께 경험할 수 있는 도시입니다.",
        "eng": "Seoul is a city where you can experience both historical heritage and modern charm.",
        "audio_path": "/data/donggukang/seamless_test/s2tt_data/audio/output_0.wav"
    },
    {
        "kor": "경주는 한국에서 가장 오래된 도시 중 하나로서, 문화재와 유적지가 많이 있습니다.",
        "eng": "Gyeongju is one of the oldest cities in Korea with numerous cultural relics and historical sites.",
        "audio_path": "/data/donggukang/seamless_test/s2tt_data/audio/output_1.wav"
    },
    ...
]

or hugging face audio datasets. check src/seamless_communication/cli/m4t/finetune/dataset.py load_custom_s2tt_hf_dataset

convert seamless format

python src/seamless_communication/cli/m4t/finetune/dataset.py \
                --name "custom" \
                --source_lang "kor" \
                --target_lang "eng" \
                --save_dir "/data/donggukang/data" \ # where to save manifest.json
                --json_path "out_json.json" \

edit finetune.py code

sys.path.append("/data/donggukang/seamless_test/seamless_communication/src")
os.environ["CUDA_VISIBLE_DEVICES"]="0,3,4,7"

finetune s2tt with fsdp

python src/seamless_communication/cli/m4t/finetune/finetune.py \
   --mode SPEECH_TO_TEXT \
   --train_dataset /data/donggukang/data/custom_manifest.json  \
   --eval_dataset /data/donggukang/data/custom_manifest.json \
   --batch_size 1 \
   --learning_rate 1e-6 \
   --warmup_steps 100 \
   --max_epochs 10 \
   --patience 5 \
   --model_name seamlessM4T_v2_large \
   --save_model_to ./checkpoint.pt

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
demo		demo
docs		docs
ggml		ggml
src/seamless_communication		src/seamless_communication
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
Seamless_Tutorial.ipynb		Seamless_Tutorial.ipynb
make_sample_data.py		make_sample_data.py
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 38

Uh oh!

Languages

License

HERIUN/fsdp_s2tt_seamlessM4T_v2_finetuning

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 38

Uh oh!

Languages

Packages