Chinese is not recognized with the default model #79

luobendewugong · 2024-09-04T03:39:05Z

Hello, thanks for your work, I have two problems with the default model and would like to consult you:

Can only recognize English, where do I need to set it to recognize Chinese? Or do I need to replace it with another model
The recognition effect is not good, and the pronunciation of the answer is not continuous, do I need to replace other TTS models? The default model actually quite big

Thank you so much!

Lbaiall · 2024-09-04T06:45:37Z

you should change model from HF

andimarafioti · 2024-09-04T10:05:57Z

Using the code from this PR:
#60

You can call the system with:
python s2s_pipeline.py --recv_host 0.0.0.0 --send_host 0.0.0.0 --lm_model_name meta-llama/Meta-Llama-3.1-8B-Instruct --init_chat_role system --tts melo --stt_model_name openai/whisper-large-v3 --language zh

andimarafioti · 2024-09-04T10:06:36Z

There, whisper is larger than the distil version (but it works for Chinese). The llm is larger (but it works for Chinese, you can change it for another one). The TTS is smaller than the default (and it works for chinese)

andimarafioti · 2024-09-04T10:06:47Z

Let me know if it works :)

andimarafioti · 2024-09-04T11:57:12Z

I merged the PR for multiple language, so you should be able to run with the code in main

Kong4Git · 2024-09-04T17:40:23Z

Hi, thanks for your work,I encountered an issue while running the code from your repository on my Mac. The error I received is as follows:【ValueError: Please select a valid model】，

The error occurs when initializing the LightningWhisperMLX model with the following command:
python s2s_pipeline.py --local_mac_optimal_settings --device mps --lm_model_name meta-llama/Meta-Llama-3.1-8B-Instruct --init_chat_role system --tts melo --stt_model_name openai/whisper-large-v3 --language zh

could you please provide some guidance on what might be causing this issue or suggest any potential solutions?

Thank you very much for your help!

andimarafioti · 2024-09-04T19:02:58Z

You can run it on mac with:

python s2s_pipeline.py  --device mps --lm_model_name meta-llama/Meta-Llama-3.1-8B-Instruct --init_chat_role system --tts melo --stt_model_name openai/whisper-large-v3 --language zh --mode local

But we still didn't make the changes to the MLX classes to support Chinese, so the generation will be quite slow.

andimarafioti · 2024-09-04T19:03:19Z

If you want to make the changes, we welcome PRs! otherwise I'll adapt it in the coming days

luobendewugong · 2024-09-06T01:35:15Z

Thank you very much, I can use Chinese now, but I feel that there are three more things I want to ask:

Modifying 'init_chat_prompt' in 'LLM\language_model.py' doesn't seem to have any effect, no matter how I modify it, LLM there is no change in the answer;
'--language None' seems to be related to the language model, I use qwen2-1.5b when running the output without Language None mode, have to choose the language;
Can I use a model in GGUF format?

Using the code from this PR: #60

You can call the system with: python s2s_pipeline.py --recv_host 0.0.0.0 --send_host 0.0.0.0 --lm_model_name meta-llama/Meta-Llama-3.1-8B-Instruct --init_chat_role system --tts melo --stt_model_name openai/whisper-large-v3 --language zh

andimarafioti · 2024-09-09T14:55:48Z

For the init_chat_prompt to take effect, you also need to set init_chat_role
We changed it to '--language auto' because we thought it was more intuitive. In any case, it's related to everything. Setting it to 'language auto' makes everything auto, setting it to 'language zh' should make everything chinese.
Do you mean for the LLM? I think you should be able to. Try it out and report back to me !

andimarafioti self-assigned this Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chinese is not recognized with the default model #79

Chinese is not recognized with the default model #79

luobendewugong commented Sep 4, 2024

Lbaiall commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

Kong4Git commented Sep 4, 2024 •

edited

Loading

andimarafioti commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

luobendewugong commented Sep 6, 2024

andimarafioti commented Sep 9, 2024

Chinese is not recognized with the default model #79

Chinese is not recognized with the default model #79

Comments

luobendewugong commented Sep 4, 2024

Lbaiall commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

Kong4Git commented Sep 4, 2024 • edited Loading

andimarafioti commented Sep 4, 2024

andimarafioti commented Sep 4, 2024

luobendewugong commented Sep 6, 2024

andimarafioti commented Sep 9, 2024

Kong4Git commented Sep 4, 2024 •

edited

Loading