Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Two Channel (Stream) Speech Prompt using API #174

Open
1 task done
jlian2 opened this issue Dec 24, 2024 · 1 comment
Open
1 task done

Two Channel (Stream) Speech Prompt using API #174

jlian2 opened this issue Dec 24, 2024 · 1 comment
Labels
question Further information is requested

Comments

@jlian2
Copy link

jlian2 commented Dec 24, 2024

Due diligence

  • I have done my due diligence in trying to find the answer myself.

Topic

The PyTorch implementation

Question

I am doing some inference experiments. I am curious if I can feed two channel speech into model and get response (something the same as dGSLM) without human-machine interaction.

@jlian2 jlian2 added the question Further information is requested label Dec 24, 2024
@jlian2
Copy link
Author

jlian2 commented Dec 24, 2024

I meant it feels to me that moshi is only able to accept user input and return agent output. But it is not able to accept both user and agent input, although it is mentioned in section 3.4.3.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant