Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gibberish output for princeton-nlp/Sheared-LLaMA-1.3B with continuous batching #94

Open
pinak-p opened this issue Jul 15, 2024 · 2 comments

Comments

@pinak-p
Copy link

pinak-p commented Jul 15, 2024

I'm seeing gibberish output with princeton-nlp/Sheared-LLaMA-1.3B when using continuous batching with transformers-neuronx and optimum neuron.

aws-neuronx-runtime-discovery 2.9
libneuronxla                  2.0.2335
neuronx-cc                    2.14.213.0+013d129b
optimum-neuron                0.0.24.dev0
torch-neuronx                 2.1.2.2.2.0

optimum-cli export neuron -m princeton-nlp/Sheared-LLaMA-1.3B --batch_size 4 --sequence_length 4096 --num_cores 2 --auto_cast_type fp16 ./neuron-model

https://github.com/huggingface/optimum-neuron/blob/aws_neuron_sdk_2.19/examples/text-generation/generation.py

['<s> One of my fondest memory is Ozi, Ozi Ozi\nPi (Greek: πι) is the symbol of a circle. A circle in geometry is called a "polytope", or "triple cover". In topology, a circle is a polytope of genus zero, or a cover of type one of genus zero. In topology, an object is called a "closed curve".\nA closed curve is called a "surface". Surfaces are defined in terms of π and ρ. The intersection πρ of the two curves at a point x is called a "point', '<s> One of my fondest memory is a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a a', '<s> One of my fondest memory is the F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F F', '<s> One of my fondest memory is the sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks sinks s']

@pinak-p
Copy link
Author

pinak-p commented Jul 15, 2024

@dacorvo

@dacorvo
Copy link

dacorvo commented Jul 18, 2024

This used to work with SDK 2.18.0.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants