Skip to content

Comments

WIP: Enabled fp16/bf16 based export and compile for some causalLMs.#775

Draft
quic-dhirajku wants to merge 1 commit intoquic:mainfrom
quic-dhirajku:custom_dtype_export
Draft

WIP: Enabled fp16/bf16 based export and compile for some causalLMs.#775
quic-dhirajku wants to merge 1 commit intoquic:mainfrom
quic-dhirajku:custom_dtype_export

Conversation

@quic-dhirajku
Copy link
Contributor

Modified compilation command creation to enable -convert-to-fp16 for ai100 device compilation only. Modified custom-io creation for causalLM to have bfloat16/float16 dtype based on loaded model's dtype.

Modified compilation command creation to enable -convert-to-fp16 for ai100 device compilation only.
Modified custom-io creation for causalLM to have bfloat16/float16 dtype based on loaded model's dtype.

Signed-off-by: Dhiraj Kumar Sah <dhirajku@qti.qualcomm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant