Pinned Loading
-
huggingface/optimum-quanto
huggingface/optimum-quanto PublicA pytorch quantization backend for optimum
898 contributions in the last year
Day of Week | March Mar | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Activity overview
Contributed to
huggingface/optimum-quanto,
huggingface/optimum-neuron,
huggingface/text-generation-inference
and 25 other
repositories
Loading
Contribution activity
March 2025
Created 2 repositories
-
dacorvo/neuronx-distributed-inference
Python
This contribution was made on Mar 17
-
dacorvo/tgi-neuron-sagemaker-demo
Python
This contribution was made on Mar 12
Created a pull request in huggingface/optimum-neuron that received 5 comments
Add support for phi3 model architecture
What does this PR do?
This allows to run the following models on neuron:
microsoft/phi-4,
microsoft/Phi-3-mini-4k-instruct.
Other phi3
models usi…
+192
−2
lines changed
•
5
comments
Opened 7 other pull requests in 5 repositories
huggingface/optimum-neuron
2
merged
-
Cache granite and phi4 models
This contribution was made on Mar 12
-
Enable back continuous batching for phi3 and cleanup hlo modeling
This contribution was made on Mar 10
huggingface/text-generation-inference
2
merged
-
Update neuron backend
This contribution was made on Mar 11
-
fix(neuron): explicitly install toolchain
This contribution was made on Mar 5
aws/deep-learning-containers
1
open
-
[HuggingFace][Neuronx] Inference - Optimum Neuron 0.1.0 - Neuron sdk 2.21.1 - Transformers to 4.48.3
This contribution was made on Mar 13
awslabs/llm-hosting-container
1
open
-
Add TGI 3.2.0 neuron
This contribution was made on Mar 12
huggingface/optimum-quanto
1
merged
-
Bump minimal pytorch version to 2.6
This contribution was made on Mar 3
Reviewed 13 pull requests in 3 repositories
huggingface/optimum-neuron
10 pull requests
-
chore(test): add test comparing Linear and RowParallelLinear outputs
This contribution was made on Mar 24
-
Fix broken cache for traced models & fix runtime error of diffusion models when batch_size > 1
This contribution was made on Mar 20
-
Adding environment options explanation
This contribution was made on Mar 20
-
latest available tgi dlc uri
This contribution was made on Mar 19
-
Add Whisper for the task "automatic-speech-recognition" w/o. KV cache
This contribution was made on Mar 18
-
More training tests updates
This contribution was made on Mar 12
-
Training remove gpt neo models support
This contribution was made on Mar 12
-
Enable back continuous batching for phi3 and cleanup hlo modeling
This contribution was made on Mar 10
-
Training test refactoring
This contribution was made on Mar 10
-
Add support for phi3 model architecture
This contribution was made on Mar 5
huggingface/optimum-quanto
2 pull requests
-
[tests] enable awq related tests on XPU
This contribution was made on Mar 18
-
Fix error when trying to access
state_dict
after activation quantizationThis contribution was made on Mar 5
huggingface/text-generation-inference
1 pull request
-
feat: add support for HF_HUB_USER_AGENT_ORIGIN to add user-agent Origin field in Hub requests.
This contribution was made on Mar 5
5
contributions
in private repositories
Mar 3 – Mar 21