Stars
📚Modern CUDA Learn Notes with PyTorch: 200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe API (Achieve 98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
TensorFlow2.0 Implementation of Fully Quantized Vision Transformer (IJCAI, 2022)
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution…
A new tensorrt integrate. Easy to integrate many tasks
Knowledge Graph,Question Answering System,基于知识图谱和向量检索的医疗诊断问答系统
基于BIO模式的序列标注工具-可用于命名实体识别、事件触发词识别等任务的数据标注
A light NER Tool,NER标注工具,基于Vue & FastAPI,带NER数据增强