Skip to content
View gpantaz's full-sized avatar

Highlights

  • Pro

Block or report gpantaz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

State-of-the-Art Text Embeddings

Python 16,131 2,558 Updated Mar 5, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,131 770 Updated Mar 1, 2025

Data validation using Python type hints

Python 22,674 2,032 Updated Mar 4, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,533 760 Updated Aug 12, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 33,443 3,778 Updated Mar 3, 2025

Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Python 906 34 Updated Jan 21, 2025

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,374 2,358 Updated Mar 4, 2025

Image augmentation for machine learning experiments.

Python 14,519 2,454 Updated Jul 30, 2024

Research code for pixel-based encoders of language (PIXEL)

Python 334 33 Updated Mar 6, 2024

Codebase for CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts

2 Updated Oct 20, 2024

Multilingual Image Captioning Evaluation

Python 2 Updated May 10, 2024

a python framework to build, learn and reason about probabilistic circuits and tensor networks

Python 89 8 Updated Mar 3, 2025

Official implementation of project Honeybee (CVPR 2024)

Python 444 20 Updated May 10, 2024

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 2,685 171 Updated Mar 5, 2025

Athens NLP Summer School 2024 - Lab material

Jupyter Notebook 20 4 Updated Oct 1, 2024

🤖 Machine Learning Summer School Guide

HTML 2,708 306 Updated Feb 28, 2025

Code for the MultipanelVQA benchmark "Muffin or Chihuahua? Challenging Large Vision-Language Models with Multipanel VQA"

Jupyter Notebook 7 Updated Apr 11, 2024

A reading list of up-to-date papers on NLP for Social Good.

298 31 Updated Sep 13, 2023

Code for Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation

Python 4 Updated Aug 17, 2024

An annotated implementation of the Transformer paper.

Jupyter Notebook 6,050 1,281 Updated Apr 7, 2024

Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"

Python 546 25 Updated Aug 16, 2024

Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

Python 24 2 Updated Jul 31, 2024

Egocentric Video Understanding Dataset (EVUD)

Python 26 3 Updated Jul 4, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 49,158 5,800 Updated Sep 18, 2024

Video+code lecture on building nanoGPT from scratch

Python 3,936 578 Updated Aug 13, 2024

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 950 54 Updated Jan 30, 2024

Multimodal language model benchmark, featuring challenging examples

Python 158 9 Updated Dec 18, 2024

Website for hosting the Open Foundation Models Cheat Sheet.

JavaScript 261 19 Updated Jun 26, 2024

The official Meta Llama 3 GitHub site

Python 28,445 3,301 Updated Jan 26, 2025
Next
Showing results