NVIDIA Metropolis Generative AI Workflows

Generative AI such as Large Language Models (LLMs) and Vision Language Models (VLMs) are used to build powerful workflows and agents to solve critical business problems. This repository hosts training materials, reference applications and workflows to build Visual AI Agents using NVIDIA NIM microservices. You can get started for free with no GPU required!

If you find this repository helpful, leave a star and share the link 🙂.

News 📰

10/31 - Technical Blog released for NIM Workflows
10/30 - VILA NIM is available for download from build.nvidia.com
10/1 - NV-CLIP NIM is available for download from build.nvidia.com.
9/25 - Llama 3.2 Vision NIM added to build.nvidia.com. Workflows updated to support Llama 3.2 Vision.
9/24 - Added VIA workflows
9/9 - VILA VLM NIM added to build.nvidia.com
8/21 - Added NVDINOv2 Few Shot Classification workflow
8/16 - Added Structured Text Extraction workflow
8/7 - Added NV-CLIP Semantic Search workflow
6/26 - Added VLM Alert workflow

Getting Started 🚀

To access NIM Microservices, visit build.nvidia.com to create an account and generate an API key. Each new account can receive up to 5,000 free credits. These credits will give you free access to preview APIs that allow you to use state of the art generative AI models such as Llama3.2 Vision, Nemotron, Mistral, VILA and much more with no GPU required!

Prerequisites

Go to build.nvidia.com and use your email to sign up.
After making an account, you can get an API Key by selecting any of the available NIMs then in the example code section, click on "Get API Key" then "Generate Key".

You will then see your API Key that will look something like "nvapi-xxx-xxxxxxxx_xxxxxxxxxxxxxxxx_xxxxxxxxxxx-xxxxxxx-xxxxxxxxxxxxxx". This full key is what will be used in the example notebooks and scripts.

You can now continue to explore the workflows in the next sections.

NIM microservice workflows

NVIDIA NIMs are GPU-accelerated AI models wrapped in an easy to use REST API interface. To help developers get started, each NIM has a hosted preview API that is accessible after generating an API token from build.nvidia.com. The preview APIs can be used for free to develop and experiment with state of the art AI models including LLMs, VLMs, Embedding and CV models.

The workflows listed under this section make use of the preview APIs and do not require a local GPU! You can run these workflows on nearly any computer and each one is quick and easy to launch. Each workflow includes a Jupyter notebook workshop that walks through how to build with NIM Preview APIs in Python.

Follow the links below to start running the workflows:

VIA microservice workflows

NVIDIA VIA Microservices are cloud-native building blocks to build AI agents capable of processing large amounts of live or archived videos and images with Vision-Language Models (VLM).

At a minimum VIA requires a NIM API Key and a local graphics card. A consumer RTX card is enough to get started.

First follow the VIA specific setup steps

VIA Setup

Then you can explore the following VIA workflows from this repository:

Changelog 📝

rel-2.2: Add support for locally hosted VILA NIM
rel-2.1: Add support for Llama 3.2 Vision
rel-2.0: Reorganize repository. Add VIA Workflows. Update READMEs.
rel-1.3.1: Update VLM Alert workflow with VILA
rel-1.3: Add NVDINOv2 Few Shot Classification workflow.
rel-1.2: Add Structured Text Extraction Workflow.
rel-1.1: Add Websocket server output for VLM Alert workflow. Add NV-CLIP Semantic Search workflow.
rel-1.0: Add VLM Alert workflow

Questions, Discussion, and Bugs ❔

If you find any bugs, have questions or want to start a discussion around the workflows feel free to file an issue. Any suggestions, feedback and new ideas are also welcome 🙂.

Related Resources 🔗

Relevant technical blogs that explore building AI Agents. Technical Blogs

Web Pages
Relevant web pages to find more information about NIMs, Metropolis and VIA.

GitHub Repositories
The following GitHub repositories include more examples of how to build with NIM microservices.

NVIDIA Developer Forums For questions and discussions, feel free to post under the appropriate topic on our developer forums.

Contributors ⭐

Thanks to the following people for contributing to our workflows:

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
nim_workflows		nim_workflows
readme_assets		readme_assets
via_workflows		via_workflows
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NVIDIA Metropolis Generative AI Workflows

Table of Contents

News 📰

Getting Started 🚀

Prerequisites

NIM microservice workflows

VIA microservice workflows

Changelog 📝

Questions, Discussion, and Bugs ❔

Related Resources 🔗

Contributors ⭐

Star History

About

Releases

Packages

Languages

License

NVIDIA/metropolis-nim-workflows

Folders and files

Latest commit

History

Repository files navigation

NVIDIA Metropolis Generative AI Workflows

Table of Contents

News 📰

Getting Started 🚀

Prerequisites

NIM microservice workflows

VIA microservice workflows

Changelog 📝

Questions, Discussion, and Bugs ❔

Related Resources 🔗

Contributors ⭐

Star History

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages