New Models/Endpoints/Providers #4922

krrishdholakia · 2024-07-27T17:20:46Z

guleng · 2024-08-01T07:37:48Z

@krrishdholakia I have deployed the text graph model locally, but I am unable to configure it for use in Litellm. Besides Azure and other vendors' locally deployed text graph models, cannot they be used in Litellm?

Can you help me with how to configure it?

seonghobae · 2024-08-01T11:37:22Z

Please checkout Azure GPT-4o-mini ;) https://techcommunity.microsoft.com/t5/ai-azure-ai-services-blog/openai-s-gpt-4o-mini-now-available-in-api-with-vision/ba-p/4200640

guleng · 2024-08-01T11:40:35Z

@seonghobae Does it not support locally deployed Wenshengtu models? Does it only support models from Azure and OpenAI manufacturers?

krrishdholakia · 2024-08-01T16:04:12Z

@guleng - your error message from the provider shows a 404 not found error.

Is your provider openai-compatible? if so use this - https://docs.litellm.ai/docs/providers/openai_compatible
If custom api, use this - https://docs.litellm.ai/docs/providers/custom_llm_server

If error continues, file a separate issue for this - with config.yaml, steps to repro, and error traceback received.

lazyhope · 2024-08-03T06:52:42Z

Github just released a new inference endpoint: https://models.inference.ai.azure.com, it has free tier for popular models like gpt-4o so would be worth adding to litellm.

Marketplace: https://github.com/marketplace/models

krrishdholakia · 2024-08-03T16:13:24Z

added @lazyhope

lazyhope · 2024-08-03T16:35:22Z

added @lazyhope

Awesome! Thank you!

yigitkonur · 2024-09-24T04:35:47Z

Hey Llama 3.1 - 405B is added to AWS Bedrock - model ID is meta.llama3-1-405b-instruct-v1:0

I appreciate if you add support and update docs. Thanks!

krrishdholakia · 2024-09-24T04:42:41Z

it's already supported @yigitkonur

litellm/model_prices_and_context_window.json

Line 4725 in 9a5115d

"meta.llama3-1-405b-instruct-v1:0": {

litellm/litellm/llms/bedrock/chat/converse_handler.py

Line 43 in 9a5115d

"meta.llama3-1-405b-instruct-v1:0",

simonhir · 2024-09-24T05:37:17Z

Is there already a way to add embedding models via azure_ai? In specific cohere.embed-multilingual-v3.0
Couldn't find anything in the documentation or code and the cohere provider seams to not allow setting the api_base.

krrishdholakia · 2024-09-24T05:43:12Z

cohere.embed-multilingual-v3.0

that model name looks like the bedrock one @simonhir

Re: azure ai embedding - not yet, can you create an issue, i'll add it tomorrow

simonhir · 2024-09-24T05:58:23Z

@krrishdholakia thanks for your fast answer.
I created #5861. Hope that helps.

AdithyanI · 2024-09-25T10:03:31Z

@krrishdholakia Google just upgraded the gemini-1.5-pro. There is a new model gemini-1.5-pro-002 in GA. Also the flash. Can we support this please? We are specifically looking for in vertex_ai support. Thank you :)

https://console.cloud.google.com/vertex-ai/publishers/google/model-garden/gemini-1.5-pro-002?hl=en&authuser=1&rapt=AEjHL4O-gl6qi8hhL9DBsAA1wb_wVoUUEkXTkJeARCSgbGPb03zHz3AwlLM98ApP59b3JpuRF68lhNwehNMo6lq0CFkU4lc4AyjAXa-eVDAGfubP1PeTUP4&project=win-compute-407516

jbellis · 2024-09-25T14:21:18Z

Support for gemini embedding models, please. (text-embedding-004 is free!)

ClancyDennis · 2024-10-02T07:40:17Z

Are there any plans to add the /image/edits endpoint currently supported by Dalle-2?

krrishdholakia · 2024-10-02T14:38:27Z

Google just upgraded the gemini-1.5-pro. There is a new model gemini-1.5-pro-002 in GA. Also the flash. Can we support this please?

done -

litellm/model_prices_and_context_window.json

Line 2032 in e19bb55

"gemini-1.5-pro-002": {

Support for gemini embedding models, please. (text-embedding-004 is free!)

@jbellis can you create an issue for this? it would help to know if you're v0 is for google ai studio or vertex ai

Are there any plans to add the /image/edits endpoint currently supported by Dalle-2?

@ClancyDennis didn't realize that, please create an issue, so we can track this.

FireballDWF · 2024-12-25T01:05:39Z

Related to Add Amazon Titan Text Premier #3774

igorlima · 2024-12-27T03:05:47Z

Hey @jbellis and @krrishdholakia!

Support for gemini embedding models, please. (text-embedding-004 is free!) ^⁺
can you create an issue for this? ^⁺

Great news! litellm already has built-in support for Gemini embeddings, though it wasn't highlighted in the documentation until now. 🎉

I've just proposed that with the PR below, which adds Gemini to the documentation page's list of supported embeddings.

Add Gemini Embeddings Support

code snippet

Here's a quick and easy code snippet to show you how to use Gemini embeddings with litellm:

import os
os.environ["GEMINI_API_KEY"] = "xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx"

from litellm import embedding
response = embedding(
  model="gemini/text-embedding-004",
  input=["good morning from litellm"],
)
print(response)

KamilFatkhiev · 2025-01-22T10:32:07Z

Hi @krrishdholakia, are you considering adding support for yandexgpt and gigachat family models?

krrishdholakia · 2025-01-22T15:53:31Z

Hey @KamilFatkhiev the api's look openai compatible - https://developers.sber.ru/docs/ru/gigachat/api/reference/rest/post-chat

so should already work - https://docs.litellm.ai/docs/providers/openai_compatible

KamilFatkhiev · 2025-01-22T16:25:34Z

Hey @KamilFatkhiev the api's look openai compatible - https://developers.sber.ru/docs/ru/gigachat/api/reference/rest/post-chat

so should already work - https://docs.litellm.ai/docs/providers/openai_compatible

Oh, cool! What about yandexgpt? Are the differences too big?
https://yandex.cloud/en-ru/docs/foundation-models/text-generation/api-ref/TextGeneration/completion

krrishdholakia · 2025-01-22T16:38:31Z

@KamilFatkhiev i don't have access to it - but if there's a mock endpoint we can use for testing it should be possible to add it

krrishdholakia pinned this issue Jul 27, 2024

krrishdholakia closed this as completed in 4258295 Aug 3, 2024

krrishdholakia reopened this Aug 3, 2024

Manouchehri mentioned this issue Aug 6, 2024

[Feature] Add gpt-4o-mini pricing for Azure OpenAI #5072

Closed

krrishdholakia unpinned this issue Oct 3, 2024

igorlima mentioned this issue Dec 27, 2024

Update Documentation - Gemini Embedding #7436

Merged

igorlima mentioned this issue Dec 30, 2024

Supporting Vision models from Groq getomni-ai/zerox#65

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Models/Endpoints/Providers #4922

New Models/Endpoints/Providers #4922

krrishdholakia commented Jul 27, 2024 •

edited

Loading

guleng commented Aug 1, 2024 •

edited

Loading

seonghobae commented Aug 1, 2024

guleng commented Aug 1, 2024 •

edited

Loading

krrishdholakia commented Aug 1, 2024 •

edited

Loading

lazyhope commented Aug 3, 2024

krrishdholakia commented Aug 3, 2024

lazyhope commented Aug 3, 2024

yigitkonur commented Sep 24, 2024

krrishdholakia commented Sep 24, 2024

simonhir commented Sep 24, 2024

krrishdholakia commented Sep 24, 2024

simonhir commented Sep 24, 2024

AdithyanI commented Sep 25, 2024

jbellis commented Sep 25, 2024

ClancyDennis commented Oct 2, 2024

krrishdholakia commented Oct 2, 2024

FireballDWF commented Dec 25, 2024 •

edited

Loading

igorlima commented Dec 27, 2024

KamilFatkhiev commented Jan 22, 2025

krrishdholakia commented Jan 22, 2025

KamilFatkhiev commented Jan 22, 2025

krrishdholakia commented Jan 22, 2025

New Models/Endpoints/Providers #4922

New Models/Endpoints/Providers #4922

Comments

krrishdholakia commented Jul 27, 2024 • edited Loading

guleng commented Aug 1, 2024 • edited Loading

seonghobae commented Aug 1, 2024

guleng commented Aug 1, 2024 • edited Loading

krrishdholakia commented Aug 1, 2024 • edited Loading

lazyhope commented Aug 3, 2024

krrishdholakia commented Aug 3, 2024

lazyhope commented Aug 3, 2024

yigitkonur commented Sep 24, 2024

krrishdholakia commented Sep 24, 2024

simonhir commented Sep 24, 2024

krrishdholakia commented Sep 24, 2024

simonhir commented Sep 24, 2024

AdithyanI commented Sep 25, 2024

jbellis commented Sep 25, 2024

ClancyDennis commented Oct 2, 2024

krrishdholakia commented Oct 2, 2024

FireballDWF commented Dec 25, 2024 • edited Loading

igorlima commented Dec 27, 2024

KamilFatkhiev commented Jan 22, 2025

krrishdholakia commented Jan 22, 2025

KamilFatkhiev commented Jan 22, 2025

krrishdholakia commented Jan 22, 2025

krrishdholakia commented Jul 27, 2024 •

edited

Loading

guleng commented Aug 1, 2024 •

edited

Loading

guleng commented Aug 1, 2024 •

edited

Loading

krrishdholakia commented Aug 1, 2024 •

edited

Loading

FireballDWF commented Dec 25, 2024 •

edited

Loading