how to use a online LLM API，instead of local vllm loaded #1307

devillaws · 2024-12-02T09:04:21Z

devillaws
Dec 2, 2024

how to use a online LLM API，instead of local vllm loaded.
such as openai、kimi

Dec 2, 2024

Please review this doc. vLLM is OpenAI compliant, meaning you can just use the openai python library and use a different base_url for whatever your inference server is.

class Testing(BaseModel):
    """
    A class representing a testing schema.
    """
    name: str
    age: int

openai_client = openai.OpenAI(
    base_url="http://0.0.0.0:1234/v1",
    api_key="dopeness"
)

# Make a request to the local LM Studio server
response = openai_client.beta.chat.completions.parse(
    model="hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF",
    messages=[
        {"role": "system", "content": "You are like so good at whatever you do."},
        {"role": "user", "content": "My name is Cameron and …

View full answer

cpfiffer · 2024-12-02T20:54:59Z

cpfiffer
Dec 2, 2024

Please review this doc. vLLM is OpenAI compliant, meaning you can just use the openai python library and use a different base_url for whatever your inference server is.

class Testing(BaseModel):
    """
    A class representing a testing schema.
    """
    name: str
    age: int

openai_client = openai.OpenAI(
    base_url="http://0.0.0.0:1234/v1",
    api_key="dopeness"
)

# Make a request to the local LM Studio server
response = openai_client.beta.chat.completions.parse(
    model="hugging-quants/Llama-3.2-1B-Instruct-Q8_0-GGUF",
    messages=[
        {"role": "system", "content": "You are like so good at whatever you do."},
        {"role": "user", "content": "My name is Cameron and I am 28 years old. What's my name and age?"}
    ],
    response_format=Testing
)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

how to use a online LLM API，instead of local vllm loaded #1307

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

how to use a online LLM API，instead of local vllm loaded #1307

Uh oh!

devillaws Dec 2, 2024

Replies: 1 comment

Uh oh!

cpfiffer Dec 2, 2024

devillaws
Dec 2, 2024

cpfiffer
Dec 2, 2024