Any way to use local models? #2

Neltherion · 2024-10-12T12:25:03Z

I'm real tired of OpenAI/Groq/... API keys and having to call their APIs over the net. Is there any way to spin up a local LLM/Vision Model and direct the queries to that?

simularlyon · 2024-10-12T18:16:52Z

Yes, you can use Ollama for local models.

saa1605 · 2024-10-14T19:38:17Z

The easiest way in the current setup is to use vLLM (https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html) to locally load and host a model on your machine. After this you can use the LMMEnginevLLM by passing the required engine_params and setting the vLLM_ENDPOINT_URL to the locally hosted model.

Export the vLLM_ENDPOINT_URL:

export vLLM_ENDPOINT_URL=YOUR_URL_ADDRESS

Specify engine params to use vllm before instantiating the GraphSearchAgent

engine_params = {'engine_type': 'vllm', 'model': 'YOUR_HOSTED_MODEL_NAME'}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any way to use local models? #2

Any way to use local models? #2

Neltherion commented Oct 12, 2024

simularlyon commented Oct 12, 2024

saa1605 commented Oct 14, 2024

Any way to use local models? #2

Any way to use local models? #2

Comments

Neltherion commented Oct 12, 2024

simularlyon commented Oct 12, 2024

saa1605 commented Oct 14, 2024