When I run /RetrievalAugmentedGeneration/examples/developer_rag/chains.py #158

Suiji12 · 2024-08-12T07:12:13Z

My setting about rag-app-text-chatbot.yaml is services:
jupyter-server:
container_name: notebook-server
image: notebook-server:${TAG:-latest}
build:
context: ../../
dockerfile: ./notebooks/Dockerfile.notebooks # replace GPU enabled Dockerfile ./notebooks/Dockerfile.gpu_notebook
ports:
- "8888:8888"
expose:
- "8888"
deploy:
resources:
reservations:
devices:
- driver: nvidia
count: 1
capabilities: [gpu]

chain-server:
container_name: chain-server
image: chain-server:${TAG:-latest}
build:
context: ../../
dockerfile: ./RetrievalAugmentedGeneration/Dockerfile
args:
EXAMPLE_NAME: developer_rag
command: --port 8081 --host 0.0.0.0
environment:
APP_VECTORSTORE_URL: "http://milvus:19530"
APP_VECTORSTORE_NAME: "milvus"
APP_EMBEDDINGS_MODELNAME: ${APP_EMBEDDINGS_MODELNAME:-G:/jjx/moxing/snowflake-arctic-embed-l}
APP_EMBEDDINGS_MODELENGINE: ${APP_EMBEDDINGS_MODELENGINE:-local}
APP_EMBEDDINGS_SERVERURL: ${APP_EMBEDDINGS_SERVERURL:-""}
APP_LLM_SERVERURL: ${APP_LLM_SERVERURL:-""}
APP_LLM_MODELNAME: ${APP_LLM_MODELNAME:-"G:/jjx/moxing/llama-2-13b-chat-hf"}
APP_LLM_MODELENGINE: ${APP_LLM_MODELENGINE:-local}
NVIDIA_API_KEY: ${NVIDIA_API_KEY}
POSTGRES_PASSWORD: ${POSTGRES_PASSWORD:-password}
POSTGRES_USER: ${POSTGRES_USER:-postgres}
POSTGRES_DB: ${POSTGRES_DB:-api}
COLLECTION_NAME: ${COLLECTION_NAME:-developer_rag}
APP_RETRIEVER_TOPK: 4
APP_RETRIEVER_SCORETHRESHOLD: 0.25
OTEL_EXPORTER_OTLP_ENDPOINT: http://otel-collector:4317
OTEL_EXPORTER_OTLP_PROTOCOL: grpc
ENABLE_TRACING: false
APP_TEXTSPLITTER_MODELNAME: Snowflake/snowflake-arctic-embed-l
APP_TEXTSPLITTER_CHUNKSIZE: 506
APP_TEXTSPLITTER_CHUNKOVERLAP: 200
LOGLEVEL: ${LOGLEVEL:-INFO}
ports:
- "8081:8081"
expose:
- "8081"
shm_size: 5gb
deploy:
resources:
reservations:
devices:
- driver: nvidia
count: 1
capabilities: [gpu]

rag-playground:
container_name: rag-playground
image: rag-playground:${TAG:-latest}
build:
context: ../.././RetrievalAugmentedGeneration/frontend/
dockerfile: Dockerfile
command: --port 8090
environment:
APP_SERVERURL: http://chain-server
APP_SERVERPORT: 8081
APP_MODELNAME: ${APP_LLM_MODELNAME:-"meta/llama3-8b-instruct"}
OTEL_EXPORTER_OTLP_ENDPOINT: http://otel-collector:4317
OTEL_EXPORTER_OTLP_PROTOCOL: grpc
ENABLE_TRACING: false
RIVA_API_URI: ${RIVA_API_URI:-}
RIVA_API_KEY: ${RIVA_API_KEY:-}
RIVA_FUNCTION_ID: ${RIVA_FUNCTION_ID:-}
TTS_SAMPLE_RATE: ${TTS_SAMPLE_RATE:-48000}
ports:
- "8090:8090"
expose:
- "8090"
depends_on:
- chain-server

networks:
default:
name: nvidia-rag
What should I do?

Suiji12 · 2024-08-12T07:14:55Z

I met a error:C:\Users\jiaojiaxing.conda\envs\localgpt\python.exe E:\jiaojiaxing\GenerativeAIExamples\RetrievalAugmentedGeneration\examples\developer_rag\chains.py C:\Users\jiaojiaxing.conda\envs\localgpt\lib\site-packages\langchain_nvidia_ai_endpoints_common.py:172: UserWarning: An API key is required for the hosted NIM. This will become an error in the future. warnings.warn( C:\Users\jiaojiaxing.conda\envs\localgpt\lib\site-packages\langchain_nvidia_ai_endpoints_common.py:172: UserWarning: An API key is required for the hosted NIM. This will become an error in the future. warnings.warn( Traceback (most recent call last): File "E:\jiaojiaxing\GenerativeAIExamples\RetrievalAugmentedGeneration\examples\developer_rag\chains.py", line 40, in set_service_context() File "E:\jiaojiaxing\GenerativeAIExamples\RetrievalAugmentedGeneration\common\utils.py", line 115, in wrapper return func(*args_hashable, **kwargs_hashable) File "E:\jiaojiaxing\GenerativeAIExamples\RetrievalAugmentedGeneration\common\utils.py", line 122, in set_service_context llm = LangChainLLM(get_llm(**kwargs)) File "E:\jiaojiaxing\GenerativeAIExamples\RetrievalAugmentedGeneration\common\utils.py", line 115, in wrapper return func(*args_hashable, **kwargs_hashable) File "E:\jiaojiaxing\GenerativeAIExamples\RetrievalAugmentedGeneration\common\utils.py", line 265, in get_llm return ChatNVIDIA(model=settings.llm.model_name, File "C:\Users\jiaojiaxing.conda\envs\localgpt\lib\site-packages\langchain_nvidia_ai_endpoints\chat_models.py", line 243, in init self._client = _NVIDIAClient( File "C:\Users\jiaojiaxing.conda\envs\localgpt\lib\site-packages\langchain_nvidia_ai_endpoints_common.py", line 213, in init raise ValueError( ValueError: Model ensemble is unknown, check available_models Process finished with exit code 1

Suiji12 changed the title ~~When I run /RetrievalAugmentedGeneration/examples/developer_rag/chains~~ When I run /RetrievalAugmentedGeneration/examples/developer_rag/chains.py I Aug 12, 2024

Suiji12 changed the title ~~When I run /RetrievalAugmentedGeneration/examples/developer_rag/chains.py I~~ When I run /RetrievalAugmentedGeneration/examples/developer_rag/chains.py Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When I run /RetrievalAugmentedGeneration/examples/developer_rag/chains.py #158

When I run /RetrievalAugmentedGeneration/examples/developer_rag/chains.py #158

Suiji12 commented Aug 12, 2024 •

edited

Loading

Suiji12 commented Aug 12, 2024

When I run /RetrievalAugmentedGeneration/examples/developer_rag/chains.py #158

When I run /RetrievalAugmentedGeneration/examples/developer_rag/chains.py #158

Comments

Suiji12 commented Aug 12, 2024 • edited Loading

Suiji12 commented Aug 12, 2024

Suiji12 commented Aug 12, 2024 •

edited

Loading