Skip to content

mbodied Multimodal Knowledge Graph Integration for Open-World Object-Goal Navigation in Embodied Systems

Notifications You must be signed in to change notification settings

vigorlee/Multimodal--RAG

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

Multimodal--RAG

Embodied Multimodal Knowledge Graph

download https://huggingface.co/google/siglip-so400m-patch14-384

change config/config.json for vis_model_path and model_LLM_path

conda create rag python==3.10

conda activate rag (for latest anaconda/miniconda use source activate )

pip install torch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 --index-url https://download.pytorch.org/whl/cu121

pip install -r requirements.txt

run MobileVLM

download https://huggingface.co/mtgv/MobileVLM_V2-1.7B

change config/config.json for model_LLM_path

python server.py MobileVLM

run MiniCPM

download https://huggingface.co/openbmb/MiniCPM-V

change config/config.json for model_LLM_path

python server.py MiniCPM

run LLaVA-HD

download https://huggingface.co/yifanzhang114/SliME-vicuna-7B

change config/config.json for model_LLM_path

python server.py LLaVA-HD

About

mbodied Multimodal Knowledge Graph Integration for Open-World Object-Goal Navigation in Embodied Systems

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages