What's Changed
This release introduces a number of exciting updates and improvements:
- Parallel LLM Calls: Replaced the llama-index with the LECL, enabling parallel LLM calls for increased efficiency.
- ChromaDB Integration: Transitioned from FAISS to ChromaDB to leverage metadata filtering and speed.
- Query Enhancer Optimization: Improved the query enhancer to operate with a single LLM call.
- Modular RAG Pipeline: Split the RAG pipeline into three distinct modules: query enhancement, retrieval, and response synthesis, for improved clarity and maintenance.
- Parent Document Retrieval: Introduced parent document retrieval functionality within the retrieval module to enhance contextuality.
- Sub-query Answering: Added sub-query answering capabilities in the response synthesis module to handle complex queries more effectively.
- API Restructuring: Redesigned the API into separate routers for retrieval, database, and chat operations.
Evaluation Result
wandbot version | Comment | response accuracy |
---|---|---|
1.0.0 | our baseline wandbot | 53.78 % |
1.1.0 | improvement over baseline; in production for the longest | 72.45 % |
1.2.0 | our new enhanced wandbot | 81.63 % |
Full Changelog: v1.1.0...v1.2.0