24 Nov 14:22

danielfleischer

1a6ab10

v3.1.1 Latest

Latest

What's Changed

Relax dependencies, add Streaming Callback by @dnoliver in #71
OpenVINO Serialization fix by @danielfleischer in #73

New Contributors

@dnoliver made their first contribution in #71

Full Changelog: v3.1.0...v3.1.1

Contributors

dnoliver and danielfleischer

Assets 2

07 Nov 14:21

danielfleischer

v3.1.0

8cf1762

v3.1.0

What's Changed

Update llava.py by @mosheber in #54
Remove indexing function by @mosheber in #55
IPEX benchmarking fix by @peteriz in #58
Removing Handlers with Phi3.5 Suppport by @mosheber in #59
replaced list[str] with List[str] by @mosheber in #67
Adding files for multi modal pipeline by @mosheber in #68
Lazy initialization of OVModel by @danielfleischer in #66
update protobuf version to 5.28.3 by @mosheber in #70
Update one link in nutrition_data.json by @bilgeyucel in #72

New Contributors

@bilgeyucel made their first contribution in #72

Full Changelog: v3.0.2...v3.1.0

Contributors

peteriz, danielfleischer, and 2 other contributors

Assets 2

09 Jul 09:49

peteriz

v3.0.2

722860d

v3.0.2

What's Changed

Fix IPEX embedders performance by @peteriz in #52
Fix support python versions by @peteriz in #53

Full Changelog: v3.0.1...v3.0.2

Contributors

peteriz

Assets 2

02 Jul 13:38

peteriz

v3.0.1

07ceba6

v3.0.1

What's Changed

Gaudi Generator by @mosheber in #50
Adding pypi packaging support by @peteriz in #51

Full Changelog: v3.0...v3.0.1

Contributors

peteriz and mosheber

Assets 2

22 May 15:13

danielfleischer

v3.0

8087aea

v3.0.0

Compatibility with Haystack v2

⚡ All our classes are now compatible with 🤖 Haystack v2, including the example notebooks and yaml pipeline configurations.
💻 We based our demos on the Chainlit UI library; examples include RAG chat with multi-modality! 🖼️

❤️ Feel free to report any issue, bug or question!

Assets 2

24 Dec 16:24

danielfleischer

v2.0

373e546

v2.0.0

fastRAG 2.0: Let's do RAG Efficiently 🔥

fastRAG 2.0 includes new highly-anticipated efficiency-oriented components, an updated chat-like demo experience with multi-modality and improvements to existing components.

The library now utilizes efficient Intel optimizations using Intel extensions for PyTorch (IPEX), 🤗 Optimum Intel and 🤗 Optimum-Habana for running as optimal as possible on Intel® Xeon® Processors and Intel® Gaudi® AI accelerators.

🚀 Intel Habana Gaudi 1 and Gaudi 2 Support

fastRAG is the first RAG framework to support Habana Gaudi accelerators for running LLMs efficiently; more details here.

🌀 Running LLMs with the ONNX Runtime and LlamaCPP Backends

Added support to run quantized LLMs on ONNX runtime and LlamaCPP for higher efficiency and speed for all your RAG pipelines.

⚡ CPU Efficient Embedders

We added support running bi-encoder embedders and cross-encoder ranker as efficiently as possible on Intel CPUs using Intel optimized software.

We integrated the optimized embedders into the following two components:

QuantizedBiEncoderRanker - bi-encoder rankers; encodes the documents provided in the input and re-orders according to query similarity.
QuantizedBiEncoderRetriever - bi-encoder retriever; encodes documents into vectors given a vectors store engine.

⏳ REPLUG

An implementation of REPLUG, an advanced technique for ensemble prompting of retrieved documents, processing them in parallel and combining their next token predictions for better results.

🏆 New Demos

We updated our demos (and demo page) to include two new demos that depict a chat-like experience plus fusing multi-modality RAG.

🐠 Enhancements

Added documentation for most models and components, containing examples and notebooks ready to run!
Support for the Fusion-in-Decoder (FiD) model using a dedicated invocation layer.
Various bug fixes and compatibility updates supporting the Haystack framework.

Full Changelog: v1.3.0...v2.0

Assets 2

20 Jun 12:06

danielfleischer

v1.3.0

81b7a1a

v1.3.0

What's Changed

ColBERT Upstream Updates by @danielfleischer in #19

Full Changelog: v1.2.1...v1.3.0

Contributors

danielfleischer

Assets 2

13 Jun 10:57

peteriz

v1.2.1

bf4df83

v1.2.1

What's Changed

Update plaid_colbert_pipeline.ipynb by @mosheber in #17
Update colbert.py by @mosheber in #18

Full Changelog: v1.2.0...v1.2.1

Contributors

mosheber

Assets 2

21 May 11:05

danielfleischer

v1.2.0

af86bc2

v1.2.0: New: Retrieval Augmented Generation with LLM

Retrieval Augmented Generation with LLM Demo (#16)

- Added a new RAG + prompt + LLM UI (demo).
- Added an example config and notebook.
- Updated main README with "updates" sub-section.
- Updated `run_demo.py` to include all the options to run a demo (UI, UI + service, UI + <user_defined_service>)

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

New Contributors

Contributors

What's Changed

New Contributors

Contributors

What's Changed

Contributors

What's Changed

Contributors

Compatibility with Haystack v2

fastRAG 2.0: Let's do RAG Efficiently 🔥

🚀 Intel Habana Gaudi 1 and Gaudi 2 Support

🌀 Running LLMs with the ONNX Runtime and LlamaCPP Backends

⚡ CPU Efficient Embedders

⏳ REPLUG

🏆 New Demos

🐠 Enhancements

What's Changed

Contributors

What's Changed

Contributors

Releases: IntelLabs/fastRAG

v3.1.1

What's Changed

New Contributors

Contributors

v3.1.0

What's Changed

New Contributors

Contributors

v3.0.2

What's Changed

Contributors

v3.0.1

What's Changed

Contributors

v3.0.0

Compatibility with Haystack v2

v2.0.0

fastRAG 2.0: Let's do RAG Efficiently 🔥

🚀 Intel Habana Gaudi 1 and Gaudi 2 Support

🌀 Running LLMs with the ONNX Runtime and LlamaCPP Backends

⚡ CPU Efficient Embedders

⏳ REPLUG

🏆 New Demos

🐠 Enhancements

v1.3.0

What's Changed

Contributors

v1.2.1

What's Changed

Contributors

v1.2.0: New: Retrieval Augmented Generation with LLM