Add Redis semantic cache feature #758

lordofthejars · 2024-07-18T10:25:53Z

One of the things it is going to be interesting to have out of the box about AI is the semantic cache of requests.

Actually, it could be used in any method but according to a recent study, 31% of queries to LLM can be cached (or, in other words, 31% of the queries are contextually repeatable), which can significantly improve response time in GenAI apps.

I created a simple example that implements this with Redis: https://github.com/lordofthejars-ai/quarkus-langchain-examples/tree/main/semantic-cache

Do you think it might be interesting to integrate this into Quarkus Cache system for example as Redis-semantic-cache or something like this?

geoand · 2024-07-18T10:27:59Z

I think @iocanel and @andreadimaio were thinking of something similar

andreadimaio · 2024-07-18T10:32:59Z

Yes, what we have here #659 is a concept of semantic cache.
The idea is to have something very similar to ChatMemory, so you can extend the default implementation (in-memory) with other products like Redis or something else.

lordofthejars · 2024-07-18T10:35:22Z

Great, feel free to take a view in my example, you'll see that code to do it is not complex, some configuration parameters that's true. Calculating Keys is easy as by default Quarkus cache offers the interface to override the creation of keys. The problem is in the code to check if it is a cache miss or not.

geoand · 2024-07-18T10:36:01Z

Thanks for the input.

Closing as duplicate in light of the conversation above.

geoand closed this as not planned Won't fix, can't repro, duplicate, stale Jul 18, 2024

geoand added duplicate This issue or pull request already exists enhancement New feature or request labels Jul 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Redis semantic cache feature #758

Add Redis semantic cache feature #758

lordofthejars commented Jul 18, 2024

geoand commented Jul 18, 2024

andreadimaio commented Jul 18, 2024 •

edited

Loading

lordofthejars commented Jul 18, 2024

geoand commented Jul 18, 2024

Add Redis semantic cache feature #758

Add Redis semantic cache feature #758

Comments

lordofthejars commented Jul 18, 2024

geoand commented Jul 18, 2024

andreadimaio commented Jul 18, 2024 • edited Loading

lordofthejars commented Jul 18, 2024

geoand commented Jul 18, 2024

andreadimaio commented Jul 18, 2024 •

edited

Loading