perf(flux2): optimize cache locking in Klein encoder to fix #7513 by girlyoulookthebest · Pull Request #8863 · invoke-ai/InvokeAI

girlyoulookthebest · 2026-02-08T09:16:26Z

Summary

Addresses a race condition in the ModelCache where models are evicted prematurely on low-VRAM systems (Issue #7513).

Refactored invoke() to hold a single ExitStack for the entire duration of the node's execution. This ensures the Text Encoder remains locked in RAM while the conditioning data is being saved.
Reordered the loading sequence in _encode_prompt. The Text Encoder is now locked immediately upon retrieval, closing the window where it could be evicted while the Tokenizer is being fetched.

Related Issues / Discussions

Issue #7513

QA Instructions

Use a system with tight VRAM where cache eviction is likely.
Run a generation using the Flux2 Klein text encoder.
In the developer log, confirm that the warning no longer appears.
Performance-wise, confirm that the text encoder is loaded once and stays in RAM, eliminating disk reloading churn.

Merge Plan

Safe to merge immediately. This is an architectural optimization restricted to the flux2_klein_text_encoder node and does not affect other pipelines.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
❗Changes to a redux slice have a corresponding migration
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

…ixes invoke-ai#7513)

perf(flux2): optimize model loading order to prevent cache eviction (f…

de028d2

…ixes invoke-ai#7513)

girlyoulookthebest requested review from JPPhoto, blessedcoolant, dunkeroni and lstein as code owners February 8, 2026 09:16

github-actions bot added python PRs that change python files invocations PRs that change invocations labels Feb 8, 2026

Update flux2_klein_text_encoder.py

35cd1b7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(flux2): optimize cache locking in Klein encoder to fix #7513#8863

perf(flux2): optimize cache locking in Klein encoder to fix #7513#8863
girlyoulookthebest wants to merge 2 commits intoinvoke-ai:mainfrom
girlyoulookthebest:perf/flux2-klein-cache-optimize

girlyoulookthebest commented Feb 8, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

girlyoulookthebest commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

girlyoulookthebest commented Feb 8, 2026 •

edited

Loading