Eval bug: Slow model loading w/ mmap #10478

hg0428 · 2024-11-25T03:53:21Z

Name and Version

all recent versions

Which operating systems do you know to be affected?

Mac

GGML backends

Metal

Hardware

Apple Silicon. M3 Max

Model

Any big model such as Mixtral 8x7b.

Steps to Reproduce

Just load it with mmap and notice how much slower it is compared to without mmap.
See original issue here: #9244 (comment)

First Bad Commit

idk

Relevant log output

it just loads for a long time.

xgdgsc · 2024-12-06T08:35:32Z

How many ram do you have? Is the second run also same slow? Otherwise it' s expected mmap behavior.

hg0428 · 2024-12-09T02:53:02Z

How many ram do you have? Is the second run also same slow? Otherwise it' s expected mmap behavior.

I have 36gb ram (32 available to GPU). I tested with multiple large models, such as Mixtral 8x7b q4_k_m. Yes, it is still slow on the second run.
I don't mean just a little slow. It's >20x slower. If this is expected mmap behavior then it should be disabled for large models because I don't like having to wait minutes for it to load.

xgdgsc · 2024-12-09T12:05:33Z

Or just use a frontend like Ollama and Continue extension to set the config of useMmap for each model https://docs.continue.dev/reference.

hg0428 · 2024-12-09T13:03:58Z

Or just use a frontend like Ollama and Continue extension to set the config of useMmap for each model https://docs.continue.dev/reference.

I already know to disable it; I'm just saying I don't think it should be this slow just from mmap. With smaller models (~<40b q4) the difference is negligible.

github-actions · 2025-01-24T01:07:10Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

hg0428 · 2025-01-24T02:04:11Z

This was not completed!

hg0428 added the bug-unconfirmed label Nov 25, 2024

github-actions bot added the stale label Jan 9, 2025

github-actions bot closed this as completed Jan 24, 2025

hg0428 mentioned this issue Jan 24, 2025

Eval bug: Slow model loading w/ mmap #11382

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Eval bug: Slow model loading w/ mmap #10478

Eval bug: Slow model loading w/ mmap #10478

hg0428 commented Nov 25, 2024

xgdgsc commented Dec 6, 2024

hg0428 commented Dec 9, 2024

xgdgsc commented Dec 9, 2024

hg0428 commented Dec 9, 2024

github-actions bot commented Jan 24, 2025

hg0428 commented Jan 24, 2025

Eval bug: Slow model loading w/ mmap #10478

Eval bug: Slow model loading w/ mmap #10478

Comments

hg0428 commented Nov 25, 2024

Name and Version

Which operating systems do you know to be affected?

GGML backends

Hardware

Model

Steps to Reproduce

First Bad Commit

Relevant log output

xgdgsc commented Dec 6, 2024

hg0428 commented Dec 9, 2024

xgdgsc commented Dec 9, 2024

hg0428 commented Dec 9, 2024

github-actions bot commented Jan 24, 2025

hg0428 commented Jan 24, 2025