-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
updates for vLLM==0.5.4 #82
Conversation
1929349
to
ba26eb6
Compare
ba26eb6
to
9b568a3
Compare
http_server_task = loop.create_task(run_http_server(engine, args, model_config)) | ||
grpc_server_task = loop.create_task( | ||
run_grpc_server( | ||
engine, args, disable_log_stats=engine_args.disable_log_stats |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note that I've gotten rid of the disable_log_stats
argument, as the _force_log
coroutine is scheduled as part of the lifespan
attribute of the fastapi server.
e1bda2a
to
aab12db
Compare
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #82 +/- ##
==========================================
- Coverage 62.73% 61.13% -1.60%
==========================================
Files 18 20 +2
Lines 1280 1181 -99
Branches 227 209 -18
==========================================
- Hits 803 722 -81
+ Misses 399 383 -16
+ Partials 78 76 -2 ☔ View full report in Codecov by Sentry. |
if (device_type := engine.engine.device_config.device.type) == "cuda": | ||
logger.info(memory_summary(engine.engine.device_config.device)) | ||
else: | ||
logger.warning("Cannot print device usage for device type: %s", device_type) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We cannot print the memory summary anymore, as this method is not available from the engine client.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @dtrifiro! It's nice that we can remove so much of the duplicated code now
8477d51
to
46ae710
Compare
dd583dc
to
7cb6234
Compare
Parametrization is disabled because of this error |
adaptations for vLLM==0.5.4