Setting num-gpus to 0 causes a division by 0 error #83

maxdebayser · 2024-08-06T18:08:11Z

This is a very low-priority bug, I'm just taking notes here for the backlog.

If you try to run python -m vllm_tgis_adapter --num-gpus 0 --device cpu it will cause this error:

Traceback (most recent call last):
  File "<frozen runpy>", line 198, in _run_module_as_main
  File "<frozen runpy>", line 88, in _run_code
  File "/home/mbayser/IBMProjects/FoundationModels/inference/vllm-tgis-adapter/build/__editable__.vllm_tgis_adapter-0.0.0-py3-none-any/vllm_tgis_adapter/__main__.py", line 59, in <module>
    engine = AsyncLLMEngine.from_engine_args(
             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/mbayser/IBMProjects/FoundationModels/inference/vllm/build/__editable__.vllm-0.5.4+cpu-cp311-cp311-linux_x86_64/vllm/engine/async_llm_engine.py", line 462, in from_engine_args
    engine_config = engine_args.create_engine_config()
                    ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/mbayser/IBMProjects/FoundationModels/inference/vllm/build/__editable__.vllm-0.5.4+cpu-cp311-cp311-linux_x86_64/vllm/engine/arg_utils.py", line 865, in create_engine_config
    return EngineConfig(
           ^^^^^^^^^^^^^
  File "<string>", line 15, in __init__
  File "/home/mbayser/IBMProjects/FoundationModels/inference/vllm/build/__editable__.vllm-0.5.4+cpu-cp311-cp311-linux_x86_64/vllm/config.py", line 1644, in __post_init__
    self.model_config.verify_with_parallel_config(self.parallel_config)
  File "/home/mbayser/IBMProjects/FoundationModels/inference/vllm/build/__editable__.vllm-0.5.4+cpu-cp311-cp311-linux_x86_64/vllm/config.py", line 273, in verify_with_parallel_config
    if total_num_attention_heads % tensor_parallel_size != 0:
       ~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~
ZeroDivisionError: integer modulo by zero

num-gpus should be ignored completely if device is cpu. But even in the device is gpu there should be a validation error instead of a zero division error.

The text was updated successfully, but these errors were encountered:

dtrifiro · 2024-08-08T10:00:44Z

This looks like an upstream issue tbh

maxdebayser added the kind/bug Something isn't working label Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setting num-gpus to 0 causes a division by 0 error #83

Setting num-gpus to 0 causes a division by 0 error #83

maxdebayser commented Aug 6, 2024

dtrifiro commented Aug 8, 2024

Setting num-gpus to 0 causes a division by 0 error #83

Setting num-gpus to 0 causes a division by 0 error #83

Comments

maxdebayser commented Aug 6, 2024

dtrifiro commented Aug 8, 2024