-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add mistral backend #5
Conversation
from benchmark_llm_serving.io_classes import QueryOutput, QueryInput | ||
|
||
|
||
class BackEnd(): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why this class is defined in first place?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In order to force the declaration of some methods in the class which inherits from BackEnd, and to declare common methods as well.
args.prompt_length = input_length | ||
args.output_length = output_length | ||
args.output_file = os.path.join(raw_results_folder, f"kv_cache_profile_input_{input_length}_output_{output_length}.json") | ||
if backend.backend_name == "happy_vllm": |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why this "if" statement is needed?
This PR add the mistral backend.
It also fixes some incompatibilities introduced with the upgrade of vLLM in happy_vLLM