O3 Mini support #1709

wintonzheng · 2025-02-03T21:01:17Z

Important

Add support for O3 Mini model with reasoning tokens handling and update configurations and models accordingly.

Behavior:
- Add support for OPENAI_O3_MINI in config_registry.py with max_completion_tokens=16384, temperature=None, and reasoning_effort="high".
- Update llm_api_handler_with_router_and_fallback() in api_handler_factory.py to handle reasoning_tokens by adding them to completion_tokens.
Models:
- Add reasoning_effort to LLMConfig and LLMRouterConfig in models.py.
- Rename max_output_tokens to max_completion_tokens in LLMConfig and LLMRouterConfig in models.py.
Misc:
- Update get_api_parameters() in api_handler_factory.py to include reasoning_effort if present.

^{This description was created by}^{for 94bcff5. It will automatically update as commits are pushed.}

> [!IMPORTANT] > Add O3 Mini model support and update token handling in LLM configurations and API handlers. > > - **Behavior**: > - Add support for `OPENAI_O3_MINI` in `config_registry.py` with `max_completion_tokens=16384`, `temperature=None`, and `reasoning_effort="high"`. > - Update `llm_api_handler_with_router_and_fallback()` in `api_handler_factory.py` to include reasoning tokens in `completion_tokens` calculation. > - **Configuration**: > - Rename `max_output_tokens` to `max_completion_tokens` in `router.py`, `api_handler_factory.py`, and `config_registry.py`. > - Add `reasoning_effort` and `temperature` to `LLMConfig` and `LLMRouterConfig` in `models.py`. > - **Misc**: > - Update `get_api_parameters()` in `api_handler_factory.py` to include `reasoning_effort` and `temperature` if present. > > <sup>This description was created by </sup>[<img alt="Ellipsis" src="https://img.shields.io/badge/Ellipsis-blue?color=175173">](https://www.ellipsis.dev?ref=Skyvern-AI%2Fskyvern-cloud&utm_source=github&utm_medium=referral)<sup> for a8a4cc46d507592d53b376afa2eccd6a2765bf9d. It will automatically update as commits are pushed.</sup>

ellipsis-dev

👍 Looks good to me! Incremental review on 94bcff5 in 4 minutes and 38 seconds

More details

Looked at 134 lines of code in 3 files
Skipped 0 files when reviewing.
Skipped posting 10 drafted comments based on config settings.

1. skyvern/forge/sdk/api/llm/api_handler_factory.py:159

Draft comment:
Ensure logging at info level for reasoning_tokens is appropriate; info may clutter production logs.
Reason this comment was not posted:
Marked as duplicate.

2. skyvern/forge/sdk/api/llm/api_handler_factory.py:396

Draft comment:
Parameter keys updated (max_completion_tokens, temperature, reasoning_effort) should be consistent with model definitions.
Reason this comment was not posted:
Marked as duplicate.

3. skyvern/forge/sdk/api/llm/config_registry.py:87

Draft comment:
New provider OPENAI_O3_MINI config: Verify that required and optional fields align with the provider's capabilities.
Reason this comment was not posted:
Marked as duplicate.

4. skyvern/forge/sdk/api/llm/models.py:39

Draft comment:
Renaming from 'max_output_tokens' to 'max_completion_tokens' looks consistent; ensure all usages in the project are updated.
Reason this comment was not posted:
Marked as duplicate.

5. skyvern/forge/sdk/api/llm/api_handler_factory.py:159

Draft comment:
Combine reasoning_tokens with completion_tokens: ensure this aggregation is intended and won’t double‐count. Consider logging only nonzero values.
Reason this comment was not posted:
Comment did not seem useful.

6. skyvern/forge/sdk/api/llm/api_handler_factory.py:396

Draft comment:
Updated API parameters now use 'max_completion_tokens' and conditionally add temperature and reasoning_effort. Verify that any consumers relying on the old keys are updated accordingly.
Reason this comment was not posted:
Comment did not seem useful.

7. skyvern/forge/sdk/api/llm/config_registry.py:87

Draft comment:
New OPENAI_O3_MINI config: temperature is set to None and reasoning_effort to 'high'. Ensure downstream handling respects that temperature is not applicable for O-models.
Reason this comment was not posted:
Comment did not seem useful.

8. skyvern/forge/sdk/api/llm/config_registry.py:83

Draft comment:
Token field renaming: 'max_output_tokens' has been replaced with 'max_completion_tokens'. Confirm that all code (e.g., cost calculations) now references the updated field.
Reason this comment was not posted:
Marked as duplicate.

9. skyvern/forge/sdk/api/llm/models.py:39

Draft comment:
LLMConfig update: Renamed 'max_output_tokens' to 'max_completion_tokens' and added 'temperature' and 'reasoning_effort'. Verify that default values from SettingsManager are appropriate.
Reason this comment was not posted:
Marked as duplicate.

10. skyvern/forge/sdk/api/llm/models.py:77

Draft comment:
LLMRouterConfig update: 'max_output_tokens' is now 'max_completion_tokens' along with new fields. Ensure consistency with LLMConfig and check downstream usage.
Reason this comment was not posted:
Marked as duplicate.

Workflow ID: wflow_xUZOas1iHu7KKVBn

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

ellipsis-dev

👍 Looks good to me! Reviewed everything up to 94bcff5 in 5 minutes and 0 seconds

More details

Looked at 134 lines of code in 3 files
Skipped 0 files when reviewing.
Skipped posting 9 drafted comments based on config settings.

1. skyvern/forge/sdk/api/llm/api_handler_factory.py:159

Draft comment:
Ensure addition of reasoning_tokens to completion_tokens is intended. Also, revisit the TODO comment to fully support reasoning tokens.
Reason this comment was not posted:
Comment did not seem useful.

2. skyvern/forge/sdk/api/llm/api_handler_factory.py:396

Draft comment:
Parameter key update to 'max_completion_tokens' reflects model changes. Confirm that downstream APIs expect this key.
Reason this comment was not posted:
Comment did not seem useful.

3. skyvern/forge/sdk/api/llm/config_registry.py:83

Draft comment:
Rename of 'max_output_tokens' to 'max_completion_tokens' is consistent. Verify that all dependent logic uses the new property name.
Reason this comment was not posted:
Marked as duplicate.

4. skyvern/forge/sdk/api/llm/models.py:39

Draft comment:
Renaming 'max_output_tokens' to 'max_completion_tokens' is consistent. Ensure documentation and usage elsewhere also reflect this change.
Reason this comment was not posted:
Marked as duplicate.

5. skyvern/forge/sdk/api/llm/api_handler_factory.py:159

Draft comment:
Consider whether summing 'reasoning_tokens' with 'completion_tokens' is the desired metric. Since the TODO indicates further work on token types, it might be useful to retain them separately for clearer usage analytics.
Reason this comment was not posted:
Marked as duplicate.

6. skyvern/forge/sdk/api/llm/api_handler_factory.py:396

Draft comment:
The updated get_api_parameters now uses 'max_completion_tokens' (and adds 'temperature' and 'reasoning_effort'). Ensure all consumers and documentation reflect this change.
Reason this comment was not posted:
Confidence changes required: 33%
None

7. skyvern/forge/sdk/api/llm/config_registry.py:87

Draft comment:
New configuration 'OPENAI_O3_MINI' is added with temperature set to None and reasoning_effort set to 'high'. Verify that using 'OPENAI_API_KEY' and these specific parameters fit the provider's requirements.
Reason this comment was not posted:
Confidence changes required: 33%
None

8. skyvern/forge/sdk/api/llm/config_registry.py:81

Draft comment:
Changing 'max_output_tokens' to 'max_completion_tokens' across configurations is consistent. Confirm that related tests and documentation are updated accordingly.
Reason this comment was not posted:
Confidence changes required: 33%
None

9. skyvern/forge/sdk/api/llm/models.py:39

Draft comment:
The attribute change from 'max_output_tokens' to 'max_completion_tokens', along with adding 'temperature' and 'reasoning_effort', should be verified against API expectations and provider docs.
Reason this comment was not posted:
Confidence changes required: 33%
None

Workflow ID: wflow_wffN1KFYVMHDPVGs

You can customize Ellipsis with 👍 / 👎 feedback, review rules, user-specific overrides, quiet mode, and more.

wintonzheng added the sync label Feb 3, 2025

suchintan enabled auto-merge (squash) February 3, 2025 21:04

ellipsis-dev bot reviewed Feb 3, 2025

View reviewed changes

suchintan approved these changes Feb 3, 2025

View reviewed changes

suchintan merged commit 59756cb into main Feb 3, 2025
7 checks passed

suchintan deleted the suchintan.o3-mini branch February 3, 2025 21:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

O3 Mini support #1709

O3 Mini support #1709

wintonzheng commented Feb 3, 2025 •

edited by ellipsis-dev bot

Loading

ellipsis-dev bot left a comment

ellipsis-dev bot left a comment

O3 Mini support #1709

O3 Mini support #1709

Conversation

wintonzheng commented Feb 3, 2025 • edited by ellipsis-dev bot Loading

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

wintonzheng commented Feb 3, 2025 •

edited by ellipsis-dev bot

Loading