fix: add maxTokens to serve mode #1280

samirtahir91 · 2024-10-23T13:53:26Z

📑 Description

We need to set maxTokens for serve mode, this is not set by default like it is with k8sgpt auth, which defaults to 2048.
Backends like google gemini and googlevertexai fail without setting maxTokens when running in serve mode (so including k8sgpt operator).

This PR adds a new environment variable K8SGPT_MAX_TOKENS which you can set for serve, the default is set to 2048 if unset

✅ Checks

My pull request adheres to the code style of this project
My code requires changes to the documentation
I have updated the documentation as required
All the tests have passed

ℹ Additional Information

The k8sgpt operator will need to be updated to set maxTokens and providerId also in the k8sgpt CRD spec - injecting the env vars - see k8sgpt-ai/k8sgpt-operator#545

samirtahir91 · 2024-10-23T14:03:18Z

@AlexsJones Could you review please?

cmd/serve/serve.go

Signed-off-by: samir-tahir <[email protected]>

samirtahir91 · 2024-10-27T15:05:13Z

@AlexsJones - Are you happy to merge?

AlexsJones

Thanks for this, I think it's a good first step

samirtahir91 · 2024-10-31T10:08:50Z

Thanks for this, I think it's a good first step

No worries 😃, once merged can you review the change for the operator to adopt this in k8sgpt-ai/k8sgpt-operator#545

samirtahir91 · 2024-11-04T06:23:48Z

@AlexsJones Can you merge this please?

codecov · 2024-11-04T07:47:04Z

Codecov Report

Attention: Patch coverage is 0% with 12 lines in your changes missing coverage. Please review.

Project coverage is 34.65%. Comparing base (173e4dc) to head (9f68d30).
Report is 27 commits behind head on main.

Files with missing lines	Patch %	Lines
cmd/serve/serve.go	0.00%	12 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1280      +/-   ##
==========================================
- Coverage   34.76%   34.65%   -0.12%     
==========================================
  Files          94       95       +1     
  Lines        6342     6417      +75     
==========================================
+ Hits         2205     2224      +19     
- Misses       4046     4100      +54     
- Partials       91       93       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

cmd/serve/serve.go

samirtahir91 requested review from a team as code owners October 23, 2024 13:53

samirtahir91 force-pushed the fix/add-max-tokens-to-serve branch from c5473f7 to 7305e8a Compare October 23, 2024 13:57

samirtahir91 force-pushed the fix/add-max-tokens-to-serve branch from 6815774 to 52b3784 Compare October 23, 2024 17:31

samirtahir91 mentioned this pull request Oct 23, 2024

fix: add providerId, maxTokens and topk to ai spec k8sgpt-ai/k8sgpt-operator#545

Merged

4 tasks

AlexsJones requested changes Oct 24, 2024

View reviewed changes

cmd/serve/serve.go Outdated Show resolved Hide resolved

fix: add maxTokens to serve mode

e7067b8

Signed-off-by: samir-tahir <[email protected]>

samirtahir91 force-pushed the fix/add-max-tokens-to-serve branch from 52b3784 to e7067b8 Compare October 24, 2024 06:25

samirtahir91 requested a review from AlexsJones October 24, 2024 20:46

samirtahir91 added 2 commits October 30, 2024 11:23

Merge branch 'main' into fix/add-max-tokens-to-serve

bc4bdad

Merge branch 'main' into fix/add-max-tokens-to-serve

5a84185

AlexsJones approved these changes Oct 31, 2024

View reviewed changes

AlexsJones and others added 2 commits October 31, 2024 08:32

Merge branch 'main' into fix/add-max-tokens-to-serve

afac15b

Merge branch 'main' into fix/add-max-tokens-to-serve

6ee2085

Merge branch 'main' into fix/add-max-tokens-to-serve

45f72b7

matthisholleville reviewed Nov 4, 2024

View reviewed changes

cmd/serve/serve.go Show resolved Hide resolved

samirtahir91 added 2 commits November 12, 2024 07:25

Merge branch 'main' into fix/add-max-tokens-to-serve

1357523

Merge branch 'main' into fix/add-max-tokens-to-serve

9f68d30

AlexsJones merged commit a50375c into k8sgpt-ai:main Nov 12, 2024
6 checks passed

github-actions bot mentioned this pull request Nov 12, 2024

chore(main): release 0.3.47 #1320

Merged

BrewTestBot mentioned this pull request Dec 2, 2024

k8sgpt 0.3.47 Homebrew/homebrew-core#199759

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: add maxTokens to serve mode #1280

fix: add maxTokens to serve mode #1280

samirtahir91 commented Oct 23, 2024 •

edited

Loading

samirtahir91 commented Oct 23, 2024

samirtahir91 commented Oct 27, 2024

AlexsJones left a comment

samirtahir91 commented Oct 31, 2024

samirtahir91 commented Nov 4, 2024

codecov bot commented Nov 4, 2024 •

edited

Loading

fix: add maxTokens to serve mode #1280

fix: add maxTokens to serve mode #1280

Conversation

samirtahir91 commented Oct 23, 2024 • edited Loading

📑 Description

✅ Checks

ℹ Additional Information

samirtahir91 commented Oct 23, 2024

samirtahir91 commented Oct 27, 2024

AlexsJones left a comment

Choose a reason for hiding this comment

samirtahir91 commented Oct 31, 2024

samirtahir91 commented Nov 4, 2024

codecov bot commented Nov 4, 2024 • edited Loading

Codecov Report

samirtahir91 commented Oct 23, 2024 •

edited

Loading

codecov bot commented Nov 4, 2024 •

edited

Loading