[Feature]: Deep dive alerts #795

matthisholleville · 2023-12-19T14:30:02Z

Checklist

I've searched for similar issues and couldn't find anything matching
I've discussed this feature request in the K8sGPT Slack and got positive feedback

Is this feature request related to a problem?

No

Problem Description

When monitoring my Kubernetes clusters, I often receive alerts due to a pod being in a crash loop or other issues for example. Investigating these alerts requires repetitive tasks such as log analysis, pod description, etc. K8sgpt allows me to anticipate some alerts by continuously analyzing my objects, but could we do even better?

Solution Description

Who hasn't dreamed of having a tool that can perform an initial investigation into one or more alerts during oncall rotation ? The solution I propose is to leverage OpenAI's assistant system (or a similar pattern) to transition from a proactive mode to a reactive mode specific to an alert. The architecture would be as follows:

AlertManager sends one or more alerts to the K8SGPT operator via http.
K8SGPT uses the OpenAI assistant system to determine which functions (or executors) it needs to execute to doing error analysis.
With the results of the functions (or executors), we conduct an analysis and provide the user with an initial investigation of the error.

Benefits

The benefits would be multiple:

Improved on-call process: Alerts received at night (when we are not operational at 100%) are automatically investigated.
Reduction of repetitive tasks and the possibility of errors (accidentally running a command in the middle of the night).
Automatic initiation and drafting of a post-mortem based on the analysis.

Potential Drawbacks

The drawbacks could be the compatibility of the solution with all AI systems supported by K8SGPT. Given that the concept of OpenAI's assistant has just been introduced, it needs to be verified whether this concept will become a standard in the future. Otherwise, the system may need to be "complexified" to use a pattern router. Another solution could be to offer this functionality exclusively to OpenAI users for the time being.

Additional Information

I have already tested this solution, and it has provided significant value for simple alerts. If the idea seems promising, the next step would be to test it on more complex alerts.

AlexsJones · 2023-12-29T11:08:04Z

This is a really cool idea. I suppose my worry is putting too much dependency on OpenAI.
Could we feature gate this and have a generic implementation for when other backends support it? Perhaps an optional set of backend commands?

github-project-automation bot added this to Backlog Dec 19, 2023

github-project-automation bot moved this to Todo in Backlog Dec 19, 2023

AlexsJones added enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed labels Dec 29, 2023

This was referenced Jan 8, 2024

feat: deep dive alerts #851

Closed

feat: add alert option in AnalyzeRequest k8sgpt-ai/schemas#21

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Deep dive alerts #795

[Feature]: Deep dive alerts #795

matthisholleville commented Dec 19, 2023

AlexsJones commented Dec 29, 2023

[Feature]: Deep dive alerts #795

[Feature]: Deep dive alerts #795

Comments

matthisholleville commented Dec 19, 2023

Checklist

Is this feature request related to a problem?

Problem Description

Solution Description

Benefits

Potential Drawbacks

Additional Information

AlexsJones commented Dec 29, 2023