Configure sane default value for grpc_next_upstream_tries #12090

mattb18 · 2024-10-02T17:45:13Z

I'd like ingress-nginx to set a sane default value for grpc_next_upstream_tries. There is currently no value set for this directive, meaning that the default nginx value is used, which in this case is zero. Ideally, this value should also be configurable via an annotation.

I've recently experienced this issue, which resulted in an infinite retry (every 5 seconds due to timeouts) to an upstream that was removed mid request. To make it worse, this turned into an infinite retry with no timeout once the IP was used by another pod which was not listening on the corresponding port, resulting in millions of requests attempts a minute.

As per the ticket, configuring grpc_next_upstream_tries to a sane value (3 in my case) resolves this issue, as the request is tried only 3 times before giving up, rather than retrying infinitely.

A similar change was made for proxy_next_upstream_tries in #6553, to resolve #5425.

Would it make sense to also set grpc_next_upstream_tries to 3 or another similar value?

Alternatively, it would be good to set the default to 0, and allow configuring via an annotation (to prevent any potential breaking changes). This way we can avoid needing to configure server snippets annotations.

I'm happy to create a PR if the consensus for this requests is positive.

The text was updated successfully, but these errors were encountered:

k8s-ci-robot · 2024-10-02T17:45:21Z

This issue is currently awaiting triage.

If Ingress contributors determines this is a relevant issue, they will accept it by applying the triage/accepted label and provide further guidance.

The triage/accepted label can be added by org members by writing /triage accepted in a comment.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

siddheshgarud · 2024-10-29T09:14:55Z

/assign

giner · 2024-10-31T05:31:20Z

Here is a Terraform snippet with a temporary workaround for nginx ingress on MicroK8s

resource "kubernetes_config_map_v1_data" "ingress_workaround" {
  metadata {
    namespace = "ingress"

    name = "nginx-load-balancer-microk8s-conf"
  }

  data = {
    "http-snippet" = "grpc_next_upstream_tries 3;"
  }
}

mattb18 added the kind/feature Categorizes issue or PR as related to a new feature. label Oct 2, 2024

k8s-ci-robot added needs-triage Indicates an issue or PR lacks a `triage/foo` label and requires one. needs-priority labels Oct 2, 2024

mattb18 changed the title ~~Configure sane defaullt values for grpc_next_upstream_tries~~ Configure sane default value for grpc_next_upstream_tries Oct 2, 2024

k8s-ci-robot assigned siddheshgarud Oct 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configure sane default value for grpc_next_upstream_tries #12090

Configure sane default value for grpc_next_upstream_tries #12090

mattb18 commented Oct 2, 2024 •

edited

Loading

k8s-ci-robot commented Oct 2, 2024

siddheshgarud commented Oct 29, 2024

giner commented Oct 31, 2024

Configure sane default value for grpc_next_upstream_tries #12090

Configure sane default value for grpc_next_upstream_tries #12090

Comments

mattb18 commented Oct 2, 2024 • edited Loading

k8s-ci-robot commented Oct 2, 2024

siddheshgarud commented Oct 29, 2024

giner commented Oct 31, 2024

mattb18 commented Oct 2, 2024 •

edited

Loading