Skip to content

[Bug]: Get endpointslice - reconcile loop stormΒ #9334

@Vormillion

Description

@Vormillion

Version

4.0.1

What Kubernetes platforms are you running on?

EKS Amazon

Steps to reproduce

I have ingress with 57 hosts and each host has 2 paths.
Almost during whole day ~54 hosts does not have any active endpoints as environments are scaled down.

When I'm restarting ingress controller pods, it takes ages to reach ready status, because controller is checking each endpointslice multiple time, just for single domain I've counted it 347 times in logs before configuration is fully loaded.
It's adding delay between 10-60 seconds for readiness and of course sometimes pod is restarting due to readiness failures.

I suspect issue is there:

podEndps, external, err = lbc.getEndpointsForIngressBackend(&path.Backend, svc)

NOTE: It's working fine without any pefromance issues with Ingress-Nginx.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugAn issue reporting a potential bugneeds triageAn issue that needs to be triaged

    Type

    No type

    Projects

    Status

    Todo β˜‘

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions