Skip to content

OOMKill for the Pod when NIC when runs out memory #7622

Open
@AlexFenlon

Description

@AlexFenlon

Hello,

I've performing some testing with Nginx v4.0.1 this morning to verify the fix for this ticket (issue also captured in #6752 and #4370).

I'm afraid to report that v4.0.1 does not resolve the issue reported.

I deployed Nginx with a low memory limit of 125Mi so that I could be able to trigger an OOMKill with some modest traffic. After the OOMKill the pod was unable to restart. We see the same familiar error message:

with a debug log level:

{"time":"2025-02-12T08:37:10.48020867Z","level":"DEBUG","msg":"Unable to fetch version: error getting client: Get \"http://config-version/configVersion\": dial unix /var/lib/nginx/nginx-config-version.sock: connect: connection refused"}

{"// Comment": "PREVIOUS REPEATED MESSAGE ~ 1500 times" }

{"time":"2025-02-12T08:37:10.789984148Z","level":"DEBUG","msg":"Unable to fetch version: error getting client: Get \"http://config-version/configVersion\": dial unix /var/lib/nginx/nginx-config-version.sock: connect: connection refused"}
2025/02/12 08:37:10 [emerg] 18#18: bind() to unix:/var/lib/nginx/nginx-config-version.sock failed (98: Address already in use)
2025/02/12 08:37:10 [emerg] 18#18: bind() to unix:/var/lib/nginx/nginx-502-server.sock failed (98: Address already in use)
2025/02/12 08:37:10 [emerg] 18#18: bind() to unix:/var/lib/nginx/nginx-418-server.sock failed (98: Address already in use)
2025/02/12 08:37:10 [notice] 18#18: try again to bind() after 500ms
{"time":"2025-02-12T08:37:10.79009325Z","level":"DEBUG","msg":"Unable to fetch version: error getting client: Get \"http://config-version/configVersion\": dial unix /var/lib/nginx/nginx-config-version.sock: connect: connection refused"}

{"// Comment": "PREVIOUS REPEATED MESSAGE ~ 4000 times" }

{"time":"2025-02-12T08:37:11.289973867Z","level":"DEBUG","msg":"Unable to fetch version: error getting client: Get \"http://config-version/configVersion\": dial unix /var/lib/nginx/nginx-config-version.sock: connect: connection refused"}
2025/02/12 08:37:10 [emerg] 18#18: bind() to unix:/var/lib/nginx/nginx-config-version.sock failed (98: Address already in use)
2025/02/12 08:37:10 [emerg] 18#18: bind() to unix:/var/lib/nginx/nginx-502-server.sock failed (98: Address already in use)
2025/02/12 08:37:10 [emerg] 18#18: bind() to unix:/var/lib/nginx/nginx-418-server.sock failed (98: Address already in use)
2025/02/12 08:37:10 [notice] 18#18: try again to bind() after 500ms
{"time":"2025-02-12T08:37:11.29015717Z","level":"DEBUG","msg":"Unable to fetch version: error getting client: Get \"http://config-version/configVersion\": dial unix /var/lib/nginx/nginx-config-version.sock: connect: connection refused"}

{"// Comment": "AND SO ON" }

Originally posted by @MarkTopping in #4604

Metadata

Metadata

Assignees

No one assigned

    Labels

    backlogPull requests/issues that are backlog itemsbugAn issue reporting a potential bug

    Type

    No type

    Projects

    Status

    Prioritized backlog

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions