Description
Description:
I'm experiencing a critical issue with CloudStack 4.20 where all API endpoints become unresponsive approximately every 10 days. The only temporary resolution is to restart the CloudStack management server.
Observed Behavior:
API requests timeout/fail completely after ~10 days of uptime
No explicit ERROR messages in logs prior to outage
Found an unusually large INFO-level log entry (3MB per line) that might be relevant
Attached log file: [filename.log] (Please ensure you actually attach the file via GitHub interface)
Environment:
CloudStack Version: 4.20.0.0
OS:Ubuntu 24.04
Steps to Reproduce:
Start CloudStack management server
Operate normally for ~10 days
API services become unavailable without obvious triggers
Expected Behavior:
API endpoints should remain available continuously without requiring manual restarts.
Additional Context:
The large INFO-level log entry repeats periodically (full content attached)
No observed resource exhaustion (CPU/MEM) before outages
Problem persists across multiple maintenance windows
Troubleshooting Attempted:
Reviewed standard error logs - no smoking gun
Monitored system resources - no apparent bottlenecks
Server restart temporarily resolves the issue
Request:
Please help investigate:
Potential memory leaks or thread blocking in the 4.20 codebase
Significance of the oversized INFO log entries
Update to Original Issue:
Further analysis of the oversized INFO log reveals repetitive entries related to createVPCOffering API calls. The JSON payload in these logs appears to be abnormally large (3MB per line) and contains repetitive configuration data.
Key Log Excerpt Pattern:
INFO [c.c.a.ApiServlet] (qtp123456789-42:) {cmd="createVPCOffering", ... JSON payload (3MB) ...}
Metadata
Metadata
Assignees
Type
Projects
Status