Skip to content

Memory usage spikes on appliance with 100+ providers #22900

Open
@jaywcarman

Description

@jaywcarman

Issue Description

Memory usage periodically spikes until out of memory. Eventually something crashes and a bunch of workers are killed. Things start back up and the process repeats.

Memory used (%) since last reboot on 2024-01-31 until 2024-02-15:

image

Environment

  • Appliance running in IBM Cloud VPC (16 vCPU | 160 GiB | 32 Gbps)
  • VERSION=4.2.0 (IM)
  • BUILD=20230913103031_b15dc60
  • Providers:
vmdb_production=# SELECT COUNT(*),type 
vmdb_production-# FROM ext_management_systems 
vmdb_production-# WHERE type LIKE '%::CloudManager' OR type LIKE '%::InfraManager'
vmdb_production-# GROUP BY type;
 count |                               type                               
-------+------------------------------------------------------------------
    57 | ManageIQ::Providers::IbmCloud::PowerVirtualServers::CloudManager
    48 | ManageIQ::Providers::IbmCloud::VPC::CloudManager
     1 | ManageIQ::Providers::IbmPowerHmc::InfraManager
     3 | ManageIQ::Providers::IbmPowerVc::CloudManager
     1 | ManageIQ::Providers::Openstack::CloudManager
     1 | ManageIQ::Providers::Openstack::InfraManager
(6 rows)

Logs

I saved off journalctl entries for the following events. The log files are too large to upload to GitHub (also, they may contain sensitive information). I can provide access to them as-needed.

Memory usage spikes ~10% on 2024-02-08 11:10

image

journalctl save: journalctl_20240208105500-20240208130500.txt

Memory usage drop on 2024-02-14 00:10

image

journalctl save: journalctl_20240214000000-20240214003000.txt

Memory usage spikes ~10% 2024-02-15 12:55

image

journalctl save: journalctl_20240214125000-20240214130500.txt
top_output save: top_output_20240215.log

From the top output it looks like used stays flat while buff/cache increases:

$  grep -n "^MiB Mem" top_output_20240215.log
~~snip~~
488291:MiB Mem : 160910.6 total,  37859.1 free, 104039.2 used,  19012.3 buff/cache
489176:MiB Mem : 160910.6 total,  37666.9 free, 104048.2 used,  19195.5 buff/cache
490071:MiB Mem : 160910.6 total,  34829.9 free, 104047.5 used,  22033.1 buff/cache
490963:MiB Mem : 160910.6 total,  31841.1 free, 104259.1 used,  24810.4 buff/cache
491862:MiB Mem : 160910.6 total,  29232.8 free, 104038.6 used,  27639.2 buff/cache
492757:MiB Mem : 160910.6 total,  26424.3 free, 104029.0 used,  30457.3 buff/cache
493650:MiB Mem : 160910.6 total,  23600.8 free, 104035.0 used,  33274.8 buff/cache
494550:MiB Mem : 160910.6 total,  21883.7 free, 103876.1 used,  35150.8 buff/cache
495446:MiB Mem : 160910.6 total,  21620.7 free, 103817.4 used,  35472.6 buff/cache
~~snip~~
``

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions