Skip to content

Memory leak using RandomForestClassifier and PCA #1881

Open
@cannolis

Description

@cannolis

Describe the bug
I am encountering a persistent memory leak when using RandomForestClassifier and PCA from the sklearnex library. With each iteration of my loop, the memory usage increases by approximately 20MB, which significantly impacts performance during large-scale data processing.

To Reproduce
Steps to reproduce the behavior:

  1. Setup the environment with sklearnex installed.
  2. Initialize and configure RandomForestClassifier and PCA.
  3. Run a loop where RandomForestClassifier and PCA are used on the data.
  4. Observe the memory usage growth with each iteration.

Expected behavior
I expect the memory usage to remain stable or return to the baseline after each iteration, ensuring efficient performance during large-scale data processing.

Environment:
• OS: Windows 10
• Compiler: PyCharm
• Version: 2024.1.2 Professional Edition

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions