The dataset consists of records collected from a private OpenStack-based cloud operated by the Distributed Systems Laboratory at the Federal University of Campina Grande, Brazil, over approximately one year of operation.
This repository contains sample data collection scripts and R Markdown notebooks for reproducing plots based on the dataset.
The data is made available and licensed under a CC-BY Attribution 4.0 License. By downloading it or using it, you agree to the terms of this license.
If you use the data for a publication or project, please cite the accompanying dataset:
Marques, Paola; Mendes, Mariana; Pereira, Thiago Emmanuel; Farias, Giovanni (2025),
“Dataset on Resource Allocation and Usage for a Private Cloud”, Mendeley Data,
V1, doi: 10.17632/trvb5k4x5m.1
The dataset is available on Mendeley Data as a compressed archive.
After downloading the ZIP file, you can extract it using:
unzip "Dataset on Resource Allocation and Usage for a Private Cloud.zip"
The dataset is organized into seven CSV files, each representing a distinct data table:
| Field | Description |
|---|---|
| timestamp | Collection time in UNIX epoch seconds |
| project_id | Unique project identifier (UUID) |
| quota_ram | RAM quota available to the project (MB) |
| quota_vcpu | vCPU quota available to the project |
| Field | Description |
|---|---|
| id | Unique row identifier |
| timestamp | Collection time in UNIX epoch seconds |
| project_id | Unique project identifier (UUID) |
| ram_allocated | RAM quota allocated per project (MB) |
| vcpu_allocated | vCPU quota allocated per project |
| Field | Description |
|---|---|
| id | Unique row identifier |
| timestamp | Collection time in UNIX epoch seconds |
| user_id | Unique user identifier (UUID) |
| project_id | Unique project identifier (UUID) |
| Field | Description |
|---|---|
| id | Unique row identifier |
| timestamp | Collection time in UNIX epoch seconds |
| server_id | Unique server identifier (UUID) |
| flavor_id | Unique flavor identifier (UUID) |
| Field | Description |
|---|---|
| id | Unique row identifier |
| timestamp | Collection time in UNIX epoch seconds |
| server_id | Unique server identifier (UUID) |
| user_id | Unique user identifier (UUID) |
| project_id | Unique project identifier (UUID) |
| Field | Description |
|---|---|
| id | Unique row identifier |
| timestamp | Collection time in UNIX epoch seconds |
| server_id | Unique server identifier (UUID) |
| vcpu_utilization | vCPU utilization (%) |
| ram_utilization | Memory (RAM) utilization (%) |
| host_id | Compute host identifier |
| Field | Description |
|---|---|
| flavor_id | Unique flavor identifier (UUID) |
| flavor_name | Name of the flavor |
| vcpu | Number of vCPUs defined in the flavor |
| ram | RAM capacity defined in the flavor (MB) |
| disk | Disk capacity defined in the flavor (GB) |
The dataset corresponds to the data archived on Mendeley Data with this repository providing supporting scripts and documentation.