Skip to content

A curated collection of research papers and publicly available datasets for storage workload analysis and performance research.

Notifications You must be signed in to change notification settings

Kritshekhar/storage-workloads

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

5 Commits
Β 
Β 

Repository files navigation

πŸ“‚ Production Storage Workloads and Research Papers

A curated collection of research papers and publicly available datasets for storage workload analysis and performance research.


πŸ“… 2024

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
SYSTOR Space-efficient FTL for Mobile Storage via Tiny Neural Nets Mobile Application I/O Traces
ASPLOS Thesios: Synthesizing Accurate Counterfactual I/O Traces from I/O Samples Google Synthesized I/O Traces
Thesios
FAST Baleen: ML Admission & Prefetching for Flash Caches Baleen

πŸ“… 2023

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
FAST Perseus: A Fail-Slow Detection Framework for Cloud Storage Systems Alibaba NVME Fail-Slow
SOSP FIFO Queues are All You Need for Cache Eviction MetaKV key-value traces
SOSP FIFO Queues are All You Need for Cache Eviction MetaCDN object traces

πŸ“… 2021

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
TOS SSD-based Workload Characteristics and Their Performance Implications YCSB RocksDB SSD

πŸ“… 2020

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
OSDI The CacheLib Caching Engine: Design and Experiences at Scale -
OSDI A Large-Scale Analysis of Hundreds of In-Memory Cache Clusters at Twitter Twitter
Memcached
IISWC An In-Depth Analysis of Cloud Block Storage Workloads in Large-Scale Production Alibaba Block Traces
IPDPSW Recorder 2.0: Efficient Parallel I/O Tracing and Analysis HPC Application I/O Traces
ATC OSCA: An Online-Model Based Cache Allocation Scheme in Cloud Block Storage Systems Tencent Block
Hotstorage It’s Time to Revisit LRU vs. FIFO IBM Object Store

πŸ“… 2018

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
ICS Demystifying Cache Policies for Photo Stores at Scale: A Tencent Case Study Tencent Photo Cache

πŸ“… 2017

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
SYSTOR Understanding storage traffic characteristics on enterprise virtual desktop infrastructure Systor '17 Traces

πŸ“… 2015

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
FAST Analysis of the ECMWF Storage Landscape ECMWF Traces

πŸ“… 2016

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
FAST Slacker: Fast Distribution with Lazy Docker Containers Slacker Traces

πŸ“… 2010

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
TOS I/O Deduplication: Utilizing content similarity to improve I/O performance FIU

πŸ“… 2009

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
Computer Networks Wikipedia Workload Analysis for Decentralized Hosting Wikipedia Dumps

πŸ“… 2008

πŸ“ Venue πŸ“„ Paper πŸ“Š Dataset / Trace
FAST Write Off-Loading: Practical Power Management for Enterprise Storage MSR Cambridge
IWQoS Statistics and Social Network of YouTube Videos YouTube Dataset

πŸ“Œ Contributing

If you have additional papers or datasets to include, feel free to submit a pull request or open an issue!

πŸ“œ License

This repository is maintained for academic and research purposes. Please check individual papers and datasets for their respective licenses and terms of use.

About

A curated collection of research papers and publicly available datasets for storage workload analysis and performance research.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •