Skip to content

Commit cf5626f

Browse files
Fix title streaming (#3150)
* fixing title * title first
1 parent d5bd6a5 commit cf5626f

File tree

2 files changed

+3
-3
lines changed

2 files changed

+3
-3
lines changed

_blog.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6888,7 +6888,7 @@
68886888
- hub
68896889

68906890
- local: streaming-datasets
6891-
title: "Streaming datasets at scale"
6891+
title: "Streaming datasets: 100x More Efficient"
68926892
author: andito
68936893
thumbnail: /blog/assets/streaming_datasets/streaming_datasets.png
68946894
date: Oct 27, 2025

streaming-datasets.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: Streaming datasets: 100x More Efficient
2+
title: "Streaming datasets: 100x More Efficient"
33
thumbnail: /blog/assets/streaming_datasets/streaming_datasets.png
44
authors:
55
- user: andito
@@ -9,6 +9,7 @@ authors:
99
- user: merve
1010
---
1111

12+
# Streaming datasets: 100x More Efficient
1213

1314
## TLDR
1415

@@ -20,7 +21,6 @@ authors:
2021
2122
<img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/streaming-dark.gif" width="800" height="auto" alt="Visualization of a dataset being streamed">
2223

23-
## Streaming datasets: 100x More Efficient
2424

2525
Loading data, especially at the terabyte scale, is a major pain in any machine learning workflow. We suffered this while training [SmolLM3](https://huggingface.co/blog/smollm3), at one point we had to wait 3 hours before each run to download enough data.
2626

0 commit comments

Comments
 (0)