Skip to content

feat(embed): add tqdm progress bar to chunk stage, clean up embed logging#108

Open
yichuan-w wants to merge 1 commit into
mainfrom
feat/progress-bars
Open

feat(embed): add tqdm progress bar to chunk stage, clean up embed logging#108
yichuan-w wants to merge 1 commit into
mainfrom
feat/progress-bars

Conversation

@yichuan-w

Copy link
Copy Markdown
Contributor

Summary

  • chunk.py: Add tqdm progress bar for article-level chunking. Users now see real-time progress during Stage 2:
    Chunking: 50%|█████     | 4/8 [00:00<00:00, 5.88it/s]
    
  • embed_cpu.py: Remove redundant logger.info("Embedded %d/%d") every 10 items — tqdm already shows this info and the log lines break the progress bar display.

Closes #99.

Test plan

  • Rendered 8 Wikipedia pages, chunked into 64 chunks — tqdm progress bar displays correctly
  • All 9 existing tests pass (test_chunk.py + test_local_source_text.py)
  • Lint clean (ruff check + format)

…ging

- chunk.py: add tqdm progress bar for article-level chunking so users
  see real-time progress during Stage 2 (e.g. "Chunking: 50%|█████ | 4/8")
- embed_cpu.py: remove redundant "Embedded X/Y" logger.info every 10
  items — tqdm already shows this and the log lines break the progress
  bar display

Tested with 8 Wikipedia pages (64 chunks produced).
@vercel

vercel Bot commented Jun 27, 2026

Copy link
Copy Markdown

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
web Ready Ready Preview, Comment Jun 27, 2026 1:12pm

@yichuan-w

Copy link
Copy Markdown
Contributor Author

Maybe embedd is bottleneck needing to add progress bar

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Enhancement] Add overall progress bar to pixelrag index build

1 participant