Skip to content

feat(ui): add TUI workflow for crawler raw-data management and cross-device sync confidence #17

Description

@2233admin

What happened

The current workflow is CLI-script heavy and scattered. For crawler/collector data (recipe.run + collector scripts), raw outputs (raw/...) and sync state (state.json/heartbeat files) are created across multiple recipe-specific folders. In practice there is no normal in-project interface to see: which data sources are active, what was crawled recently, what is still pending, and whether the raw data is complete.

Because there is no unified management surface, using the same project on another device becomes risky. I can only move the vault via Baidu Netdisk/夸克/NAS, but I still do not have a clear "same dataset" verification loop for raw crawler inputs and collector state.

What I expected

A dedicated management interface (normal TUI in the repo workflow) for crawler/raw-data and recipe sync operations that can:

  • list all installed collectors and their last sync status
  • show raw material health (latest raw payloads, size/age, and state pointers)
  • trigger/re-run collector jobs from one place
  • show whether local raw data is ready for cross-device parity before syncing
  • let me mark a collector workflow as "safe to mirror" across devices

Steps to reproduce

  1. Configure and run one or more recipe collectors (recipe.run for e.g. source-to-vault recipes).
  2. Confirm new artifacts are written under the recipe outputs and raw/ trees.
  3. Open a second machine or a fresh shell session and try to do the same management operations from one place.
  4. You still have to remember and run per-recipe scripts/commands manually, with no single interface for job status, raw-data inventory, or parity checks.

Additional context

The repo already has recipe-based collectors, and those docs describe each source, but orchestration/state visibility is per-recipe and disconnected. This request is not about a single missing command; it is about a unified raw-data management UX and cross-device sync confidence layer.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions