Add PGS to SRT OCR conversion feature #701

mikeSGman · 2025-10-15T07:01:04Z

Add PGS to SRT OCR Conversion Feature

Summary

This PR adds support for converting image-based PGS (Presentation Graphic Stream) subtitles to text-based SRT format using OCR (Optical Character Recognition). This feature enables users to extract Blu-ray subtitles as editable text files.

Motivation

PGS subtitles are image-based and cannot be edited or searched. Many users want to:

Edit subtitle timing or text content
Search subtitle content
Reduce file sizes (text SRT vs. image SUP)
Use subtitles with devices/players that don't support PGS

Features

User-Facing Changes

Dropdown Menu for PGS Subtitles
- PGS subtitle tracks now show a dropdown menu with two options:
  - "Extract as .sup (image - fast)" - Instant extraction of image-based subtitles
  - "Convert to .srt (OCR - 3-5 min)" - OCR conversion to text-based subtitles
Settings Panel
- New checkbox: "Enable PGS to SRT OCR conversion"
- Dependency status display showing which OCR tools are installed
- Link to installation instructions for missing dependencies
Smart Dependency Detection
- Auto-detects Tesseract OCR on all drives (C:, D:, E:, etc.)
- Checks Windows registry for Tesseract installation
- Auto-detects MKVToolNix (mkvmerge)
- Auto-detects pgsrip Python package
- Gracefully falls back to .sup extraction if dependencies are missing
User-Friendly Error Messages
- Clear instructions for installing missing dependencies
- Platform-specific installation commands (Windows, Linux, macOS)
- Links to download pages for manual installation

Technical Implementation

Files Modified

fastflix/models/config.py
- Added find_ocr_tool() function to locate Tesseract, mkvmerge, and pgsrip
- Searches system PATH, environment variables, common install locations, and Windows registry
- Added config fields: enable_pgs_ocr, tesseract_path, mkvmerge_path, pgsrip_path
fastflix/widgets/background_tasks.py
- Added _check_pgsrip_dependencies() method to verify all required tools
- Added _convert_sup_to_srt() method to perform OCR conversion
- Handles language code conversion (ISO 639-2/T 3-letter → ISO 639-1 2-letter)
- Sets environment variables (PATH and TESSERACT_CMD) for pytesseract
- Added use_ocr parameter to ExtractSubtitleSRT class
fastflix/widgets/panels/subtitle_panel.py
- Modified the Extract button for PGS tracks to show a dropdown menu
- Conditionally enables OCR option based on settings and dependencies
- Shows helpful tooltips when OCR option is disabled
fastflix/widgets/settings.py
- Added PGS OCR settings checkbox with tooltip
- Added update_ocr_dependency_status() method to show dependency status
- Displays checkmarks for installed dependencies
- Shows link to wiki for installation instructions when dependencies are missing
FastFlix_Windows_OneFile.spec
- Added pgsrip and its dependencies to the PyInstaller build
- Ensures OCR libraries are bundled in compiled Windows executable

Dependencies

Tesseract OCR - OCR engine for text recognition
MKVToolNix (mkvmerge) - Required by pgsrip for subtitle extraction
pgsrip - Python library for PGS subtitle OCR conversion
- Automatically installs: pytesseract, opencv-python, numpy, pysrt, babelfish, cleanit

Key Design Decisions

Two-Step Process: First extract .sup file using FFmpeg, then convert with pgsrip
- Separates FFmpeg operations from OCR operations
- Allows fallback to .sup extraction if OCR fails
- Provides better error handling
Language Code Conversion: Automatically converts ISO 639-2/T (eng) to ISO 639-1 (en)
- pgsrip expects 2-letter language codes in filenames
- Maintains compatibility with FastFlix's 3-letter language codes
Environment Variable Management: Sets both TESSERACT_CMD and PATH
- TESSERACT_CMD points to tesseract.exe
- PATH includes Tesseract directory for subprocess calls
- Fixes issue where pytesseract can't find Tesseract on Windows
Automatic Cleanup: Deletes .sup file after successful .srt conversion
- Keeps only the text-based .srt file
- Reduces clutter in the output directory

Testing

Tested Scenarios

✅ Windows with Tesseract on D: drive (non-standard location)
✅ PGS subtitle extraction and OCR conversion
✅ Language code conversion (eng → en)
✅ Dependency detection and status display
✅ Settings UI enable/disable functionality
✅ Dropdown menu for PGS tracks
✅ Fallback to .sup when dependencies are missing
✅ Error handling and user-friendly messages

Test Results

Successfully converted Blade (1998) Blu-ray PGS subtitles to SRT
Conversion time: ~46 seconds for a feature film
Output: Clean .srt file with proper timing

Testing Checklist

Test on Windows with Tesseract in the default location (C:\Program Files)
Test on Linux with apt-installed dependencies
Test on macOS with Homebrew-installed dependencies
Test with Spanish, French, and other language subtitles
Test with missing dependencies (verify error messages)
Test dropdown menu behavior with OCR enabled/disabled
Test PyInstaller build includes pgsrip dependencies
Verify settings persist after restart

Installation Instructions for Users

Windows

# Install Tesseract OCR
# Download installer from: https://github.com/UB-Mannheim/tesseract/wiki

# Install MKVToolNix
# Download installer from: https://mkvtoolnix.download/downloads.html

# Install pgsrip (from FastFlix virtual environment)
pip install pgsrip

Linux

sudo apt install tesseract-ocr mkvtoolnix
pip install pgsrip

macOS

brew install tesseract mkvtoolnix
pip install pgsrip

Breaking Changes

None. This is a purely additive feature that's disabled by default.

Migration Guide

No migration needed. Existing users will see the new option after updating and installing dependencies.

Future Enhancements

Potential improvements for future PRs:

Support for more OCR languages via Tesseract language packs
Batch conversion of multiple PGS tracks
OCR accuracy tuning options
Progress bar for long OCR operations
Integration with online OCR services as a fallback

Related Issues

Closes #[issue-number] (if applicable)

Screenshots

Checklist

Code follows project style guidelines
Comments added for complex logic
No debug/console.log statements
User-facing strings use translation function t()
Error handling for all external tool calls
Graceful degradation when dependencies are missing
Platform-specific code tested on Windows
Config fields have sensible defaults
Feature is opt-in (disabled by default)

- Add dropdown menu for PGS subtitle tracks with OCR option - Auto-detect Tesseract OCR on all drives and Windows registry - Add settings panel with dependency status display - Support for converting image-based PGS to editable SRT - Handles language code conversion and environment setup - Includes comprehensive error handling and user guidance

cdgriffith

Thank you so much for this wonderful addition!

I have a few tweaks suggested. If you are up for doing them let me know, otherwise I can merge it and work on it as well as you set up a great feature I would love to add!

If you do add more please also run the pre-commit checks so it passes linting:

pre-commit install
pre-commit run --all-files

fastflix/models/config.py

fastflix/widgets/background_tasks.py

mikeSGman · 2025-10-18T21:18:18Z

Thank you so much for this wonderful addition!

I have a few tweaks suggested. If you are up for doing them let me know, otherwise I can merge it and work on it as well as you set up a great feature I would love to add!

If you do add more please also run the pre-commit checks so it passes linting:
pre-commit install
pre-commit run --all-files

Ah, that's how you do it - thank you. Will do for future commits, and go back to see if I can do it on this PR also.

mikeSGman · 2025-10-19T19:27:43Z

Hi @cdgriffith - My earlier commit missed the BabelLanguage patch for dual 2- and 3-letter ISO code handling. This push includes that fix; verified locally, and all pre-commit hooks are passing.

Based on our threaded convo, I think I've addressed all issues:

    - Use environment variables for Windows tool detection instead of
      scanning all drives (LOCALAPPDATA, PROGRAMFILES, PROGRAMFILES(X86))
    - Remove pgsrip_path config field and use pgsrip Python API directly
    - Update dependency checks to use importlib for pgsrip library
    - Fix BabelLanguage to handle both 2-letter and 3-letter ISO codes
    - Update error messages and installation instructions

- Use environment variables for Windows tool detection instead of scanning all drives (LOCALAPPDATA, PROGRAMFILES, PROGRAMFILES(X86)) - Remove pgsrip_path config field and use pgsrip Python API directly - Update dependency checks to use importlib for pgsrip library - Fix BabelLanguage to handle both 2-letter and 3-letter ISO codes - Update error messages and installation instructions All changes pass pre-commit linting checks.

The glob pattern was failing when filenames contained brackets like [imdbid-tt0187738] because glob interprets [] as character classes. Changed to detect newly created .srt files by comparing before/after directory listings instead of using filename-based glob patterns. Fixes false error for files like "Blade II (2002) [imdbid-tt0187738].mkv"

Include package metadata for pgsrip, pytesseract, and babelfish in the Windows builds to fix 'No package metadata was found' error when running OCR conversion from the compiled executable.

Add collect_data_files('babelfish') to bundle ISO language code data files needed by babelfish at runtime.

Add copy_metadata('cleanit') for pgsrip dependency.

Add collect_data_files('cleanit') to bundle YAML config files needed by cleanit at runtime.

Add copy_metadata('trakit') for pgsrip dependency.

mikeSGman · 2025-10-20T03:26:45Z

@cdgriffith - Ok, I think we're finally there. The working screenshots I showed in #701 (comment) were based on running via Python in a Windows command prompt. After that, I realized we needed a bunch more work and tweaks to get it working OOB via the compiled binary. So, commits f5ddccc through 4f8e347 are just that. It works beautifully now. I think it's finally ready for your ACK/NACK. Sorry for all the noise, it's been a long time since I've done a PR - but I hope this brings some extra functionality and usefulness for someone out there. I, for one, use SRT subtitles alongside every MKV I stream via Jellyfin. Without them, it's a transcode every time I start a movie just to render the Blu-ray subtitles natively. HTH. YMMV.

Include pgsrip, pytesseract, babelfish, cleanit, trakit, opencv-python, and pysrt in project dependencies to fix Windows build error where PyInstaller's copy_metadata() could not find package metadata for packages that weren't installed during the build process.

mikeSGman · 2025-10-30T02:56:38Z

Summary of Changes in aacb011

Fixed Windows build error where PyInstaller couldn't find package metadata for OCR dependencies.

Changes:

Added OCR dependencies to pyproject.toml:

pgsrip>=0.1.0
pytesseract>=0.3.0
babelfish>=0.6.0
cleanit>=0.4.0
trakit>=0.2.0
opencv-python>=4.8.0
pysrt>=1.1.0

Updated uv.lock with the new dependencies
Added WINDOWS_BUILD.md documentation for building on Windows

Why this fixes the build:

The spec files use copy_metadata() for these OCR packages, but they weren't being installed during the Windows build because they weren't
in pyproject.toml. The CI runs uv sync --frozen which only installs declared dependencies. Now these packages will be installed and their
metadata will be available to PyInstaller.

Include all babelfish.converters submodules (alpha2, alpha3b, alpha3t, name, opensubtitles) in PyInstaller hidden imports to fix 'No module named babelfish.converters.alpha2' error during OCR conversion.

Add mkvtoolnix directory to PATH environment variable so pgsrip can find mkvextract executable when performing OCR conversion. This fixes the 'mkvextract command not found' error.

Change working directory to video folder and use relative filename when calling pgsrip to avoid issues with special characters (parentheses, brackets) in Windows paths that may cause mkvextract to fail.

cdgriffith · 2025-10-30T05:14:52Z

FastFlix_Windows_Installer.spec

@@ -1,5 +1,5 @@
 # -*- mode: python ; coding: utf-8 -*-
-from PyInstaller.utils.hooks import collect_submodules
+from PyInstaller.utils.hooks import collect_submodules, copy_metadata, collect_data_files


I did not know about those functions, handy!

Thanks! Just testing some final changes. I had to deal with detection for Subtitle Edit's tesseract installations. It works locally, testing a build now.

Check AppData/Roaming/Subtitle Edit for Tesseract installations, parse version numbers from directory names (e.g., Tesseract550), and automatically select the newest version. This ensures modern Tesseract versions are detected even when multiple versions exist.

Initialize PATH environment variables for tesseract and mkvextract at application startup before any subprocesses are spawned. This ensures frozen PyInstaller executables can properly pass environment to subprocesses spawned by pgsrip library.

Set TEMP and TMP environment variables to standard temp directory to ensure pgsrip can create temporary folders correctly when running from frozen PyInstaller executable.

Override pgsrip's temp folder creation to work correctly in frozen PyInstaller executables. pgsrip's MediaPath.create_temp_folder() doesn't work properly when frozen, so we create our own temp folder if the one provided doesn't exist.

Ensure the monkey-patch is applied before importing Mkv class to prevent pgsrip from capturing the original read_data method in its lambda closures. This should fix PyInstaller temp folder issue.

Move the pgsrip monkey-patch to setup_ocr_environment() which runs at application startup, before any pgsrip imports. This ensures the patch is applied before pgsrip's lambda closures are created, fixing temp folder creation in PyInstaller frozen executables.

Move patch_pgsrip_for_pyinstaller() to run AFTER environment variables are set up, in case pgsrip import requires the environment to be configured first.

Simplify code back to working state from source. PyInstaller exe issue is a known pgsrip bug that needs to be fixed upstream. Feature works perfectly when running from source.

Add documentation explaining that PGS to SRT OCR conversion works from source but fails in PyInstaller builds due to pgsrip temp folder bug. Include workaround instructions and requirements.

Implement OCR conversion for PGS (Presentation Graphic Stream) subtitles to SRT format using pgsrip library with auto-detection of required tools. Features: - Auto-detect Tesseract OCR from PATH or Subtitle Edit installations - Auto-detect MKVToolNix (mkvextract/mkvmerge) from standard locations - Support for multiple language codes (2-letter, 3-letter, names) - Automatic cleanup of temporary .sup files after conversion - Works when running FastFlix from source Known limitation: Due to an upstream issue in pgsrip v0.1.12, this feature does not work in PyInstaller-built executables. Users needing PGS OCR should run FastFlix from source with: python -m fastflix Dependencies added: - pgsrip (OCR engine for PGS subtitles) - pytesseract (Tesseract OCR Python wrapper) - babelfish (language code handling) - cleanit, trakit (metadata handling) - opencv-python, pysrt (image/subtitle processing)

mikeSGman · 2025-10-31T03:39:06Z

I give up, I simply can't figure out how to get it to work on a compiled binary (it compiles cleanly - but the srt extraction fails). It works perfectly from source, though, so that's good enough for my usecase.

cdgriffith · 2025-11-04T18:07:52Z

Hey @mikeSGman can you re-open this, I'd like to merge it to dev and play around with it to see if I can get the build working for ya! This is a great feature and would love to have it as part of the standard build

mikeSGman · 2025-11-09T20:50:01Z

I tried to, but it says:

mikeSGman · 2025-11-09T20:50:29Z

It might be because I squashed my commits in my source branch, but I still have the code if it's useful can give it to you.

mikeSGman · 2025-12-22T03:41:17Z

GitHub won’t let me reopen this PR because the source branch history was rewritten. I opened a new PR with the same changes here: #709.

cdgriffith changed the base branch from master to develop October 18, 2025 18:50

cdgriffith reviewed Oct 18, 2025

View reviewed changes

fastflix/models/config.py Outdated Show resolved Hide resolved

fastflix/models/config.py Outdated Show resolved Hide resolved

fastflix/models/config.py Outdated Show resolved Hide resolved

fastflix/widgets/background_tasks.py Outdated Show resolved Hide resolved

mikeSGman force-pushed the feature/pgs-to-srt-ocr branch from c067e1f to cc88b50 Compare October 19, 2025 19:13

mikeSGman force-pushed the feature/pgs-to-srt-ocr branch from cc88b50 to 1c6c486 Compare October 19, 2025 19:37

mikeSGman added 6 commits October 19, 2025 19:57

Add pgsrip metadata to PyInstaller builds

5dd7627

Include package metadata for pgsrip, pytesseract, and babelfish in the Windows builds to fix 'No package metadata was found' error when running OCR conversion from the compiled executable.

Include babelfish data files in PyInstaller builds

c1d63d1

Add collect_data_files('babelfish') to bundle ISO language code data files needed by babelfish at runtime.

Include cleanit metadata in PyInstaller builds

39b1a5f

Add copy_metadata('cleanit') for pgsrip dependency.

Include cleanit data files in PyInstaller builds

964ce3c

Add collect_data_files('cleanit') to bundle YAML config files needed by cleanit at runtime.

Include trakit metadata in PyInstaller builds

4f8e347

Add copy_metadata('trakit') for pgsrip dependency.

mikeSGman added 5 commits October 30, 2025 03:02

Add babelfish converter submodules as hidden imports

9bd98ea

Include all babelfish.converters submodules (alpha2, alpha3b, alpha3t, name, opensubtitles) in PyInstaller hidden imports to fix 'No module named babelfish.converters.alpha2' error during OCR conversion.

Add MKVToolNix directory to PATH for pgsrip

fdee985

Add mkvtoolnix directory to PATH environment variable so pgsrip can find mkvextract executable when performing OCR conversion. This fixes the 'mkvextract command not found' error.

Run pgsrip from video directory to avoid Windows path issues

c7fcaa1

Change working directory to video folder and use relative filename when calling pgsrip to avoid issues with special characters (parentheses, brackets) in Windows paths that may cause mkvextract to fail.

Add test script and use POSIX paths for pgsrip

61e9735

Update test script with tesseract/mkvextract paths

d967c82

cdgriffith reviewed Oct 30, 2025

View reviewed changes

mikeSGman added 7 commits October 30, 2025 05:30

Fix tesseract path for Subtitle Edit installation

5060f88

Use Tesseract 5.5.0 for testing

613c64f

Add detection test script

54376c9

Add debug logging for pgsrip

90a64ba

Enable keep_temp_files for debugging PyInstaller temp folder issue

4560654

mikeSGman added 10 commits October 30, 2025 06:33

Ensure TEMP/TMP env vars are set for PyInstaller

211b08b

Set TEMP and TMP environment variables to standard temp directory to ensure pgsrip can create temporary folders correctly when running from frozen PyInstaller executable.

Remove invalid keep_temp_files parameter

8ecb3ad

Fix pgsrip monkey-patch to apply before Mkv import

953c967

Ensure the monkey-patch is applied before importing Mkv class to prevent pgsrip from capturing the original read_data method in its lambda closures. This should fix PyInstaller temp folder issue.

Apply pgsrip patch after environment setup

b7884e4

Move patch_pgsrip_for_pyinstaller() to run AFTER environment variables are set up, in case pgsrip import requires the environment to be configured first.

Add debug output to verify pgsrip patch is applied

1ba9941

Revert to simpler pgsrip usage - works from source

ddaae55

Simplify code back to working state from source. PyInstaller exe issue is a known pgsrip bug that needs to be fixed upstream. Feature works perfectly when running from source.

Document known PyInstaller limitation for PGS OCR

835607e

Add documentation explaining that PGS to SRT OCR conversion works from source but fails in PyInstaller builds due to pgsrip temp folder bug. Include workaround instructions and requirements.

mikeSGman force-pushed the feature/pgs-to-srt-ocr branch from a66fdfb to 2f89be5 Compare October 31, 2025 03:26

mikeSGman closed this Oct 31, 2025

mikeSGman deleted the feature/pgs-to-srt-ocr branch October 31, 2025 03:39

mikeSGman mentioned this pull request Oct 31, 2025

Support external subtitles (SRT UTF-8) with same name as source file to be automatically imported and tagged #706

Open

mikeSGman mentioned this pull request Dec 22, 2025

Add PGS to SRT OCR subtitle extraction feature #709

Open

Uh oh!

Add PGS to SRT OCR conversion feature #701

Add PGS to SRT OCR conversion feature #701

Uh oh!

Conversation

mikeSGman commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Add PGS to SRT OCR Conversion Feature

Summary

Motivation

Features

User-Facing Changes

Technical Implementation

Files Modified

Dependencies

Key Design Decisions

Testing

Tested Scenarios

Test Results

Testing Checklist

Installation Instructions for Users

Windows

Linux

macOS

Breaking Changes

Migration Guide

Future Enhancements

Related Issues

Screenshots

Checklist

Uh oh!

cdgriffith left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mikeSGman commented Oct 18, 2025

Uh oh!

mikeSGman commented Oct 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mikeSGman commented Oct 20, 2025

Uh oh!

mikeSGman commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cdgriffith Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

mikeSGman Oct 30, 2025

Choose a reason for hiding this comment

Uh oh!

mikeSGman commented Oct 31, 2025

Uh oh!

cdgriffith commented Nov 4, 2025

Uh oh!

mikeSGman commented Nov 9, 2025

Uh oh!

mikeSGman commented Nov 9, 2025

Uh oh!

mikeSGman commented Dec 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mikeSGman commented Oct 15, 2025 •

edited

Loading

mikeSGman commented Oct 19, 2025 •

edited

Loading

mikeSGman commented Oct 30, 2025 •

edited

Loading