Subtitle OCR

Convert bitmap subtitle sources into editable subtitles you can review, clean, and export.

Subtitle OCR

Overview

Subtitle OCR converts bitmap subtitle sources into editable text subtitles. Import PGS tracks from media containers, standalone SUP files, or standalone VobSub IDX/SUB pairs, then review each decoded cue against its source image.

Works with other tools

Some outputs can be opened again in another MediaFlow workspace.

Merge can use audio, subtitle, and video tracks saved from Extraction.
Translation can use subtitles saved from Audio to Subs, Video OCR, or Subtitle OCR.
Rename can organize generated files; File Information can inspect media used by other tools.

Demo video

Watch a short product demo for this tool before following the detailed steps.

Formats

Use this section to confirm what you can bring in and what you can save.

Supported imports

Import media files with PGS bitmap subtitle tracks, standalone SUP subtitle files, or standalone VobSub IDX/SUB pairs.

MKVM2TSVOBSUPIDX/SUB
Exports

Export the selected OCR version as a standard text subtitle file.

ASSSRTVTT

How to use

Follow the annotated captures to find the right controls inside the desktop app.

1
Import subtitle sources

Add supported media files, SUP files, or IDX/SUB pairs. For media containers, choose the embedded bitmap subtitle tracks you want to process.

2
Run local OCR

Choose the OCR model and GPU option, then start the run. MediaFlow decodes bitmap cues and shows recognized text as processing progresses.

3
Review cues and cleanup

Compare each bitmap preview with the recognized text. Enable AI cleanup when you want OCR errors corrected and duplicate consecutive cues merged.

4
Edit or retry

Edit cue text directly, select versions, retry OCR when needed, or run cleanup again without losing the review history.

5
Export a version

Open the version dialog, preview the output, then copy or export the selected version as ASS, SRT, or WebVTT.

Screenshots

Use these annotated captures to recognize the workspace and key areas of the tool.