Capture2Text

Contents

What is Capture2Text?

Capture2Text enables users to quickly OCR a portion of the screen using a keyboard shortcut. The resulting text will be saved to the clipboard by default.

Conceptual illustration:

Download

The latest version can be found on the Capture2Text download page hosted by SourceForge. Source code is included.

How to Launch (no installation required)

  1. Unzip the contents of the zip file. Make sure that there are no Asian or other non-ASCII characters in the path where you unzipped it. Don't unzip it to the "Program Files" directory (this will avoid issues related to write privileges).
  2. Double-click on Capture2Text.exe. You should see the Capture2Text icon on the bottom-right of your screen (though it might be hidden in which case you will have to click on the "Show hidden icons" arrow).

OCR

Capture2Text can OCR the following languages:

Afrikaans (afr) Greek (ell) Odiya (ori)
Albanian (sqi) Gujarati (guj) Panjabi (pan)
Amharic (amh) Haitian (hat) Persian (fas)
Ancient Greek (grc) Hebrew (heb) Polish (pol)
Arabic (ara) Hindi (hin) Portuguese (por)
Assamese (asm) Hungarian (hun) Pushto (pus)
Azerbaijani (aze) Icelandic (isl) Romanian (ron)
Basque (eus) Indic (inc) Russian (rus)
Belarusian (bel) Indonesian (ind) Sanskrit (san)
Bengali (ben) Inuktitut (iku) Serbian (srp)
Bosnian (bos) Irish (gle) Sinhala (sin)
Bulgarian (bul) Italian (ita) Slovak (slk)
Burmese (mya) Japanese (jpn) Slovenian (slv)
Catalan (cat) Javanese (jav) Spanish (spa)
Cebuano (ceb) Kannada (kan) Swahili (swa)
Central Khmer (khm) Kazakh (kaz) Swedish (swe)
Cherokee (chr) Kirghiz (kir) Syriac (syr)
Chinese - Simplified (chi_sim) Korean (kor) Tagalog (tgl)
Chinese - Traditional (chi_tra) Kurukh (kru) Tajik (tgk)
Croatian (hrv) Lao (lao) Tamil (tam)
Czech (ces) Latin (lat) Telugu (tel)
Danish (dan) Latvian (lav) Thai (tha)
Dutch (nld) Lithuanian (lit) Tibetan (bod)
Dzongkha (dzo) Macedonian (mkd) Tigrinya (tir)
English (eng) Malay (msa) Turkish (tur)
Esperanto (epo) Malayalam (mal) Uighur (uig)
Estonian (est) Maltese (mlt) Ukrainian (ukr)
Finnish (fin) Marathi (mar) Urdu (urd)
Frankish (frk) Math/Equations (equ) Uzbek (uzb)
French (fra) Middle English (1100-1500) (enm) Vietnamese (vie)
Galician (glg) Middle French (1400-1600) (frm) Welsh (cym)
Georgian (kat) Nepali (nep) Yiddish (yid)
German (deu) Norwegian (nor)

By default only English, French, German, Italian, Japanese and Spanish are installed.

How to install additional languages:

  1. Download the appropriate OCR language dictionary.
  2. Open the ".zip" file you just downloaded with 7-Zip or similar decompression software.
  3. Drag all files contained within the zip file to this path in the Capture2Text folder: Capture2Text\Utils\tesseract\tessdata
  4. Restart Capture2Text

How to OCR:

  1. Position your mouse at the top-left corner of the text that you want to OCR.
  2. Press the OCR key (Windows Key + Q) to begin an OCR capture.
  3. Move your mouse to resize the blue box over the text that you want to OCR.
  4. Press the OCR key again or left-click to complete the OCR capture. The OCR'd text will be placed in the clipboard.

To cancel an OCR capture, press ESC.

To move the entire OCR capture box, hold down the right mouse button and drag.

To nudge the OCR capture box, use the arrow keys.

To toggle the active OCR capture box corner, press the space bar.

To change the OCR language, right-click the Capture2Text tray icon, select the OCR Language option and then select the desired language.

To quickly switch between 3 languages, use the OCR language quick access keys: Windows Key + 1, Windows Key + 2, and Windows Key + 3.

When Chinese or Japanese is selected, you should specify the text direction (vertical/horizontal/auto) using the text direction key: Windows Key + W. If auto is selected, horizontal will be used when the capture width is more than twice the height, otherwise vertical will be used. The text direction also affects how furigana is stripped from Japanese text.

(For Japanese) When OCR pre-processing is enabled, by default, Capture2Text will attempt to strip out furigana. You may disable this behavior in "Preferences... -> OCR -> Strip Furigana".

Using the Preferences dialog, you can change the following OCR settings:

Output Options

By default, the OCR'd text will be placed on the clipboard.

You also have 3 more ways to output the text.

To send the text to a pop-up window you can right-click the Capture2Text tray icon and select Show Popup Window.

To send the text to whichever textbox currently contains the blinking cursor/I-beam, right-click the Capture2Text tray icon and select Send to Cursor.

(Advanced) To send the text directly to a window/control (for example, Notepad++), first fill in the Send to Control settings in the Preferences dialog.

Using the Preferences dialog, you can change the following output settings:

Configuration

Right-click the Capture2Text tray icon in the bottom-right of your screen and then select the "Preferences..." option to bring up the Preferences dialog.

Substitutions

Sometimes Capture2Text consistently makes the same OCR mistakes such as recognizing an "M" as "I\/|".

By editing the substitutions.txt file in the Capture2Text directory, you may tell Capture2Text to substitute one text string for another text string.

Just find the appropriate language section and add one substitution per line in this format:
from_text = to_text

Example (adding 3 substitutions to the English section):

English:
I\/| = M
>< = X
some%space%text = some_text

To create a substitution regardless of language, add the substitution to the "All:" section.

Special tokens and escape characters:

%space% Space character
%tab% Tab character
%eq% Equals (=)
%perc% Percent sign (%)
%lf% Linefeed character (\n)
%cr% Carriage return character (\r)

You may disable a substitution by adding a "#" in front.

When done editing substitutions.txt, either restart Capture2Text or switch language for the substitutions to take effect.

Command Line Options

You may OCR the screen via command line by calling Capture2Text in this format:

Capture2Text.exe x1 y1 x2 y2 [output_file]

Required Arguments:
x1 - X1-Coordinate of the screen
y1 - Y1-Coordinate of the screen
x2 - X2-Coordinate of the screen
y2 - Y2-Coordinate of the screen

Optional Arguments:
output_file - The OCR'd text will be written to this file if specified.

Capture2Text will read settings.ini to determine settings such as OCR language and output options (clipboard, popup, etc.).

Examples:
Capture2Text.exe 10 152 47 321 output.txt
Capture2Text.exe 10 152 47 321

Note: Make sure that you wait for Capture2Text to finish processing before attempting to start a new instance of Capture2Text. Batch files automatically do this. If you are typing directly into the console window, be sure to use the "start /wait" command. For example:

start /wait Capture2Text.exe 10 152 47 321 output.txt

Troubleshooting & FAQ

How to Run Capture2Text from Source Code

  1. Download and install AutoHotkey 32-bit Unicode from: http://ahkscript.org/download/
  2. Copy the .ahk files from SourceCode/Capture2Text_AHK_Script to the root Capture2Text folder (the same folder that contains Capture2Text.exe)
  3. Double-click Capture2Text.ahk

Acknowledgments/Tools Used

Tesseract OCR engine
Leptonica Image processing and analysis library
ScreenCapture AutoHotKey screen capture script

Related Tools for Japanese Language Learners