App/OCR の履歴(No.5)

目次†

各国語毎の言語ファイルは別パッケージになっている。必要に応じて追加インストールする。

   tesseract <input-image-file> <out-file> -l <lang> -psm <number>

-psm <number>

Page segmentation modes:

0 : Orientation and script detection (OSD) only.

言語データ：osd.traineddata 必須
OSD は Orientation and script detection の略
言語の種類やテキストの傾斜角度等を xxxxx.osd に出力

1 : Automatic page segmentation with OSD.

言語データ：osd.traineddata 必須
回転(90度・180度・270度)したテキストも認識する

2 : Automatic page segmentation, but no OSD, or OCR.

テキストの傾斜角度にかかわる情報を画面に標準出力する。
ファイルは何も作成されない。

3 : Fully automatic page segmentation, but no OSD. (Default)

デフォルトのオプション。
水平テキスト（横書き文書）と垂直テキスト（縦書き文書）に対応
傾斜角度のあるテキストは非対応

5 : Assume a single uniform block of vertically aligned text.

ユニフォームブロックの垂直テキストと仮定する。

6 : Assume a single uniform block of text.

ユニフォームブロックのテキストと仮定する。
デフォルトよりも認識率が良い。

7 : Treat the image as a single text line.

単一のテキスト行として扱う。
画像内に複数行が存在する場合は、画像内の文字すべてが誤認識する。

8 : Treat the image as a single word.

単語として扱う。行に複数の単語がある場合は、
単語間の空白は削られて出力される。「太陽　お月様」 → 「太陽お月様」
画像内に複数行が存在する場合は、画像内の文字すべてが誤認識する。

9 : Treat the image as a single word in a circle.

画像を円の中の一つの単語として扱う。（丸囲みのことか？）

10 : Treat the image as a single character.

1文字として扱う。2文字以上ある場合は、すべてが誤認識する。

11 : Sparse text. Find as much text as possible in no particular order.

色の薄いテキスト。できるだけ多くのテキストを探す。
デフォルトで探せなかったテキストも認識する。

12 : Sparse text with OSD.

言語データ：osd.traineddata 必須
上記11番、及び、
回転(90度・180度・270度)したテキストも認識する

13 : Raw line. Treat the image as a single text line,
```
bypassing hacks that are Tesseract-specific.
```