Use this action to extract text from an image. The OCR action supports over 92 languages, including typed or printed text, handwriting, and multi-byte characters like Japanese, Chinese, and Korean. Check What languages are supported?. This action can scan PNG, PDF, JPEG, and JPG file types.
If you use this action to scan handwritten documents, OCR accuracy can vary dramatically based on the penmanship or handwriting style. This is a limitation of current OCR technology.
✅ Heads-up: This action is unable to extract text entered into a PDF file itself, such as a user-fillable form.
This is a premium action. By default, a premium action is not available in a team as there is an additional fee associated with it. Please contact Catalytic Support to begin working with this action, or check the Premium Actions article.
Since the OCR action extracts any text from an image, it’s easy to use in processes that handle incoming paper documents. For example, if a paper purchase order is scanned and submitted as an image, pull useful information from it with OCR instead of retyping it manually.
This action works great in a Workflow that consistently scans similar images, such as purchasing orders. When incoming images are similar, the output fields will be easy to integrate into other actions.
The latest version supports scanning images up to 7.5mb and PDFs up to 2000 page or 100mb.
- The image to be scanned, no greater than 10mb. This will most likely be a reference field to an image uploaded in another field.
Falsefrom the drop down to choose whether to output a full OCR data table in addition to the normal scanned output text.
Trueto include the data table with a row for each word in the document.
- Leave blank or select
Falseto only output the horizontal and vertical scans.
- To help keep output fields organized, choose a prefix that will be added to the beginning of the output field name. The name you specify will become the output field prefix for the field.
The horizontal scan and vertical scan fields are the full text extracted from the document. Both should return similar results, but in a different order or layout. The difference between the horizontal and vertical scan is the direction in which the OCR action scans the document.
If Include output data table is TRUE, the action will also output a data table with more details. The data table includes a row for each block the OCR scan finds.
This action can output an immense amount of text depending on the document. If an output text field exceeds the field size system limit, there will be a fix task. The Workflow system limits field size limit is 128kB, or about 82 solid pages of text.
This action may generate multiple fields. To help keep output fields organized, the return field name above will be added to the beginning of each of the output field names, separated by two dashes. Each field will result as:
- Equals either True or False.
- Provides the raw results of the horizontal OCR scan.
- Provides the raw results of the vertical OCR scan.
- If Include output data table is TRUE, the data table ID is output in this field. There is a data table row for each block extracted from the image. The different columns of each row are details for the block, such as which block is directly left or right of it. The columns for this table are:
Result Result Description
the block of text the word belongs to
the block directly to the left of the current block
the block directly to the right of the current block
the block directly above the current block
the block directly below the current block
the page number of the word within the PDF/document
If something’s not working as expected, or you’re looking for suggestions, check through the options below.
The OCR action is able to detect a wide variety of languages, and multiple languages within each scan.
The following languages are considered fully supported.
|Language||English language name|
|Русский (старая орфография)||Russian|
The following languages are considered to be partially supported.
|Language||English language name|
|Αρχαία ελληνικά||Ancient Greek|
|Azərbaycan (qədim yazı)||Azerbaijani|
|Kreyòl Ayisyen||Haitian Creole|
This action is unable to extract text entered into a PDF file itself, such as a user fillable form. This is a limitation of the PDF file type and OCR technology.
If it is possible to export the PDF as another file type, the user-filled forms are typically “flattened” into the file. For example, export the PDF as a PNG, then run it through OCR.
Thanks for your feedback
We update the Help Center daily, so expect changes soon.
Paste this URL anywhere to link straight to the section.