This feature will create a separate text file containing all of the textual elements of your printed document. These text files are often paired with the image or files when stored in archival systems to allow searching and retrieval of the files using textual data.


The text extraction feature was not designed to be Optical Character Recognition (OCR) software. Only straight text will be extracted and formatting of the text file may not be exact.


Text Extraction

Enable Text Extraction

Enable this check box to turn on text extracting.

If you leave the field below empty a file with a .txt extension will be created in the same directory and named with the same name as the output  file

If you want to create the text file with a particular name enter the full path to the file you want to use for this purpose, or use the Browse... button to search and locate a file on disk.


Choose the layout for the text in your file.

Physical - attempts to match the format of the text in the original file.

Raw - saves the text in the order in which it is was sent to the driver. This may not be the same order as the text in the original file.

None - No formatting is attempted. All text is written to the file in the order in which it is received from the printing application.


Choose the encoding format and end-of-line encoding for your text file.  Depending on the operating system the text file will be used on, you may need to choose the appropriate line return code.

UTF -16, UTF-8 - 8-bit and 16-bit Unicode encoding

ANSI - uses the current ANSI code page

Windows - lines end with the carriage return line feed (CRLF, \r\n) used by Windows.

Unix - lines end with the line feed (LF, \n) used by UNIX.

Mac - lines end with the carriage return (CR, \r) used by Macintosh

Emit Page Breaks

Enable this check box to have insert a page break, or form feed (\f) in your text file for every page in your original document.

Enable Control Strings

Enable this check box to turn on support for control strings. While you can still use control strings, this feature is being deprecated. It is retained in the driver for backwards compatibility. See Using Control Strings in the Deprecated Features section for more information.

Prefix, Suffix

By default the prefix of '~0%' and suffix of '?' are used to recognize a control string pattern in a printed document. Some applications will break this string and print each character separately, making it impossible for us to detect the control string. This features allows you to set the control string prefix and suffix to a single Unicode character value. By using this feature, products like Crystal Reports and Adobe Reader can be configured so the product recognizes control strings.


