The Text Extraction tab will create a separate text file containing all of the textual elements of your printed document. These text files are often paired with the files when stored in archival systems to allow searching and retrieval of the files using textual data.
The text extraction feature was not designed to be Optical Character Recognition (OCR) software. Only straight text will be extracted and formatting of the text file may not be exact.
By default this setting is disabled. Enabling this setting means PDF Image Printer will extract text and save the created text file in the same directory and with the same name as the output file.
Choose the layout for the text in your file.
•Physical - Attempts to match the format of the text in the original file.
•Raw - Saves the text in the order in which it is was sent to the driver. This may not be the same order as the text in the original file.
•None - No formatting is attempted. All text is written to the file in the order in which it is received from the printing application.
Choose the encoding format for your text file.
•UTF-16 - uses 16-bit Unicode encoding
•UTF-8 - uses 8-bit Unicode encoding
•ANSI - uses the current ANSI code page
Choose the end-of-line encoding for your text file. Depending on the operating system the text file will be used on, you may need to choose the appropriate line return code.
•Windows - lines end with the carriage return line feed (CRLF, \r\n) used by Windows
•Mac - lines end with the carriage return (CR, \r) used by Macintosh
•Unix - lines end with the line feed (LF, \n) used by UNIX.
By default this setting is enabled. Enabling this setting means PDF Image Printer will insert a page break, or form feed (\f) in your text file for every page in your original document.
See Also: Extracting Text From the Created File