• Contact
  • Company
  • Login / My Account
  • Shopping Cart (0)
Document Conversion Made Easy!
Peernet Menu
  • Products
      • Virtual Printers
        • tiff-image-printer-iconTIFF Image Printer – Create TIFF Images
        • raster-image-printer-iconRaster Image Printer – Create TIFF, PDF, JPEG, etc.
        • pdf-image-printer-iconPDF Image Printer – Create Searchable PDF
      • PDF Editor
        • pdf-creator-plus-iconPDF Creator Plus – Merge, Edit, Create Searchable PDF
      • Batch Converters
        • document-conversion-service-iconDocument Conversion Service – Unattended 24/7 Batch Converter
        • file-conversion-center-iconFile Conversion Center – Desktop Batch Converter
      • Reporting Software
        • peernet-reports-iconPEERNET Reports – Barcode, Report and Label Software
      • enterprise-licensingEnterprise Licensing for your Corporation
      • discounts-multiple-licensesDiscounts for Purchasing Multiple Licenses
      • distribute-bundle-peernet-softwareDistribute PEERNET Software Bundled with your Product
  • Purchase
      • Purchase Virtual Printers
        • tiff-image-printer-iconTIFF Image Printer – Create TIFF Images
        • raster-image-printer-iconRaster Image Printer – Create TIFF, PDF, JPEG etc.
        • pdf-image-printer-iconPDF Image Printer – Create Searchable PDF
      • Purchase PDF Editor
        • pdf-creator-plus-iconPDF Creator Plus – Merge, Edit, Create Searchable PDF
      • Purchase Batch Converters
        • document-conversion-service-iconDocument Conversion Service – Unattended 24/7 Batch Converter
        • file-conversion-center-iconFile Conversion Center – Desktop Batch Converter
      • Purchase Reporting Software
        • peernet-reports-iconPEERNET Reports – Barcode, Report and Label Software
      • peernet-online-store-purchase-optionsPurchase Options
      • peernet-software-license-levelsLicense Levels
      • peernet-software-purchase-resellerFind Resellers
      • peernet-software-sales-faqsSales FAQ
  • Learn & Support
        • peernet-help-centerTutorials
          • Learn the Basics or Go Beyond with Video Tutorials, FAQs and Guides

            At PEERNET we pride ourselves on providing the best support and the fastest response times in the industry.
          • Select Software Tutorials:
              • tiff-image-printer-iconTIFF Image Printer
              • raster-image-printer-iconRaster Image Printer
              • pdf-image-printer-iconPDF Image Printer
              • pdf-creator-plus-iconPDF Creator Plus
              • document-conversion-service-iconDocument Conversion Service
              • file-conversion-center-iconFile Conversion Center
              • peernet-reports-iconPEERNET Reports
        • peernet-software-faqsSales FAQ
          • Popular Topics

            Find all the answers you need to our most frequently asked questions.
              • Download & Install
                • How do I download software I already purchased?
              • Purchasing & Renewing
                • How do I purchase PEERNET software?
                • What license level do I need?
                • How do I add licenses to an existing serial number?
                • How do I renew my annual subscription?
              • Licensing & Operating
                • How do I activate my new PEERNET software?
                • How do I activate my software without an internet connection?
                • Where do I find my serial number?
                • How do I move my software to a new computer?
                • How do I update/upgrade my software to the latest release?
            • Read all Sales FAQs
  • Blog
  • Menu Menu

Create TIFF and Extract Text From Images Using OCR

December 6, 2023/by Robert Massart

You can use TIFF Image Printer and Raster Image Printer to effortlessly extract text from images by printing your images or scanned PDF documents. With just one step, you can create TIFF images and extract the text from pages into an editable text file, making it easy to modify the content as needed.

If you have a scanned PDF document and need to create searchable PDF files, see Convert PDF to Searchable PDF with OCR instead.

What is OCR?

OCR (Optical Character Recognition) searches for and recognizes text (characters) on scanned pages or images and extracts it as digital text. Outside factors such as image quality, the font used, and any image background on the pages will all affect the quality of the OCR results.

You can save the text output from the OCR process as hOCR, Text, or ALTO files. From the OCR settings, you can choose which type of extracted text file to create from the options in the OCR tab and even generate all of them at once if you want.

We’re using the TIFF Image Printer below to extract text from images, but the steps are the same for the Raster Image Printer. For Raster Image Printer, this works for all output images, TIFF, PNG, JPEG, etc.

Create the Extract Text From Images Profile

To start, open the Dashboard by double-clicking on the desktop shortcut for your printer.

Launch TIFF Image Printer Dashboard

The Dashboard gives you access to license information, printers, and resources, but most importantly, creating, copying, and editing profiles.

Select Edit & Create Profiles to open the Profile Manager to create a new profile.

Open the Profile Manager to create a new profile.

Find the system profile named Color Optimized TIFF. Create a copy of it using the copy icon in the lower left. The same steps we are doing here apply to any system profile. You can also create custom profiles through the Add a profile button.

Copy of an existing profile to create our text extraction profile.

Configure OCR For The Extract Text From Images Profile

Give your new profile a name and a description. Next, go to the OCR tab and turn on OCR (Optical Character Recognition). Running OCR on each page can be a time-consuming step. For this reason, it is disabled to start.

Enable OCR for the new profile so we can extract text from images.

Next, choose which OCR text files to create. There are three to choose from, and you can select to create more than one type.

  • hOCR is an XHTML file containing the text extracted from the page. It also stores format and layout information and a score for how confident the OCR engine is on its match.
  • Text creates a UTF-8 text file containing only the extracted text.
  • ALTO is similar to hOCR but stores the information as XML following the Analyzed Layout and Text Object specification

For our example, we chose Text OCR. We only want to extract the text from the page and don’t care about the layout or positioning on the page.

Select what type of OCR text file to create.

Lastly, choose which languages to look for on the page. You must select at least one language. The more languages to match against, the longer the OCR process will take. If you have documents with mixed languages, select all languages used.

PEERNET Image Printers can recognize Arabic, English, French, German, Hebrew, Hindi, Italian, and Spanish, with additional languages available to download.

Choose which languages the OCR process will recognize.

Saving and Using the New Printer Profile

With your OCR settings configured, you can now Save the changes to your new profile. Click the Back arrow to return to the main screen of the Profile Manager and then close it.

We have our new profile. Let’s set it as the default profile TIFF Image Printer uses when printing. To do this, return to the TIFF Image Printer Dashboard and select Manage Printers to open Printer Management.

Open the Printer Management screen.

The Printer Management screen lists all copies of your TIFF Image Printer and which profile to use when creating files using that printer.

Next to the printer name, use the drop box to set your new OCR profile as the default profile. Here, we created the new profile OCR Color Optimized TIFF and we will select that profile. This profile creates multipaged TIFF images. We will use OCR to scan the pages and save the text as a separate text file along with the TIFF image.

Set the printer to use our new extract text from images profile.

Select the Save icon to save your changes to the printer settings.

Save the changes to the printer settings.

Close Printer Management and the Dashboard.

Close the Dashboard.

Convert Scanned PDF to TIFF and Extract Text From Images

Open the document you want to convert to TIFF and extract text from images into an editable file. Here, we opened a scanned PDF in Adobe Reader. You can do the same with TIFF, PNG, and other image files.

Each page in our document is an image. We want to recognize and save the text in this file as we create the TIFF images. You can tell if a page is a scanned image as you cannot select any text on the page, only an area of the page, as shown below.

Open a scanned PDF or an image you want to extract the text from.

Select File – Print from your application, and select TIFF Image Printer 12 from the list of printers. Then click Print to send the document to the printer.

Print the PDF or image to TIFF Image Printer.

Printing your document will prompt you to choose the name and location of your new TIFF image and OCR text file. The OCR process saves the extracted text files with the same base name and location as the new image.

Leave the profile OCR Color Optimized TIFF selected in the Save as type field.

Click Save to create your TIFF image and OCR text file.

Choose the name and location of your TIFF and OCR extracted text file.

And we are done. That is all there is to extract text from images using the PEERNET Image Printers. Looking at our new TIFF image and OCR text file, we can see that the text file contains the extracted text from the image.

TIFF Image is saved with a matching text file with the extracted text.
https://www.peernet.com/wp-content/uploads/Extract-Text-Using-OCR.jpg 800 800 Robert Massart https://www.peernet.com/wp-content/uploads/peernet-logo.png Robert Massart2023-12-06 10:15:002024-05-17 13:56:04Create TIFF and Extract Text From Images Using OCR
  • Document Conversion Service
  • TIFF Image Printer
  • Raster Image Printer
  • PDF Image Printer
  • PDF Creator Plus
  • File Conversion Center
  • PEERNET Reports
Search Search

Recent Posts

  • PNSrv11Lib to PNSrv12Lib: Migration Made Easy
  • Migrating to Version 12: Compatibility Mode Quick Start Guide
  • Well Logs: Stitch PDF Pages into a Continuous TIFF Image
  • Dynamic Stamp Content
  • Convert to PDF: The Power of On-Premise PDF Creation

INTERESTING LINKS

Below are some interesting links for you! Enjoy your stay :)

RSS Feed Logo RSS Feed Logo Subscribeto RSS Feed

OUR PRODUCTS

  • Document Conversion Service
  • TIFF Image Printer
  • Raster Image Printer
  • PDF Image Printer
  • PDF Creator Plus
  • File Conversion Center
  • PEERNET Reports

LATEST NEWS

  • PNSrv11Lib to PNSrv12Lib: Migration Made EasyMarch 14, 2025 - 2:10 pm
  • Migrating to Version 12: Compatibility Mode Quick Start GuideMarch 14, 2025 - 2:09 pm
  • Well Logs: Stitch PDF Pages into a Continuous TIFF ImageMarch 14, 2025 - 2:08 pm
  • Dynamic Stamp ContentNovember 4, 2024 - 4:47 pm

BUSINESS INFORMATION

Toll Free: 1-800-883-7980 North America

Tel: 1-613-224-6894

Our office hours are Monday to Friday, from 0900 hrs to 1700 hrs, Eastern Standard Time.

Email Address: [email protected]
Copyright © 1997-2026. All rights reserved. Terms and Conditions | Disclaimer | Privacy Policy | Trademarks.
PEERNET® is a registered trademark of PEERNET Inc.
  • Link to Youtube
  • Link to Rss this site
  • Products
  • Purchase
  • Company
  • Contact
Link to: Convert PDF to Searchable PDF with OCR Link to: Convert PDF to Searchable PDF with OCR Convert PDF to Searchable PDF with OCR Link to: Password Protect PDF With PDF Creator Plus Link to: Password Protect PDF With PDF Creator Plus Password Protect PDF With PDF Creator Plus
Scroll to top Scroll to top Scroll to top