Skip to main content

How to fix OCR issues

Updated over 11 months ago

How to use PDF Multitool OCR Analyzer

If you are dealing with scanned PDFs and the extracted text is incomplete or inaccurate, you can use the PDF Multitool OCR Analyzer to find the best combination of filters to improve the extractor's results. Here's a step-by-step guide on how to use the PDF Multitool:

  1. First, download the free version of PDF Multitool from here.

  2. Next, load your document into the multitool.

  3. Then, in the left navigation menu, select OCR Analyzer.

  4. Choose the OCR Language and OCR Resolution and click Go.

  5. Copy the recommended filters in the results.

  6. Select your preferred extraction method on the left navigation (e.g. Extract as CSV).

  7. Under Optical Character Recognition, click on the Edit button for the OCR Filters.

  8. Click on the Add button and select the recommended filter(s).

  9. Once all recommended filters have been selected, click OK.

  10. Click Preview to check the result.

  11. If you're satisfied with the outcome, go to the Profile for PDF.co and API Server tab.

  12. Click on Copy as payload for PDF.co or API Server.

  13. Finally, paste this as a value to the profiles parameter.

For a demo on how to use this tool, watch this video: https://youtu.be/NSyyohNNe6E

Check out this page for more information on how to use the PDF Multitool https://bytescout.com/products/pdfmultitool/index.html

Did this answer your question?