Knowledgebase
Extra words in PDF files
Posted by Alexey Sokolov (AIT) on 27 June 2012 12:04 PM

A problem with extra words can be caused by graphics in your .pdf file. AnyCount’s word count engine will count every graphic element as one word. Thus if your .pdf file is recognized, then the ‘PDF Graphic Recognition’ function should be disabled. You can access the default settings of this file format, by selecting All formats > Acrobat > PDF in the ribbon.

Also, please find some suggestions below to provide fully correct counting of unrecognized PDF files:

1. Avoid all signatures, pictures which do not contain text and stamps. delete all such objects in your unrecognized documents to get fully right results.
2. If you have pages in documents which have rotated text, please, rotate these pages to establish normal view of it. Pages with rotated text will be counted incorrectly.
3. Resolution and quality of images in .PDF files influence text recognition quality. So, make sure that you have a normal resolution and quality of pages of your unrecognized PDF file.

(0 vote(s))
Helpful
Not helpful

Comments (0)