Scan to Word OCR Converter is designed to convert scanned image to editable Word or RTF document. The application supports various languages, including English, French, German, Italian, Spanish and Portuguese. It also allows you to select page range. If the password of a PDF document is inputted in Scan to Word OCR Converter, the encrypted PDF file can be converted to Word or RTF document.
Please see the interface of Scan to Word OCR Converter in Figure1. The list on the left is for listing all the PDF documents or image files to be converted. To add files, you can use the "Add File (s)" button below the list or drag all the files which need to be converted into the list. You can also right click the mouse and choose "Add files" option in dropdown list. To clear some files in the list, you can choose the files in the list, then click "Remove" button below the list. The "Remove All" button is used for clearing the file list.
There are four group boxes on the right. In "Output Options" group box, there are 13 options in the combo box for your choice. Options from 1 to 7 are for different output layout formats. Options from 8 to 13 are the six kinds of languages which are supported by the application. You can choose "MS Word Document (*.DOC)" and "Rich Text Format (*.RTF)" as output formats in "Output Formats" group box. If you want to convert a section of PDF document to Word or RTF, you can select the page range in "Page Range" group box. If not, just click "All Pages" radio button. The application also allows you to convert encrypted PDF document to Word or RTF and you just need to input the user password of PDF document into "PDF Password" group box. You can choose "View after convert" to browse the output documents after conversion or not.
If you want to convert image files to Word or RTF documents, Scan to Word OCR Converter can recognize the languages which are supported by the application in the images. For example, there are two image files with Portuguese in them, with some easy clicks, the application can convert the images to editable Word or RTF documents and recognize the characters in them.
The following is the list of OCR & non-OCR Output Options:
1. Original layout without text boxes (Best),
2. Text only (No Images),
3. Original layout with text boxes (Fastest),
4. Flow text with text boxes,
5. Exact layout with text boxes,
6. Flow text without text boxes,
7. Continuous text without text boxes,
8. OCR PDF and Image file (Language: English),
9. OCR PDF and Image file (Language: French),
10. OCR PDF and Image file (Language: German),
11. OCR PDF and Image file (Language: Italian),
12. OCR PDF and Image file (Language: Spanish),
13. OCR PDF and Image file (Language: Portuguese),
Please click "Add File(s)" button to add image files or drag all the files into file list directly. For knowing it is Portuguese in the images, you should choose the thirteenth option-- "OCR PDF and Image file (Language: Portuguese)" in "Output Options" group box. Then choose one output format in "Output Formats" such as "MS Word Document (*.DOC)". Then please click "Convert" button to specify the directory for output documents in "Browse for Folder" dialog box and click "OK" button to run the conversion. Please see the conversion progress in Figure 2.
Several seconds later, you can browse the output Word documents in the specified location. Please see the comparison of one original image and newly created document in Figure3.
It is worthy to mention that if the resolutions of original image are high enough, the languages in the images will be converted to editable ones. On the contrary, the recognized languages can not be edited.