OCR technology, in full optical character recognition technology, Scanning and comparison technique intended to identify printed text or numerical data. OCR technology avoids the need to retype already printed material for data entry. OCR technology software attempts to identify characters by comparing shapes to those stored in the software library. The OCR technology software tries to identify words using character proximity and will try to reconstruct the original page layout. High accuracy can be obtained by using sharp, clear scans of high-quality originals.
VeryPDF's PNM to Plain Text OCR Converter Command Line is a Command Line application that lets you use OCR technology accurately and quickly to convert PNM to plain text singly or in batches and other formats of image to plain text files singly or in batches through command line and supported parameters in both 32 bits and 64 bits Windows systems. VeryPDF's PNM to Plain Text OCR Converter Command Line also enable you to convert scanned PDF to plain text singly or in batches flawlessly with OCR technology and command line.
PNM to Plain Text OCR Converter Command Line has following features:
Download and Purchase PNM to Plain Text OCR Converter Command Line
Version |
Quantity |
Price (USD) |
Download |
|
PNM to Plain Text OCR Converter Command Line |
1 Server License | 195/each | ||
1 Developer License | 1495/each | |||
OCR Language Packs |
|
Free |
Free |
Notice: default package of PNM to Plain Text OCR Converter Command Line contains OCR technology only supporting English as default. However you can download more OCR language packs at here.
PNM to Plain Text OCR Converter Command Line Options:
-------------------------------------------------------
Usage: pdf2txtocr.exe [options] <PDF-file> <Text-file>
-firstpage <int> : first PDF page to convert
-lastpage <int> : last PDF page to convert
-res <int> : set resolution, the
unit is DPI (default is 300 dpi)
-ownerpwd <string> : set owner password for encrypted PDF file
-userpwd <string> : set user password for encrypted PDF file
-layout :
maintain original physical layout
-noc
: don't insert page breaks 0x0C between pages in text file
-bitcount <int> : set color depth when render PDF page to
image data, it can be set 1, 8, 24, default is 8bit
-ocr
: enable OCR function for scanned PDF file
-lang <string> : choose the language for OCR engine
-text <string> : add additional text at end of each text
page, this parameter supports the following variables:
%PageNumber% : current page number
%PageCount% : total page count of PDF file
-$ <string> : input your License Key
Examples:
pdf2txtocr.exe C:\in.pdf C:\out.txt
pdf2txtocr.exe -firstpage 1 -lastpage 1 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -res 300 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ownerpwd 123 -userpwd 456 C:\in.pdf C:\out.txt
pdf2txtocr.exe -layout C:\in.pdf C:\out.txt
pdf2txtocr.exe -noc C:\in.pdf C:\out.txt
pdf2txtocr.exe C:\in.tif C:\out.txt
pdf2txtocr.exe C:\in.jpg C:\out.txt
pdf2txtocr.exe C:\in.bmp C:\out.txt
pdf2txtocr.exe C:\in.png C:\out.txt
pdf2txtocr.exe -ocr -lang eng C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 1 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 8 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 24 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -lang deu C:\in.pdf C:\out.txt
pdf2txtocr.exe -lang deu C:\in.tif C:\out.txt
pdf2txtocr.exe -text "PageText %PageNumber% of %PageCount%" C:\in.pdf C:\out.txt
Following command line will OCR all PDF
files in D:\temp\ folder to text files:
for %F in (D:\temp\*.pdf) do pdf2txtocr.exe -ocr -lang deu "%F" "%~dpnF.txt"
Following command line will OCR all PDF
files in D:\temp\ folder and subdirectories to text files:
for /r D:\temp %F in (*.pdf) do pdf2txtocr.exe -ocr "%F" "%~dpnF.txt"
Following command line will OCR all PDF
files from D:\temp\ folder and output text files to C:\test folder:
for %F in (D:\temp\*.pdf) do pdf2txtocr.exe -ocr "%F" "C:\test\%~nF.txt""
See Also:
DocConverter COM
Component (+HTML2PDF.exe): Convert HTML, DOC, RTF, XLS, PPT, TXT etc.
files to PDF files, it is depend on
PDFcamp Printer
product.
PDF to Text
Converter: Convert PDF files to plain text files.
HTML
Converter: Convert HTML files to TIF, TIFF, JPG, JPEG, GIF, PNG, BMP, PCX,
TGA, JP2 (JPEG2000), PNM, etc. formats.
PDF to Image
Converter: Convert PDF files to TIF, TIFF, JPG, GIF, PNG, BMP, EMF, PCX, TGA
formats.
Image to
PDF Converter: Convert 40+ image formats to PDF files.
Email Us: support@verypdf.com
Search By Keywords:
MNG TO HTML ::
MNG TO ASCII ::
MNG TO HTM ::
MNG TO TEXT DOCUMENT ::
MNG TO DOCUMENT ::
MNG TO DOC ::
MNG TO EDITABLE DOCUMENT ::
MNG TO EDITABLE DOC ::
MNG TO DOCX ::
MNG TO WORD ::
MNG TO OFFICE ::
MNG TO OPENOFFICE ::
MNG TO XML ::
MNG TO EDITABLE WORD ::
PIC TO TXT ::
PIC TO TEXT ::
PIC TO PLAIN TEXT ::
PIC TO RTF ::
PIC TO HTML ::
PIC TO ASCII ::
VeryPDF.com
|
VeryDOC.com |
VeryPCL.com |
Links |
Contact
Copyright © 2002- VeryPDF.com, Inc. All rights reserved.
Send comments about this site to the
webmaster.