Convert pic to text document and scanned PDF to text document with command line and OCR technology
About OCR technology?
Optical Character Recognition (OCR technology) is widely used in business and other organizations to examine scanned images of printed text and translates the characters into files that can be edited such as Text or Word. Since OCR is used for automated text recognition, the problems faced during machine print conversion have been solved. OCR recognition is also used to decipher characters and numbers including symbols, images, print marks, and logos in a document.
About
VeryPDF's PIC to Text Document OCR Converter Command Line
VeryPDF's PIC to Text Document OCR Converter Command Line can recognize
characters from various formats of pic to text files and scanned PDF to text
with OCR technology singly or in batches flexibly and professionally,
accurately. The command line application is handy for implementing batch process
with script, and also provides convenience for manual controlling with effective
options. Optical Character Recognition (OCR) is a visual recognition process
that turns printed or written text into an electronic character-based file. A
document that is scanned and converted into a PDF document provides the basis
for which character recognition software may interpret each character image on
the PDF and assign it an electronic character-based file that can then be
entered into an editable format, such as a Text or Word document.
Download and Purchase PIC to Text Document OCR Converter Command Line
Version |
Quantity |
Price (USD) |
Download |
|
PIC to Text Document OCR Converter Command Line |
1 Server License | 195/each | ||
1 Developer License | 1495/each | |||
OCR Language Packs |
|
Free |
Free |
Note: The default package of PIC to Text Document OCR Converter Command Line includes only OCR technology for English. However you can download more OCR language packs at here.
PIC to Text Document OCR Converter Command Line has following features:
PIC to Text Document OCR Converter Command Line Options:
Email: support@verypdf.com
-------------------------------------------------------
Usage: pdf2txtocr.exe [options] <PDF-file> <Text-file>
-firstpage <int> : first PDF page to convert
-lastpage <int> : last PDF page to convert
-res <int> : set resolution, the
unit is DPI (default is 300 dpi)
-ownerpwd <string> : set owner password for encrypted PDF file
-userpwd <string> : set user password for encrypted PDF file
-layout :
maintain original physical layout
-noc
: don't insert page breaks 0x0C between pages in text file
-bitcount <int> : set color depth when render PDF page to
image data, it can be set 1, 8, 24, default is 8bit
-ocr
: enable OCR function for scanned PDF file
-lang <string> : choose the language for OCR engine
-text <string> : add additional text at end of each text
page, this parameter supports the following variables:
%PageNumber% : current page number
%PageCount% : total page count of PDF file
-$ <string> : input your License Key
Examples:
pdf2txtocr.exe C:\in.pdf C:\out.txt
pdf2txtocr.exe -firstpage 1 -lastpage 1 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -res 300 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ownerpwd 123 -userpwd 456 C:\in.pdf C:\out.txt
pdf2txtocr.exe -layout C:\in.pdf C:\out.txt
pdf2txtocr.exe -noc C:\in.pdf C:\out.txt
pdf2txtocr.exe C:\in.tif C:\out.txt
pdf2txtocr.exe C:\in.jpg C:\out.txt
pdf2txtocr.exe C:\in.bmp C:\out.txt
pdf2txtocr.exe C:\in.png C:\out.txt
pdf2txtocr.exe -ocr -lang eng C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 1 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 8 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 24 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -lang deu C:\in.pdf C:\out.txt
pdf2txtocr.exe -lang deu C:\in.tif C:\out.txt
pdf2txtocr.exe -text "PageText %PageNumber% of %PageCount%" C:\in.pdf C:\out.txt
Following command line will OCR all PDF
files in D:\temp\ folder to text files:
for %F in (D:\temp\*.pdf) do pdf2txtocr.exe -ocr -lang deu "%F" "%~dpnF.txt"
Following command line will OCR all PDF
files in D:\temp\ folder and subdirectories to text files:
for /r D:\temp %F in (*.pdf) do pdf2txtocr.exe -ocr "%F" "%~dpnF.txt"
Following command line will OCR all PDF
files from D:\temp\ folder and output text files to C:\test folder:
for %F in (D:\temp\*.pdf) do pdf2txtocr.exe -ocr "%F" "C:\test\%~nF.txt""
People who viewed this software also viewed:
PDF to Image
Converter: Convert PDF files to TIF, TIFF, JPG, GIF, PNG, BMP, EMF, PCX, TGA
formats.
DocConverter COM
Component (+HTML2PDF.exe): Convert HTML, DOC, RTF, XLS, PPT, TXT etc.
files to PDF files, it is depend on
PDFcamp Printer
product.
Image to
PDF Converter: Convert 40+ image formats to PDF files.
PDF to HTML
Converter: Convert PDF files to HTML documents.
PDF to Text
Converter: Convert PDF files to plain text files.
PDF to
Vector Converter: Convert PDF files to PS, EPS, WMF, EMF, XPS, PCL, HPGL,
SWF, SVG, etc. vector files.
More Products at VeryPDF
Search By Keywords:
METAFILE TO OFFICE ::
METAFILE TO OPENOFFICE ::
METAFILE TO XML ::
METAFILE TO EDITABLE WORD ::
IMG TO TXT ::
IMG TO TEXT ::
IMG TO PLAIN TEXT ::
IMG TO RTF ::
IMG TO HTML ::
IMG TO ASCII ::
IMG TO HTM ::
IMG TO TEXT DOCUMENT ::
IMG TO DOCUMENT ::
IMG TO DOC ::
IMG TO EDITABLE DOCUMENT ::
IMG TO EDITABLE DOC ::
IMG TO DOCX ::
IMG TO WORD ::
IMG TO OFFICE ::
IMG TO OPENOFFICE ::
VeryPDF.com
|
VeryDOC.com |
VeryPCL.com |
Links |
Contact
Copyright © 2002- VeryPDF.com, Inc. All rights reserved.
Send comments about this site to the
webmaster.