About OCR technology(Optical Character Recognition)?
OCR technology, optical character recognition,
refers to the branch of
computer science that involves reading
text from paper and translating the images into
a form that the
computer can manipulate (for example, into
ASCII codes). An OCR technology
system enables you to take a book or a magazine
article, feed it directly into an electronic computer
file, and then edit the file using a word
processor. All OCR technology systems include an
optical scanner for
reading text, and sophisticated
software for analyzing images. Most OCR
technology systems use a combination of
hardware (specialized
circuit boards) and software to recognize
characters, although some inexpensive systems
do it entirely through software. Advanced OCR systems can read text in large
variety of
fonts, but they still have difficulty with
handwritten text.
VeryPDF's PCX to Text Document OCR Converter Command Line is an useful
and popular Windows utility for command line users to convert PCX to text and
scanned PDF to text files singly or in batches professionally.
VeryPDF's PCX to Text Document OCR Converter Command Line also offer you
other image processes to text files, e.g., JPG to DOC, BMP to text, TIFF to DOC,
TIF to RTF etc..
VeryPDF's PCX to Text Document OCR Converter Command Line supports
scanned PDF to text and PCX to text, and other image to text in batches through
OCR technology, too.
Download and Purchase PCX to Text Document OCR
Converter Command Line
Version |
Quantity |
Price (USD) |
Download |
|
PCX to Text Document OCR Converter Command Line |
1 Server License | 195/each | ||
1 Developer License | 1495/each | |||
OCR Language Packs |
|
Free |
Free |
Note: The default package of PCX to Text Document OCR Converter Command Line includes only English OCR technology for command line conversions. However you can download more OCR language packs at here.
Features and Abilities on PCX to Text Document OCR Converter Command Line:
Supported Options of PCX to Text Document OCR Converter Command Line:
-------------------------------------------------------
Usage: pdf2txtocr.exe [options] <PDF> <Text>
-firstpage <int> : first PDF page to convert
-lastpage <int> : last PDF page to convert
-res <int> : set resolution, the
unit is DPI (default is 300 dpi)
-ownerpwd <string> : set owner password for encrypted PDF file
-userpwd <string> : set user password for encrypted PDF file
-layout :
maintain original physical layout
-noc
: don't insert page breaks 0x0C between pages in text file
-bitcount <int> : set color depth when render PDF page to
image data, it can be set 1, 8, 24, default is 8bit
-ocr
: enable OCR function for scanned PDF file
-lang <string> : choose the language for OCR engine
-text <string> : add additional text at end of each text
page, this parameter supports the following variables:
%PageNumber% : current page number
%PageCount% : total page count of PDF file
-$ <string> : input your License Key
Examples:
pdf2txtocr.exe C:\in.pdf C:\out.txt
pdf2txtocr.exe -firstpage 1 -lastpage 1 C:\in.pdf C:\out.doc
pdf2txtocr.exe -ocr -res 300 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ownerpwd 123 -userpwd 456 C:\in.pdf C:\out.docx
pdf2txtocr.exe -layout C:\in.pdf C:\out.txt
pdf2txtocr.exe -noc C:\in.pdf C:\out.rtf
pdf2txtocr.exe C:\in.tif C:\out.txt
pdf2txtocr.exe C:\in.jpg C:\out.txt
pdf2txtocr.exe C:\in.bmp C:\out.txt
pdf2txtocr.exe C:\in.png C:\out.txt
pdf2txtocr.exe -ocr -lang eng C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 1 C:\in.pdf C:\out.docx
pdf2txtocr.exe -ocr -bitcount 8 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 24 C:\in.pdf C:\out.doc
pdf2txtocr.exe -ocr -lang deu C:\in.pdf C:\out.txt
pdf2txtocr.exe -lang deu C:\in.tif C:\out.doc
pdf2txtocr.exe -text "PageText %PageNumber% of %PageCount%" C:\in.pdf C:\out.txt
Following command line will OCR all PDF
files in D:\temp\ folder to text files:
for %F in (D:\temp\*.pdf) do pdf2txtocr.exe -ocr -lang deu "%F" "%~dpnF.txt"
Following command line will OCR all PDF
files in D:\temp\ folder and subdirectories to text files:
for /r D:\temp %F in (*.pdf) do pdf2txtocr.exe -ocr "%F" "%~dpnF.txt"
Following command line will OCR all PDF
files from D:\temp\ folder and output text files to C:\test folder:
for %F in (D:\temp\*.pdf) do pdf2txtocr.exe -ocr "%F" "C:\test\%~nF.txt""
View Other Tools here:
PDF to Image
Converter: Convert PDF files to TIF, TIFF, JPG, GIF, PNG, BMP, EMF, PCX, TGA
formats.
DocConverter COM
Component (+HTML2PDF.exe): Convert HTML, DOC, RTF, XLS, PPT, TXT etc.
files to PDF files, it is depend on
PDFcamp Printer
product.
Image to
PDF Converter: Convert 40+ image formats to PDF files.
PDF to HTML
Converter: Convert PDF files to HTML documents.
PDF to Text
Converter: Convert PDF files to plain text files.
PDF to
Vector Converter: Convert PDF files to PS, EPS, WMF, EMF, XPS, PCL, HPGL,
SWF, SVG, etc. vector files.
Email Us: support@verypdf.com
Search By Keywords:
MULTIPLE PAGE TIFF TO OFFICE ::
MULTIPLE PAGE TIFF TO OPENOFFICE ::
MULTIPLE PAGE TIFF TO XML ::
MULTIPLE PAGE TIFF TO EDITABLE WORD ::
PHOTO TO TXT ::
PHOTO TO TEXT ::
PHOTO TO PLAIN TEXT ::
PHOTO TO RTF ::
PHOTO TO HTML ::
PHOTO TO ASCII ::
PHOTO TO HTM ::
PHOTO TO TEXT DOCUMENT ::
PHOTO TO DOCUMENT ::
PHOTO TO DOC ::
PHOTO TO EDITABLE DOCUMENT ::
PHOTO TO EDITABLE DOC ::
PHOTO TO DOCX ::
PHOTO TO WORD ::
PHOTO TO OFFICE ::
PHOTO TO OPENOFFICE ::
VeryPDF.com
|
VeryDOC.com |
VeryPCL.com |
Links |
Contact
Copyright © 2002- VeryPDF.com, Inc. All rights reserved.
Send comments about this site to the
webmaster.