Home PDF2TXT Sample Support Document Component

PDF to HTM OCR Converter Command Line -- Convert PDF to HTM and scanned PDF to HTM with OCR technology

VeryPDF's PDF to HTM OCR Converter Command Line is able to help you convert PDF to HTM and scanned PDF to HTM through OCR technology with command line. VeryPDF's PDF to HTM OCR Converter Command Line is a great tool for Windows command line users to convert image to HTM, too, e.g., TIF, BMP, PNG etc.. singly or in batches. VeryPDF's PDF to HTM OCR Converter Command Line supports various PDF files and image files conversions for HTM files and other formats files, e.g., RTF, TXT, DOC etc..

Download and Purchase PDF to HTM OCR Converter Command Line

Version	Quantity	Price (USD)	Download	Buy All
PDF to HTM OCR Converter Command Line	1 Server License	195/each
PDF to HTM OCR Converter Command Line	1 Developer License	1495/each
OCR Language Packs		Free		Free

Note: The default package of PDF to HTM OCR Converter Command Line only include OCR technology for language English. However you can download more OCR language packs at here.

What is OCR technology?

Optical Character Recognition (OCR) is a visual recognition process that turns printed or written text into an electronic character-based file. A document that is scanned and converted into a PDF document provides the basis for which character recognition software may interpret each character image on the PDF and assign it an electronic character-based file that can then be entered into an editable format, such as a Text or Word document.

What is OCR technology? What is OCR? OCR Technology

PDF to HTM OCR Converter Command Line has following features:

Able to run on both 32 bits and 64 bits Win95/98/ME/NT/2000/XP/2003/Vista/7 systems;
Does NOT need Adobe Acrobat or free Acrobat Reader software;
OCR technology lets you select language flexibly, e.g., English, German, French, Spanish, Italian and many Languages else;
OCR technology engine provides you with 92% faster speed than other OCR software;
Convert scanned PDF to HTM files with OCR technology and command line singly or in batches;
Convert scanned image to HTM files with OCR technology;
Convert normal PDF to HTM and password protected PDF to HTM files singly or in batches with or without OCR technology;
Support page selection, OCR single, range or all pages during conversions from PDF to HTM and scanned PDF to HTM files with command line;
Supports command line operation for manual use or inclusion in scripts;
Enable you to extract text from PDF to HTM accurately with command line and OCR technology.

PDF to HTM OCR Converter Command Line Options:
-------------------------------------------------------
Usage: pdf2txtocr.exe [options] <PDF-file> <Text-file>
-firstpage <int>   : first PDF page to convert
-lastpage <int>    : last PDF page to convert
-res <int>         : set resolution, the unit is DPI (default is 300 dpi)
-ownerpwd <string> : set owner password for encrypted PDF file
-userpwd <string> : set user password for encrypted PDF file
-layout            : maintain original physical layout
-noc               : don't insert page breaks 0x0C between pages in text file
-bitcount <int>    : set color depth when render PDF page to image data, it can be set 1, 8, 24, default is 8bit
-ocr               : enable OCR function for scanned PDF file
-lang <string>     : choose the language for OCR engine
-text <string>     : add additional text at end of each text page, this parameter supports the following variables:
    %PageNumber%   : current page number
    %PageCount%    : total page count of PDF file
-$ <string>        : input your License Key
Examples:
pdf2txtocr.exe C:\in.pdf C:\out.htm
pdf2txtocr.exe -firstpage 1 -lastpage 1 C:\in.pdf C:\out.htm
pdf2txtocr.exe -ocr -res 300 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ownerpwd 123 -userpwd 456 C:\in.pdf C:\out.htm
pdf2txtocr.exe -layout C:\in.pdf C:\out.txt
pdf2txtocr.exe -noc C:\in.pdf C:\out.txt
pdf2txtocr.exe C:\in.tif C:\out.txt
pdf2txtocr.exe C:\in.jpg C:\out.txt
pdf2txtocr.exe C:\in.bmp C:\out.txt
pdf2txtocr.exe C:\in.png C:\out.txt
pdf2txtocr.exe -ocr -lang eng C:\in.pdf C:\out.htm
pdf2txtocr.exe -ocr -bitcount 1 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 8 C:\in.pdf C:\out.txt
pdf2txtocr.exe -ocr -bitcount 24 C:\in.pdf C:\out.htm
pdf2txtocr.exe -ocr -lang deu C:\in.pdf C:\out.txt
pdf2txtocr.exe -lang deu C:\in.tif C:\out.txt
pdf2txtocr.exe -text "PageText %PageNumber% of %PageCount%" C:\in.pdf C:\out.txt

Following command line will OCR all PDF files in D:\temp\ folder to text files:
for %F in (D:\temp\*.pdf) do pdf2txtocr.exe -ocr -lang deu "%F" "%~dpnF.txt"

Following command line will OCR all PDF files in D:\temp\ folder and subdirectories to text files:
for /r D:\temp %F in (*.pdf) do pdf2txtocr.exe -ocr "%F" "%~dpnF.txt"

Following command line will OCR all PDF files from D:\temp\ folder and output text files to C:\test folder:
for %F in (D:\temp\*.pdf) do pdf2txtocr.exe -ocr "%F" "C:\test\%~nF.txt""

View Other Tools here Also:

Image to PDF Converter: Convert 40+ image formats to PDF files.
PDF to HTML Converter: Convert PDF files to HTML documents.
PDF to Text Converter: Convert PDF files to plain text files.
PDF to Vector Converter: Convert PDF files to PS, EPS, WMF, EMF, XPS, PCL, HPGL, SWF, SVG, etc. vector files.
PDF to Image Converter: Convert PDF files to TIF, TIFF, JPG, GIF, PNG, BMP, EMF, PCX, TGA formats.
DocConverter COM Component (+HTML2PDF.exe): Convert HTML, DOC, RTF, XLS, PPT, TXT etc. files to PDF files, it is depend on PDFcamp Printer product.

More Products at VeryPDF

Email: support@verypdf.com

Search By Keywords:
TIF TO DOCX :: TIF TO WORD :: TIF TO OFFICE :: TIF TO OPENOFFICE :: TIF TO XML :: TIF TO EDITABLE WORD :: TIFF TO TXT :: TIFF TO TEXT :: TIFF TO PLAIN TEXT :: TIFF TO RTF :: TIFF TO HTML :: TIFF TO ASCII :: TIFF TO HTM :: TIFF TO TEXT DOCUMENT :: TIFF TO DOCUMENT :: TIFF TO DOC :: TIFF TO EDITABLE DOCUMENT :: TIFF TO EDITABLE DOC :: TIFF TO DOCX :: TIFF TO WORD ::