RE: [program-l] Re: Seeking c++ programmer to compile Windows binary of free Tesseract OCR

  • From: "Manish Agrawal" <manish10@xxxxxxxxx>
  • To: <program-l@xxxxxxxxxxxxx>
  • Date: Tue, 2 Mar 2010 21:49:09 +0530

Hey Jamal,
I uploaded the wrong zip earlier. apologies for that.
Here is the link for an updated zip that contains the test-images folder:
http://www.sendspace.com/file/ttt7lh

Let me know if this improves anything.
Also, I am running into issues with the latest tesseract 3.0.0. That
codebase doesn't build with libtiff included. I'll try a little more and let
you know if I can make any progress.

-Manish 

-----Original Message-----
From: program-l-bounce@xxxxxxxxxxxxx [mailto:program-l-bounce@xxxxxxxxxxxxx]
On Behalf Of Jamal Mazrui
Sent: Tuesday, March 02, 2010 12:23 AM
To: program-l@xxxxxxxxxxxxx
Cc: programmingblind@xxxxxxxxxxxxx
Subject: [program-l] Re: Seeking c++ programmer to compile Windows binary of
free Tesseract OCR

I am not finding the test-images subdirectory.  When I tried on a couple 
of .tif files I had, I get the following error (Windows XP):

Tessedit: tesseract.exe - Application Error
The instruction at "0x6fd8cf1d" referenced memory at "0x00000245". The 
memory could not be "read".
Click on OK to terminate the program
Click on CANCEL to debug the program
OK Cancel

Jamal
On Mon, 1 Mar 
2010, Manish Agrawal wrote:

> Date: Mon, 1 Mar 2010 23:11:21 +0530
> From: Manish Agrawal <manish10@xxxxxxxxx>
> Reply-To: program-l@xxxxxxxxxxxxx
> To: program-l@xxxxxxxxxxxxx, programmingblind@xxxxxxxxxxxxx
> Subject: [program-l] Re: Seeking c++ programmer to compile Windows binary
of
>     free Tesseract OCR
> 
> You can download the zip from:
> http://www.sendspace.com/file/k6fsjq
>
> Before running this, make sure that you have libtiff3.dll somewhere on
your
> path.
> For this, you'll need to download the setup exe from
> http://gnuwin32.sourceforge.net/packages/tiff.htm
>
> (use the first setup download without sources)
> And then right click Mycomputer|properties|advanced|environment variables
in
> the path user variable, append C:\Program Files\GnuWin32\bin or whereever
> you installed the setup from the above step.
>
> Instead of trying with pdf2txt as you suggested in another mail, can you
> first try:
> Tesseract.exe test-images\test.tiff output-filename
>
> In the bin.dbg folder that you extract from the sendspace link at the top.
>
> -Manish
> -----Original Message-----
> From: program-l-bounce@xxxxxxxxxxxxx
[mailto:program-l-bounce@xxxxxxxxxxxxx]
> On Behalf Of Jamal Mazrui
> Sent: Monday, March 01, 2010 10:26 PM
> To: programmingblind@xxxxxxxxxxxxx; Program-L@xxxxxxxxxxxxx
> Subject: [program-l] Re: Seeking c++ programmer to compile Windows binary
of
> free Tesseract OCR
>
> Thanks for your work on this.  I hope you also saw another message I
> just posted describing the Tesseract tests I am trying.
>
> One accessible way of sharing large files is the following web site:
>
> Blind File Sharing : Internet file storage and delivery for the visually
> impaired
>
> http://www.blindfilesharing.com/
>
> I'm copying another blind programming related list because some
> people there are also working on this, so I hope we can share info
> and collaborate as much as possible.
>
> Jamal
>
> On
> Mon, 1 Mar 2010, Manish
> Agrawal wrote:
>
>> Date: Mon, 1 Mar 2010 22:12:59 +0530
>> From: Manish Agrawal <manish10@xxxxxxxxx>
>> Reply-To: programmingblind@xxxxxxxxxxxxx
>> To: programmingblind@xxxxxxxxxxxxx
>> Subject: RE: [program-l] Seeking c++ programmer to compile Windows binary
> of
>>     free Tesseract OCR
>>
>> Jamal, I have version 2.0.3 working with libtiff included. The bin.dbg
> file
>> after zipping comes to about 22 MB. Is there somewhere I can upload it
for
>> you before I try this with version 3.0?
>> I tested this with just some tiff file I had lying around. What can I do
> to
>> verify that this has the compression you need?
>>
>> -Manish
>>
>> -----Original Message-----
>> From: programmingblind-bounce@xxxxxxxxxxxxx
>> [mailto:programmingblind-bounce@xxxxxxxxxxxxx] On Behalf Of Jamal Mazrui
>> Sent: Monday, March 01, 2010 8:55 PM
>> To: programmingblind@xxxxxxxxxxxxx
>> Subject: RE: [program-l] Seeking c++ programmer to compile Windows binary
> of
>> free Tesseract OCR
>>
>> At this point, I think the 3.0 pre-release would be preferable since it
>> includes other OCR enhancements.  Certainly, however, if you cannot get
>> that working but can get the earlier stable release working, that
>> would be valuable progress.
>>
>> Thanks for your interest
>> Jamal
>> On Mon, 1 Mar 2010, Manish Agrawal wrote:
>>
>>> Date: Mon, 1 Mar 2010 19:36:48 +0530
>>> From: Manish Agrawal <manish10@xxxxxxxxx>
>>> Reply-To: programmingblind@xxxxxxxxxxxxx
>>> To: programmingblind@xxxxxxxxxxxxx
>>> Subject: RE: [program-l] Seeking c++ programmer to compile Windows
binary
>> of
>>>     free Tesseract OCR
>>>
>>> I can give it a try. I worked with c++ some years ago and am not sure if
>>> I'll be able to deliver the goods.
>>> There appear to be two different versions, 2.0.4 (which appears to be
the
>>> stable version) and 3.0.0 (current/prerelease). Which one do you want
>> built
>>> with the libtif included?
>>>
>>> -Manish
>>> -----Original Message-----
>>> From: programmingblind-bounce@xxxxxxxxxxxxx
>>> [mailto:programmingblind-bounce@xxxxxxxxxxxxx] On Behalf Of Jamal Mazrui
>>> Sent: Monday, March 01, 2010 5:53 PM
>>> To: programmingblind@xxxxxxxxxxxxx
>>> Subject: Fwd: [program-l] Seeking c++ programmer to compile Windows
> binary
>>> of free Tesseract OCR
>>>
>>> A couple people from another list said they would try to do this, but as
>>> far as I know, no working binary is available yet.  Can any C++
>>> programmers here tackle the problem?
>>>
>>> Jamal
>>>
>>>
>>> -------- Forwarded Message --------
>>> Subject: [program-l] Seeking c++ programmer to compile Windows binary of
>>> free Tesseract OCR
>>> Date: Sat, 20 Feb 2010 10:52:55 -0500
>>> From: Jamal Mazrui <empower@xxxxxxxxx>
>>> Reply-To: program-l@xxxxxxxxxxxxx
>>> To: program-l@xxxxxxxxxxxxx, programmingblind@xxxxxxxxxxxxx
>>>
>>> I think Google Tesseract is generally considered to be the best, free
>>> OCR engine at this time.  It is available in cross-platform C++ source
>>> code.  A Windows binary is available, but as far as I can tell, it does
>>> not incorporate a free TIF library that would support more common .tif
>>> formats than the default, uncompressed, minimalist TIF format that the
>>> available binary is limited to.
>>>
>>> However, a Microsoft Visual c++ 2008 project file is also available for
>>> compiling Tesseract to Windows binary form, and supposedly, the TIF
>>> library can be incorporated in such a solution.  I do not know C++, so
>>> am hoping that another programmer can build this for us.
>>>
>>> A starting page for Tesseract is the following:
>>>
>>> ReadMe - tesseract-ocr - Important information all Tesseract users need
>>> to know. - Project Hosting on Google Code
>>>
>>> http://code.google.com/p/tesseract-ocr/wiki/ReadMe
>>>
>>> Jamal
>>>
>>> ** To leave the list, click on the immediately-following link:-
>>> ** [mailto:program-l-request@xxxxxxxxxxxxx?subject=unsubscribe]
>>> ** If this link doesn't work then send a message to:
>>> ** program-l-request@xxxxxxxxxxxxx
>>> ** and in the Subject line type
>>> ** unsubscribe
>>> ** For other list commands such as vacation mode, click on the
>>> ** immediately-following link:-
>>> ** [mailto:program-l-request@xxxxxxxxxxxxx?subject=faq]
>>> ** or send a message, to
>>> ** program-l-request@xxxxxxxxxxxxx with the Subject:- faq
>>>
>>> __________
>>> View the list's information and change your settings at
>>> //www.freelists.org/list/programmingblind
>>>
>>> __________
>>> View the list's information and change your settings at
>>> //www.freelists.org/list/programmingblind
>>>
>> __________
>> View the list's information and change your settings at
>> //www.freelists.org/list/programmingblind
>>
>> __________
>> View the list's information and change your settings at
>> //www.freelists.org/list/programmingblind
>>
> ** To leave the list, click on the immediately-following link:-
> ** [mailto:program-l-request@xxxxxxxxxxxxx?subject=unsubscribe]
> ** If this link doesn't work then send a message to:
> ** program-l-request@xxxxxxxxxxxxx
> ** and in the Subject line type
> ** unsubscribe
> ** For other list commands such as vacation mode, click on the
> ** immediately-following link:-
> ** [mailto:program-l-request@xxxxxxxxxxxxx?subject=faq]
> ** or send a message, to
> ** program-l-request@xxxxxxxxxxxxx with the Subject:- faq
>
> ** To leave the list, click on the immediately-following link:-
> ** [mailto:program-l-request@xxxxxxxxxxxxx?subject=unsubscribe]
> ** If this link doesn't work then send a message to:
> ** program-l-request@xxxxxxxxxxxxx
> ** and in the Subject line type
> ** unsubscribe
> ** For other list commands such as vacation mode, click on the
> ** immediately-following link:-
> ** [mailto:program-l-request@xxxxxxxxxxxxx?subject=faq]
> ** or send a message, to
> ** program-l-request@xxxxxxxxxxxxx with the Subject:- faq
>
** To leave the list, click on the immediately-following link:-
** [mailto:program-l-request@xxxxxxxxxxxxx?subject=unsubscribe]
** If this link doesn't work then send a message to:
** program-l-request@xxxxxxxxxxxxx
** and in the Subject line type
** unsubscribe
** For other list commands such as vacation mode, click on the
** immediately-following link:-
** [mailto:program-l-request@xxxxxxxxxxxxx?subject=faq]
** or send a message, to
** program-l-request@xxxxxxxxxxxxx with the Subject:- faq

__________
View the list's information and change your settings at 
//www.freelists.org/list/programmingblind

Other related posts: