[bksvol-discuss] Re: For Speech Users

  • From: "EVAN REESE" <mentat3@xxxxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Fri, 13 Jun 2008 23:37:37 -0400

After Lissi did the prodigious amount of work to put all of the diacritical 
marks into the four books by J. R. R. Tolkien that she validated -- and 
there are a lot of them, I assure you -- I did some experimenting and 
discovered that adding a few foreign languages in the recognition process in 
K1000 will deliver those marks into the original scan. Well, most of them, 
anyway. OCR is not perfect, and so the diacritical marks will not be 
perfectly rendered either. Still, my experiments show that most of them will 
be included in the scan, which will save a tremendous amount of work in the 
future Tolkien books I have in mind to scan.

I am glad that others do this work, if it is not in the original submission, 
and I am equally glad that some people appreciate having the same text as 
sighted readers have access to. The author put them in the book, so I 
believe they should be reproduced as faithfully as we can. I used to be much 
more casual about such things, but not anymore. And I also think that at 
least some speech software is aware of them and pronounces words with them 
accordingly.

So the work is definitely not useless, though admittedly it is tedious for 
validators working with books containing them but the scans of which don't 
have them. True, it is very unlikely that other books written in English 
will be riddled with them, as Tolkien's writing is, <smile>, but it can 
still be a good deal of work to put them in where they were in the book.

Adding French and the Germans in the FineReader engine, which is what I 
generally use, seems to render the acutes, umlauts and circumflexes pretty 
well, which are the primary ones I need.  Of course, the better the 
recognition overall, the better everything will be recognized, including the 
diacritical marks. So optimizing is always a good thing. Adding more 
languages is supposed to slow down the recognition process, and I think it 
does, but not egregiously, at least not for my computer. And considering the 
amount of work it will save down the road, especially for the Tolkien books, 
it is well worth investing a little extra recognition time up front.

Evan

----- Original Message ----- 
From: "Grandma Cindy" <popularplace@xxxxxxxxx>
To: <bksvol-discuss@xxxxxxxxxxxxx>
Sent: Friday, June 13, 2008 9:46 PM
Subject: [bksvol-discuss] Re: For Speech Users


>I do it, too. A thrill of fear went through me when  you suggested that 
>maybe it was useless--but then I remembered that we have sighted members, 
>too.
>
> When I find a word or name that needs a diacritical mark (I think that's 
> what they're called) I change it and then use replace all to change all 
> the names or words without it to the word that has it. If it is a proper 
> name that could also be an ordinary word, I match case.
>
> I hope my explanation is clear and helpful. I'm not really sure how clear 
> it is. I wish I could write like Lissi and Mayre and some of the rest of 
> you.
>
> G.Cndy
>
>
>
> ***WISH LIST (CALLED REQUESTED ADDITIONS TO THE BOOKSHARE COLLECTION)IS 
> AVAILABLE AT
> http://people.delphiforums.com/jamiecalton/Book_Requests.htm
> http://www.friendsofbookshare.org/
> http://studentpages.alma.edu/~07jmyate/book_requests.htm
>
> A LIST OF BOOKS CURRENTLY BEING SCANNED IS AVAILABLE AT 
> http://people.delphiforums.com/jamiecalton/scanning.html
>
> Jake's site for useful links: http://www.jbrownell.com/bkslinks.html
>
>
> --- On Fri, 6/13/08, Gary Petraccaro <garyp130@xxxxxxxxxxx> wrote:
>
>> From: Gary Petraccaro <garyp130@xxxxxxxxxxx>
>> Subject: [bksvol-discuss] Re: For Speech Users
>> To: bksvol-discuss@xxxxxxxxxxxxx
>> Date: Friday, June 13, 2008, 2:47 PM
>> I do it where I can.  If you're wasting your time, so am
>> I.
>>
>> ----- Original Message ----- 
>> From: "Marilyn Beasley"
>> <mbeasley@xxxxxxxxxxxxx>
>> To: <bksvol-discuss@xxxxxxxxxxxxx>
>> Sent: Friday, June 13, 2008 9:33 AM
>> Subject: [bksvol-discuss] For Speech Users
>>
>>
>> >> I hope you can answer a question for me.
>> >>
>> >> I'm on Page 119 of a 700-page biography of a
>> world
>> >> traveler.  All languages are included in proper
>> nouns of
>> >> people and places.  I have been going through the
>> tedious
>> >> work of inserting special characters:  the E's
>> with accents
>> >> going this way and that; a, e, i, o, and u with
>> two little dots
>> >> on top, N's with tildes on top, etc.
>> >>
>> >> Am i wasting my time because this is not
>> necessary, maybe
>> >> it even makes matters worse?
>> >>
>> >> I just want to make the best scan possible.
>> >>
>> >> Thanks so much,
>> >> Marilyn
>> >>
>> >>
>> >
>> > To unsubscribe from this list send a blank Email to
>> > bksvol-discuss-request@xxxxxxxxxxxxx
>> > put the word 'unsubscribe' by itself in the
>> subject line.  To get a list
>> > of available commands, put the word 'help' by
>> itself in the subject line.
>> >
>>
>>
>> --------------------------------------------------------------------------------
>>
>>
>>
>> No virus found in this incoming message.
>> Checked by AVG.
>> Version: 8.0.100 / Virus Database: 270.3.0/1498 - Release
>> Date: 6/11/2008
>> 7:13 PM
>>
>>  To unsubscribe from this list send a blank Email to
>> bksvol-discuss-request@xxxxxxxxxxxxx
>> put the word 'unsubscribe' by itself in the subject
>> line.  To get a list of available commands, put the word
>> 'help' by itself in the subject line.
>
>
>
> To unsubscribe from this list send a blank Email to
> bksvol-discuss-request@xxxxxxxxxxxxx
> put the word 'unsubscribe' by itself in the subject line.  To get a list 
> of available commands, put the word 'help' by itself in the subject line.
> 


 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: