[bksvol-discuss] Re: diacritical marks and language

  • From: "siss52" <siss52@xxxxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Sat, 14 Jun 2008 08:23:51 -0500

Hi Cindy,

Did you try putting British rather than English?  I mean, U.S. English is 
considered English by computers.  (smile)  Just a thought.

Sue S.

----- Original Message ----- 
From: "Grandma Cindy" <popularplace@xxxxxxxxx>
To: <bksvol-discuss@xxxxxxxxxxxxx>
Sent: Saturday, June 14, 2008 1:37 AM
Subject: [bksvol-discuss] diacritical marks and language


I tried putting English UK in my spell-check for validating books in English 
rather than U.S. English but it didn't work. I haven't tried the others.

But many English writers, especially when writing about the upper or 
educated classes, throw a lot of French and even some German words. Lord 
Peter Views the Body had both, and a phrase or two of Greek. I tried coying 
and pasting the Greek letters but I gave up and instead wrote the letter out 
in English. I did that some time ago, too, with a science book of some sort 
that I validated that had a lot of Greek. I've downloaded the Greek alphabet 
so it's handy.

G.Cindy

***WISH LIST (CALLED REQUESTED ADDITIONS TO THE BOOKSHARE COLLECTION)IS 
AVAILABLE AT
http://people.delphiforums.com/jamiecalton/Book_Requests.htm
http://www.friendsofbookshare.org/
http://studentpages.alma.edu/~07jmyate/book_requests.htm

A LIST OF BOOKS CURRENTLY BEING SCANNED IS AVAILABLE AT 
http://people.delphiforums.com/jamiecalton/scanning.html

Jake's site for useful links: http://www.jbrownell.com/bkslinks.html


--- On Fri, 6/13/08, EVAN REESE <mentat3@xxxxxxxxxxx> wrote:

> From: EVAN REESE <mentat3@xxxxxxxxxxx>
> Subject: [bksvol-discuss] Re: For Speech Users
> To: bksvol-discuss@xxxxxxxxxxxxx
> Date: Friday, June 13, 2008, 8:37 PM
> After Lissi did the prodigious amount of work to put all of
> the diacritical
> marks into the four books by J. R. R. Tolkien that she
> validated -- and
> there are a lot of them, I assure you -- I did some
> experimenting and
> discovered that adding a few foreign languages in the
> recognition process in
> K1000 will deliver those marks into the original scan.
> Well, most of them,
> anyway. OCR is not perfect, and so the diacritical marks
> will not be
> perfectly rendered either. Still, my experiments show that
> most of them will
> be included in the scan, which will save a tremendous
> amount of work in the
> future Tolkien books I have in mind to scan.
>
> I am glad that others do this work, if it is not in the
> original submission,
> and I am equally glad that some people appreciate having
> the same text as
> sighted readers have access to. The author put them in the
> book, so I
> believe they should be reproduced as faithfully as we can.
> I used to be much
> more casual about such things, but not anymore. And I also
> think that at
> least some speech software is aware of them and pronounces
> words with them
> accordingly.
>
> So the work is definitely not useless, though admittedly it
> is tedious for
> validators working with books containing them but the scans
> of which don't
> have them. True, it is very unlikely that other books
> written in English
> will be riddled with them, as Tolkien's writing is,
> <smile>, but it can
> still be a good deal of work to put them in where they were
> in the book.
>
> Adding French and the Germans in the FineReader engine,
> which is what I
> generally use, seems to render the acutes, umlauts and
> circumflexes pretty
> well, which are the primary ones I need.  Of course, the
> better the
> recognition overall, the better everything will be
> recognized, including the
> diacritical marks. So optimizing is always a good thing.
> Adding more
> languages is supposed to slow down the recognition process,
> and I think it
> does, but not egregiously, at least not for my computer.
> And considering the
> amount of work it will save down the road, especially for
> the Tolkien books,
> it is well worth investing a little extra recognition time
> up front.
>
> Evan
>
> ----- Original Message ----- 
> From: "Grandma Cindy"
> <popularplace@xxxxxxxxx>
> To: <bksvol-discuss@xxxxxxxxxxxxx>
> Sent: Friday, June 13, 2008 9:46 PM
> Subject: [bksvol-discuss] Re: For Speech Users
>
>
> >I do it, too. A thrill of fear went through me when
> you suggested that
> >maybe it was useless--but then I remembered that we
> have sighted members,
> >too.
> >
> > When I find a word or name that needs a diacritical
> mark (I think that's
> > what they're called) I change it and then use
> replace all to change all
> > the names or words without it to the word that has it.
> If it is a proper
> > name that could also be an ordinary word, I match
> case.
> >
> > I hope my explanation is clear and helpful. I'm
> not really sure how clear
> > it is. I wish I could write like Lissi and Mayre and
> some of the rest of
> > you.
> >
> > G.Cndy
> >
> >
> >
> > ***WISH LIST (CALLED REQUESTED ADDITIONS TO THE
> BOOKSHARE COLLECTION)IS
> > AVAILABLE AT
> >
> http://people.delphiforums.com/jamiecalton/Book_Requests.htm
> > http://www.friendsofbookshare.org/
> >
> http://studentpages.alma.edu/~07jmyate/book_requests.htm
> >
> > A LIST OF BOOKS CURRENTLY BEING SCANNED IS AVAILABLE
> AT
> >
> http://people.delphiforums.com/jamiecalton/scanning.html
> >
> > Jake's site for useful links:
> http://www.jbrownell.com/bkslinks.html
> >
> >
> > --- On Fri, 6/13/08, Gary Petraccaro
> <garyp130@xxxxxxxxxxx> wrote:
> >
> >> From: Gary Petraccaro <garyp130@xxxxxxxxxxx>
> >> Subject: [bksvol-discuss] Re: For Speech Users
> >> To: bksvol-discuss@xxxxxxxxxxxxx
> >> Date: Friday, June 13, 2008, 2:47 PM
> >> I do it where I can.  If you're wasting your
> time, so am
> >> I.
> >>
> >> ----- Original Message ----- 
> >> From: "Marilyn Beasley"
> >> <mbeasley@xxxxxxxxxxxxx>
> >> To: <bksvol-discuss@xxxxxxxxxxxxx>
> >> Sent: Friday, June 13, 2008 9:33 AM
> >> Subject: [bksvol-discuss] For Speech Users
> >>
> >>
> >> >> I hope you can answer a question for me.
> >> >>
> >> >> I'm on Page 119 of a 700-page
> biography of a
> >> world
> >> >> traveler.  All languages are included in
> proper
> >> nouns of
> >> >> people and places.  I have been going
> through the
> >> tedious
> >> >> work of inserting special characters:
> the E's
> >> with accents
> >> >> going this way and that; a, e, i, o, and
> u with
> >> two little dots
> >> >> on top, N's with tildes on top, etc.
> >> >>
> >> >> Am i wasting my time because this is not
> >> necessary, maybe
> >> >> it even makes matters worse?
> >> >>
> >> >> I just want to make the best scan
> possible.
> >> >>
> >> >> Thanks so much,
> >> >> Marilyn
> >> >>
> >> >>
> >> >
> >> > To unsubscribe from this list send a blank
> Email to
> >> > bksvol-discuss-request@xxxxxxxxxxxxx
> >> > put the word 'unsubscribe' by itself
> in the
> >> subject line.  To get a list
> >> > of available commands, put the word
> 'help' by
> >> itself in the subject line.
> >> >
> >>
> >>
> >>
> --------------------------------------------------------------------------------
> >>
> >>
> >>
> >> No virus found in this incoming message.
> >> Checked by AVG.
> >> Version: 8.0.100 / Virus Database: 270.3.0/1498 -
> Release
> >> Date: 6/11/2008
> >> 7:13 PM
> >>
> >>  To unsubscribe from this list send a blank Email
> to
> >> bksvol-discuss-request@xxxxxxxxxxxxx
> >> put the word 'unsubscribe' by itself in
> the subject
> >> line.  To get a list of available commands, put
> the word
> >> 'help' by itself in the subject line.
> >
> >
> >
> > To unsubscribe from this list send a blank Email to
> > bksvol-discuss-request@xxxxxxxxxxxxx
> > put the word 'unsubscribe' by itself in the
> subject line.  To get a list
> > of available commands, put the word 'help' by
> itself in the subject line.
> >
>
>
>  To unsubscribe from this list send a blank Email to
> bksvol-discuss-request@xxxxxxxxxxxxx
> put the word 'unsubscribe' by itself in the subject
> line.  To get a list of available commands, put the word
> 'help' by itself in the subject line.



 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: