[bksvol-discuss] Re: Rank spelling & Spell-check

  • From: "Scott Berry" <sberry@xxxxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Thu, 28 Dec 2006 08:37:20 -0600

Right and it should be noted that I think there are a couple of different ways to rank the spelling also.


Scott


----- Original Message ----- From: "Pratik Patel" <pratikp1@xxxxxxxxx>
To: <bksvol-discuss@xxxxxxxxxxxxx>
Sent: Thursday, December 28, 2006 6:50 AM
Subject: [bksvol-discuss] Re: Rank spelling & Spell-check


I'm afraid Rank spelling is much better than spell check as it saves you
heaps of time when you process the entire file rather than doing it one page
at a time.

Rank spelling takes all spelling mistakes, and ranks them according to the
frequency of occurrence. A misspelled word appearing 50 times in a file get put toward the top of the list above a mispelled word that appears 49 times,
which, in turn, appears a word that is misspelled 48 times or less, etc.
All other words are listed alphabetically.  This allows you to deal with
mistakes in bulk. So, for example, if the OCR program decided that it wants to put the word diere instead of the word there, you can tell rank spelling
to replace each occurrence wit the word there rather than having to go
through every single instance. This saves you a lot of time. This has cut
off hours on my scan/clean ups.

Pratik

-----Original Message-----
From: bksvol-discuss-bounce@xxxxxxxxxxxxx
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of Jamie Yates
Sent: Thursday, December 28, 2006 6:52 AM
To: bksvol-discuss@xxxxxxxxxxxxx
Subject: [bksvol-discuss] Re: Rank spelling & Spell-check

So spell check with word is just as good as rank
spelling?

I just hear everyone say the first thing they do is
rank spelling and I feel like I'm missing out on
something really good smile

When I spell check something I've scanned, and
Omnipage spell checks it for me when I scan (plus I
spell check it again with word) I find that the words
that are supposed to be together but have a space are
usually due to zoning errors in the ocr process.

I like to manually draw the zones for ocr because if I
let omnipage do it, 90% of the time it does a great
job and doesn't ocr junk like the center of the book
or my arm as I hold down the spine. But sometimes it
does and then it inserts crap so I draw a text zone
for it to ocr. But even then sometimes it will have a
word like this man ually. And spell check will say
ually is not a word, and sure it is, and then I just
fix it. But I do that as I scan. I guess I like to
check each page as I scan.

I need to go back to bed. I woke up at 3 when my
husband and son got up so they could go fishing and
could not go back to sleep. Now I'm tired! I hope they
are catching some good fish up there (up at Tippy Dam
in Wellston, not too far from Manistee). It's about a
90 minute drive each way I think so they should be
fishing in the dark still, and maybe snow. It was
snowing here when they left but it had been raining so
our snow probably won't stay. Up there they might have
snow. They don't care though; they have waders and
waterproof coats and all that crap smile. Santa
brought my son $70 in Bass Pro Shops gift cards which
he happily spent on fishing crap uh I mean stuff.

More than y'all wanted to know I'm sure! I guess I
ramble when I'm tired.




Jamie in Michigan

To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line. To get a list of
available commands, put the word 'help' by itself in the subject line.

To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line. To get a list of available commands, put the word 'help' by itself in the subject line.



To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: