[bksvol-discuss] Re: Finding Soft Hyphens

I was behind on my email and did look forward at several answers to your question before I sent mine, but clearly did not look at enough since the suggestion about changing the margins did eliminate most of your soft hyphens. Thanks for your polite response for my late out of the barn message.


Misha

Lora wrote:
Yes, and there are regular hyphens, too.  These are definitely soft.  First,
JAWS says they are.  Not always reliable, but a start.  Second, I can find
regular dashes.  Third, I can't find these by any normal means, and had to
do the caret dash trick to find them.


-----Original Message-----
From: bksvol-discuss-bounce@xxxxxxxxxxxxx
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of Mike
Sent: Wednesday, March 19, 2008 7:43 PM
To: bksvol-discuss@xxxxxxxxxxxxx
Subject: [bksvol-discuss] Re: Finding Soft Hyphens

Are you sure they are soft hyphens?  They may be real hyphens.  Soft hyphens
should only be at the ends of lines.  Back in the 1960s and before, many
publishers split words with hyphens when the whole word didn't fit at the
end of the line to make the right margin look straighter.  When these books
are scanned, these words split by hyphens can end up in the middle of the
line since the lines are not necessarily the same width as in the book.
Have you tried to search for regular hyphens?

Misha

Lora wrote:
I'm validating a book that has numerous words broken up by soft hyphens. I'd like to fix these, especially because they're not words at the end of lines or pages, just random words in the middle of various lines. I tried selecting the soft hyphen, and copying it into the Find box so I could locate other occurrences, but it won't copy and paste. Is there an easy way to find this soft hyphen? This book poses another problem. There are lots of the common scanos in it. For instance, 1 for I, rime for time, etc. I've fixed the ones I found as I skimmed the book, but many of these won't register in the spell checker. I suppose it means I should read them through. The real tricky one is "me" for "the." I suspect, if the OCR did that, it did some other funky stuff for the letters th. This is a long book. It's Buddhism in Action. Knowing I can be a slow validator, should I return it, or take the time to read through it. I'm interested in the book, and will gladly read it cover to cover, but at over 400 pages, this could take me a while. The scan is generally excellent, but there are lots of these little OCR quirks. Advice is welcome. Thanks, Lora

 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of
available commands, put the word 'help' by itself in the subject line.

 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.



To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: