[bksvol-discuss] Re: removing extraneous linebreaks

  • From: "Mayrie ReNae" <mayrierenae@xxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Sun, 8 Feb 2009 05:24:48 -0800

HI E.

        Here's the pertinent part of that note.


        The character string that denotes a paragraph mark in K1000 is \n.
Very often text is OCRed with extra paragraph marks in it.  You can do a
find and replace to get rid of these.  For each letter of the alphabet, you
can do the following:
In the find box of the find and replace dialogue type \na (that is back
slash, n a) 
You want to make sure to pay attention to case sensitivity.
In the replace box type (space a). 
This will join lines of text that OCRed into two paragraphs that shouldn't
have been separate paragraphs.
Make sure to use lower case letters or you'll trash all paragraph marks and
be in a world of curses! 
I do this with every letter of the alphabet in lower case.
 
Also, very often dialogue gets condensed into one paragraph where it
shouldn't be.
You can search for " " (quotation mark, space, quotation mark) and replace
with "\n" that is (quotation mark, backslash n, quotation mark). This will
separate dialogue that didn't get separated by the OCR process. 

Mayrie

-----Original Message-----
From: bksvol-discuss-bounce@xxxxxxxxxxxxx
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of E.
Sent: Sunday, February 08, 2009 4:40 AM
To: bksvol-discuss@xxxxxxxxxxxxx
Subject: [bksvol-discuss] removing extraneous linebreaks

I never got a posting talking about how to remove extraneous 
linebreaks Mary. Please re-send it.

E.
        
        

 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of
available commands, put the word 'help' by itself in the subject line.

 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: