[bksvol-discuss] Re: removing extraneous linebreaks

I think Carrie was just having a flashback to her programming days.
The \ key is the one used in many programming languages as an escape character, indicating the next key is a ctrl character. You're right though, the ^p is what word uses as their token for ctrl characters, and in this case a paragraph indicator, unless you look directly at the raw data for a RTF, where it appears as the \p.

Dave

At 09:46 AM 2/8/2009, you wrote:
Hi Carrie,

        I thought the paragraph mark in Word was ^p.  Are there two ways to
search for this?

Mayrie



-----Original Message-----
From: bksvol-discuss-bounce@xxxxxxxxxxxxx
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of Carrie Karnos
Sent: Sunday, February 08, 2009 6:26 AM
To: bksvol-discuss@xxxxxxxxxxxxx
Subject: [bksvol-discuss] Re: removing extraneous linebreaks

The same thing applies to MS Word, except that its paragraph mark is \p. In
MS Word, \n is a column break.

Please be a bit careful with changing "quote, space, quote" to "quote,
paragraph mark, quote" because occasionally the "quote, space, quote" will
be part of a list of quoted items, as in the example: My favorite Jane
Austen movies are "Emma," "Northanger Abbey," and "Persuasion."

Carrie


________________________________

From: Mayrie ReNae <mayrierenae@xxxxxxxxx>
To: bksvol-discuss@xxxxxxxxxxxxx
Sent: Sunday, February 8, 2009 5:24:48 AM
Subject: [bksvol-discuss] Re: removing extraneous linebreaks

HI E.

    Here's the pertinent part of that note.


    The character string that denotes a paragraph mark in K1000 is \n.
Very often text is OCRed with extra paragraph marks in it.  You can do a
find and replace to get rid of these.  For each letter of the alphabet, you
can do the following:
In the find box of the find and replace dialogue type \na (that is back
slash, n a)
You want to make sure to pay attention to case sensitivity.
In the replace box type (space a).
This will join lines of text that OCRed into two paragraphs that shouldn't
have been separate paragraphs.
Make sure to use lower case letters or you'll trash all paragraph marks and
be in a world of curses!
I do this with every letter of the alphabet in lower case.

Also, very often dialogue gets condensed into one paragraph where it
shouldn't be.
You can search for " " (quotation mark, space, quotation mark) and replace
with "\n" that is (quotation mark, backslash n, quotation mark). This will
separate dialogue that didn't get separated by the OCR process.

Mayrie

-----Original Message-----
From: bksvol-discuss-bounce@xxxxxxxxxxxxx
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of E.
Sent: Sunday, February 08, 2009 4:40 AM
To: bksvol-discuss@xxxxxxxxxxxxx
Subject: [bksvol-discuss] removing extraneous linebreaks

I never got a posting talking about how to remove extraneous
linebreaks Mary. Please re-send it.

E.



To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of
available commands, put the word 'help' by itself in the subject line.

To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of
available commands, put the word 'help' by itself in the subject line.




 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line. To get a list of available commands, put the word 'help' by itself in the subject line.

To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: