[bksvol-discuss] Re: A search to help find split paragraphs

  • From: "Judy s." <cherryjam@xxxxxxxxxxxxxxxx>
  • To: bksvol-discuss@xxxxxxxxxxxxx
  • Date: Wed, 10 Feb 2010 21:10:52 -0600

I usually use an alternative of the 27 step process that I turned into a macro for myself that I think I posted a while ago. But I've used this one also, depending on the book. smile.


Judy

Melissa Smith wrote:
Judy, do you do this in addition to or in place of the 27 step process described in the manual?

Thanks,

Melissa


Judy s. wrote:
I'm always frustrated with the split paragraphs, too. Here's a way I've come up with to search for those split paragraphs that can't be distinguished by a " " (double quote space double quote), using Word.

You can't do this search globally. You have to search for each instance one at a time and check it, then go to the next one. This is because there are paragraphs in a scanned book that legitimately begin with a lower case letter. For example, if the text at the top of a page is a continuation of a paragraph that started on the preceding page, that text will correctly start with a lower case letter. Lines of poetry can start with a lower case letter, and so can some sections when an author is using quoted material.

I do this search before I do any formatting clean-up of a book, because sometimes it can biff special paragraph formatting you've added.

First, I replace all paragraph marks with something unique, because I use a wildcard search and you can't use a wildcard search in Word that involves paragraph marks. So:

In the Find box enter: ^p
In the replace box enter three instances of a character that is not likely to appear in the book. A good choice is $$$, that is dollar dollar.dollar
Then execute a "replace all."

Now you are going to actually look for paragraphs that are split. To do this, you use a special kind of search, using the "use wildcards" box in the find and replace dialogue. In the Find and Replace dialogue box, click on the button that is marked "More." This will expand the options that are available in the Find and Replace box to include a new list of Search Options. In the list of Search Options, check the box for "use wildcards" (you can also do this while in the Search box by typing alt U, which is alt "capital U)

Click the box that puts wildcards on and enter the following term into the find box:

Search : $$$([a-z])

Then begin your find by either clicking "find" or use the keyboard (alt f).

Examine each instance it finds. It may be a paragraph that is correctly starting with a lower case letter because it is a continuation of a paragraph from a previous page, or correct because it is the beginning of a indented quotation (sometimes publishers do that). However, other than those circumstances, you will almost always find that it is the start of a split paragraph. In those cases, I then replace the $$$ with a space. If it isn't a split paragraph, I leave it alone and continue my search.

When you've gone through the entire book, and corrected all instances of split paragraphs, you must then put all the paragraph marks back in. To do this:

First, make sure that you have unchecked the option for a wildcard search in the search and replace dialogue box. Then,
In the find box, enter $$$
In the replace box, enter ^p
Execute "replace all."

That's it!

Judy s.

To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line. To get a list of available commands, put the word 'help' by itself in the subject line.


To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line. To get a list of available commands, put the word 'help' by itself in the subject line.


To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: