[duxuser] Re: unwanted italics

  • From: "Susan" <chrn3292@xxxxxxx>
  • To: <duxuser@xxxxxxxxxxxxx>
  • Date: Thu, 27 Aug 2009 11:57:45 -0500

Hi John,

 

Actually OmniPage has been doing the styles bit for at least the
last few versions, so it's more like ABBYY is following that
lead. As with any software, you have pros and cons to each, and
it's necessary to decide which one best serves your needs.

 

Applying a particular "style sheet" or using a particular
template won't solve the problem unless you are certain the font
styles are not used within the document, or you've modified those
font styles so there isn't any unwanted emphasis if you do keep
them. You can delete the font styles once you don't need them (if
you use them to do some basic formatting). 

 

As I indicated, your best option may be to save to Word 2007 docx
format and then save back to doc. What I didn't mention in my
earlier post is that you will still have some paragraph styles
that may come through, but those don't cause the problems (or
least not that I've encountered yet). It's the font styles that
create the emphasis problems for us in DBT. 

 

I'm still trying to muddle through different issues in both
FineReader and OmniPage. It's certainly a work in progress. I'll
post updates to this problem if I come across anything else. 

 

Susan

 

 

 

From: duxuser-bounce@xxxxxxxxxxxxx
[mailto:duxuser-bounce@xxxxxxxxxxxxx] On Behalf Of John Blake
Sent: Thursday, August 27, 2009 1:57 AM
To: duxuser@xxxxxxxxxxxxx
Subject: [duxuser] Re: unwanted italics

 

Hi Susan,

 

Thank you so much for your information. I have "recently"
attended a 'Styles' Training session hosted by one of our our
Regional Technicians' bases back in July (seems years ago now). I
think that once I can get to grips with Styles I assume that once
I have a dedicated style set up for different students and apply
'their' Style sheet the problem will be eradicated?!

 

It was very interesting to hear that OmniPage has gone down the
same sorry route - perhaps ABBYY ought to sell their 'Sprint'
version without styles!

 

Best Regards,

John.

 

PS - 11 out of 10 for your most excellent response! 

 

  _____  

From: duxuser-bounce@xxxxxxxxxxxxx
[mailto:duxuser-bounce@xxxxxxxxxxxxx] On Behalf Of Susan
Sent: 27 August 2009 06:18
To: duxuser@xxxxxxxxxxxxx
Subject: [duxuser] Re: unwanted italics

The "rest of the story" is pretty murky. Jean's problem wasn't
what I was expecting. In fact, I couldn't duplicate her problem
with her files. Her doc file came from a pdf file, and she
doesn't know the details of how it was pulled from pdf, so we
can't sort it out. So, Jean's particular issue is still a
mystery. 

 

Here's what I can tell you. 

 

1) Yes, ABBYY FineReader 9 is one of the issues. Earlier versions
are OK. FR 9 now assigns font styles, and so text that is
emphasized in some way will have a unique numbered font style
assigned. In the processing of cleaning up your file in Word, you
may decide to get rid of unnecessary emphasis, such as text that
is recognized as bold, but in fact is just a heavier font in the
text. You get the file to look the way you want it, but when you
open the file in DBT, you find you have all the emphasis you
worked at getting rid of. The solution is to either delete the
font style in Word or redefine the style. In other words, you
need to understand how to use and modify styles correctly, vs.
manually changing text in the document.  If you want to avoid
this problem altogether, choose the option to save the file to
Word 2007 docx, as font styles do not come through in those
files. Note, sending the file to Word 2007 keeps the font styles,
so the file must be saved as docx. Once you have the file open in
Word, you'll need to save the file back to the doc option so you
can open it in DBT. Obviously another option is to save as txt,
but for many situations that's going to create more work in the
long run. 

 

Don't automatically decide to avoid the styles altogether by
going the docx route if you understand styles and find/replace.
Font styles can come in handy for quickly formatting a document. 

 

2) OmniPage also assigns styles, and I get sporadic problems with
those files, but at this point I can't pinpoint specifics. I
can't remember what version OmniPage introduced styles, so if you
use a version older than 16, make sure you check out what you are
dealing with in the options. Recent versions do have an option to
not save styles, but I'm not finding consistent results one way
or the other with that option checked. Again, if you are finding
emphasis issues and you are using a recent version of OmniPage,
try saving to docx and then back to doc.

 

3) If you copy/paste text from the internet, you can also run
into this same problem. Italics are styled as "Emphasis" and bold
is styled as "Strong." Let's say you don't want to keep the bold.
There are a couple of different ways you may do that globally,
but if you don't deal with the Strong style by either redefining
it or deleting the Strong style from that document, you'll still
end up with emphasis when you open the file in DBT. 

 

In communicating with Peter Sullivan at Duxbury about this issue,
he informs me that DBT is doing what it's designed to do. 

 

That's what I know for the "rest of the story" right now, but I'm
guessing there is more to learn. 

 

Susan

 

 

 

From: duxuser-bounce@xxxxxxxxxxxxx
[mailto:duxuser-bounce@xxxxxxxxxxxxx] On Behalf Of John Blake
Sent: Wednesday, August 26, 2009 5:29 AM
To: duxuser@xxxxxxxxxxxxx
Subject: [duxuser] Re: unwanted italics
Importance: High

 

Aloha Betsy,

 

I too would be interested to know if the Scanner Software is
ABBYY Fine Reader 9.0 Professional - even if I change the default
'Style/s' to Arial and get rid of all the others there is some
strange 'hard coding' going on between ABBYY and Word which does
not show up until you use the Swift macro to transfer the
document from word to Duxbury!

 

Best regards,

John.

 

  _____  

From: duxuser-bounce@xxxxxxxxxxxxx
[mailto:duxuser-bounce@xxxxxxxxxxxxx] On Behalf Of Betsy Whitney
Sent: 19 August 2009 00:20
To: duxuser@xxxxxxxxxxxxx
Subject: [duxuser] Re: unwanted italics

Please let us all know "the rest of the story.
Betsy
At 01:15 PM 8/18/2009, you wrote:

Hello Susan, 

The file was originally a PDF. I believe it was scanned into (or
pasted) into an RTF and then cleaned up for e-text. I took the
RTF and converted it to a DOC, and then put it into my file with
the DBT template. 
 
I'll copy the rtf and doc files tomorrow if you'd like to see
them. I couldn't find any underlying reasons, but I'd love to
know your suspicions. 
Thanks. 
Jean

Other related posts: