[bksvol-discuss] Re: Stripper and Preliminary Pages

  • From: "Evan Reese" <mentat1@xxxxxxxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Tue, 16 May 2006 13:55:46 -0700

I apparently automatically assumed that there would be the same number of 
titles in both.  I just figured that if the same problem was causing both to 
have multiple titles, then the number would be the same.  I'm still not sure 
why that wouldn't be the case, unless it's a different bug in each instance?

But you're right that it isn't a big deal, especially not compared to some 
other things which have been talked about on these lists.  And no, it doesn't 
happen anywhere except at the top of the file.

  ----- Original Message ----- 
  From: Gerald Hovas 
  To: bksvol-discuss@xxxxxxxxxxxxx 
  Sent: Tuesday, May 16, 2006 12:10 PM
  Subject: [bksvol-discuss] Re: Stripper and Preliminary Pages


  Hey, there's no reason why the HTML file can't have a different number than 
the BRF file.  The important thing is that the HTML files seem to be 
consistent, and from what you're saying, the BRF files are consistent, too.

   

  I sent Bookshare a message about the problem and copied Pratik since he's 
looking into the BRF issue.  I wouldn't expect to see anything done about it 
any time soon, though, because it's not really hurting anything, unless you 
guys are saying that it's happening in multiple places in the BRF files.  From 
what you've said,  though, my understanding is that it's only happening at the 
top of the BRF files, and who among us doesn't skip down to the beginning of 
the prologue or chapter one when they begin reading?

   

  BTW, Pratik says he's still looking into the BRF issue to make sure we've got 
a similar problem as to what I'm seeing in the HTML files.

   

  Gerald

   


------------------------------------------------------------------------------

  From: bksvol-discuss-bounce@xxxxxxxxxxxxx 
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of Evan Reese
  Sent: Tuesday, May 16, 2006 1:51 PM
  To: bksvol-discuss@xxxxxxxxxxxxx
  Subject: [bksvol-discuss] Re: Stripper and Preliminary Pages

   

  But only four?  I'm quite sure there are six in the .brf file, I did not 
miscount.

   

    ----- Original Message ----- 

    From: Gerald Hovas 

    To: bksvol-discuss@xxxxxxxxxxxxx 

    Sent: Tuesday, May 16, 2006 11:22 AM

    Subject: [bksvol-discuss] Re: Stripper and Preliminary Pages

     

    Interesting.  I checked the HTML file, and it has multiple titles at the 
top of it as well.  It has the title, followed by the author and title, 
followed by the title twice.  Looks like Bookshare has a problem with the tool 
which adds the legal notice at the top of the books.

     

    Pratik and I'll check into it.

     

    Gerald


----------------------------------------------------------------------------

    From: bksvol-discuss-bounce@xxxxxxxxxxxxx 
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of Evan Reese
    Sent: Tuesday, May 16, 2006 1:07 PM
    To: bksvol-discuss@xxxxxxxxxxxxx
    Subject: [bksvol-discuss] Re: Stripper and Preliminary Pages

     

    The version of "Knife of Dreams" has six titles at the top.  First there's 
the centered title, then the author's name, then five more titles, some 
centered, some just indented two spaces in the brf file I have.  They appear to 
alternate between being centered and indented two spaces, but there are six of 
them, then one more after the legal boilerplate.

     

      ----- Original Message ----- 

      From: Gerald Hovas 

      To: bksvol-discuss@xxxxxxxxxxxxx 

      Sent: Monday, May 15, 2006 9:13 AM

      Subject: [bksvol-discuss] Re: Stripper and Preliminary Pages

       

      Jill,

       

      No, I don't.  That's the first I've heard about the problem.  Would you 
mind sending me a few names of some books which have the problem so I can ask 
about it?

       

      Gerald

       


--------------------------------------------------------------------------

      From: bksvol-discuss-bounce@xxxxxxxxxxxxx 
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of Jill O'Connell
      Sent: Sunday, May 14, 2006 10:38 PM
      To: bksvol-discuss@xxxxxxxxxxxxx
      Subject: [bksvol-discuss] Re: Stripper and Preliminary Pages

       

      Gerald, to you happen to know why we are seeing the book title five or 
six times in a row in the newer books; at least this is the case in braille. 

        ----- Original Message ----- 

        From: Gerald Hovas 

        To: bksvol-discuss@xxxxxxxxxxxxx 

        Sent: Sunday, May 14, 2006 8:28 PM

        Subject: [bksvol-discuss] Re: Stripper and Preliminary Pages

         

        Monica,

         

        Whoever designed the Stripper seems to have taken the title page into 
consideration.  I wouldn't worry about this page.  The only pages at the 
beginning of the book where it might be necessary to worry about something 
getting stripped would be the headings on the pages of the Table of Contents 
and the Acknowledgements.  I've been treating these headings like chapter 
headings and protecting them.

         

        As for the roman numeral pages at the beginning of books, I'd treat 
them the same as the headers and page numbers in the rest of the book.  The 
Stripper seems to have been designed to handle roman numerals as page numbers.  
At least I haven't noticed any difference in how the Stripper treats them.  The 
evidence that the Stripper seems to understand them is that it's not adding 
page numbers to these pages in the HTML files.  I've recently noticed that the 
Stripper seems to have a problem with blank pages where a page number has been 
added, making the page number the only thing on the page.  I've spotted two 
page numbers on these pages in the HTML files, indicating that the Stripper 
didn't recognize the page number and didn't strip it.  The Stripper appears to 
be storing a page number in the SMIL file based on what it expected to find but 
didn't, and then adding that guess back to that page when the HTML file is 
created resulting in two page numbers for the page.  As I said, I haven't 
noticed a second set of page numbers on the pages with roman numerals, so the 
Stripper appears to understand that they are page numbers and treats them 
accordingly.  However, I haven't been looking that close at them, though, 
because I don't recall having worked on a book which had them.  My advice would 
be to treat them as if they were arabic page numbers, then download the book 
once it makes it into the collection and see what happened.  I'll try to pay 
closer attention when I validate a book with roman numerals to see exactly what 
is happening.

         

        HTH

         

        Gerald

         


------------------------------------------------------------------------

        From: bksvol-discuss-bounce@xxxxxxxxxxxxx 
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of Monica Cortada
        Sent: Sunday, May 14, 2006 7:26 PM
        To: bksvol-discuss@xxxxxxxxxxxxx
        Subject: [bksvol-discuss] Stripper and Preliminary Pages

         

        What would be the ideal treatment of the first few pages of a book like 
the title page?  Should a placeholder header be put here for the stripper to 
strip?

         

        Also, what should be done with the roman numeral page numbers at the 
beginning of a book?  

         

        Sorry to have so many questions and thanks for your patience.

         

        Monica in Maryland


------------------------------------------------------------------------

        No virus found in this incoming message.
        Checked by AVG Anti-Virus.
        Version: 7.1.392 / Virus Database: 268.5.6/339 - Release Date: 5/14/2006

Other related posts: