[bksvol-discuss] Re: improving a book in the collection

  • From: "Pratik Patel" <pratikp1@xxxxxxxxx>
  • To: <bksvol-discuss@xxxxxxxxxxxxx>
  • Date: Mon, 23 Jan 2006 19:39:50 -0500

Gerald, Jake, Julie et al,

If HTML is rendered correctly according to specified standards, then it
should contain no page breaks.  That is the reason why I would imagine
Bookshare material, hwen converted to HTML, does not preserve page breaks.
And, submitting a HTML file that contains page break chars from editors such
as MS Word only results in the stripper/other automated tool removing
extraneous characters.  Word often relies on a combination of nonstandard
HTML and Microsoft version of bad CSS so you might find that page numbers
are preserved in Word version of HTML.  Indeed this topic needs to be on
Bookshare's agenda.  The suitability of HTML as a consistent submission
format becomes a large question.

Pratik

Pratik Patel
Director, CUNY Assistive technology Services (CATS)
The City University of New York
pratik.patel@xxxxxxxxxxxx

-----Original Message-----
From: bksvol-discuss-bounce@xxxxxxxxxxxxx
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx] On Behalf Of Gerald Hovas
Sent: Monday, January 23, 2006 5:27 PM
To: bksvol-discuss@xxxxxxxxxxxxx
Subject: [bksvol-discuss] Re: improving a book in the collection

Jake,

I was just about to bring that issue up.  I've checked a few books in the
past using Word, and the process that Bookshare uses to convert to HTML
appears to strip the page breaks.  I know that the HTML document I've been
working on for the last two or three months using Word still has the Page
Breaks, so I'm almost certain that Bookshare strips them when processing the
book.

Since Page Breaks are a requirement for adding a book to the collection, the
method of converting HTML files to RTF that volunteers used in the past is
probably not viable anymore for improving books.  The only way that it would
be is if none of the headers were recognized by the Stripper, and the page
breaks could be put in by hand.  Rather than converting the HTML file to
RTF, I recommend writing Gustavo and asking him to kick a book back to the
Step 1 page if someone wishes to improve a book.

I have a topic or two that I'm sending to Janice, so I'll add this one to
the message.

Gerald

-----Original Message-----
From: bksvol-discuss-bounce@xxxxxxxxxxxxx
[mailto:bksvol-discuss-bounce@xxxxxxxxxxxxx]On Behalf Of Jake Brownell
Sent: Monday, January 23, 2006 3:55 PM
To: bksvol-discuss@xxxxxxxxxxxxx
Subject: [bksvol-discuss] Re: improving a book in the collection


Hi Julie,
    Has someone verified that this process keeps page breaks? I'm concerned
because HTML shouldn't really have any concept of a page break.

If this hasn't been verified then we'll want to attempt to verify it or find
an alternative method.

Thanks,
Jake
----- Original Message -----
From: "Julie Morales" <mercy421@xxxxxxxxx>
To: <bksvol-discuss@xxxxxxxxxxxxx>
Sent: Monday, January 23, 2006 3:48 PM
Subject: [bksvol-discuss] Re: improving a book in the collection


> HI, Lissi. Take the HTML file and convert it to RTF. I think you can do
> this
> in Word but have never done it. Anyway, at that point, you can work with
> it
> as you would any other RTF file and submit it with BSO in the file name so
> admin knows it's a replacement. Good luck! take care.
> Julie Morales
> Life is a gift from God. What we do with it is our gift to Him.
> ----- Original Message -----
> From: "Estelnalissi" <airadil@xxxxxxxxxxxxx>
> To: <bksvol-discuss@xxxxxxxxxxxxx>
> Sent: Monday, January 23, 2006 3:28 PM
> Subject: [bksvol-discuss] improving a book in the collection
>
>
> Dear Kelly, or anyone who wants to jump in,
>
> I've discovered a book in the collection rated good but with many errors.
> I
> have a print edition and would be happy to make the corrections and upload
> it as a pso improved copy. I'm not sure I got those initials right, but
> would check before uploading so the file would be named correctly.
>
> My question is that I don't know what format I'm working in, as in rtf,
> etc.
> When I download a book that's already in the collection, I get the htm
> version and about 4 others, the last of which I can listen to with Jaws.
> Is
> there a way to convert the book to RTF or do I keep it in the format in
> which I'm reading?
>
> I hope this question makes sense. Thanks so much.
>
> Always With Love,
>
> Lissi
> ----- Original Message -----
>
>
> To unsubscribe from this list send a blank Email to
> bksvol-discuss-request@xxxxxxxxxxxxx
> put the word 'unsubscribe' by itself in the subject line.  To get a list
> of
> available commands, put the word 'help' by itself in the subject line.
>
>
> To unsubscribe from this list send a blank Email to
> bksvol-discuss-request@xxxxxxxxxxxxx
> put the word 'unsubscribe' by itself in the subject line.  To get a list
> of available commands, put the word 'help' by itself in the subject line.
>
>
>
> --
> No virus found in this incoming message.
> Checked by AVG Free Edition.
> Version: 7.1.375 / Virus Database: 267.14.21/236 - Release Date: 1/20/2006
>
>

 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of
available commands, put the word 'help' by itself in the subject line.

 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of
available commands, put the word 'help' by itself in the subject line.


 To unsubscribe from this list send a blank Email to
bksvol-discuss-request@xxxxxxxxxxxxx
put the word 'unsubscribe' by itself in the subject line.  To get a list of 
available commands, put the word 'help' by itself in the subject line.

Other related posts: