[freedict] Re: [urgent] call for testing
- From: Sebastian Humenda <shumenda@xxxxxx>
- To: freedict@xxxxxxxxxxxxx
- Date: Wed, 30 Dec 2020 09:07:42 +0100
Hi Einhard
Einhard Leichtfuß schrieb am 28.12.2020, 12:48 +0100:
On 28/12/2020 11:24, Sebastian Humenda wrote:
You can give it a try. However, I now use a SAX streaming parser that doesn't
preserve the original XML 1-to-1 (escaping, etc.) and hence I would expect a
lot of noise.
Are the old TEI results available somewhere? Regenerating them might
take a while. Though I should be able to exploit parallelism massively.
It is probably, but for completeness sake:
https://freedict.org/download.
There you can either download the TEI source releases individually or
programmatically using wget from
https://download.freedict.org/dictionaries.
And last, but not least, they are also listed in the API :).
Appreciated, thanks. BTW, I'm releasing deu-eng and eng-deu soon, you are
aware of the time it takes to build it :).
OK. By the way, I contacted my thesis advisor; they did not have any
major criticism, in particular none that I could use to improve my code.
Great.
Answering your 2nd e-mail:
Checked (43/165; notes below):
Thanks so much, that's a real help.
i> * I did not see no pronunciations for translations. I assume this is
correct.
I modelled the old behaviour here, I had no ambitions, especially time-wise,
do do anything more elaborate. That said, we would need to port
teiaddphonetics to libespeakng from a espeakngng process invocation, otherwise
the conversion would take too long. I also haven't tested what the stylesheets
would say about <pron/> in translations. In any case, feel free to open an
issue.
Specific notes (likely unrelated to teiaddphonetics):
* deu-rus: Last entry "ǃXóõ" is not a german word.
Please patch in the repo :).
* eng-bul, eng-nor: embedded surrounding slashes in (presumably)
non-autogenerated entries. This results in double surrounding
Again, if you want, you can push a change to master. Please add the
explanation in the change set of the TEI header.
* eng-deu (old / -pron; not -phonetics): last few entries'
pronunciation seems wrong.
I have to test the old ones then for problems, but be assured that your new
dictionary is about to being released.
Cheers
Sebastian
Attachment:
signature.asc
Description: PGP signature
Other related posts: