Continuing from the alphanumeric start (notes below):
eng-pol-pron.tei
eng-por-pron.tei
eng-rom-pron.tei
eng-rus-pron.tei
eng-spa-pron.tei
eng-srp-pron.tei
eng-swe-pron.tei
eng-swh-pron.tei
eng-tur-pron.tei
epo-eng-pron.tei
fin-bul-pron.tei
fin-deu-pron.tei
fin-ell-pron.tei
fin-eng-pron.tei
fin-fra-pron.tei
fin-ita-pron.tei
fin-jpn-pron.tei
fin-lat-pron.tei
fin-nld-pron.tei
fin-nor-pron.tei
fin-pol-pron.tei
fin-por-pron.tei
fin-swe-pron.tei
fra-bre-pron.tei
fra-bul-pron.tei
fra-deu-pron.tei
fra-ell-pron.tei
fra-eng-pron.tei
fra-fin-pron.tei
fra-ita-pron.tei
fra-jpn-pron.tei
fra-lat-pron.tei
fra-nld-pron.tei
fra-pol-pron.tei
fra-por-pron.tei
fra-rus-pron.tei
fra-spa-pron.tei
fra-swe-pron.tei
fra-tur-pron.tei
gla-deu-pron.tei
gle-eng-pron.tei
gle-pol-pron.tei
hrv-eng-pron.tei
hun-eng-pron.tei
isl-eng-pron.tei
ita-bul-pron.tei
ita-deu-pron.tei
ita-ell-pron.tei
ita-eng-pron.tei
ita-fin-pron.tei
ita-jpn-pron.tei
ita-pol-pron.tei
ita-por-pron.tei
ita-rus-pron.tei
ita-swe-pron.tei
ita-tur-pron.tei
jpn-deu-pron.tei
jpn-eng-pron.tei
jpn-fra-pron.tei
jpn-rus-pron.tei
kur-deu-pron.tei
kur-eng-pron.tei
kur-tur-pron.tei
lat-deu-pron.tei
lat-eng-pron.tei
lit-eng-pron.tei
mkd-bul-pron.tei
nld-deu-pron.tei
nld-eng-pron.tei
nld-fin-pron.tei
nld-fra-pron.tei
nld-ita-pron.tei
nld-lat-pron.tei
nld-lit-pron.tei
nld-por-pron.tei
nld-rus-pron.tei
nld-spa-pron.tei
nld-swe-pron.tei
Specific notes:
* eng-pol: only contains references to other TEI files, no entries.
(cannot check)
* fra-bre: has a lot of empty <pron> tags (<pron></pron>) at the end.
* I assume these are not generated, but preexisting.
* jpn-*: I can't really help. Length checking does not seem to work.
Unsure, whether ok.
* Most unexpected: first entry from jpn-eng:
<orth>ヽ</orth>
<pron>(en)dʒˈapəniːz(ja)lˈe̞tə</pron>
* jpn-rus: Some <pron> tags are empty (incl. 1st, 4th; i.e.,
"<pron></pron>").
Specific notes (most likely problems with espeak-ng):
* nld-<some>: "à" is provided with pronunciation "ˌaːɣrˈaːvə", also
when part of an expression.
* nld-eng: "à", "à pari"
* pron: "ˌaːɣrˈaːvə", "ˌaːɣrˈaːvə pˈaːri"
* nld-ita, nld-lit, nld-por, nld-rus, nld-spa: "à propos"
* pron: "ˌaːɣrˈaːvə prˈoːpɔs"
Less specific notes:
* More dictionaries containing embedded slashes:
ita-bul, ita-ell, ita-fin, ita-jpn, ita-pol, ita-rus, ita-swe,
ita-tur, nld-fin, nld-itam, nld-lat, nld-lit, nld-por, nld-rus,
nld-spa, nld-swe
* ita-ell further contains '(/.../' in at least one instance (first
entry).
Regards,
Einhard
On 28/12/2020 15:55, Karl Bartel wrote:
Looks great! I checked the following (starting from the end) without--
finding any problems:
spa-ast-pron.tei.xz
spa-deu-pron.tei.xz
spa-eng-pron.tei.xz
spa-por-pron.tei.xz
srp-eng-pron.tei.xz
swe-bul-pron.tei.xz
swe-deu-pron.tei.xz
swe-ell-pron.tei.xz
swe-eng-pron.tei.xz
swe-fin-pron.tei.xz
swe-fra-pron.tei.xz
swe-ita-pron.tei.xz
swe-jpn-pron.tei.xz
swe-lat-pron.tei.xz
swe-nor-pron.tei.xz
swe-pol-pron.tei.xz
swe-por-pron.tei.xz
swe-rus-pron.tei.xz
swe-spa-pron.tei.xz
swe-tur-pron.tei.xz
swh-eng-pron.tei.xz
swh-pol-pron.tei.xz
tur-deu-pron.tei.xz
tur-eng-pron.tei.xz
Two other thoughts I had when looking at the pronunciations:
* In deu-eng-pron.tei.xz, there are entries where additional information is
given in parentheses (e.g. "(Buch) Stelle"). The part in parens should not
be included in the pronunciation, IMO.
* It would make sense to add pronunciations to the translations to, for
those cases where I start in my native language.
But that's probably nothing you don't know already.
Karl
On Sun, Dec 27, 2020 at 10:59 PM Einhard Leichtfuß <alguien@xxxxxxxxxxxxx>
wrote:
Hi,
in brief: checked *swe-fra*; seems fine
On 27/12/2020 20:54, Sebastian Humenda wrote:
Hi all#25 of
As the deadline for the Debian freeze is approaching which also affects
derivativs such as Mint and Ubuntu, I have started working on a fix for
our tools repository [^1].
TL;DR: please help out until 29/12/2020 20:00 UTC, if possible.
[^1]: https://github.com/freedict/tools/issues/25
Thanks for the work!
Unfortunately, I had to rewrite the teiaddphonetics script and hence Iwould
like to request your help. Please grab at least one dictionary and checkthe
IPA pronunciation generated by the script. No knowledge of IPA required.If
you are short on time, just grab one, but if possible, check more thanone
dictionary. The deadline is 29/12/2020, 20:00 UTC.the
The already processed files are at
https://download.freedict.org/generated/tmp. Grab either the *.c5.gz or
xxx-yyy-phonetics.tei file.
Shouldn't most/all of the generated phonetics remain unchanged? (Except
they did not exist before, as with deu-eng) I.e., can we not simply
diff the output of the old and new script?
Instructions:be
0. Errors are never within the transcribed IPA, the only problem could
that the IPA does not match the headword. So even if you don't knowIPA,
just guessing from the characters, it can be guessed whether theright
word was transcribed.
1. Please check at least the first and the last entry.
2. Please check at least one entry in the middle.
3. Come back to the list with dictionaries that you have checked.
Your help is really welcome.
For now, I have looked at *swe-fra* (TEI). Looks fine. I intend to
check a few more soon.
Regards,
Einhard
PS: I'll also get back to some other e-mails concerning deu-eng soon.
--
FreeDict - Free And Open Dictionaries
Manage your subscription at https://www.freelists.org/list/freedict
Wiki: https://github.com/freedict/fd-dictionaries/wiki
Web: http://freedict.org