Hi Einhard
Einhard Leichtfuß schrieb am 31.10.2020, 13:41 +0100:
I have submitted my thesis [1] by now, accompanied by a usable ding2tei
importer program.
Unless anyone objects, I will proceed to merge the ding2tei-haskell
branch with master.
I am unsure what should happen with the current deu-eng and eng-deu
Spanish-German dictionary
-------------------------
The Spanish-German "Ding"-formatted dictionary is not truly supported,
since the syntax is actually somewhat different.
There exists a sed script that allows it to be translated to TEI.
However, this is not to be seen as a solution. Instead, the main
program should be adapted.
Efficiency
----------
The memory usage is heavy (4.5 GiB). Runtime is fine (approx. 3 minutes
including compilation of the code).
In contrast, the FreeDict tools, when applied to the huge resulting TEI
dictionaries, take a lot more time: Runtime peaked at about 24 hours for
Phonetics (teiaddphonetics)
---------------------------
Teiaddphonetics unfortunately still fails due to some uncommon character
combinations, see the corresponding issue on GitHub [3].
Review & Comments
-----------------
- Code & Documentation.
I expect to get some valuable critique on the code by the people
evaluating my thesis. I shall try to use that in order to improve my code.
- TEI.
An earlier version of the TEI result was briefly reviewed by Sebastian.
I have made according changes and discussed most further changes with
him. - Thanks a lot for all the help!
Further development
-------------------
I intend to improve the importer further. However, this is not my top
priority right now.
Attachment:
signature.asc
Description: PGP signature