After an upstream fix, I can now properly distinguish all inflected forms
for German verbs. My suggestion is to use the following list of forms in
this order for all Germans verbs where the data is available. I will take
care of other languages once we agree on German forms.
other_written mood number person tense
-------------------- -------------- ---------- ---------- ----------
ich stehle IndicativeMood Singular First Present
du stiehlst IndicativeMood Singular Second Present
er/sie/es stiehlt IndicativeMood Singular Third Present
ich stahl IndicativeMood Singular First Past
ich stähle SubjunctiveMoo Singular First Past
ich stöhle SubjunctiveMoo Singular First Past
stiehl! ImperativeMood Singular Second Present
stehlt! ImperativeMood Plural Second Present
ich habe gestohlen IndicativeMood Singular First Perfect
Does this look good?
Is there a standardized way to encode the other columns? The RDF data for
them belongs to the "olia" namespace. I could add that name space to the
TEI files and add those as attributes.
Karl
On Sun, May 24, 2020 at 1:58 PM Karl Bartel <karl@karl.berlin> wrote:
Ok, I was oversimplifying for the sake of not dissecting each entry :).Of
course you are right. How are you going to proceed? Are you going toinclude
all forms or do you wait for things to change at DBnary/Wiktionary?
I'll dig a bit more in the dbnary data to see if there's a way to
select on the entries we're interested right now. This can take a
while, since it is a lot of data and different wiktionaries probably
have some variation in their structure.
Karl