The Annotated types table

The Annotated types table lists all annotated types and their properties. It is based on the annoted version of the underlying corpus, i.e., each token has been morphosyntactically analyzed. Annotation consists of

  • the assignment of a part-of-speech tag (tagging)
  • the assignment of a lemma (canonical form, headword) (lemmatisation)

to each token. Each entry in the Annotated types table is defined by its unique combination of

For example, singen occurs as an infinitival verb (tagged VVINF), but also as a finite verb (first or third person plural, tagged VVFIN).

The third constitutive component of an annotated type is its Lemma. The corpus type singen, but also the inflected forms singt, sangen etc. are all mapped to the lemma singen.

Two annotated types always differ at least with respect to one of the three components.

The Annotated types table provides access to annotated type frequencies. For example, it provides separate frequencies for singen/VVINF as an infinitive (16.5/mil.) and for singen/VVFIN as a finite verb (4.7/mil.). The frequency of the purely orthographically defined type singen, which can be obtained from the Types table, is the sum of the two annotated type frequencies: 21.2/mil. So the Annotated types table provides more specific frequency information than the Types table.

The Annotated types table offers the following filters or variables for output:

In addition to that, the Annotated types table has all the filters from the Types and Lemmata tables: