Welcome to the public corpus documentation site of the Society for Danish Language and Literature, DSL.
The site is maintained by Jørg Asmussen, DSL.
This site comprises all kinds of documentation and other writings related to the gathering of comprehensive LGP corpora of modern Danish carried out at DSL. These corpora are mainly used by the ordnet.dk project, cf. ordnet.dk publications.
This site also comprises a number of NLP resources. Some of these resources are exclusively available at ja-korpus.dsl.lan which means that they are accessible only from within the dsl domain (DSL only). Other resources come as password protected zip files (password). Some are freely available (free).
This documentation paves the ground for the corpus work at DSL.
Some of the resources listed below are available for public download. However, most of them require a password to unzip. To obtain a password, please send a mail to firstname.lastname@example.org with a brief description of the purpose(s) you intend to use the resources for.
If you download resources from this site you agree to to the following conditions:
Due to copyright reasons, the corpora listed below comprise sentences or shorter excerpts in arbitrary order. They do not contain full texts.