Corpus Resources & DocumentationJørg Asmussen @ DSL

Corpus Documentation

This page lists technical reports, papers, manuals, and other writings related to the gathering of comprehensive LGP corpora of modern Danish carried out at DSL. Corpora are mainly used by lexicographers at the ordnet.dk project, cf. ordnet.dk publications.

General

The following documents pave the ground for the corpus work carried out at DSL. They were written as part of the DK-CLARIN Project to establish a framework for structuring and processing large text corpora.

Manuals

Specific

The following papers describe specific aspects of working with large corpora.