
The Corpus of The Danish Dictionary (DDO) contains 40 million running words from the period 1983-92 distributed over more than 43,000 text samples taken from a broad variety of text types and media. Each text sample has been supplied with a number of carefully determined information types relating to the text sample itself and its author or sender. A detailed description of these information types and the composition of the DDO Corpus in general can be found in Norling-Christensen & Asmussen 1998: The Corpus of the Danish Dictionary (Lexikos 8, Stellenbosch).
The DDO Corpus was established as part of the project The Danish Dictionary and was compiled in the period 1991-93. It has since served as the primary source during the compilation of The Danish Dictionary. The downloadable version amounts to 36 million running words, as samples totalling approx. 4 million words were omitted due to limitations on use imposed by the text suppliers. Moreover, information about the individual text sample has been simplified, and information about the author or sender of the text has been reduced.
The DDO Corpus is downloadable as a Semaskop module from the DSL website.
The corpus module consists of three components: ddo1.exe, ddo2.exe, ddo3.exe.
All three components must be downloaded. Each individual component is to be installed in the same way as the corpus tool Semaskop.
The corpus module is not functional until all three components have been installed and the corpus tool Semaskop has been installed.
The order in which the different parts are installed is unimportant.
As the size of the corpus module is substantial, download via a telephone modem is not recommended.