
The PAROLE Corpus is a text corpus consisting of more than 1,500 text extracts amounting to a total of 250,000 running words. Each word has been supplied with information on part of speech and inflection.
The PAROLE Corpus was compiled at DSL within a project subsidized by the EU called PAROLE (Preparatory Action for linguistic Resources Organization for Language Engineering) during the period 1996-1998. The project was motivated by the EU Commission's desire to provide collections of structured written electronic texts (corpora) for all EU languages, as well as having morphological and syntactic word databases derived from these available for national and international language technology research and industry.
You can download the PAROLE Corpus from DSL's website along with instructions in Danish (comprehensive) and in English (brief).
PAROLE documentation - Danish (PDF)