Skip to main content

Leipzig Corpora Collection

The Leipzig Corpora Collection provides different tools and data for download, which are protected by copyright. For more details please refer to our terms of usage. (https://wortschatz.uni-leipzig.de/en/usage).

The corpora are automatically collected from carefully selected public sources without considering in detail the content of the contained text. No responsibility is taken for the content of the data. In particular, the views and opinions expressed in specific parts of the data remain exclusively with the authors.

If you use one of these corpora in your work we kindly ask you to cite this paper as: D. Goldhahn, T. Eckart and U. Quasthoff: Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages. In: Proceedings of the 8th International Language Resources and Evaluation (LREC'12), 2012.

Any data provided by Projekt Deutscher Wortschatz are subject to copyright. Permission for use is granted free of charge solely for non-commercial personal and scientific purposes licensed under the Creative Commons License CC BY-NC. Any use that exceeds the means of query provided by the WWW-Interface, any automated queries (except using our RESTful Webservices) and any commercial use of the data obtained is forbidden without explicit written permission by the copyright owner. All corpora provided for download are licensed under CC BY. If you are interested in larger data sets, please contact us.

Data og ressourcer

Nøgleord

Yderligere info

URI https://data.gov.dk/catalogue/lang-resources/langresources-MiscDatasets.rdf/leipzig-corpora-collection3
Destinationsside http://wortschatz.uni-leipzig.de/en/download/Danish
Høstes af Datavejviser Nej
Udgivelsesdato
Seneste ændringsdato
Opdateringsfrekvens ubekendt
Dækningsperiode  / 
Emne(r)
  • 16.05.07 Sprog og retskrivning
  • Uddannelse, kultur og sport
Adgangsrettigheder offentlig
Overholder
Proveniensudsagn
Dokumentation