Skip to main content

NST dansk ATG-database (16 kHz) – reorganisert

his database was created by Nordic Language Technology for the development of automatic speech recognition and dictation in Danish. In this updated version, the organization of the data have been altered to make the database more user friendly. In the original version of the material, the files were organized in a specific folder structure where the folder names were meaningful. However, the file names were not meaningful, and there were also cases of files with identical names in different folders. This proved to be impractical, since users had to keep the original folder structure in order to use the data. The files have been renamed, such that each file name is unique and meaningful regardless of the folder structure. The original metadata files were in spl format; these have been converted to JSON format. The metadata files are anonymized, and the text encoding has been converted from ANSI to UTF-8. Metadata and transcriptions are also available as CSV files. See the documentation file for a full description of the data and the changes that have been made to the database.

Data og ressourcer

Nøgleord

Yderligere info

URI https://data.gov.dk/dataset/lang/https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-55/
Destinationsside https://www.nb.no/sprakbanken/ressurskatalog/oai-nb-no-sbr-55/#resource-common-info
Høstes af Datavejviser
Udgivelsesdato 11-03-2022
Seneste ændringsdato 11-03-2022
Opdateringsfrekvens aldrig
Dækningsperiode  / 
Emne(r)
  • 16.05.07.05 Sprogudvikling
  • Uddannelse, kultur og sport
Adgangsrettigheder offentlig
Overholder
Proveniensudsagn

Originalt udviklet af Nordisk Språkteknologi Norge 1990. Norges Nationalbiblitoek har overtaget ansvaret for ressourcen i forbindelse med at Nordisk Språkteknologik gik konkurs i 2003.

Dokumentation