Skip to main content

DanPASS-korpus (Danish Phonetically Annotated Spontaneous Speech)

The DanPASS corpus was developed for research and applied research purposes. It consists of of non-scripted monologues and dialogues, recorded by 27 speakers, comprising a total of 73,227 running words, corresponding to 9 h and 46 min of speech. The monologues were recorded as one-way communication with an unseen partner where the speaker performed three different tasks: (s)he described a network consisting of various geometrical shapes in various colours, (s)he guided the listener through four different routes in a virtual city map, and (s)he instructed the listener how to build a house from its individual pieces. The dialogues are replicas of the HCRC map tasks. Annotation is performed in Praat. The sound files are segmented into prosodic phrases, words, and syllables. The files are supplied, in separate interval tiers, with an orthographical representation, detailed part-of-speech tags, simplified part-of-speech tags, a phonemic notation, a semi-narrow phonetic notation, a symbolic representation of the pitch relation between each stressed and post-tonic syllable, and a symbolic representation of the phrasal intonation.

An extensive description and documentation of the corpus and its numerous resources can be found at https://danpass.hum.ku.dk.

The corpus was presented at the 5th International Conference on Language Resources and Evaluation, Genova 24-24 May 2006.

Note that to open the sound files you need a password. Contact the publisher via email.

Data og ressourcer

Nøgleord

Yderligere info

URI https://data.gov.dk/dataset/lang/78e6f282-9214-4d42-8dff-6cf3180ed4cc
Destinationsside https://danpass.hum.ku.dk/
Høstes af Datavejviser
Udgivelsesdato 01-01-2006
Seneste ændringsdato 01-03-2016
Opdateringsfrekvens uregelmæssig
Dækningsperiode  / 
Emne(r)
  • 16.05.07 Sprog og retskrivning
  • Uddannelse, kultur og sport
Adgangsrettigheder offentlig
Overholder
Proveniensudsagn
Dokumentation