The IberSPEECH 2020 Challenge starts!
Zaragoza, March 1, 2020
IBERSPEECH 2020
Valladolid, November 18-20, 2020
Valladolid, November 18-20, 2020
The multimodal diarization and identification evaluation consists of segmenting broadcast audiovisual documents according to a closed set of different speakers and faces and linking those segments which originate from the same speaker and face. For this evaluation, a list of characters to recognize will be given. The rest of characters on the audiovisual document will be discarted for the evaluation purposes. System outputs must give for each segment who is speaking and who is/are in the image from the list of characters. For each character, a set of face pictures and short audiovisual document will be given.
This year an optional scene diarization is proposed. The scene diarization consists of segmenting broadcast audiovisual documents according to a closed set of descriptors. The descriptors are: day, night, urban, rural, summer, winter, indoor, outdoor and multiscreen.
The calendar for the Albayzín Multimodal Diarization and Identification evaluation is: