Next message: Iman Thabet: "Corpora: Corpus query language"
[ We apologise for the duplicate posting of this announcement ]
*********************************************************************************************************************************************************
ELRA
European Language Resources Association
ELRA News
*********************************************************************************************************************************************************
We are happy to announce new resources available via ELRA in the
framework of the Aurora project.
*** AURORA PROJECT DATABASE ***
[ELRA is releasing 5 databases made within the ETSI STQ-AURORA DSR
working group.]
AURORA PROJECT DATABASE 2.0
The Aurora project is releasing a revised version of the Noisy TI digits
database to follow on the work of ETSI. This CD set is a replacement for
the previous set (version 1.0 consisted of 2 CDs while version 2.0 now
consists of 4 CDs) .
This database is intended for the evaluation of algorithms for front-end
feature extraction algorithms in background
noise but may also be used more widely by speech researchers to evaluate
and compare the performance of
noise robust speech recognition algorithms.
Compared to version 1.0 the changes are as follows:
1) The files are restored to the energy level of the original speech
in the TI digits database.
2) One of the noise types added to the speech has been changed
(the babble one)
3) There is an additional test sets where the noises are mismatched
to those used in the training set.
4) There is a convolutional distortion test.
5) There is a clean training set
The CD ROM will be used for the next round of ETSI Aurora standards
evaluation.
AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR FINNISH
DATABASE
This database is a subset of the SpeechDat-Car database in Finnish
language which has been collected as part of the European Union funded
SpeechDat-Car project. It contains isolated and connected Finnish digits
spoken in the following driving conditions inside a car:
1. 0 km/hr with the car engine on
2. 40-60 km/hr with the car windows closed
3. 40-60 km/hr with the car windows open
4. 100-120km/hr with no music in the background
5. 100-120km/hr with music in the background
The database also contains the software needed to run simulations
using the Entropic's HTK, which has been adopted as the
"standard"
HMM recogniser for the Aurora standard evaluation.
AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR DANISH DATABASE
The Aurora project was originally set up to establish a world wide
standard for the feature extraction software which forms the core of the
front-end of a DSR (Distributed Speech Recognition) system. ETSI formally
adopted this activity as work items 007 and 008.The two work items within
ETSI are:
- ETSI
DES/STQ WI007: Distributed Speech Recognition - Front-End Feature
Extraction Algorithm & Compression Algorithm
- ETSI
DES/STQ WI008: Distributed Speech Recognition - Advanced Feature
Extraction Algorithm.
This database is a subset of the SpeechDat-Car database in Danish
language which has been collected as part of the European Union funded
SpeechDat-Car project. It contains isolated and connected Danish digits
spoken in the following noise and driving conditions inside a car:
1. High
speed good road
2. Low
speed rough road
3. Stopped
with motor running
4. Town
traffic
AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR GERMAN DATABASE
The Aurora project was originally set up to establish a world wide
standard for the feature extraction software which forms the core of the
front-end of a DSR (Distributed Speech Recognition) system. ETSI formally
adopted this activity as work items 007 and 008.The two work items within
ETSI are:
- ETSI
DES/STQ WI007: Distributed Speech Recognition - Front-End Feature
Extraction Algorithm & Compression Algorithm
- ETSI
DES/STQ WI008: Distributed Speech Recognition - Advanced Feature
Extraction Algorithm.
This database is a subset of the SpeechDat-Car database in German
language which has been collected as part of the European Union funded
SpeechDat-Car project. It contains isolated and connected German digits
spoken in the following noise and driving conditions inside a car:
5. High
speed good road
6. Low
speed rough road
7. Stopped
with motor running
8. Town
traffic
AURORA PROJECT DATABASE 3.0- SUBSET OF SPEECHDAT-CAR SPANISHDATABASE
The Aurora project was originally set up to establish a world wide
standard for the feature extraction software which forms the core of the
front-end of a DSR (Distributed Speech Recognition) system. ETSI formally
adopted this activity as work items 007 and 008.The two work items within
ETSI are:
- ETSI
DES/STQ WI007: Distributed Speech Recognition - Front-End Feature
Extraction Algorithm & Compression Algorithm
- ETSI
DES/STQ WI008: Distributed Speech Recognition - Advanced Feature
Extraction Algorithm.
This database is a subset of the SpeechDat-Car database in Spanish
language which has been collected as part of the European Union funded
SpeechDat-Car project. It contains isolated and connected Spanish digits
spoken in the following noise and driving conditions inside a car:
1. Quiet
environment. Stop motor running.
2. Low
noise. Town traffic + low speed rough road.
3. High
noise: High speed good road.
This archive was generated by hypermail 2b29
: Mon Apr 30 2001 - 15:04:32 MET DST