Corpora: ELRA News 2/2

From: Valerie Mapelli (mapelli@elda.fr)
Date: Tue Jun 13 2000 - 16:21:35 MET DST

  • Next message: Antoine Consigny: "Re: Corpora: Collaborative venture"

    [ We apologise for the duplicate posting of this announcement ]
    ___________________________________________________________
                                    ELRA
                    European Language Resources Association
                                   ELRA News
    ___________________________________________________________

     
                         *** AURORA Project Database ***
     
    ELRA is releasing two databases made within the ETSI STQ-AURORA DSR
    working group.
    _______________________________________
    AURORA Project Database 2.0
    _______________________________________
     
    The Aurora project is releasing a revised version of the Noisy TI digits
    database to follow on the work of ETSI. This CD set is a replacement for
    the previous set (version 1.0 consisted of 2 CDs while version 2.0 now
    consists of 4 CDs) .

    This database is intended for the evaluation of algorithms for front-end
    feature extraction algorithms in background noise but may also be used
    more widely by speech researchers to evaluate and compare the performance
    of noise robust speech recognition algorithms.
     
    Compared to version 1.0 the changes are as follows:
    1) The files are restored to the energy level of the original speech
    in the TI digits database.
    2) One of the noise types added to the speech has been changed (the babble
    one)
    3) There is an additional test sets where the noises are mismatched to
    those used in the training set
    4) There is a convolutional distortion test.
    5) There is a clean training set
     
    The CD ROM will be used for the next round of ETSI Aurora standards
    evaluation.

    _______________________________________
    AURORA Project Database 3.0 - Subset of SpeechDat-Car
    Finnish database
    _______________________________________

    This database is a subset of the SpeechDat-Car database in Finnish
    language which has been collected as part of the European Union
    funded SpeechDat-Car project. It contains isolated and connected
    Finnish digits spoken in the following driving conditions inside a car:

    1. 0 km/hr with the car engine on
    2. 40-60 km/hr with the car windows closed
    3. 40-60 km/hr with the car windows open
    4. 100-120km/hr with no music in the background
    5. 100-120km/hr with music in the background
     
    The database also contains the software needed to run simulations
    using the Entropic's HTK, which has been adopted as the "standard"
    HMM recogniser for the Aurora standard evaluation.

    =====================================
    For further information, please contact:

          ELRA/ELDA Tel +33 01 43 13 33 33
          55-57 rue Brillat-Savarin Fax +33 01 43 13 33 30
          F-75013 Paris, France E-mail mapelli@elda.fr
     
    or visit the online catalogue on our Web site:
     
          http://www.icp.grenet.fr/ELRA/home.html
          or http://www.elda.fr
    =====================================
     



    This archive was generated by hypermail 2b29 : Tue Jun 13 2000 - 16:21:36 MET DST