Corpora: ELRA New Resources

From: Valerie Mapelli (mapelli@elda.fr)
Date: Tue Aug 08 2000 - 17:45:43 MET DST

  • Next message: Constantin ORASAN: "Corpora: Agents for NLP"

    [ We apologise for the duplicate posting of this announcement ]
    ___________________________________________________________
                                    ELRA
                    European Language Resources Association
                                   ELRA News
    ___________________________________________________________

                         *** ELRA NEW RESOURCES ***

    We are happy to announce a new resource available via ELRA:

    _______________________________________
    ELRA-S0085 BABEL Bulgarian Database
    _______________________________________

    The BABEL Database is a speech database that was produced
    by a research consortium funded by the European Union
    under the COPERNICUS programme (COPERNICUS Project
    1304). The project began in March 1995 and was completed
    in December 1998. The objective was to create a database of
    languages of Central and Eastern Europe in parallel to the
    EUROM1 databases produced by the SAM Project (funded by
    the ESPRIT programme).

    The BABEL consortium included six partners from Central
    and Eastern Europe (who had the major responsibility of
    planning and carrying out the recording and labelling) and six
    from Western Europe (whose role was mainly to advise and in
    some cases to act as host to BABEL researchers). The five
    databases collected within the project concern the Bulgarian,
    Estonian, Hungarian, Polish, and Romanian languages.

    The Bulgarian database consists of the basic "common" set which is:

    - Many Talker Set: 30 males, 30 females; each to read twice
    the five blocks of numbers (each of which contains 10 numbers),
    3 connected passages and one “filler” passage.
    - Few Talker Set: 5 males, 5 females, selected from the above
    group: each to read 5 times the blocks of numbers, 15 connected
    passages and 2 “filler” passages, and 5 repetitions of the lists of
    monosyllables.
    - Very Few Talker Set: 1 male, 1 female, selected from Few
    Talker set: each to read blocks of monosyllables in carrier sentences
    and five repetitions of the context words.

    And the extension part: semi-spontaneous answers to questions:
    the answers were recorded by the 10 Few Talker Set speakers.

    The other languages will be available soon.

    =====================================
    For further information, please contact:

         ELRA/ELDA Tel +33 01 43 13 33 33
         55-57 rue Brillat-Savarin Fax +33 01 43 13 33 30
         F-75013 Paris, France E-mail mapelli@elda.fr

    or visit the online catalogue on our Web site:

         http://www.icp.grenet.fr/ELRA/home.html
         or http://www.elda.fr
    =====================================



    This archive was generated by hypermail 2b29 : Tue Aug 08 2000 - 17:46:45 MET DST