Corpora: Re: Unicode

From: LITTLECHILD Peter (peter.littlechild@swift.com)
Date: Tue Dec 12 2000 - 13:43:24 MET

  • Next message: Chapman, Wendy: "Corpora: learning regular expressions: responses"

    "Mcenery, Tony" wrote:
    > So while I think Unicode is the way for corpus work to go in the future,
    > treading that path with non-alphabetic writing systems at this moment in time
    > is somewhat difficult.

    I had a quick look at Unicode as a way of solving the problem of accented characters in Welsh. My first impressions were that I
    would have to throw away my favourite editors and have all my Perl and Balise programs re-written by rocket scientists.

    But maybe that's too pessimistic..

    <from>
    <name>peter littlechild</name>
    <section>publishing tools and technology</section>
    <dept>user documentation</dept>
    <firm>s.w.i.f.t. sc</firm>
    </from>



    This archive was generated by hypermail 2b29 : Tue Dec 12 2000 - 13:43:58 MET