gecko-dev/intl/unicharutil/tools
smontagu%smontagu.org 5e9cd949e0 Update data files to Unicode 5.0. Bug 345156, r=jshin, sr=jag 2006-07-30 12:50:15 +00:00
..
data
.cvsignore
MUTTUCData.txt
Makefile.in Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-18 14:21:17 +00:00
README.txt bug 210502 : update the normalization data file to 4.1.0 and add README.txt on the way to generate data files : r=smontagu, sr=dbaron 2005-04-05 23:00:13 +00:00
UCDATAREADME.txt
ccmapbin.pl bug 224337 : fix misalgined accesss in pre-composed CCMap (on RISC architectures) : r/sr=brendan, a=dbaron 2003-12-15 00:32:16 +00:00
format.txt
gencasetable.pl Correct a typo in gencasetable.pl and regenerate casetable.h. Bug 337976, r=jshin, sr=jag 2006-05-15 14:07:03 +00:00
gencattable.pl Update data files to Unicode 5.0. Bug 345156, r=jshin, sr=jag 2006-07-30 12:50:15 +00:00
gentransliterate.pl Improve detection of numbers in the transliterator. Bug 345024, r=jshin, sr=rbs 2006-07-20 12:15:51 +00:00
ucgendat.c

README.txt

* How to generate various properties files in intl/unicharutils/tables and
  header files in intl/unicharutils/src
   ( written by Jungshik Shin for bug 210502 
     https://bugzilla.mozilla.org/show_bug.cgi?id=210502 on 2005-04-05 )

1. Grab the latest version of idnkit at http://www.nic.ad.jp/en/idn/index.html
   (http://www.nic.ad.jp/ja/idn/idnkit/download/index.html )

2. There are three files we need in the kit:
   generate_normalize_data.pl, UCD.pm and SparseMap.pm

3. a. Download the following Unicode data files  :
     CaseFolding.txt,CompositionExclusions.txt, 
     SpecialCasing.txt, UnicodeData.txt

   b. Rename UnicodeData.txt to UnicodeData-Latest.txt

   The latest version is, as of this writing, in 
   ftp://ftp.unicode.org/Public/4.1.0/ucd

4. a. Run generate_normalize_data.pl and save the output to a temporary file
   b. Edit the file 
      - remove the case folding part (search for 'Lowercase' and delete 
        all the lines following it) because we have separate scripts for that, 
      - replace 'unsigned short' and 'unsigned long' with 'PRUnichar' and 
        'PRUint32'
   c. Replace the actual source part (after the license) of 
      intl/unicharutil/src/normalization_data.h with the file you edited.

5. Generate casetable.h and cattable.h with  gencasetable.pl and gencattable.pl
   Just running them will put casetable.h and cattable.h in the right place.