ruby/enc/trans
Martin Dürst 369ff79394 add encoding conversion from/to CESU-8
Add encoding conversion (transcoding) from UTF-8 to CESU-8
and back. CESU-8 is an encoding similar to UTF-8, but encodes
codepoints above U+FFFF as two surrogates, these surrogates
again being encoded as if they were UTF-8 codepoints. This
preserves the same binary sorting order as in UTF-16. It is
also somewhat similar (although not exactly identical) to an
encoding used internally by Java.

This completes issue #15995.

enc/trans/cesu_8.trans: Add encoding conversion from/to CESU-8
test/ruby/test_transcode.rb: Add tests for above
2019-07-14 10:58:50 +09:00
..
CP
EMOJI
GB replace copyrights by explanatory text in data files for GB2312/GB12345 mappings 2017-09-01 10:22:09 +00:00
JIS * remove trailing spaces. 2016-03-16 12:42:16 +00:00
ucm
big5-hkscs-tbl.rb
big5-uao-tbl.rb
big5.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
cesu_8.trans add encoding conversion from/to CESU-8 2019-07-14 10:58:50 +09:00
chinese.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
cp850-tbl.rb
cp852-tbl.rb
cp855-tbl.rb
cp949-tbl.rb
ebcdic.trans follow the change of the name 2015-12-15 13:11:33 +00:00
emoji-exchange-tbl.rb
emoji.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
emoji_iso2022_kddi.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
emoji_sjis_docomo.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
emoji_sjis_kddi.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
emoji_sjis_softbank.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
escape.trans remove trailing spaces 2014-05-22 10:58:08 +00:00
euckr-tbl.rb euckr-tbl.rb: euro and registered signs 2014-08-19 13:22:46 +00:00
gb18030-tbl.rb
gb18030.trans remove trailing spaces 2014-05-22 10:58:08 +00:00
gbk-tbl.rb
gbk.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
ibm437-tbl.rb
ibm737-tbl.rb
ibm775-tbl.rb
ibm852-tbl.rb
ibm855-tbl.rb
ibm857-tbl.rb
ibm860-tbl.rb
ibm861-tbl.rb
ibm862-tbl.rb
ibm863-tbl.rb
ibm865-tbl.rb
ibm866-tbl.rb
ibm869-tbl.rb
iso-8859-1-tbl.rb
iso-8859-2-tbl.rb
iso-8859-3-tbl.rb
iso-8859-4-tbl.rb
iso-8859-5-tbl.rb
iso-8859-6-tbl.rb
iso-8859-7-tbl.rb
iso-8859-8-tbl.rb
iso-8859-9-tbl.rb
iso-8859-10-tbl.rb
iso-8859-11-tbl.rb
iso-8859-13-tbl.rb
iso-8859-14-tbl.rb
iso-8859-15-tbl.rb
iso-8859-16-tbl.rb * enc/trans/iso-8859-16-tbl.rb: add ISO-8859-16 converter. 2011-12-09 10:27:37 +00:00
iso2022.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
japanese.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
japanese_euc.trans * enc/trans/japanese_euc.trans, test/ruby/test_transcode.rb, 2013-06-02 14:36:41 +00:00
japanese_sjis.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
koi8-r-tbl.rb
koi8-u-tbl.rb
korean.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
maccroatian-tbl.rb
maccyrillic-tbl.rb
macgreek-tbl.rb
maciceland-tbl.rb
macroman-tbl.rb
macromania-tbl.rb
macturkish-tbl.rb
macukraine-tbl.rb
newline.trans
single_byte.trans single_byte.trans: dead code 2016-10-28 14:18:52 +00:00
tis-620-tbl.rb
transdb.c
utf8_mac-tbl.rb * enc/trans/utf8_mac-tbl.rb: fix r42789. 2013-09-04 06:40:39 +00:00
utf8_mac.trans * enc/trans/utf8_mac-tbl.rb: fix r42789. 2013-09-04 06:40:39 +00:00
utf_16_32.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
windows-874-tbl.rb
windows-1250-tbl.rb
windows-1251-tbl.rb
windows-1252-tbl.rb
windows-1253-tbl.rb
windows-1254-tbl.rb
windows-1255-tbl.rb Update windows-1255 table 2016-10-28 15:14:32 +00:00
windows-1256-tbl.rb
windows-1257-tbl.rb