ruby/enc/trans
Martin Dürst 369ff79394 add encoding conversion from/to CESU-8
Add encoding conversion (transcoding) from UTF-8 to CESU-8
and back. CESU-8 is an encoding similar to UTF-8, but encodes
codepoints above U+FFFF as two surrogates, these surrogates
again being encoded as if they were UTF-8 codepoints. This
preserves the same binary sorting order as in UTF-16. It is
also somewhat similar (although not exactly identical) to an
encoding used internally by Java.

This completes issue #15995.

enc/trans/cesu_8.trans: Add encoding conversion from/to CESU-8
test/ruby/test_transcode.rb: Add tests for above
2019-07-14 10:58:50 +09:00
..
CP * tool/transcode-tblgen.rb: add table generator from Citrus maps. 2008-08-31 16:23:04 +00:00
EMOJI * enc/trans/EMOJI/*.src, enc/trans/emoji*, enc/x-emoji.c, test/ruby/enc/test_emoji.rb, tool/enc-emoji-citrus-gen.rb, tool/enc-emoji4unicode.rb, tool/jisx0208.rb, tool/test/test_jisx0208.rb: new encodings to support emoji charsets, which are used by Japanese mobile phones [ruby-dev:40528]. Thanks Yoji Shidara for a lot of contribution. 2010-03-09 09:15:42 +00:00
GB replace copyrights by explanatory text in data files for GB2312/GB12345 mappings 2017-09-01 10:22:09 +00:00
JIS * remove trailing spaces. 2016-03-16 12:42:16 +00:00
ucm Add missing tables. 2010-11-22 11:35:55 +00:00
big5-hkscs-tbl.rb * enc/trans/big5-hkscs-tbl.rb: Update table as HKSCS-2008. 2010-11-24 16:40:38 +00:00
big5-uao-tbl.rb * enc/big5.c, enc/trans/big5.trans, enc/trans/big5-uao-tbl.rb, 2009-11-17 08:56:11 +00:00
big5.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
cesu_8.trans add encoding conversion from/to CESU-8 2019-07-14 10:58:50 +09:00
chinese.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
cp850-tbl.rb * enc/trans/single_byte.trans, cp850-tbl.rb, cp852-tbl.rb, 2008-12-09 09:39:25 +00:00
cp852-tbl.rb * enc/trans/single_byte.trans, cp850-tbl.rb, cp852-tbl.rb, 2008-12-09 09:39:25 +00:00
cp855-tbl.rb * enc/trans/single_byte.trans, cp850-tbl.rb, cp852-tbl.rb, 2008-12-09 09:39:25 +00:00
cp949-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
ebcdic.trans follow the change of the name 2015-12-15 13:11:33 +00:00
emoji-exchange-tbl.rb * enc/trans/EMOJI/*.src, enc/trans/emoji*, enc/x-emoji.c, test/ruby/enc/test_emoji.rb, tool/enc-emoji-citrus-gen.rb, tool/enc-emoji4unicode.rb, tool/jisx0208.rb, tool/test/test_jisx0208.rb: new encodings to support emoji charsets, which are used by Japanese mobile phones [ruby-dev:40528]. Thanks Yoji Shidara for a lot of contribution. 2010-03-09 09:15:42 +00:00
emoji.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
emoji_iso2022_kddi.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
emoji_sjis_docomo.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
emoji_sjis_kddi.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
emoji_sjis_softbank.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
escape.trans remove trailing spaces 2014-05-22 10:58:08 +00:00
euckr-tbl.rb euckr-tbl.rb: euro and registered signs 2014-08-19 13:22:46 +00:00
gb18030-tbl.rb * enc/trans/gb18030-tbl.rb: Fix omission of C1 region in code table 2009-12-03 11:29:33 +00:00
gb18030.trans remove trailing spaces 2014-05-22 10:58:08 +00:00
gbk-tbl.rb * enc/trans/gbk-tbl.rb: Add euro sign. [ruby-core:33094] 2010-11-08 00:50:13 +00:00
gbk.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
ibm437-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm737-tbl.rb * enc/trans/ibm737-tbl.rb: greek code page. fixes #4738 2011-05-19 15:58:09 +00:00
ibm775-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm852-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm855-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm857-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm860-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm861-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm862-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm863-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm865-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm866-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
ibm869-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
iso-8859-1-tbl.rb * enc/trans/single_byte.trans: refactoring to make it easier 2008-10-30 05:47:01 +00:00
iso-8859-2-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-3-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-4-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-5-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-6-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-7-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-8-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-9-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-10-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-11-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-13-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-14-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-15-tbl.rb * tool/build-transcode: new file. 2008-08-05 12:32:13 +00:00
iso-8859-16-tbl.rb * enc/trans/iso-8859-16-tbl.rb: add ISO-8859-16 converter. 2011-12-09 10:27:37 +00:00
iso2022.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
japanese.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
japanese_euc.trans * enc/trans/japanese_euc.trans, test/ruby/test_transcode.rb, 2013-06-02 14:36:41 +00:00
japanese_sjis.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
koi8-r-tbl.rb * enc/trans/single_byte.trans, cp850-tbl.rb, cp852-tbl.rb, 2008-12-09 09:39:25 +00:00
koi8-u-tbl.rb * enc/trans/single_byte.trans, cp850-tbl.rb, cp852-tbl.rb, 2008-12-09 09:39:25 +00:00
korean.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
maccroatian-tbl.rb * enc/trans/single_byte.trans, maccroatioan-tbl.rb, 2008-10-31 09:07:21 +00:00
maccyrillic-tbl.rb * enc/trans/single_byte.trans, maccroatioan-tbl.rb, 2008-10-31 09:07:21 +00:00
macgreek-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
maciceland-tbl.rb * enc/trans/single_byte.trans, maccroatioan-tbl.rb, 2008-10-31 09:07:21 +00:00
macroman-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
macromania-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
macturkish-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
macukraine-tbl.rb * enc/trans/single_byte.trans, macgreek-tbl.rb, macroman-tbl.rb, 2008-11-11 05:26:20 +00:00
newline.trans * enc/trans/newline.trans (fun_so_universal_newline): generate \n 2009-11-22 19:15:55 +00:00
single_byte.trans single_byte.trans: dead code 2016-10-28 14:18:52 +00:00
tis-620-tbl.rb * enc/trans/single_byte.trans, cp850-tbl.rb, cp852-tbl.rb, 2008-12-09 09:39:25 +00:00
transdb.c * encoding.c (enc_init_db): moved to enc/encdb.c. 2008-04-07 06:51:33 +00:00
utf8_mac-tbl.rb * enc/trans/utf8_mac-tbl.rb: fix r42789. 2013-09-04 06:40:39 +00:00
utf8_mac.trans * enc/trans/utf8_mac-tbl.rb: fix r42789. 2013-09-04 06:40:39 +00:00
utf_16_32.trans Supports static linking of extensions and encodings again. 2012-05-16 05:39:06 +00:00
windows-874-tbl.rb * enc/trans/single_byte.trans: adding WINDOWS-wwww encodings 2008-10-19 09:15:37 +00:00
windows-1250-tbl.rb * enc/trans/single_byte.trans: adding WINDOWS-wwww encodings 2008-10-19 09:15:37 +00:00
windows-1251-tbl.rb * enc/trans/single_byte.trans: adding WINDOWS-wwww encodings 2008-10-19 09:15:37 +00:00
windows-1252-tbl.rb * enc/trans/single_byte.trans: added windows-1252 2008-10-14 04:37:10 +00:00
windows-1253-tbl.rb * enc/trans/single_byte.trans: adding WINDOWS-wwww encodings 2008-10-19 09:15:37 +00:00
windows-1254-tbl.rb * enc/trans/single_byte.trans: adding WINDOWS-wwww encodings 2008-10-19 09:15:37 +00:00
windows-1255-tbl.rb Update windows-1255 table 2016-10-28 15:14:32 +00:00
windows-1256-tbl.rb * enc/trans/single_byte.trans: adding WINDOWS-wwww encodings 2008-10-19 09:15:37 +00:00
windows-1257-tbl.rb * enc/trans/single_byte.trans: adding WINDOWS-wwww encodings 2008-10-19 09:15:37 +00:00