github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Lars Kanis	d403591b34	Add string encoding IBM720 alias CP720 (#3803 ) The mapping table is generated from the ICU project: https://github.com/unicode-org/icu/blob/master/icu4c/source/data/mappings/ibm-720_P100-1997.ucm Fixes bug 16233 : https://bugs.ruby-lang.org/issues/16233	2020-11-22 22:23:40 +09:00
卜部昌平	490010084e	sed -i '/rmodule.h/d'	2020-08-27 16:42:06 +09:00
卜部昌平	756403d775	sed -i '/r_cast.h/d'	2020-08-27 15:03:36 +09:00
卜部昌平	0da2a3f1fc	sed -i '\,2/extern.h,d'	2020-08-27 14:07:49 +09:00
Kazuhiro NISHIYAMA	946cd6c534	Use https instead of http	2020-07-28 19:51:54 +09:00
Jeremy Evans	ddd9704ae9	Encode ' as ' when using encode(xml: :attr) Fixes [Bug #16922]	2020-07-10 09:34:08 -07:00
卜部昌平	9e41a75255	sed -i 's\|ruby/impl\|ruby/internal\|' To fix build failures.	2020-05-11 09:24:08 +09:00
卜部昌平	d7f4d732c1	sed -i s\|ruby/3\|ruby/impl\|g This shall fix compile errors.	2020-05-11 09:24:08 +09:00
Nobuyoshi Nakada	b7e1eda932	Suppress warnings by gcc 10.1.0-RC-20200430 * Folding results should not be empty. If `OnigCodePointCount(to->n)` were 0, `for` loop using `fn` wouldn't execute and `ncs` elements are not initialized. ``` enc/unicode.c:557:21: warning: 'ncs[0]' may be used uninitialized in this function [-Wmaybe-uninitialized] 557 \| for (i = 0; i < ncs[0]; i++) { \| ~~~^~~ ``` * Cast to `enum yytokentype` Additional enums for scanner events by ripper are not included in `yytokentype`. ``` ripper.y:7274:28: warning: implicit conversion from 'enum <anonymous>' to 'enum yytokentype' [-Wenum-conversion] ```	2020-05-04 12:28:24 +09:00
卜部昌平	9e6e39c351	Merge pull request #2991 from shyouhei/ruby.h Split ruby.h	2020-04-08 13:28:13 +09:00
Kazuki Tsujimoto	b25ef4bf70	Suppress warnings: reserved for numbered parameter	2020-04-05 18:24:59 +09:00
Nobuyoshi Nakada	21d0b40de2	Added tooldir variable	2020-04-05 09:26:57 +09:00
卜部昌平	115fec062c	more on NULL versus functions. Function pointers are not void*. See also `ce4ea956d2` `8427fca49b`	2020-02-07 14:24:19 +09:00
卜部昌平	0c2d731ef2	update dependencies	2019-12-26 20:45:12 +09:00
卜部昌平	5e22f873ed	decouple internal.h headers Saves comitters' daily life by avoid #include-ing everything from internal.h to make each file do so instead. This would significantly speed up incremental builds. We take the following inclusion order in this changeset: 1. "ruby/config.h", where _GNU_SOURCE is defined (must be the very first thing among everything). 2. RUBY_EXTCONF_H if any. 3. Standard C headers, sorted alphabetically. 4. Other system headers, maybe guarded by #ifdef 5. Everything else, sorted alphabetically. Exceptions are those win32-related headers, which tend not be self- containing (headers have inclusion order dependencies).	2019-12-26 20:45:12 +09:00
Nobuyoshi Nakada	992aa2cda5	enc/x_emoji.h: fixed dead-links [ci skip] English version pages seem no longer provided.	2019-12-24 10:33:32 +09:00
Nobuyoshi Nakada	cc87037f1c	Fixed misspellings Fixed misspellings reported at [Bug #16437], missed and a new typo.	2019-12-22 22:49:17 +09:00
Nobuyoshi Nakada	e1b2341488	Update dependencies	2019-11-18 23:16:22 +09:00
Nobuyoshi Nakada	162cf2879a	Init function is need to link statically	2019-08-10 01:41:50 +09:00
Nobuyoshi Nakada	cecae8593a	Removed unnecessary headers	2019-08-10 01:05:09 +09:00
Nobuyoshi Nakada	88db6fa479	Use ENC_REPLICATE to copy an encoding	2019-08-10 01:04:39 +09:00
Yusuke Endoh	a8ba22cd32	Revert "Removed unused includes" This reverts commit `c9eb8f82e9`. The change caused "implicit declaration" warning and actual segfault. ``` /tmp/ruby/v2/src/trunk-gc-asserts/enc/gb2312.c: In function ‘Init_gb2312’: /tmp/ruby/v2/src/trunk-gc-asserts/enc/gb2312.c:6:31: warning: implicit declaration of function ‘rb_enc_find’ [-Wimplicit-function-declaration] rb_enc_register("GB2312", rb_enc_find("EUC-KR")); ^~~~~~~~~~~ /tmp/ruby/v2/src/trunk-gc-asserts/enc/gb2312.c:6:31: warning: passing argument 2 of ‘rb_enc_register’ makes pointer from integer without a cast [-Wint-conversion] <command-line>:0:19: note: expected ‘OnigEncoding {aka const struct OnigEncodingTypeST }’ but argument is of type ‘int’ /tmp/ruby/v2/src/trunk-gc-asserts/regenc.h:231:12: note: in expansion of macro ‘ONIG_ENC_REGISTER’ extern int ONIG_ENC_REGISTER(const char , OnigEncoding); ^~~~~~~~~~~~~~~~~ ```	2019-08-10 00:01:36 +09:00
Nobuyoshi Nakada	c9eb8f82e9	Removed unused includes	2019-08-09 23:08:30 +09:00
Nobuyoshi Nakada	715955ff27	Include ruby/assert.h in ruby/ruby.h so that assertions can be there	2019-07-14 17:58:03 +09:00
Takashi Kokubun	18603e9046	Update dependencies for `369ff79394` Just copy-pasting diff from https://travis-ci.org/ruby/ruby/jobs/558407687	2019-07-14 12:55:58 +09:00
Martin Dürst	369ff79394	add encoding conversion from/to CESU-8 Add encoding conversion (transcoding) from UTF-8 to CESU-8 and back. CESU-8 is an encoding similar to UTF-8, but encodes codepoints above U+FFFF as two surrogates, these surrogates again being encoded as if they were UTF-8 codepoints. This preserves the same binary sorting order as in UTF-16. It is also somewhat similar (although not exactly identical) to an encoding used internally by Java. This completes issue #15995. enc/trans/cesu_8.trans: Add encoding conversion from/to CESU-8 test/ruby/test_transcode.rb: Add tests for above	2019-07-14 10:58:50 +09:00
Nobuyoshi Nakada	d905ff61e6	Update dependencies	2019-07-09 13:47:07 +09:00
NARUSE, Yui	4275f09015	remove UNREACHABLE	2019-06-24 16:01:46 +09:00
NARUSE, Yui	7f64a0b4db	Add new encoding CESU-8 [Feature #15931 ]	2019-06-24 12:58:33 +09:00
duerst	50eea44a27	remove Unicode 12.0.0 related directory and generated files This completes issue #15195. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67453 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-04-05 23:52:15 +00:00
duerst	7fe64d17d3	update to Unicode Version 12.1.0 (beta) Unicode Version 12.1.0 adds one single character, U+32FF SQUARE ERA NAME REIWA, for the new Japanese era starting on May 1st. 12.1.0 will be finalized only on May 7th, so we go with the beta version because further changes in the data we need are highly unlikely, and we want to make sure Ruby is ready for the new era. * common.mk: change UNICODE_VERSION to 12.1.0, UNICODE_BETA to YES * enc/unicode/12.1.0, enc/unicode/12.1.0/casefold.h, enc/unicode/12.1.0/name2ctype.h: add directory and generated data files for new version * lib/unicode_normalize/tables.rb: update for new character * test/ruby/test_regexp.rb: add test for character property age=12.1 * test/test_unicode_normalize.rb: add test for NFKC decomposition of new character This (mostly) completes issue #15195. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67441 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-04-05 00:58:51 +00:00
duerst	f831ca6764	delete directory and files related to Unicode version 11.0.0 this completes and closes feature #15321 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67174 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-03-06 03:19:10 +00:00
duerst	cff7eefa07	update Unicode version (and Emoji version) to 12.0.0 - common.mk: set UNICODE_VERSION and UNICODE_EMOJI_VERSION to 12.0.0 - lib/unicode_normalize/tables.rb: update table data to Unicode version 12.0.0 - enc/unicode/12.0.0/casefold.h, enc/unicode/12.0.0/name2ctype.h: add generated files for Unicode version 12.0.0 This is the main commit for #15321. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67169 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-03-06 01:55:19 +00:00
nobu	4fc656a2f3	Removed duplicate dependents git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67072 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-02-14 05:40:08 +00:00
nobu	b2f4241509	Update dependencies, internal.h includes ruby.h git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67057 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-02-12 12:31:55 +00:00
duerst	3628eae2e7	implement special behavior for Georgian for String#capitalize The modern Georgian script is special in that it has an 'uppercase' variant called MTAVRULI which can be used for emphasis of whole words, for screamy headlines, and so on. However, in contrast to all other bicameral scripts, there is no usage of capitalizing the first letter in a word or a sentence. Words with mixed capitalization are not used at all. We therefore implement special behavior for String#capitalize. Formally, we define String#capitalize as first applying String#downcase for the whole string, then using titlecase on the first letter. Because Georgian defines titlecase as the identity function both for MTAVRULI ('uppercase') and Mkhedruli (lowercase), this results in String#capitalize being equivalent to String#downcase for Georgian. This avoids undesirable mixed case. * enc/unicode.c: Actual implementation * string.c: Add mention of this special case for documentation * test/ruby/enc/test_case_mapping.rb: Add two tests, a general one that uses String#capitalize on some (including nonsensical) combinations of MTAVRULI and Mkhedruli, and a canary test to detect the potential assignment of characters to the currently open slots (holes) at U+1CBB and U+1CBC. * test/ruby/enc/test_case_comprehensive.rb: Tweak generation of expectation data. Together with r65933, this closes issue #14839. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66300 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-12-09 23:14:29 +00:00
duerst	c2d8078e3d	delete Unicode 10.0.0 related files, no longer needed [#14802 ] This line, and those below, will be ignored-- D enc/unicode/10.0.0 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66295 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-12-09 02:02:45 +00:00
duerst	e824e21beb	remove obsolete data from unicode.c * unicode.c: Remove the arrays onigenc_unicode_GCB_ranges_GAZ, onigenc_unicode_GCB_ranges_E_Base, and onigenc_unicode_GCB_ranges_Emoji, because they are not needed anymore for Unicode 11.0.0. * regparse.c: Remove external declarations for above arrays. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66232 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-12-06 00:05:08 +00:00
duerst	66a6073859	update to Unicode 11.0.0 (main step, not complete yet) - common.mk: Change Unicode version to 11.0.0, and Emoji version to 11.0 - test/ruby/enc/test_emoji_breaks.rb: update hard-coded Emoji version - enc/unicode/11.0.0, enc/unicode/11.0.0/casefold.h, enc/unicode/name2ctype.h: Add generated files. Files for Unicode 10.0.0 will be removed once we are sure 11.0.0 works. - lib/unicode_normalize/tables.rb: Updated table. - regparse.c: Almost completely reimplement grapheme cluster detection in function node_extended_grapheme_cluster(). git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66213 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-12-05 08:10:24 +00:00
duerst	a96a594f99	solve the genie/zombie/wrestlers bug enc/unicode.c: - Add U+1F93C (WRESTLERS), U+1F9DE (GENIE), and U+1F9DF to onigenc_unicode_GCB_ranges_E_Base. - Add comments with character names. test/ruby/enc/test_emoji_breaks.rb: Activate tests for genie/zombie/wrestlers. This closes issue #15343. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66133 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-12-02 10:07:42 +00:00
nobu	26771cadc0	Added words in the comment at r65088 [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66103 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-30 07:19:49 +00:00
nobu	7aaf5b2878	Embed the Emoji version git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66023 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-27 06:44:02 +00:00
duerst	fc6243a6a6	deal with ONIGENC_CASE_IS_TITLECASE flag on lowercase characters In the function onigenc_unicode_case_map() in enc/unicode.c, deal with the case that the ONIGENC_CASE_IS_TITLECASE flag is set on lowercase characters. This is in preparation for Georgian Mtavruli, which are uppercase but not titlecase, in Unicode 11.0.0. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65971 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-25 10:12:45 +00:00
duerst	2d5b57d63c	prepare for Unicode 11.0.0 update - enc/unicode/case-folding.rb: - Convert unpredicted case to actual flag setting - Eliminate an unused variable - Change a variable name to avoid a warning git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65933 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-23 06:45:26 +00:00
nobu	34cc6fef83	Make some internal functions static git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65764 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-16 06:52:00 +00:00
shyouhei	6732423b5e	enc/unicode.c: 'a' is bigger than 'A' In ASCII, 'a' is bigger than 'A'. Which means 'A' - 'a' is a negative number (-32, to be precise). In C, the type of 'a' and 'A' are signed int (cf: ISO/IEC 9899:1990 section 6.1.3.4). So 'A' - 'a' is also a signed int. It is `(signed int)-32`. The problem is, OnigCodePoint is unsigned int. Adding a negative number to a variable of OnigCodepoint (`code` here) introduces an unintentional cast of `(unsigned)(signed)-32`, which is 4,294,967,264. Adding this value to code then overflows, and the result eventually becomes normal codepoint. The series of operations are not a serious problem but because `code >= 'a'` holds, we can `(code - 'a') + 'A'` to reroute this. See also: https://github.com/k-takata/Onigmo/pull/107 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65752 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-16 02:34:00 +00:00
duerst	a5818630f8	revert r65091, r65090 because ci fails git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65093 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-10-16 07:53:37 +00:00
duerst	33b5c610a6	update to Unicode 11.0.0 (basic step, not complete yet) - common.mk: Change Unicode version to 11.0.0 - enc/unicode/case-folding.rb, enc/unicode.c: Initial changes to deal with Gregorian Mtavruli. This should bring us up to the same level as e.g. Python 3.7, by following the Unicode tables exactly. But it will produce undesirable (mixed-case) results for String#capitalize. This will be addressed in a later commit. - enc/unicode/11.0.0, enc/unicode/11.0.0/casefold.h, enc/unicode/name2ctype.h: Add generated files. - lib/unicode_normalize/tables.rb: Updated table. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65091 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-10-16 07:01:55 +00:00
duerst	7223582866	add some comments to enc/unicode/case-folding.rb [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65090 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-10-16 06:41:47 +00:00
nobu	0814870fed	Removed data for old Unicode [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65088 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-10-16 05:14:59 +00:00

1 2 3 4 5 ...

648 Коммитов