github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Aaron Patterson	6fce8c7980	Don't try compacting ivars on Classes that are "too complex" Too complex classes use a hash table to store ivs, and should always pin their IVs. We shouldn't touch those classes in compaction.	2023-11-20 16:09:48 -08:00
Jean Boussier	94c9f16663	Refactor rb_obj_evacuate_ivs_to_hash_table That function is a bit too low level to called from multiple places. It's always used in tandem with `rb_shape_set_too_complex` and both have to know how the object is laid out to update the `iv_ptr`. So instead we can provide two higher level function: - `rb_obj_copy_ivs_to_hash_table` to prepare a `st_table` from an arbitrary oject. - `rb_obj_convert_to_too_complex` to assign the new `st_table` to the old object, and safely free the old `iv_ptr`. Unfortunately both can't be combined into one, because `rb_obj_copy_ivar` need `rb_obj_copy_ivs_to_hash_table` to copy from one object to another.	2023-11-17 09:19:21 +01:00
Jean Boussier	81b35fe729	rb_evict_ivars_to_hash: get rid of the sahpe paramater It's only used to allocate the table with the right size, but in some case we were passing `rb_shape_get_shape_by_id(SHAPE_OBJ_TOO_COMPLEX)` which `next_iv_index` is a bit undefined. So overall we're better to just allocate a table the size of the existing object, it should be close enough in the vast majority of cases, and that's already a de-optimizaton path anyway.	2023-11-16 17:49:59 +01:00
Jean Boussier	d898e8d6f8	Refactor rb_shape_transition_shape_capa out Right now the `rb_shape_get_next` shape caller need to first check if there is capacity left, and if not call `rb_shape_transition_shape_capa` before it can call `rb_shape_get_next`. And on each of these it needs to checks if we got a TOO_COMPLEX back. All this logic is duplicated in the interpreter, YJIT and RJIT. Instead we can have `rb_shape_get_next` do the capacity transition when needed. The caller can compare the old and new shapes capacity to know if resizing is needed. It also can check for TOO_COMPLEX only once.	2023-11-08 11:02:55 +01:00
Nobuyoshi Nakada	4da6333615	Export functions used for builtins	2023-11-08 13:02:55 +09:00
Nobuyoshi Nakada	8becc889db	Suppress array-bounds warnings from gcc 13	2023-11-07 23:19:51 +09:00
Peter Zhu	1321df773b	Use shape capacity transitions for generic ivars This commit changes generic ivars to respect the capacity transition in shapes rather than growing the capacity independently.	2023-11-03 10:15:32 -04:00
Aaron Patterson	a3f66e09f6	geniv objects can become too complex	2023-10-24 10:52:06 -07:00
Jean Boussier	e5364ea496	rb_shape_transition_shape_capa: use optimal sizes transitions Previously the growth was 3(embed), 6, 12, 24, ... With this change it's now 3(embed), 8, 16, 32, 64, ... by default. However, since power of two isn't the best size for all allocators, if `malloc_usable_size` is vailable, we use it to discover the best offset. On Linux/glibc 2.35 for instance, the growth will be 3(embed), 7, 15, 31 to avoid wasting 8B per object. Test program: ```c size_t test(size_t slots) { size_t allocated = slots * VALUE_SIZE; void test_ptr = malloc(allocated); size_t wasted = malloc_usable_size(test_ptr) - allocated; free(test_ptr); fprintf(stderr, "slots = %lu, wasted_bytes = %lu\n", slots, wasted); return wasted; } int main(int argc, char argv[]) { size_t best_padding = 0; size_t padding = 0; for (padding = 0; padding <= 2; padding++) { size_t wasted = test(8 - padding); if (wasted == 0) { best_padding = padding; break; } } size_t index = 0; fprintf(stderr, "=============== naive ================\n"); size_t list_size = 4; for (index = 0; index < 10; index++) { test(list_size); list_size = 2; } fprintf(stderr, "=============== auto-padded (-%lu) ================\n", best_padding); list_size = 4; for (index = 0; index < 10; index ++) { test(list_size - best_padding); list_size = 2; } fprintf(stderr, "\n\n"); return 0; } ``` ``` ===== glibc ====== slots = 8, wasted_bytes = 8 slots = 7, wasted_bytes = 0 =============== naive ================ slots = 4, wasted_bytes = 8 slots = 8, wasted_bytes = 8 slots = 16, wasted_bytes = 8 slots = 32, wasted_bytes = 8 slots = 64, wasted_bytes = 8 slots = 128, wasted_bytes = 8 slots = 256, wasted_bytes = 8 slots = 512, wasted_bytes = 8 slots = 1024, wasted_bytes = 8 slots = 2048, wasted_bytes = 8 =============== auto-padded (-1) ================ slots = 3, wasted_bytes = 0 slots = 7, wasted_bytes = 0 slots = 15, wasted_bytes = 0 slots = 31, wasted_bytes = 0 slots = 63, wasted_bytes = 0 slots = 127, wasted_bytes = 0 slots = 255, wasted_bytes = 0 slots = 511, wasted_bytes = 0 slots = 1023, wasted_bytes = 0 slots = 2047, wasted_bytes = 0 ``` ``` ========== jemalloc ======= slots = 8, wasted_bytes = 0 =============== naive ================ slots = 4, wasted_bytes = 0 slots = 8, wasted_bytes = 0 slots = 16, wasted_bytes = 0 slots = 32, wasted_bytes = 0 slots = 64, wasted_bytes = 0 slots = 128, wasted_bytes = 0 slots = 256, wasted_bytes = 0 slots = 512, wasted_bytes = 0 slots = 1024, wasted_bytes = 0 slots = 2048, wasted_bytes = 0 =============== auto-padded (-0) ================ slots = 4, wasted_bytes = 0 slots = 8, wasted_bytes = 0 slots = 16, wasted_bytes = 0 slots = 32, wasted_bytes = 0 slots = 64, wasted_bytes = 0 slots = 128, wasted_bytes = 0 slots = 256, wasted_bytes = 0 slots = 512, wasted_bytes = 0 slots = 1024, wasted_bytes = 0 slots = 2048, wasted_bytes = 0 ```	2023-10-23 09:33:15 +02:00
Yusuke Endoh	591336a0f2	Avoid the pointer hack in RCLASS_EXT ... because GCC 13 warns it. ``` In file included from class.c:24: In function ‘RCLASS_SET_ALLOCATOR’, inlined from ‘class_alloc’ at class.c:251:5, inlined from ‘rb_module_s_alloc’ at class.c:1045:17: internal/class.h:159:43: warning: array subscript 0 is outside array bounds of ‘rb_classext_t[0]’ {aka ‘struct rb_classext_struct[]’} [-Warray-bounds=] 159 \| RCLASS_EXT(klass)->as.class.allocator = allocator; \| ^ ``` https://rubyci.s3.amazonaws.com/arch/ruby-master/log/20231015T030003Z.log.html.gz	2023-10-15 15:35:45 +09:00
Nobuyoshi Nakada	5fc9810bf3	Shorten `rb_strterm_literal_t` members	2023-10-14 11:08:43 +09:00
Nobuyoshi Nakada	a075c55d0c	Manage `rb_strterm_t` without imemo	2023-10-14 11:08:43 +09:00
Nobuyoshi Nakada	cb06b6632a	Remove unions in `rb_strterm` structs for alignment	2023-10-14 11:08:43 +09:00
Koichi Sasada	be1bbd5b7d	M:N thread scheduler for Ractors This patch introduce M:N thread scheduler for Ractor system. In general, M:N thread scheduler employs N native threads (OS threads) to manage M user-level threads (Ruby threads in this case). On the Ruby interpreter, 1 native thread is provided for 1 Ractor and all Ruby threads are managed by the native thread. From Ruby 1.9, the interpreter uses 1:1 thread scheduler which means 1 Ruby thread has 1 native thread. M:N scheduler change this strategy. Because of compatibility issue (and stableness issue of the implementation) main Ractor doesn't use M:N scheduler on default. On the other words, threads on the main Ractor will be managed with 1:1 thread scheduler. There are additional settings by environment variables: `RUBY_MN_THREADS=1` enables M:N thread scheduler on the main ractor. Note that non-main ractors use the M:N scheduler without this configuration. With this configuration, single ractor applications run threads on M:1 thread scheduler (green threads, user-level threads). `RUBY_MAX_CPU=n` specifies maximum number of native threads for M:N scheduler (default: 8). This patch will be reverted soon if non-easy issues are found. [Bug #19842]	2023-10-12 14:47:01 +09:00
Nobuyoshi Nakada	54f1d398d9	Make popcount bit-masks stricter Each bit run is upto the right shift count, so the each mask does not need more upper bits.	2023-10-05 20:03:54 +09:00
Aaron Patterson	d3574c117a	Move IO#readline to Ruby This commit moves IO#readline to Ruby. In order to call C functions, keyword arguments must be converted to hashes. Prior to this commit, code like `io.readline(chomp: true)` would allocate a hash. This commits moves the keyword "denaturing" to Ruby, allowing us to send positional arguments to the C API and avoiding the hash allocation. Here is an allocation benchmark for the method: ``` x = GC.stat(:total_allocated_objects) File.open("/usr/share/dict/words") do \|f\| f.readline(chomp: true) until f.eof? end p ALLOCATIONS: GC.stat(:total_allocated_objects) - x ``` Before this commit, the output was this: ``` $ make run ./miniruby -I./lib -I. -I.ext/common -r./arm64-darwin22-fake ./test.rb {:ALLOCATIONS=>707939} ``` Now it is this: ``` $ make run ./miniruby -I./lib -I. -I.ext/common -r./arm64-darwin22-fake ./test.rb {:ALLOCATIONS=>471962} ``` [Bug #19890] [ruby-core:114803]	2023-09-28 10:43:45 -07:00
Nobuyoshi Nakada	bab01d284c	[Feature #19790 ] Rename BUGREPORT_PATH as CRASH_REPORT	2023-09-25 22:57:28 +09:00
Nobuyoshi Nakada	70e8a08295	Add `--bugreport-path` option It has precedence over the environment variable `RUBY_BUGREPORT_PATH`.	2023-09-25 22:57:28 +09:00
Nobuyoshi Nakada	ac244938e8	Dump backtraces to an arbitrary stream	2023-09-25 22:57:28 +09:00
Peter Zhu	f43dac0df2	Add rb_hash_free for the GC to use	2023-09-24 09:07:52 -04:00
Nobuyoshi Nakada	4634405f7c	Stop exposing FrozenCore in headers Revert commit "Directly allocate FrozenCore as an ICLASS", `813a5f4fc4`.	2023-09-19 14:08:05 +09:00
Peter Zhu	12102d101a	Fix crash in WeakMap during compaction WeakMap can crash during compaction because the st_insert could allocate memory.	2023-09-06 14:20:23 -04:00
Peter Zhu	9a8398a18f	Introduce rb_gc_remove_weak If we're during incremental marking, then Ruby code can execute that deallocates certain memory buffers that have been called with rb_gc_mark_weak, which can cause use-after-free bugs.	2023-09-05 14:32:15 -04:00
John Hawthorn	d89b15cdce	Use end of char boundary in start_with? Previously we used the next character following the found prefix to determine if the match ended on a broken character. This had caused surprising behaviour when a valid character was followed by a UTF-8 continuation byte. This commit changes the behaviour to instead look for the end of the last character in the prefix. [Bug #19784] Co-authored-by: ywenc <ywenc@github.com> Co-authored-by: Nobuyoshi Nakada <nobu@ruby-lang.org>	2023-09-01 16:23:28 -07:00
Matt Valentine-House	322548180d	Prevent rb_gc_mark_values from pinning objects This is an internal only function not exposed to the C extension API. It's only use so far is from rb_vm_mark, where it's used to mark the values in the vm->trap_list.cmd array. There shouldn't be any reason why these cannot move. This commit allows them to move by updating their references during the reference updating step of compaction. To do this we've introduced another internal function rb_gc_update_values as a partner to rb_gc_mark_values. This allows us to refactor rb_gc_mark_values to not pin	2023-08-31 19:31:18 +01:00
Nobuyoshi Nakada	00ac3a64ba	Introduce `at_char_boundary` function	2023-08-26 08:58:02 +09:00
Peter Zhu	bfb395c620	Implement weak references in the GC [Feature #19783] This commit adds support for weak references in the GC through the function `rb_gc_mark_weak`. Unlike strong references, weak references does not mark the object, but rather lets the GC know that an object refers to another one. If the child object is freed, the pointer from the parent object is overwritten with `Qundef`. Co-Authored-By: Jean Boussier <byroot@ruby-lang.org>	2023-08-25 09:01:21 -04:00
Nobuyoshi Nakada	6aa16f9ec1	Move SCRIPT_LINES__ away from parse.y	2023-08-25 18:23:05 +09:00
卜部昌平	097b766b41	do not redefine a typedef duplicated typedef declaration was not allowed in C99.	2023-08-25 17:27:53 +09:00
卜部昌平	c914382db2	do not redefine a typedef duplicated typedef declaration was not allowed in C99.	2023-08-25 17:27:53 +09:00
Jeremy Evans	b635388a90	Check that __builtin_mul_overflow can handle long long Fixes [Bug #17646] Patch from xtkoba (Tee KOBAYASHI)	2023-08-24 20:47:23 -07:00
Takashi Kokubun	e210b899dc	Move the PC regardless of the leaf flag (#8232 ) Co-authored-by: Alan Wu <alansi.xingwu@shopify.com>	2023-08-16 20:28:33 -07:00
Nobuyoshi Nakada	cc0fca2729	Fix range of `--backtrace-limit` Also an option command line should have precedence over `RUBYOPT`.	2023-08-11 01:56:50 +09:00
Nobuyoshi Nakada	1c4a523006	Move `posix_signal` declaration internal with prefix `ruby_`	2023-07-17 21:31:59 +09:00
Jean Boussier	fa30b99c34	Implement Process.warmup [Feature #18885] For now, the optimizations performed are: - Run a major GC - Compact the heap - Promote all surviving objects to oldgen Other optimizations may follow.	2023-07-17 11:20:15 +02:00
Takashi Kokubun	d814722fb8	YJIT: Make ratio_in_yjit always available (#8064 )	2023-07-13 18:14:43 -04:00
Peter Zhu	87e1486d31	Remove unused references to the transient heap	2023-07-13 14:48:14 -04:00
Peter Zhu	3223181284	Remove RARRAY_CONST_PTR_TRANSIENT RARRAY_CONST_PTR now does the same things as RARRAY_CONST_PTR_TRANSIENT.	2023-07-13 14:48:14 -04:00
Nobuyoshi Nakada	db3b8f84f5	Set backtrace length limit at last Command line options should have higher precedence than the same options in shebang and `RUBYOPT`.	2023-07-13 22:59:26 +09:00
Nobuyoshi Nakada	1c2a4d9682	Shrink `ruby_cmdline_options_t` a bit	2023-07-13 22:59:26 +09:00
Peter Zhu	1e7b67f733	[Feature #19730 ] Remove transient heap	2023-07-13 09:27:33 -04:00
Nobuyoshi Nakada	913e01e80e	Stop allocating unused backref strings at `defined?`	2023-06-27 23:14:10 +09:00
Nobuyoshi Nakada	df5ae0a550	Use `rb_reg_nth_defined` instead of `rb_match_nth_defined`	2023-06-27 22:39:15 +09:00
Nobuyoshi Nakada	73529a12bb	Declare `RHASH_AR_TABLE` and `RHASH_ST_TABLE` return non-null	2023-06-23 14:54:50 +09:00
Nobuyoshi Nakada	b934976024	Prefer `0` over `NULL` as function pointers SunC warns use of `NULL`, pointer to data as function pointers.	2023-06-23 03:15:55 +09:00
Samuel Williams	a87bce86bb	Allow setting the name of a class or module. (#7483 ) Introduce `Module#set_temporary_name` for setting identifiers for otherwise anonymous modules/classes.	2023-06-21 16:49:51 +09:00
Peter Zhu	813a5f4fc4	Directly allocate FrozenCore as an ICLASS It's a bad idea to overwrite the flags as the garbage collector may have set other flags.	2023-06-14 10:42:40 -04:00
Nobuyoshi Nakada	81836c6cb9	Fix duplicate symbol errors when statically linking ripper	2023-06-12 20:22:01 +09:00
yui-knk	b481b673d7	[Feature #19719 ] Universal Parser Introduce Universal Parser mode for the parser. This commit includes these changes: * Introduce `UNIVERSAL_PARSER` macro. All of CRuby related functions are passed via `struct rb_parser_config_struct` when this macro is enabled. * Add CI task with 'cppflags=-DUNIVERSAL_PARSER' for ubuntu.	2023-06-12 18:23:48 +09:00
Peter Zhu	441302be1a	Remove RHASH_TRANSIENT_FLAG Hashes are no longer allocated on the transient heap.	2023-06-08 10:42:59 -04:00

1 2 3 4 5 ...

393 Коммитов