github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Jemma Issroff	d594a5a8bd	This commit implements the Object Shapes technique in CRuby. Object Shapes is used for accessing instance variables and representing the "frozenness" of objects. Object instances have a "shape" and the shape represents some attributes of the object (currently which instance variables are set and the "frozenness"). Shapes form a tree data structure, and when a new instance variable is set on an object, that object "transitions" to a new shape in the shape tree. Each shape has an ID that is used for caching. The shape structure is independent of class, so objects of different types can have the same shape. For example: ```ruby class Foo def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end class Bar def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end foo = Foo.new # `foo` has shape id 2 bar = Bar.new # `bar` has shape id 2 ``` Both `foo` and `bar` instances have the same shape because they both set instance variables of the same name in the same order. This technique can help to improve inline cache hits as well as generate more efficient machine code in JIT compilers. This commit also adds some methods for debugging shapes on objects. See `RubyVM::Shape` for more details. For more context on Object Shapes, see [Feature: #18776] Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org> Co-Authored-By: Eileen M. Uchitelle <eileencodes@gmail.com> Co-Authored-By: John Hawthorn <john@hawthorn.email>	2022-09-28 08:26:21 -07:00
Aaron Patterson	06abfa5be6	Revert this until we can figure out WB issues or remove shapes from GC Revert "* expand tabs. [ci skip]" This reverts commit `830b5b5c35`. Revert "This commit implements the Object Shapes technique in CRuby." This reverts commit `9ddfd2ca00`.	2022-09-26 16:10:11 -07:00
Jemma Issroff	9ddfd2ca00	This commit implements the Object Shapes technique in CRuby. Object Shapes is used for accessing instance variables and representing the "frozenness" of objects. Object instances have a "shape" and the shape represents some attributes of the object (currently which instance variables are set and the "frozenness"). Shapes form a tree data structure, and when a new instance variable is set on an object, that object "transitions" to a new shape in the shape tree. Each shape has an ID that is used for caching. The shape structure is independent of class, so objects of different types can have the same shape. For example: ```ruby class Foo def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end class Bar def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end foo = Foo.new # `foo` has shape id 2 bar = Bar.new # `bar` has shape id 2 ``` Both `foo` and `bar` instances have the same shape because they both set instance variables of the same name in the same order. This technique can help to improve inline cache hits as well as generate more efficient machine code in JIT compilers. This commit also adds some methods for debugging shapes on objects. See `RubyVM::Shape` for more details. For more context on Object Shapes, see [Feature: #18776] Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org> Co-Authored-By: Eileen M. Uchitelle <eileencodes@gmail.com> Co-Authored-By: John Hawthorn <john@hawthorn.email>	2022-09-26 09:21:30 -07:00
Takashi Kokubun	f6925fab85	Do not fork the process on --mjit-wait fork is for parallel compilation, but --mjit-wait cancels it. It's more useful to not fork it for binding.irb, debugging, etc.	2022-09-06 15:42:02 +09:00
S.H	78748a5de2	Introduce `usage_analysis_clear`	2022-08-29 14:42:19 +09:00
Takashi Kokubun	485019c2bd	Rename mjit_exec to jit_exec (#6262 ) * Rename mjit_exec to jit_exec * Rename mjit_exec_slowpath to mjit_check_iseq * Remove mjit_exec references from comments	2022-08-19 23:57:17 -07:00
Nobuyoshi Nakada	81fbc8d553	Move `mjit_exec` to vm.c	2022-08-17 15:23:01 +09:00
Nobuyoshi Nakada	ee864beb7c	Simplify around `USE_YJIT` macro (#6240 ) * Simplify around `USE_YJIT` macro - Use `USE_YJIT` macro only instead of `YJIT_BUILD`. - An intermediate macro `YJIT_SUPPORTED_P` is no longer used. * Bail out if YJIT is enabled on unsupported platforms	2022-08-15 13:05:12 -04:00
Peter Zhu	efb91ff19b	Rename rb_ary_tmp_new to rb_ary_hidden_new rb_ary_tmp_new suggests that the array is temporary in some way, but that's not true, it just creates an array that's hidden and not on the transient heap. This commit renames it to rb_ary_hidden_new.	2022-07-26 09:12:09 -04:00
Yusuke Endoh	8f7e188822	Add "rb_" prefixes to toplevel enum definitions ... as per ko1's request.	2022-07-22 23:10:24 +09:00
Takashi Kokubun	5b21e94beb	Expand tabs [ci skip] [Misc #18891]	2022-07-21 09:42:04 -07:00
Aaron Patterson	1341dea771	Prevent the stack from being marked twice This commit prevents the stack from being marked twice: once via the Fiber, and once via the Thread. It introduces an assertion to assert that the ec on the thread is the same as the ec on the Fiber being marked via the thread.	2022-07-20 13:45:55 -07:00
Alan Wu	744d17ff6c	Fix infinite loop when b_return TracePoint throws Previously, we didn't pop the frame that runs the TracePoint hook for b_return events for blocks running as methods (bmethods). In case the hook raises, that formed an infinite loop during stack unwinding in hook_before_rewind(). [Bug #18060]	2022-06-22 08:30:39 -04:00
Takashi Kokubun	23459e4dbb	Move RubyVM::MJIT to builtin Ruby just less C code to maintain	2022-06-15 10:52:37 -07:00
Koichi Sasada	d9984f39d3	remove `NON_SCALAR_THREAD_ID` support `NON_SCALAR_THREAD_ID` shows `pthread_t` is non-scalar (non-pointer) and only s390x is known platform. However, the supporting code is very complex and it is only used for deubg print information. So this patch removes the support of `NON_SCALAR_THREAD_ID` and make the code simple.	2022-05-24 10:06:51 +09:00
Koichi Sasada	37ea07d59b	setup vm->main_ractor before `Init_native_thread()`	2022-05-20 17:37:59 +09:00
Koichi Sasada	eab99b1d4b	`rb_thread_t::serial` for debug `rb_thread_t::serial` is auto-incremented serial number for threads and it can overflow, it means the serial is not a ID for each thread, it is only for debug print. `RUBY_DEBUG_LOG` shows this information. Also skip EC related information if EC is NULL. This patch enable to use `RUBY_DEBUG_LOG` without setup EC.	2022-05-20 17:37:46 +09:00
Samuel Williams	f626998c4f	Delete autoload data from global features after autoload has completed. (#5910 ) * Update naming of critical section assertions macros. * Improved locking for autoload.	2022-05-17 00:50:02 +12:00
Samuel Williams	32de6097b2	Fix various autoload race conditions. (#5898 ) * Add RUBY_VM_CRITICAL_SECTION for detecting unexpected context switch. * Prevent race between GC mark and autoload setup. * Protect race on autoload state. * Avoid potential race condition when allocating `autoload_featuremap`. * Add NEWS entry for autoload fixes.	2022-05-15 16:07:12 +12:00
Alan Wu	f90549cd38	Rust YJIT In December 2021, we opened an [issue] to solicit feedback regarding the porting of the YJIT codebase from C99 to Rust. There were some reservations, but this project was given the go ahead by Ruby core developers and Matz. Since then, we have successfully completed the port of YJIT to Rust. The new Rust version of YJIT has reached parity with the C version, in that it passes all the CRuby tests, is able to run all of the YJIT benchmarks, and performs similarly to the C version (because it works the same way and largely generates the same machine code). We've even incorporated some design improvements, such as a more fine-grained constant invalidation mechanism which we expect will make a big difference in Ruby on Rails applications. Because we want to be careful, YJIT is guarded behind a configure option: ```shell ./configure --enable-yjit # Build YJIT in release mode ./configure --enable-yjit=dev # Build YJIT in dev/debug mode ``` By default, YJIT does not get compiled and cargo/rustc is not required. If YJIT is built in dev mode, then `cargo` is used to fetch development dependencies, but when building in release, `cargo` is not required, only `rustc`. At the moment YJIT requires Rust 1.60.0 or newer. The YJIT command-line options remain mostly unchanged, and more details about the build process are documented in `doc/yjit/yjit.md`. The CI tests have been updated and do not take any more resources than before. The development history of the Rust port is available at the following commit for interested parties: `1fd9573d8b` Our hope is that Rust YJIT will be compiled and included as a part of system packages and compiled binaries of the Ruby 3.2 release. We do not anticipate any major problems as Rust is well supported on every platform which YJIT supports, but to make sure that this process works smoothly, we would like to reach out to those who take care of building systems packages before the 3.2 release is shipped and resolve any issues that may come up. [issue]: https://bugs.ruby-lang.org/issues/18481 Co-authored-by: Maxime Chevalier-Boisvert <maximechevalierb@gmail.com> Co-authored-by: Noah Gibbs <the.codefolio.guy@gmail.com> Co-authored-by: Kevin Newton <kddnewton@gmail.com>	2022-04-27 11:00:22 -04:00
Koichi Sasada	03d21a4fb0	introduce struct `rb_native_thread` `rb_thread_t` contained `native_thread_data_t` to represent thread implementation dependent data. This patch separates them and rename it `rb_native_thread` and point it from `rb_thraed_t`. Now, 1 Ruby thread (`rb_thread_t`) has 1 native thread (`rb_native_thread`).	2022-04-23 03:08:27 +09:00
Koichi Sasada	702dc116c4	refactoring thread inits in vm.c * `th_init` accepts vm and ractor. * remove `ruby_thread_init` because it is duplicated with `th_init`. * add some comments.	2022-04-23 00:50:13 +09:00
Jeremy Evans	4b14b2902a	Uncomment code to raise LocalJumpError for yield across thread through enum Not sure if this is the correct fix. It does raise LocalJumpError in the yielding thread as you would expect, but the value yielded to the calling thread is still yielded without an exception. Fixes [Bug #18649]	2022-04-21 20:48:27 -07:00
Jeremy Evans	0b091fdac6	Raise RuntimeError if Kernel#binding is called from a non-Ruby frame Check whether the current or previous frame is a Ruby frame in call_trace_func and rb_tracearg_binding before attempting to create a binding for the frame. Fixes [Bug #18487] Co-authored-by: Alan Wu <XrXr@users.noreply.github.com>	2022-04-06 19:14:03 -07:00
Kevin Newton	8ee4a82e8c	RubyVM.stat constant cache metrics (#5766 ) Before the new constant cache behavior, caches were invalidated by a single global variable. You could inspect the value of this variable with RubyVM.stat(:global_constant_state). This was mostly useful to verify the behavior of the VM or to test constant loading like in Rails. With the new constant cache behavior, we introduced RubyVM.stat(:constant_cache) which returned a hash with symbol keys and integer values that represented the number of live constant caches associated with the given symbol. Additionally, we removed the old RubyVM.stat(:global_constant_state). This was proven to be not very useful, so it doesn't help you diagnose constant loading issues. So, instead we added the global constant state back into the RubyVM output. However, that number can be misleading as now when you invalidate something like `Foo::Bar::Baz` you're actually invalidating 3 different lists of inline caches. This commit attempts to get the best of both worlds. We remove RubyVM.stat(:global_constant_state) like we did originally, as it doesn't have the same semantic meaning and it could be confusing going forward. Instead we add RubyVM.stat(:constant_cache_invalidations) and RubyVM.stat(:constant_cache_misses). These two metrics should provide enough information to diagnose any constant loading issues, as well as provide a replacement for the old global constant state.	2022-04-05 16:37:00 -04:00
Kevin Newton	42000664be	Bring back RubyVM.stat(:global_constant_state) This was removed as part of [Feature #18589]. But some applications were relying on this behavior. So bringing this back to make it better for backward compatibility going forward.	2022-04-04 20:41:05 +02:00
Kevin Newton	6068da8937	Finer-grained constant cache invalidation (take 2) This commit reintroduces finer-grained constant cache invalidation. After `8008fb7` got merged, it was causing issues on token-threaded builds (such as on Windows). The issue was that when you're iterating through instruction sequences and using the translator functions to get back the instruction structs, you're either using `rb_vm_insn_null_translator` or `rb_vm_insn_addr2insn2` depending if it's a direct-threading build. `rb_vm_insn_addr2insn2` does some normalization to always return to you the non-trace version of whatever instruction you're looking at. `rb_vm_insn_null_translator` does not do that normalization. This means that when you're looping through the instructions if you're trying to do an opcode comparison, it can change depending on the type of threading that you're using. This can be very confusing. So, this commit creates a new translator function `rb_vm_insn_normalizing_translator` to always return the non-trace version so that opcode comparisons don't have to worry about different configurations. [Feature #18589]	2022-04-01 14:48:22 -04:00
Jeremy Evans	d1d48cb690	Revert "Raise RuntimeError if Kernel#binding is called from a non-Ruby frame" This reverts commit `343ea9967e`. This causes an assertion failure with -DRUBY_DEBUG=1 -DRGENGC_CHECK_MODE=2	2022-04-01 07:22:49 -07:00
Nobuyoshi Nakada	42a0bed351	Prefix ccan headers (#4568 ) * Prefixed ccan headers * Remove unprefixed names in ccan/build_assert * Remove unprefixed names in ccan/check_type * Remove unprefixed names in ccan/container_of * Remove unprefixed names in ccan/list Co-authored-by: Samuel Williams <samuel.williams@oriontransfer.co.nz>	2022-03-30 20:36:31 +13:00
Nobuyoshi Nakada	69967ee64e	Revert "Finer-grained inline constant cache invalidation" This reverts commits for [Feature #18589]: * `8008fb7352` "Update formatting per feedback" * `8f6eaca2e1` "Delete ID from constant cache table if it becomes empty on ISEQ free" * `629908586b` "Finer-grained inline constant cache invalidation" MSWin builds on AppVeyor have been crashing since the merger.	2022-03-25 20:29:09 +09:00
Jeremy Evans	343ea9967e	Raise RuntimeError if Kernel#binding is called from a non-Ruby frame Check whether the current or previous frame is a Ruby frame in call_trace_func before attempting to create a binding for the frame. Fixes [Bug #18487] Co-authored-by: Alan Wu <XrXr@users.noreply.github.com>	2022-03-24 12:31:07 -07:00
Kevin Newton	629908586b	Finer-grained inline constant cache invalidation Current behavior - caches depend on a global counter. All constant mutations cause caches to be invalidated. ```ruby class A B = 1 end def foo A::B # inline cache depends on global counter end foo # populate inline cache foo # hit inline cache C = 1 # global counter increments, all caches are invalidated foo # misses inline cache due to `C = 1` ``` Proposed behavior - caches depend on name components. Only constant mutations with corresponding names will invalidate the cache. ```ruby class A B = 1 end def foo A::B # inline cache depends constants named "A" and "B" end foo # populate inline cache foo # hit inline cache C = 1 # caches that depend on the name "C" are invalidated foo # hits inline cache because IC only depends on "A" and "B" ``` Examples of breaking the new cache: ```ruby module C # Breaks `foo` cache because "A" constant is set and the cache in foo depends # on "A" and "B" class A; end end B = 1 ``` We expect the new cache scheme to be invalidated less often because names aren't frequently reused. With the cache being invalidated less, we can rely on its stability more to keep our constant references fast and reduce the need to throw away generated code in YJIT.	2022-03-24 09:14:38 -07:00
Peter Zhu	5f10bd634f	Add ISEQ_BODY macro Use ISEQ_BODY macro to get the rb_iseq_constant_body of the ISeq. Using this macro will make it easier for us to change the allocation strategy of rb_iseq_constant_body when using Variable Width Allocation.	2022-03-24 10:03:51 -04:00
Yuta Saito	dff70b50d0	[wasm] vm.c: stop unwinding to main for every vm_exec call by setjmp the original rb_wasm_setjmp implementation always unwinds to the root call frame to have setjmp compatible interface, and simulate sjlj's undefined behavior. Therefore, every vm_exec call unwinds to main, and a deep call stack makes setjmp call very expensive. The following snippet from optcarrot takes 5s even though it takes less than 0.3s on native. ``` [0x0, 0x4, 0x8, 0xc].map do \|attr\| (0..7).map do \|j\| (0...0x10000).map do \|i\| clr = i[15 - j] * 2 + i[7 - j] clr != 0 ? attr \| clr : 0 end end end ``` This patch adds a WASI specialized vm_exec which uses lightweight try-catch API without unwinding to the root frame. After this patch, the above snippet takes only 0.5s.	2022-02-18 18:28:18 +09:00
Kazuhiro NISHIYAMA	c00e360bc0	Fix error: old-style function definition https://rubyci.s3.amazonaws.com/debian-riscv64/ruby-master/log/20220122T050018Z.log.html.gz#miniruby ``` compiling vm_trace.c vm_trace.c: In function 'rb_vm_memsize_postponed_job_buffer': vm_trace.c:1599:1: error: old-style function definition [-Werror=old-style-definition] 1599 \| rb_vm_memsize_postponed_job_buffer() \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ```	2022-01-22 15:48:05 +09:00
Kevin Newton	fc6fd4c31e	Accurately report VM memsize Currently the calculation only counts the size of the struct. This commit adds the size of the associated st tables, id tables, and linked lists. Still missing is the size of the ractors and (potentially) the size of the object space.	2022-01-21 14:34:53 -08:00
Koichi Sasada	7e21b77dc6	T#dup (T < Proc) should return T's object T#dup (T < Proc) returns Proc object (not T) from Ruby 1.9. [Bug #17545]	2022-01-13 17:43:14 +09:00
Sutou Kouhei	c9396a3520	Show the target Proc on Ractor::IsolationError It's useful for debug.	2021-12-22 11:10:20 +09:00
Koichi Sasada	ad450c9fe5	make `overloaded_cme_table` truly weak key map `overloaded_cme_table` keeps cme -> monly_cme pairs to manage corresponding `monly_cme` for `cme`. The lifetime of the `monly_cme` should be longer than `monly_cme`, but the previous patch losts the reference to the living `monly_cme`. Now `overloaded_cme_table` values are always root (keys are only weak reference), it means `monly_cme` does not freed until corresponding `cme` is invalidated. To make managing easy, move `overloaded_cme_table` to `rb_vm_t`.	2021-12-21 15:21:30 +09:00
Takashi Kokubun	02ba0bda7e	Remove RubyVM::JIT (#5275 ) [Feature #18349] reverts [Feature #17490]	2021-12-15 20:02:30 -08:00
John Hawthorn	e307627b6c	Don't invalidate BOPs when aliases redefined Previously when redefining an alias of a BOP, we would unnecessarily invalidate the bop. For example: class String alias len length private :len end This commit avoids this by checking that the called_id on the method entry matches the original_id on the definition.	2021-12-14 15:22:51 -08:00
John Hawthorn	b7ae08992f	YJIT: Avoid unnecessary BOP invalidation Previously we would invalidate BOPs in YJIT when the method registered as a BOP was redefined on a subclass.	2021-12-14 15:22:51 -08:00
Takashi Kokubun	1a63468831	Prepare for removing RubyVM::JIT (#5262 )	2021-12-13 23:07:46 -08:00
Koichi Sasada	cce331272b	`Ractor.make_shareable` checks proc's sefl `Ractor.make_shareable(proc_obj)` raises an `IsolationError` if the self of `proc_obj` is not a shareable object. [Bug #18243]	2021-12-09 16:20:04 +09:00
John Hawthorn	733500e9d0	Lazily create singletons on instance_{exec,eval} (#5146 ) * Lazily create singletons on instance_{exec,eval} Previously when instance_exec or instance_eval was called on an object, that object would be given a singleton class so that method definitions inside the block would be added to the object rather than its class. This commit aims to improve performance by delaying the creation of the singleton class unless/until one is needed for method definition. Most of the time instance_eval is used without any method definition. This was implemented by adding a flag to the cref indicating that it represents a singleton of the object rather than a class itself. In this case CREF_CLASS returns the object's existing class, but in cases that we are defining a method (either via definemethod or VM_SPECIAL_OBJECT_CBASE which is used for undef and alias). This also happens to fix what I believe is a bug. Previously instance_eval behaved differently with regards to constant access for true/false/nil than for all other objects. I don't think this was intentional. String::Foo = "foo" "".instance_eval("Foo") # => "foo" Integer::Foo = "foo" 123.instance_eval("Foo") # => "foo" TrueClass::Foo = "foo" true.instance_eval("Foo") # NameError: uninitialized constant Foo This also slightly changes the error message when trying to define a method through instance_eval on an object which can't have a singleton class. Before: $ ruby -e '123.instance_eval { def foo; end }' -e:1:in `block in <main>': no class/module to add method (TypeError) After: $ ./ruby -e '123.instance_eval { def foo; end }' -e:1:in `block in <main>': can't define singleton (TypeError) IMO this error is a small improvement on the original and better matches the (both old and new) message when definging a method using `def self.` $ ruby -e '123.instance_eval{ def self.foo; end }' -e:1:in `block in <main>': can't define singleton (TypeError) Co-authored-by: Matthew Draper <matthew@trebex.net> * Remove "under" argument from yield_under * Move CREF_SINGLETON_SET into vm_cref_new * Simplify vm_get_const_base * Fix leaf VM_SPECIAL_OBJECT_CONST_BASE Co-authored-by: Matthew Draper <matthew@trebex.net>	2021-12-02 15:53:39 -08:00
Alan Wu	9121e57a5f	Rework tracing for blocks running as methods The main impetus for this change is to fix [Bug #13392]. Previously, we fired the "return" TracePoint event after popping the stack frame for the block running as method (BMETHOD). This gave undesirable source location outputs as the return event normally fires right before the frame going away. The iseq for each block can run both as a block and as a method. To accommodate that, this commit makes vm_trace() fire call/return events for instructions that have b_call/b_return events attached when the iseq is running as a BMETHOD. The logic for rewriting to "trace_*" instruction is tweaked so that when the user listens to call/return events, instructions with b_call/b_return become trace variants. To continue to provide the return value for non-local returns done using the "return" or "break" keyword inside BMETHODs, the stack unwinding code is tweaked. b_return events now provide the same return value as return events for these non-local cases. A pre-existing test deemed not providing a return value for these b_return events as a limitation. This commit removes the checks for call/return TracePoint events that happen when calling into BMETHODs when no TracePoints are active. Technically, migrating just the return event is enough to fix the bug, but migrating both call and return removes our reliance on `VM_FRAME_FLAG_FINISH` and re-entering the interpreter when the caller is already in the interpreter.	2021-12-01 17:42:33 -05:00
Yusuke Endoh	feda058531	Refactor hacky ID tables to struct rb_ast_id_table_t The implementation of a local variable tables was represented as `ID*`, but it was very hacky: the first element is not an ID but the size of the table, and, the last element is (sometimes) a link to the next local table only when the id tables are a linked list. This change converts the hacky implementation to a normal struct.	2021-11-21 08:59:24 +09:00
Koichi Sasada	be71c95b88	`rb_method_optimized_t` for further extension Now `rb_method_optimized_t optimized` field is added to represent optimized method type.	2021-11-19 08:32:39 +09:00
Koichi Sasada	b2255153cf	`vm_empty_cc_for_super` Same as `vm_empty_cc`, introduce a global variable which has `.call_ = vm_call_super_method`. Use it if the `cme == NULL` on `vm_search_super_method`.	2021-11-17 22:21:42 +09:00
Nobuyoshi Nakada	833c69ee82	Convert IDs to Integers As the ID serial is 32bit value and internal IDs created in the parser are assigned from its maximum value, Symbol converted from it will exceed 32bit and overflow on 32bit platforms.	2021-11-08 15:14:34 +09:00

1 2 3 4 5 ...

1039 Коммитов