github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Takashi Kokubun	beeee54880	YJIT: Replace perf_fn! with a simpler macro (#9971 )	2024-02-14 15:57:24 -08:00
Alan Wu	6f9ca7f453	YJIT: Use i32 over isize for ctx.sp_opnd() (#9968 ) It eventually casts it to i32 anyways, and a lot of callers already have an i32, so using isize was just adding unnecessary casts.	2024-02-14 21:50:58 +00:00
Alan Wu	72f8883a2f	YJIT: Extract keyword handling to make gen_send_iseq() shorter This should make it easier to grok the order of operations.	2024-02-14 16:09:11 -05:00
Aaron Patterson	c9ed59c2e2	Update yjit/src/codegen.rs Co-authored-by: Takashi Kokubun <takashikkbn@gmail.com>	2024-02-14 12:49:36 -08:00
Aaron Patterson	7943cb22f6	Consider rb_str_getbyte as leaf sometimes If YJIT knows the parameter to rb_str_getbyte is a fixnum, then I think we can consider the function to be a leaf	2024-02-14 12:49:36 -08:00
Peter Zhu	1d3b306753	Move rb_class_allocate_instance from gc.c to object.c	2024-02-14 13:43:02 -05:00
Takashi Kokubun	7177731282	YJIT: Add --yjit-perf=codegen option (#9957 )	2024-02-14 09:09:14 -08:00
Alan Wu	ee3b4bec0e	YJIT: Simplify Kernel#send guards and admit more cases (#9956 ) Previously, our compile time check rejected dynamic symbols (e.g. what String#to_sym could return) even though we could handle them just fine. The runtime guards for the type of method name was also overly restrictive and didn't accept dynamic symbols. Fold the type check into the rb_get_symbol_id() and take advantage of the guard already checking for 0. This also avoids generating the same call twice in case the same method name is presented as different types.	2024-02-14 11:19:04 -05:00
Aaron Patterson	a93f4e3d1a	byteslice can return nil so we have to push an "unknown" value	2024-02-13 10:10:19 -08:00
Alan Wu	fdaef7a72e	YJIT: Fixup kwrest stack base I was a little rushed and didn't notice that it was still using the final stack size even though we don't grow the stack before kwrest handling anymore. Oh well, we got a new test out of it. Fix: `cbdabd5890`	2024-02-13 12:16:36 -05:00
Aaron Patterson	c35fea8509	Specialize String#byteslice(a, b) (#9939 ) * Specialize String#byteslice(a, b) This adds a specialization for String#byteslice when there are two parameters. This makes our protobuf parser go from 5.84x slower to 5.33x slower ``` Comparison: decode upstream (53738 bytes): 7228.5 i/s decode protobuff (53738 bytes): 1236.8 i/s - 5.84x slower Comparison: decode upstream (53738 bytes): 7024.8 i/s decode protobuff (53738 bytes): 1318.5 i/s - 5.33x slower ``` * Update yjit/src/codegen.rs --------- Co-authored-by: Maxime Chevalier-Boisvert <maximechevalierb@gmail.com>	2024-02-13 16:20:27 +00:00
Alan Wu	cbdabd5890	YJIT: Fix kwrest calls setting SP with uninit values We did stack_push() and then saved the SP without writing to the slots of the new values first, which caused the GC to mark uninitialized values. Should fix crashes like https://github.com/ruby/ruby/actions/runs/7877298133/job/21493179294	2024-02-12 16:39:11 -05:00
Alan Wu	2131d04f43	YJIT: Add support for `kwrest` parameters Now that `...` uses `kwrest` instead of regular splat and ruby2keywords, we need to support these type of methods to support `...` well.	2024-02-12 13:57:24 -05:00
Alan Wu	e878bbd641	Allow `foo(nil, &block_arg)` Previously, `nil` by itself worked, but if you add a block argument, it raised a conversion error. The presence of the block argument shouldn't change how keyword splat works. See: <https://bugs.ruby-lang.org/issues/20064>	2024-02-12 13:02:50 -05:00
Takashi Kokubun	e4d3e652ff	YJIT: Prefer an overloaded cme if available (#9913 ) YJIT: Prefer an overloaded cme if applicable	2024-02-12 12:56:44 -05:00
Takashi Kokubun	b9f25b100f	YJIT: Fix insufficient type guards (#9911 )	2024-02-12 11:55:52 -05:00
Takashi Kokubun	e7b0a01002	YJIT: Add top ISEQ call counts to --yjit-stats (#9906 )	2024-02-09 22:12:24 +00:00
Takashi Kokubun	717adb564b	YJIT: Fallback megamorphic opt_case_dispatch (#9894 )	2024-02-09 12:12:45 -05:00
Takashi Kokubun	5d32e328d9	YJIT: Refactor recv_known_class to Option (#9895 )	2024-02-09 00:53:01 +00:00
Takashi Kokubun	e2aa00ca66	YJIT: Remove unnecessary casts for chain_depth (#9893 )	2024-02-08 15:52:45 -08:00
Takashi Kokubun	5cbca9110c	YJIT: Allow tracing a counted exit (#9890 ) * YJIT: Allow tracing a counted exit * Avoid clobbering caller-saved registers	2024-02-08 15:47:02 -08:00
Takashi Kokubun	1936278461	YJIT: Maintain MapToLocal that is just upgraded (#9876 )	2024-02-08 12:03:47 -05:00
John Hawthorn	3397449846	YJIT: Report invalidation counts in non-stats mode (#9878 ) The `invalidation_count` and `invalidate_*` counters are all incremented using `incr_counter!` without a guard on stats mode, so they can be made always available. This could be to helpful in investigating where, how often, and what types of invalidations are occurring in a production system.	2024-02-08 11:03:59 -05:00
Takashi Kokubun	50b4ef29b2	YJIT: Use jit_prepare_call_with_gc as much as possible (#9874 ) * YJIT: Use jit_prepare_call_with_gc as much as possible * Stop assuming vm_defined doesn't make a call	2024-02-08 11:03:07 -05:00
Takashi Kokubun	b74c8abd11	YJIT: Skip pushing a frame for Hash#empty? (#9875 )	2024-02-08 10:22:07 -05:00
Alan Wu	44f0dc622a	YJIT: Allow popping before falling back Popping but not generating any code before returning `None` was allowed before fallbacks were introduced so this is restoring that support in the same way. The included test used to trip an assert due to popping too much.	2024-02-06 16:17:54 -05:00
Alan Wu	bae3e5b29a	YJIT: No need to reject splat+zsuper There is nothing special about argument handling when it comes to zsuper if you look around in the VM. Everything passes removing these fallback reasons. It was ~16% on `railsbench`.	2024-02-05 16:13:22 -05:00
Maxime Chevalier-Boisvert	5a87e9e2b5	YJIT: add missing jge comparison instruction (#9819 ) I ran into this while trying to implement setbyte, was surprised to find out we hadn't implemented it yet.	2024-02-02 17:09:31 -05:00
Takashi Kokubun	21031f0a84	YJIT: Float arithmetics are actually leaf with these guards in YJIT. The previous commit was to fix "conflict" between two PRs, but I actually wanted to use it here, which is why I filed the other one.	2024-01-31 10:13:27 -08:00
Takashi Kokubun	2220c4cf42	YJIT: s/jit_prepare_routine_call/jit_prepare_non_leaf_call/	2024-01-31 10:10:03 -08:00
Takashi Kokubun	09427f51a2	YJIT: Add codegen for Float arithmetics (#9774 ) * YJIT: Add codegen for Float arithmetics * Add Flonum and Fixnum tests	2024-01-31 17:58:47 +00:00
Takashi Kokubun	cc9bbbdd80	YJIT: Add jit_prepare_for_gc function (#9775 ) * YJIT: Add jit_prepare_for_gc function * s/jit_prepare_routine_call/jit_prepare_non_leaf_call/ * s/jit_prepare_for_gc/jit_prepare_call_with_gc/ * Use jit_prepare_call_with_gc for leaf builtin	2024-01-31 17:54:10 +00:00
Takashi Kokubun	c1f8d974a8	YJIT: Specialize splatkw on T_HASH (#9764 ) * YJIT: Specialize splatkw on T_HASH * Fix a typo Co-authored-by: Alan Wu <XrXr@users.noreply.github.com> * Fix a few more comments --------- Co-authored-by: Alan Wu <XrXr@users.noreply.github.com>	2024-01-30 14:59:53 -05:00
Maxime Chevalier-Boisvert	fe5590e464	YJIT: add specialized codegen for fixnum XOR (#9763 )	2024-01-30 14:57:13 -05:00
Maxime Chevalier-Boisvert	bc10b958d2	YJIT: print warning when disasm options used without a dev build (#9744 ) * YJIT: print warning when disasm options used without a dev build I was confused for a few minutes the other way then --yjit-dump-disasm printed nothing, so I figured this would be useful for end-users (and future me). * Fix lone extraneous space	2024-01-29 16:17:12 -05:00
Maxime Chevalier-Boisvert	adf29c9a98	YJIT: add asm comment when we clear local types (#9713 ) Small PR to add a comment when we clear local variable types, so we can be aware that it's happening when looking at the disasm.	2024-01-29 15:36:34 +00:00
Takashi Kokubun	7567e4e1e1	YJIT: Fix exits on splatkw instruction (#9711 )	2024-01-26 00:22:27 +00:00
Takashi Kokubun	2034e6ad5a	YJIT: Support concattoarray and pushtoarray (#9708 )	2024-01-25 21:45:58 +00:00
Alan Wu	f0224adf2f	YJIT: Assert lea source operand type	2024-01-25 16:17:03 -05:00
Takashi Kokubun	d4cc77e7b6	YJIT: Add a counter for invokebuiltin exits (#9696 )	2024-01-25 11:23:26 -05:00
Jeremy Evans	b8516d6d01	Add pushtoarray VM instruction This instruction is similar to concattoarray, but it takes the number of arguments to push to the array, removes that number of arguments from the stack, and adds them to the array now at the top of the stack. This allows `f(a, 1)` to allocate only a single array on the caller side (which can be reused on the callee side in the case of `def f(a)`). Prior to this commit, `f(a, 1)` would generate 3 arrays: a dupped by splatarray true * 1 wrapped in array by newarray * a dupped again by concatarray Instructions Before for `a = []; f(a, 1)`: ``` 0000 newarray 0 ( 1)[Li] 0002 setlocal_WC_0 a@0 0004 putself 0005 getlocal_WC_0 a@0 0007 splatarray true 0009 putobject_INT2FIX_1_ 0010 newarray 1 0012 concatarray 0013 opt_send_without_block <calldata!mid:f, argc:1, ARGS_SPLAT\|FCALL> 0015 leave ``` Instructions After for `a = []; f(a, 1)`: ``` 0000 newarray 0 ( 1)[Li] 0002 setlocal_WC_0 a@0 0004 putself 0005 getlocal_WC_0 a@0 0007 splatarray true 0009 putobject_INT2FIX_1_ 0010 pushtoarray 1 0012 opt_send_without_block <calldata!mid:f, argc:1, ARGS_SPLAT\|ARGS_SPLAT_MUT\|FCALL> 0014 leave ``` With these changes, method calls to Ruby methods should implicitly allocate at most one array. Ignore typeprof bundled gem failure due to unrecognized instruction.	2024-01-24 18:25:55 -08:00
Jeremy Evans	22e488464a	Add VM_CALL_ARGS_SPLAT_MUT callinfo flag This flag is set when the caller has already created a new array to handle a splat, such as for `f(a, b)` and `f(a, b)`. Previously, if `f` was defined as `def f(a)`, these calls would create an extra array on the callee side, instead of using the new array created by the caller. This modifies `setup_args_core` to set the flag whenver it would add a `splatarray true` instruction. However, when `splatarray true` is changed to `splatarray false` in the peephole optimizer, to avoid unnecessary allocations on the caller side, the flag must be removed. Add `optimize_args_splat_no_copy` and have the peephole optimizer call that. This significantly simplifies the related peephole optimizer code. On the callee side, in `setup_parameters_complex`, set `args->rest_dupped` to true if the flag is set. This takes a similar approach for optimizing regular splats that was previiously used for keyword splats in `d2c41b1bff` (via VM_CALL_KW_SPLAT_MUT).	2024-01-24 18:25:55 -08:00
Alan Wu	2cc7a56ec7	YJIT: Avoid leaks by skipping objects with a singleton class For receiver with a singleton class, there are multiple vectors YJIT can end up retaining the object. There is a path in jit_guard_known_klass() that bakes the receiver into the code, and the object could also be kept alive indirectly through a path starting at the CME object baked into the code. To avoid these leaks, avoid compiling calls on objects with a singleton class. See: https://github.com/Shopify/ruby/issues/552 [Bug #20209]	2024-01-24 18:06:58 -05:00
Maxime Chevalier-Boisvert	1702528258	YJIT: reduce default exec mem size to 48MiB (#9685 ) YJIT: reduce default exec mem size to 48MiB based Based on user feedback from @jhawthorn and others. Better for small and memory-constrained deployments. NOTE: This commit should be included in the next Ruby 3.3.x point release. @xrxr should we tag someone specific?	2024-01-24 18:14:36 +00:00
Maxime Chevalier-Boisvert	23d4682926	YJIT: fix small bug causing jit_rb_int_rshift to side-exit (#9684 ) YJIT: fix bug causing jit_rb_int_rshift to side-exit The nqueens benchmark was showing zero performance improvement because we immediately side-exited as soon as the shift amount changed. If the amount changes, we want to fall back to the C function call, not side-exit.	2024-01-24 13:09:08 -05:00
Alan Wu	974b141ffe	YJIT: Reuse get_array_{ptr,len}	2024-01-23 19:47:26 -05:00
Alan Wu	ac1e9e443a	YJIT: Fix ruby2_keywords splat+rest and drop bogus checks YJIT didn't guard for ruby2_keywords hash in case of splat calls that land in methods with a rest parameter, creating incorrect results. The compile-time checks didn't correspond to any actual effects of ruby2_keywords, so it was masking this bug and YJIT was needlessly refusing to compile some code. About 16% of fallback reasons in `lobsters` was due to the ISeq check. We already handle the tagging part with exit_if_supplying_kw_and_has_no_kw() and should now have a dynamic guard for all splat cases. Note for backporting: You also need `7f51959ff1`. [Bug #20195]	2024-01-23 19:22:57 -05:00
Takashi Kokubun	27c1dd8634	YJIT: Allow inlining ISEQ calls with a block (#9622 ) * YJIT: Allow inlining ISEQ calls with a block * Leave a TODO comment about u16 inline_block	2024-01-23 19:36:23 +00:00
Alan Wu	7f51959ff1	YJIT: Move guard up for a case of splat+rest Previously, YJIT put the guard for having enough items to extract from splat array at a place where the side exit is invalid, so if the guard fails, YJIT could raise something other than ArgumentError. Move the guard up to a place before any stack manipulation. [Bug #20204]	2024-01-23 11:04:16 -05:00
dependabot[bot]	a64e26c5d5	Bump capstone from 0.10.0 to 0.11.0 in /yjit (#9653 ) Bumps [capstone](https://github.com/capstone-rust/capstone-rs) from 0.10.0 to 0.11.0. - [Release notes](https://github.com/capstone-rust/capstone-rs/releases) - [Changelog](https://github.com/capstone-rust/capstone-rs/blob/master/CHANGELOG.md) - [Commits](https://github.com/capstone-rust/capstone-rs/compare/capstone-v0.10.0...capstone-v0.11.0) --- updated-dependencies: - dependency-name: capstone dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-01-22 14:44:00 -08:00
dependabot[bot]	bd1895990c	Bump shlex from 1.1.0 to 1.3.0 in /yjit/bindgen (#9652 ) Bumps [shlex](https://github.com/comex/rust-shlex) from 1.1.0 to 1.3.0. - [Changelog](https://github.com/comex/rust-shlex/blob/master/CHANGELOG.md) - [Commits](https://github.com/comex/rust-shlex/commits) --- updated-dependencies: - dependency-name: shlex dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-01-22 14:11:07 -08:00
Alan Wu	703eee7745	YJIT: Drop extra arguments passed by yield (#9596 ) Support dropping extra arguments passed by `yield` in blocks. For example `10.times { work }` drops the count argument. This is common enough that it's about 3% of fallback reasons in `lobsters`. Only support simple cases where the surplus arguments are at the top of the stack, that way they just need to be popped, which takes no work.	2024-01-22 11:55:44 -05:00
Takashi Kokubun	e0f7cee8c5	YJIT: Avoid doubly splitting Opnd::Value on CSel (#9617 ) YJIT: Avoid doubly splitting Opnd::Value	2024-01-19 11:51:35 -08:00
Takashi Kokubun	3c9290173a	YJIT: Optimize defined?(yield) (#9599 ) * YJIT: Optimize defined?(yield) * Remove an irrelevant comment * s/get/gen/	2024-01-19 11:00:46 -05:00
Alan Wu	bbd249e351	YJIT: Properly reject keyword splat with `yield` We don't have support for keyword splat anywhere, but we tried to compile these anyways in case of `invokeblock`. This led to bad things happening such as passing the wrong value and passing a hash into rb_yjit_array_len(), which raised in the middle of compilation. [Bug #20192]	2024-01-18 20:22:40 -05:00
Takashi Kokubun	33306a08d1	YJIT: Stop incrementing chain_depth on defer_compilation (#9597 )	2024-01-18 11:40:11 -08:00
Alan Wu	03645d1eef	YJIT: Support empty splat and some block_arg calls to ivar getters (#9567 ) These seem odd at first glance, but they're used with `...` calls with `Module#delegate` from Active Support. These account for ~3% of fallback reasons in the `lobsters` benchmark.	2024-01-17 10:52:15 -05:00
Maxime Chevalier-Boisvert	afba09d30f	YJIT: specialized codegen for integer right shift (#9564 ) * YJIT: specialized codegen for integer right shift Used in optcarrot. May also be used to write pure-Ruby gems. No overflow check or fixnum untagging required. * Update yjit/src/codegen.rs Co-authored-by: Takashi Kokubun <takashikkbn@gmail.com> --------- Co-authored-by: Takashi Kokubun <takashikkbn@gmail.com>	2024-01-17 10:35:48 -05:00
Takashi Kokubun	8642a573e6	Rename BUILTIN_ATTR_SINGLE_NOARG_INLINE to BUILTIN_ATTR_SINGLE_NOARG_LEAF The attribute was created when the other attribute was called BUILTIN_ATTR_INLINE. Now that the original attribute is renamed to BUILTIN_ATTR_LEAF, it's only confusing that we call it "_INLINE".	2024-01-16 17:31:27 -08:00
Takashi Kokubun	e37a37e696	Drop obsoleted BUILTIN_ATTR_NO_GC attribute The thing that has used this in the past was very buggy, and we've never revisied it. Let's remove it until we need it again.	2024-01-16 17:27:53 -08:00
Alan Wu	2bf9970101	YJIT: Finer breakdown for block_arg counters For example, on `lobsters` the block_arg reason was at ~10%. With this change it shows that 6% of that is `send_cfunc_block_arg`.	2024-01-16 16:54:34 -05:00
Maxime Chevalier-Boisvert	6a1bf4cf1b	YJIT: optimized codegen for String#length (#9543 ) Minor optimization for programs doing string processing.	2024-01-15 17:35:29 -05:00
Takashi Kokubun	5e61cc26c9	YJIT: Optimize Integer#succ (#9519 )	2024-01-15 10:32:48 -05:00
ywenc	16624ef463	YJIT: expandarray for non-arrays (#9495 ) * YJIT: expandarray for non-arrays Co-authored-by: John Hawthorn <john@hawthorn.email> * Skip the new test on RJIT * Increment counter for to_ary exit --------- Co-authored-by: John Hawthorn <john@hawthorn.email> Co-authored-by: Takashi Kokubun <takashikkbn@gmail.com>	2024-01-12 15:41:58 +00:00
Alan Wu	015b0e2e1d	YJIT: Fix unused warnings ``` warning: unused import: `condition::Condition` --> src/asm/arm64/arg/mod.rs:13:9 \| 13 \| pub use condition::Condition; \| ^^^^^^^^^^^^^^^^^^^^ \| = note: `#[warn(unused_imports)]` on by default warning: unused import: `rb_yjit_fix_mul_fix as rb_fix_mul_fix` --> src/cruby.rs:188:9 \| 188 \| pub use rb_yjit_fix_mul_fix as rb_fix_mul_fix; \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ warning: unused import: `rb_insn_len as raw_insn_len` --> src/cruby.rs:142:9 \| 142 \| pub use rb_insn_len as raw_insn_len; \| ^^^^^^^^^^^^^^^^^^^^^^^^^^^ \| = note: `#[warn(unused_imports)]` on by default ``` Make asm public so it stops warning about unused public stuff in there.	2024-01-10 13:19:15 -05:00
Takashi Kokubun	a0eecfb5ba	YJIT: Fallback Integer#<< if a shift amount varies (#9426 ) * YJIT: Fallback Integer#<< if a shift amount varies * YJIT: Do not fallback lshift in the first chain	2024-01-08 17:34:57 +00:00
Takashi Kokubun	7f9c174102	YJIT: Let RubyVM::YJIT.enable respect --yjit-stats (#9415 )	2024-01-05 14:08:57 -05:00
Hiroshi SHIBATA	863ded45a1	Typofix under bootstraptest, spec and yjit directories	2023-12-25 13:50:23 +09:00
Takashi Kokubun	bd91c5127f	YJIT: Add stats option to RubyVM::YJIT.enable (#9297 )	2023-12-19 11:47:27 -08:00
Takashi Kokubun	b266890dab	YJIT: Add --yjit-disable to help and reorder it (#9230 )	2023-12-13 13:29:37 -08:00
Alan Wu	d4bbee7475	YJIT: Fix off-by-one in Kernel#send type handling (#9212 ) Previously, if the method ID argument happens to be on one below the top of the stack, we didn't overwrite the type of the stack slot, which leaves an incorrect type for the stack slot. The included script tripped asserts both with and without --yjit-verify-ctx.	2023-12-12 22:10:14 +00:00
Alan Wu	9cb0ad863c	YJIT: Fix missing arity check for splat calls to methods with optionals Previously, for splat callsites that land in methods with optional parameters, we didn't reject the case where the caller supplies too many arguments. Accepting those calls previously caused YJIT to construct corrupted control frames, which leads to crashes if the callee uses certain stack walking methods such as Kernel#raise and String#gsub (for setting up the frame-local `$~`). Example crash in a debug build: Assertion Failed: ../vm_core.h:1375:VM_ENV_FLAGS:FIXNUM_P(flags)	2023-12-11 19:21:08 -05:00
Alan Wu	4755309474	YJIT: Rename helper function and correct counter name Counter::guard_send_iseq_has_rest_and_splat_not_equal was using jump-if-lesser-than, so wasn't checking for equality. Rename function because moving is destructive in Rust, which is confusing for this function which doesn't modify the array.	2023-12-11 19:21:08 -05:00
Jeremy Evans	a950f23078	Ensure f(**kw, &block) calls kw.to_hash before block.to_proc Previously, block.to_proc was called first, by vm_caller_setup_arg_block. kw.to_hash was called later inside CALLER_SETUP_ARG or setup_parameters_complex. This adds a splatkw instruction that is inserted before sends with ARGS_BLOCKARG and KW_SPLAT and without KW_SPLAT_MUT. This is not needed in the KW_SPLAT_MUT case, because then you know the value is a hash, and you don't need to call to_hash on it. The splatkw instruction checks whether the second to top block is a hash, and if not, replaces it with the value of calling to_hash on it (using rb_to_hash_type). As it is always before a send with ARGS_BLOCKARG and KW_SPLAT, second to top is the keyword splat, and top is the passed block.	2023-12-09 13:15:47 -08:00
Takashi Kokubun	ac5fd58700	YJIT: Fix on-stack ISEQ comparison for auto_compact (#9164 )	2023-12-07 22:53:05 +00:00
Alan Wu	9d9865d9bc	YJIT: Add some object validity assertions We've seen quite a few compaction bugs lately, and these assertions should give clearer symptoms. We only call class_of() on objects that the Ruby code can see.	2023-12-06 16:42:53 -05:00
Takashi Kokubun	a439fc2f17	YJIT: Avoid register allocation conflict with a higher stack_idx (#9143 ) YJIT: Avoid register allocation conflict with a higher stack_idx	2023-12-06 15:19:43 -05:00
Alan Wu	a063969ec1	YJIT: Assert code pages are not partially in-bounds Helps understand page switching	2023-12-05 13:20:06 -05:00
Alan Wu	695e5c179e	YJIT: Simplify code page switching logic, remove an assert We have received a report of `assert!( !cb.has_dropped_bytes())` in set_page() failing. The only explanation for this seems to be memory allocation failing in write_byte(). The if condition implies that `current_write_pos < dst_pos < mem_size`, which rules out failing to encode the relative jump. The has_capacity() assert above not tripping implies that we were in a place in the page where write_byte() did attempt to write the byte and potentially made a syscall in the process. Remove the assert, since memory allocation could fail. Also, return failure if the destination is outside of the code region to detect that out-of-memory situation quicker.	2023-12-05 13:20:06 -05:00
Alan Wu	b5a62eb9ab	YJIT: Mark and update stubs in invalidated blocks (#9104 ) Like in the example given in delayed_deallocation(), stubs can be hit even if the block housing it is invalidated. Mark them so we don't work with invalidate ISeqs when hitting these stubs.	2023-12-04 10:13:40 -05:00
Takashi Kokubun	ba1cdadfc8	YJIT: Cancel on-stack jit_return on invalidation (#9086 ) * YJIT: Cancel on-stack jit_return on invalidation Co-authored-by: Alan Wu <alansi.xingwu@shopify.com> * Use RUBY_VM_CONTROL_FRAME_STACK_OVERFLOW_P --------- Co-authored-by: Alan Wu <alansi.xingwu@shopify.com>	2023-11-30 21:35:55 -05:00
Maxime Chevalier-Boisvert	73e6d8a0d0	YJIT: optimized codegen for `rb_ary_length()` (#9085 ) YJIT: optimized codegen for rb_ary_length()	2023-11-30 17:14:53 -05:00
Takashi Kokubun	d048bae96b	YJIT: Bump ec->cfp after setting cfp->jit_return (#9072 )	2023-11-30 11:23:25 -05:00
Maxime Chevalier-Boisvert	8d1138c1cf	YJIT: edit `yjit.md` and bring it up to date (#9068 ) Also make various minor edits to improve readability.	2023-11-29 18:09:55 -05:00
Maxime Chevalier-Boisvert	ea3e17e430	YJIT: fix bug in top cfunc logging in `--yjit-stats` (#9056 ) YJIT: correctly handle case where there are no cfunc calls Fix bug in top cfunc logging in `--yjit-stats`	2023-11-28 22:27:11 +00:00
Maxime Chevalier-Boisvert	6310522a9a	YJIT: reduce default exec-mem-size to 64MiB (#9054 )	2023-11-28 20:04:33 +00:00
Takashi Kokubun	476a231e7e	YJIT: Assert no patch overlap on pos_marker (#9048 )	2023-11-28 10:41:14 -05:00
Maxime Chevalier-Boisvert	7f50c70574	YJIT: add top C function call counts to `--yjit-stats` (#9047 ) * YJIT: gather call counts for individual cfuncs Co-authored by Takashi Kokubun	2023-11-27 22:49:53 +00:00
Maxime Chevalier-Boisvert	f05d586cc9	YJIT: record `num_send_cfunc` stat (#9022 ) * YJIT: record num_send_cfunc stat Also report num_send_known_cfunc as percentage of num_send_cfunc * Rename num_send_known_cfunc => num_send_cfunc_inline Name seems more descriptive of what we do with out custom codegen	2023-11-23 15:33:43 -05:00
Takashi Kokubun	440b59db10	YJIT: Apply patches ignoring page_end_reserve (#9015 )	2023-11-23 10:53:12 -05:00
Takashi Kokubun	95369ac0a3	YJIT: Fix jmp_ptr_bytes on x86_64 (#9016 )	2023-11-23 10:50:42 -05:00
Takashi Kokubun	926bfc3bc0	YJIT: Avoid a register spill on arm64 (#9014 )	2023-11-22 15:13:32 -08:00
Takashi Kokubun	95d4a52b3a	YJIT: Skip dump-disasm if it fails to create a file (#8968 )	2023-11-21 10:28:45 -05:00
Aaron Patterson	6fce8c7980	Don't try compacting ivars on Classes that are "too complex" Too complex classes use a hash table to store ivs, and should always pin their IVs. We shouldn't touch those classes in compaction.	2023-11-20 16:09:48 -08:00
Takashi Kokubun	fa547cd702	YJIT: Print a disasm path to stderr (#8967 ) YJIT: Print a perf map path to stderr	2023-11-20 14:51:54 -08:00
Maxime Chevalier-Boisvert	f9628fb4be	YJIT: make --yjit-max-versions=N option undocumented (#8962 ) Not useful for the vast majority of end users to change this option.	2023-11-20 21:26:18 +00:00
Maxime Chevalier-Boisvert	c552a5f7b0	YJIT: shrink `Context` down to 15 bytes (#8911 ) * WIP context refactoring * Refactor to remove Context.temp_mapping	2023-11-13 17:14:24 -05:00
Alan Wu	f5fa90fe0b	YJIT: Fix `clippy::useless_vec` in a test	2023-11-10 16:55:56 -05:00
Alan Wu	8dfbfa15f6	YJIT: Take cargo --fix for unnecessary calls to into()	2023-11-10 16:55:56 -05:00
Alan Wu	408d5886cf	YJIT: Auto fix for clippy::unnecessary_cast	2023-11-10 16:55:56 -05:00
Alan Wu	0a93ea4808	YJIT: Auto fix for clippy::clone_on_copy	2023-11-10 16:55:56 -05:00
Alan Wu	b8eb4bd4ce	YJIT: Panic with more info when global invalidation patching fails	2023-11-10 11:51:05 -05:00
Alan Wu	38fe710e08	YJIT: Invoke PosMarker callbacks only with solid positions Previously, PosMarker callbacks ran even when the assembler failed to assemble its contents due to insufficient space. This was problematic because when Assembler::compile() failed, the callbacks were given positions that have no valid code, contrary to general expectation. For example, we use a PosMarker callback to record VM instruction boundaries and patch in jumps to exits in case the guest program starts tracing, however, previously, we could record a location near the end of the code block, where there is no space to patch in jumps. I suspect this is the cause of the recent occurrences of rare random failures on GitHub Actions with the invariants.rs:529 "can rewrite existing code" message. `--yjit-perf` also uses PosMarker and had a similar issue. Buffer the list of callbacks to fire, and only fire them when all code in the assembler are written out successfully. It's more intuitive this way.	2023-11-10 11:51:05 -05:00
Edwin Garcia	1ee6968855	YJIT: Fix comment typos [ci skip]	2023-11-10 09:47:56 -05:00
Takashi Kokubun	50402db5a7	YJIT: Disable code GC (#8865 ) Co-authored-by: Alan Wu <alansi.xingwu@shopify.com> Co-authored-by: Maxime Chevalier-Boisvert <maxime.chevalierboisvert@shopify.com>	2023-11-08 10:21:04 -05:00
Jean Boussier	d898e8d6f8	Refactor rb_shape_transition_shape_capa out Right now the `rb_shape_get_next` shape caller need to first check if there is capacity left, and if not call `rb_shape_transition_shape_capa` before it can call `rb_shape_get_next`. And on each of these it needs to checks if we got a TOO_COMPLEX back. All this logic is duplicated in the interpreter, YJIT and RJIT. Instead we can have `rb_shape_get_next` do the capacity transition when needed. The caller can compare the old and new shapes capacity to know if resizing is needed. It also can check for TOO_COMPLEX only once.	2023-11-08 11:02:55 +01:00
Alan Wu	7367336c4e	YJIT: Fix assert in OOM scenario We still need to do `jit.record_boundary_patch_point = false` when gen_outlined_exit() returns `None` and we return with `?`. Previously, we tripped the assert at codegen.rs:1042. Found with `--yjit-exec-mem-size=3` on the lobsters benchmark. Co-authored-by: Takashi Kokubun <takashikkbn@gmail.com> Co-authored-by: Maxime Chevalier-Boisvert <maxime.chevalierboisvert@shopify.com>	2023-11-07 18:50:10 -05:00
Alan Wu	a1c61f0ae5	YJIT: Use u32 for CodePtr to save 4 bytes each We've long had a size restriction on the code memory region such that a u32 could refer to everything. This commit capitalizes on this restriction by shrinking the size of `CodePtr` to be 4 bytes from 8. To derive a full raw pointer from a `CodePtr`, one needs a base pointer. Both `CodeBlock` and `VirtualMemory` can be used for this purpose. The base pointer is readily available everywhere, except for in the case of the `jit_return` "branch". Generalize lea_label() to lea_jump_target() in the IR to delay deriving the `jit_return` address until `compile()`, when the base pointer is available. On railsbench, this yields roughly a 1% reduction to `yjit_alloc_size` (58,397,765 to 57,742,248).	2023-11-07 17:43:43 -05:00
Takashi Kokubun	9877f3ada8	YJIT: Inline basic Ruby methods (#8855 ) * YJIT: Inline basic Ruby methods * YJIT: Fix "InsnOut operand made it past register allocation" checktype should not generate a useless instruction.	2023-11-07 10:54:33 -05:00
Jean byroot Boussier	a294bb844c	YJIT: handle out of shape situation in gen_setinstancevariable (#8857 ) If the VM ran out of shape, `rb_shape_transition_shape_capa` might return `OBJ_TOO_COMPLEX_SHAPE`. Co-authored-by: Jean Boussier <byroot@ruby-lang.org>	2023-11-07 10:49:36 -05:00
Maxime Chevalier-Boisvert	6e38076b16	YJIT: implement two-step call threshold (#8839 ) * YJIT: implement two-step call threshold Automatically switch call threshold to a larger value for larger, production-sized apps, while still allowing smaller apps and command-line programs to start with a lower threshold. * Update yjit/src/options.rs Co-authored-by: Alan Wu <XrXr@users.noreply.github.com> * Make the new variables constants * Check that a custom call threshold was not specified --------- Co-authored-by: Alan Wu <XrXr@users.noreply.github.com>	2023-11-03 21:07:13 +00:00
Alan Wu	38bdb9d0da	YJIT: Delete some dead code and enable lints	2023-11-03 18:47:41 +00:00
Takashi Kokubun	ad4f973ecd	YJIT: Always define method codegen table at boot (#8807 )	2023-11-02 09:15:48 -07:00
Aaron Patterson	dc81432482	updating bindgen	2023-10-24 10:52:06 -07:00
Takashi Kokubun	8a88172fd4	YJIT: Skip printing stats at exit if --yjit-disable (#8727 )	2023-10-20 13:12:38 -07:00
Maxime Chevalier-Boisvert	3e65115cef	YJIT: remove unused `--yjit-greedy-versioning` command-line option (#8713 )	2023-10-19 19:29:31 +00:00
Alan Wu	cdc2a18541	YJIT: Return Option from asm.compile() for has_dropped_bytes() So that we get a reminder to check CodeBlock::has_dropped_bytes(). Internally, asm.compile() already checks it, and this patch just propagates it out to the caller with a `#[must_use]`. Code GC logic moved out one level in entry_stub_hit(), so the body can freely use `?`	2023-10-19 14:56:35 -04:00
Takashi Kokubun	6beb09c2c9	YJIT: Add RubyVM::YJIT.enable (#8705 )	2023-10-19 10:54:35 -07:00
Alan Wu	9d9aa63e82	YJIT: Enable the dead_code lint and delete some dead code	2023-10-19 11:50:36 -04:00
Alan Wu	d2b0c9da2e	YJIT: Add a live ISeq counter It's an estimator for application size and could be used as a compilation heuristic later. Co-authored-by: Maxime Chevalier-Boisvert <maxime.chevalierboisvert@shopify.com> Co-authored-by: Takashi Kokubun <takashikkbn@gmail.com>	2023-10-18 23:46:35 +00:00
Katherine Oelsner	a7032b80af	Revert "shape.h: Make attr_index_t uint8_t" This reverts commit `e3afc212ec`.	2023-10-18 15:01:13 -07:00
Takashi Kokubun	f51b92fe23	YJIT: Add --yjit-perf (#8697 ) Co-authored-by: Alan Wu <alansi.xingwu@shopify.com>	2023-10-18 21:07:03 +00:00
dependabot[bot]	58ee088bb3	Bump rustix from 0.38.2 to 0.38.19 in /yjit/bindgen Bumps [rustix](https://github.com/bytecodealliance/rustix) from 0.38.2 to 0.38.19. - [Release notes](https://github.com/bytecodealliance/rustix/releases) - [Commits](https://github.com/bytecodealliance/rustix/compare/v0.38.2...v0.38.19) --- updated-dependencies: - dependency-name: rustix dependency-type: indirect ... Signed-off-by: dependabot[bot] <support@github.com>	2023-10-18 12:34:10 -07:00
Alan Wu	36ee5d8ca8	YJIT: Fix clippy::redundant_locals > note: `#[deny(clippy::redundant_locals)]` on by default On Rust 1.73.0.	2023-10-17 18:36:23 -04:00
Alan Wu	2abd061e8b	YJIT: Remove call to compile() on empty Assembler	2023-10-17 15:24:55 -04:00
Takashi Kokubun	d458b4127f	YJIT: Add a few missing counters for send fallback (#8681 )	2023-10-17 11:36:13 -04:00
Alan Wu	1f7234c015	YJIT: Lookup IDs on boot instead of binding to them Previously, the version-controlled `cruby_bindings.inc.rs` file contained the build-time artifact `id.h`, which nobu mentioned hinders the goal of having fewer magic numbers in the repository. Lookup the IDs YJIT needs on boot. It costs cycles, but it's fine since YJIT only uses a handful of IDs at the moment. No perceptible degradation to boot time found in my testing.	2023-10-17 11:30:35 -04:00
Takashi Kokubun	48c6cea330	YJIT: Count cold_iseq_entry by default (#8654 )	2023-10-14 21:00:50 -07:00
Takashi Kokubun	5808999d30	YJIT: Fallback opt_getconstant_path for const_missing (#8623 ) * YJIT: Fallback opt_getconstant_path for const_missing * Fix a comment [ci skip] * Remove a wrapper function	2023-10-13 08:52:23 -07:00
Alan Wu	0bf1749e9f	YJIT: Fix argument clobbering in some block_arg+rest_param calls (#8647 ) Previously, for block argument callsites with some specific argument count and callee local variable count combinations, YJIT ended up writing over arguments that are supposed to be collected into a rest parameter array unmodified. Detect when clobbering would happen and avoid it. Also, place the block handler after the stack overflow check, since it writes to new stack space. Reported-by: Takashi Kokubun <takashikkbn@gmail.com>	2023-10-13 10:41:53 -04:00
Maxime Chevalier-Boisvert	b2e1ddffa5	YJIT: port call threshold logic from Rust to C for performance (#8628 ) * Port call threshold logic from Rust to C for performance * Prefix global/field names with yjit_ * Fix linker error * Fix preprocessor condition for rb_yjit_threshold_hit * Fix third linker issue * Exclude yjit_calls_at_interv from RJIT bindgen --------- Co-authored-by: Takashi Kokubun <takashikkbn@gmail.com>	2023-10-12 10:05:34 -04:00
Jean Boussier	e3afc212ec	shape.h: Make attr_index_t uint8_t Given `SHAPE_MAX_NUM_IVS 80`, we transition to TOO_COMPLEX way before we could overflow a 8bit counter. This reduce the size of `rb_shape_t` from 32B to 24B. If we decide to raise `SHAPE_MAX_NUM_IVS` we can always increase that type again.	2023-10-11 08:33:09 +02:00
Takashi Kokubun	4d0a619f71	YJIT: Allow --yjit-trace-exits on release builds (#8619 )	2023-10-10 14:54:19 -07:00
Jean Boussier	5cc44f48c5	Refactor rb_shape_transition_shape_capa to not accept capacity This way the groth factor is encapsulated, which allows rb_shape_transition_shape_capa to be smarter about ideal sizes.	2023-10-10 14:47:54 +02:00
Alan Wu	8cb906d706	YJIT: Simplify linker flavor check By not allowlisting every OS that could have a GNU flavor linker, it's also a bigger tent	2023-10-05 17:31:54 -04:00
Maxime Chevalier-Boisvert	13bde94a9f	YJIT: shink local types from 8 to 4 bytes (#8597 ) * Shink local types from 8 to 4 bytes, context from 21 to 17 bytes Use repr(packed) * Add comment about Type being limited to 4 bits	2023-10-05 17:17:33 -04:00
Alan Wu	7d1abd5d31	YJIT: Run bindgen	2023-10-05 17:09:18 -04:00
Alan Wu	07a7c4bdaf	YJIT: Remove duplicate cfp->iseq accessor	2023-10-05 16:40:27 -04:00
Alan Wu	41a6e4bdf9	YJIT: Avoid writing return value to memory in `leave` Previously, at the end of `leave` we did `*caller_cfp->sp = return_value`, like the interpreter. With future changes that leaves the SP field uninitialized for C frames, this will become problematic. For cases like returning from `rb_funcall()`, the return value was written above the stack and never read anyway (callers use the copy in the return register). Leave the return value in a register at the end of `leave` and have the code at `cfp->jit_return` decide what to do with it. This avoids the unnecessary memory write mentioned above. For JIT-to-JIT returns, it goes through `asm.stack_push()` and benefits from register allocation for stack temporaries. Mostly flat on benchmarks, with maybe some marginal speed improvements. Co-authored-by: Takashi Kokubun <takashikkbn@gmail.com>	2023-10-05 15:53:05 -04:00
Nobuyoshi Nakada	cf1223348a	Remove unmatch parenthesis ``` yjit/yjit.mk:48: Extraneous text after `ifneq' directive ```	2023-10-06 03:39:33 +09:00
Thomas Hurst	fd22d5debb	YJIT: Build on BSD platforms with GNU make (#8591 )	2023-10-05 13:23:51 -04:00
Takashi Kokubun	f834b1a40d	YJIT: Add a comment for YJIT_OPTIONS [ci skip]	2023-10-04 10:28:32 -07:00
Takashi Kokubun	7db6f448ed	Use a better variable name for w	2023-10-04 10:25:46 -07:00
Takashi Kokubun	577ff858bc	Make the function names consistent	2023-10-04 10:25:46 -07:00
Takashi Kokubun	01c462ce6a	YJIT: Move help descriptions to options.rs	2023-10-04 10:25:46 -07:00
Takashi Kokubun	465bc682a2	YJIT: Call mprotect after entry stub failure (#8582 ) Co-authored-by: Alan Wu <XrXr@users.noreply.github.com>	2023-10-03 22:39:45 -07:00
Maxime Chevalier-Boisvert	ea491802fa	YJIT: add heuristic to avoid compiling cold ISEQs (#8522 ) * YJIT: Add counter to measure how often we compile "cold" ISEQs (#535) Fix counter name in DEFAULT_COUNTERS YJIT: add --yjit-cold-threshold, don't compile cold ISEQs YJIT: increase default cold threshold to 200_000 Remove rb_yjit_call_threshold() Remove conflict markers Fix compilation errors Threshold 1 should compile immediately Debug deadlock issue with test_ractor Fix call threshold issue with tests * Revert exception threshold logic. Document option in yjid.md * (void) for 0 parameter functions in C99 * Rename iseq_entry_cold => cold_iseq_entry * Document --yjit-cold-threshold in ruby.c * Update doc/yjit/yjit.md Co-authored-by: Jean byroot Boussier <jean.boussier+github@shopify.com> * Shorten help string to appease test * Address bug found by Kokubun. Reorder logic. --------- Co-authored-by: Alan Wu <XrXr@users.noreply.github.com> Co-authored-by: Jean byroot Boussier <jean.boussier+github@shopify.com>	2023-10-03 17:45:46 -04:00
Takashi Kokubun	1992aef722	YJIT: Stop spilling temps on jit_prepare_routine_call (#8581 ) YJIT: Remove spill_temps from jit_prepare_routine_call	2023-10-03 12:54:24 -04:00
Takashi Kokubun	0b67e3fd3e	YJIT: Chain-guard opt_mult overflow (#8554 ) * YJIT: Chain-guard opt_mult overflow * YJIT: Support regenerating Jo after Mul	2023-09-29 21:55:48 -04:00
Takashi Kokubun	0d4c4b6573	YJIT: Use registers for passing C method arguments (#8538 )	2023-09-29 10:08:15 -04:00

1 2 3 4 5 ...

975 Коммитов