github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Jean byroot Boussier	c19defd026	Revert "Only emit circular dependency warning for owned thread shields" This reverts commit `fa49651e05`.	2023-02-06 23:30:35 +01:00
Jean Boussier	fa49651e05	Only emit circular dependency warning for owned thread shields [Bug #19415] If multiple threads attemps to load the same file concurrently it's not a circular dependency issue. So we check that the existing ThreadShield is owner by the current fiber before warning about circular dependencies.	2023-02-06 19:35:38 +01:00
Koichi Sasada	0a82bfe5e1	use correct svar (#7225 ) * use correct svar Without this patch, svar location is used "nearest Ruby frame". It is almost correct but it doesn't correct when the `each` method is written in Ruby. ```ruby class C include Enumerable def each %w(bar baz).each{\|e\| yield e} end end C.new.grep(/(b.)/){\|e\| p [$1, e]} ``` This patch fix this issue by traversing ifunc's cfp. Note that if cfp doesn't specify this Thread's cfp stack, reserved svar location (`ec->root_svar`) is used. * make yjit-bindgen --------- Co-authored-by: Takashi Kokubun <takashikkbn@gmail.com>	2023-02-01 16:13:19 -08:00
Peter Zhu	2296b877d8	Remove rb_hash_st_table It's a duplicate of RHASH_ST_TABLE.	2023-01-31 15:48:09 -05:00
Peter Zhu	2866f951c5	Remove rb_hash_ar_table It's dead code and duplicate of RHASH_AR_TABLE.	2023-01-31 15:48:09 -05:00
Kunshan Wang	de724487f0	Copying GC support for EXIVAR Instance variables held in gen_ivtbl are marked with rb_gc_mark. It prevents the referenced objects from moving, which is bad for copying garbage collectors. This commit allows those instance variables to be updated during gc_update_object_references.	2023-01-31 09:24:26 -05:00
Peter Zhu	4e037075ae	Fix typo in RHASH_ST_CLEAR We should be setting as.st and not as.ar.	2023-01-20 15:58:17 -05:00
Nobuyoshi Nakada	15328302dc	In `UNALIGNED_MEMBER_PTR` cast through `void` pointer Suppress warnings shown even with `-Waddress-of-packed-member` disabled in gcc 11.	2023-01-21 00:30:34 +09:00
Peter Zhu	41bf2354e3	Add rb_gc_mark_and_move and implement on iseq This commit adds rb_gc_mark_and_move which takes a pointer to an object and marks it during marking phase and updates references during compaction. This allows for marking and reference updating to be combined into a single function, which reduces code duplication and prevents bugs if marking and reference updating goes out of sync. This commit also implements rb_gc_mark_and_move on iseq as an example.	2023-01-19 11:23:35 -05:00
Peter Zhu	4fa7d38324	Don't redefine RB_OBJ_WRITE RB_OBJ_WRITE already exists in rgengc.h, so we shouldn't redefine it in gc.h.	2023-01-18 08:49:32 -05:00
Peter Zhu	abff5f6203	Move classpath to rb_classext_t This commit moves the classpath (and tmp_classpath) from instance variables to the rb_classext_t. This improves performance as we no longer need to set an instance variable when assigning a classpath to a class. I benchmarked with the following script: ```ruby name = :MyClass puts(Benchmark.measure do 10_000_000.times do \|i\| Object.const_set(name, Class.new) Object.send(:remove_const, name) end end) ``` Before this patch: ``` 5.440119 0.025264 5.465383 ( 5.467105) ``` After this patch: ``` 4.889646 0.028325 4.917971 ( 4.942678) ```	2023-01-11 11:06:58 -05:00
Peter Zhu	d86833e717	Make variation_count an unsigned char Since SHAPE_MAX_VARIATIONS is 8, it can easily fit inside an unsigned char.	2023-01-11 11:06:58 -05:00
John Hawthorn	fbaa5db44a	Use a BOP for Hash#default On a hash miss we need to call default if it is redefined in order to return the default value to be used. Previously we checked this with rb_method_basic_definition_p, which avoids the method call but requires a method lookup. This commit replaces the previous check with BASIC_OP_UNREDEFINED_P and a new BOP_DEFAULT. We still need to fall back to rb_method_basic_definition_p when called on a subclasss of hash. \| \|compare-ruby\|built-ruby\| \|:---------------\|-----------:\|---------:\| \|hash_aref_miss \| 2.692\| 3.531\| \| \| -\| 1.31x\| Co-authored-by: Daniel Colson <danieljamescolson@gmail.com> Co-authored-by: "Ian C. Anderson" <ian@iancanderson.com> Co-authored-by: Jack McCracken <me@jackmc.xyz>	2022-12-17 14:51:49 -08:00
Peter Zhu	c505448cdb	Move definition of SIZE_POOL_COUNT back to gc.h SIZE_POOL_COUNT is a GC macro, it should belong in gc.h and not shape.h. SIZE_POOL_COUNT doesn't depend on shape.h so we can have shape.h depend on gc.h. Co-Authored-By: Matt Valentine-House <matt@eightbitraptor.com>	2022-12-15 16:33:46 -05:00
Jemma Issroff	c1ab6ddc9a	Transition complex objects to "too complex" shape When an object becomes "too complex" (in other words it has too many variations in the shape tree), we transition it to use a "too complex" shape and use a hash for storing instance variables. Without this patch, there were rare cases where shape tree growth could "explode" and cause performance degradation on what would otherwise have been cached fast paths. This patch puts a limit on shape tree growth, and gracefully degrades in the rare case where there could be a factorial growth in the shape tree. For example: ```ruby class NG; end HUGE_NUMBER.times do NG.new.instance_variable_set(:"@unique_ivar_#{_1}", 1) end ``` We consider objects to be "too complex" when the object's class has more than SHAPE_MAX_VARIATIONS (currently 8) leaf nodes in the shape tree and the object introduces a new variation (a new leaf node) associated with that class. For example, new variations on instances of the following class would be considered "too complex" because those instances create more than 8 leaves in the shape tree: ```ruby class Foo; end 9.times { Foo.new.instance_variable_set(":@uniq_#{_1}", 1) } ``` However, the following class is not too complex because it only has one leaf in the shape tree: ```ruby class Foo def initialize @a = @b = @c = @d = @e = @f = @g = @h = @i = nil end end 9.times { Foo.new } `` This case is rare, so we don't expect this change to impact performance of most applications, but it needs to be handled. Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org>	2022-12-15 10:06:04 -08:00
Jemma Issroff	a3d552aedd	Add variation_count on classes Count how many "variations" each class creates. A "variation" is a a unique ordering of instance variables on a particular class. This can also be thought of as a branch in the shape tree. For example, the following Foo class will have 2 variations: ```ruby class Foo ; end Foo.new.instance_variable_set(:@a, 1) # case 1: creates one variation Foo.new.instance_variable_set(:@b, 1) # case 2: creates another variation foo = Foo.new foo.instance_variable_set(:@a, 1) # does not create a new variation foo.instance_variable_set(:@b, 1) # does not create a new variation (a continuation of the variation in case 1) ``` We will use this number to limit the amount of shapes that a class can create and fallback to using a hash iv lookup. Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org>	2022-12-15 10:06:04 -08:00
John Hawthorn	43f9351177	Fix parens on LIKELY in basic operators We want to hint to the compiler that it's likely that the BOP is unredefined (the bit is 0). Previously we were accidentally hinting to the compiler that it was non-zero due to a misplaced parenthesis.	2022-12-12 14:05:43 -08:00
Daniel Colson	e69b91fae4	Introduce BOP_CMP for optimized comparison Prior to this commit the `OPTIMIZED_CMP` macro relied on a method lookup to determine whether `<=>` was overridden. The result of the lookup was cached, but only for the duration of the specific method that initialized the cmp_opt_data cache structure. With this method lookup, `[x,y].max` is slower than doing `x > y ? x : y` even though there's an optimized instruction for "new array max". (John noticed somebody a proposed micro-optimization based on this fact in https://github.com/mastodon/mastodon/pull/19903.) ```rb a, b = 1, 2 Benchmark.ips do \|bm\| bm.report('conditional') { a > b ? a : b } bm.report('method') { [a, b].max } bm.compare! end ``` Before: ``` Comparison: conditional: 22603733.2 i/s method: 19820412.7 i/s - 1.14x (± 0.00) slower ``` This commit replaces the method lookup with a new CMP basic op, which gives the examples above equivalent performance. After: ``` Comparison: method: 24022466.5 i/s conditional: 23851094.2 i/s - same-ish: difference falls within error ``` Relevant benchmarks show an improvement to Array#max and Array#min when not using the optimized newarray_max instruction as well. They are noticeably faster for small arrays with the relevant types, and the same or maybe a touch faster on larger arrays. ``` $ make benchmark COMPARE_RUBY=<master@5958c305> ITEM=array_min $ make benchmark COMPARE_RUBY=<master@5958c305> ITEM=array_max ``` The benchmarks added in this commit also look generally improved. Co-authored-by: John Hawthorn <jhawthorn@github.com>	2022-12-06 12:37:23 -08:00
Daniel Colson	c43951e60e	Move BOP macros to separate file This commit moves ruby_basic_operators and the unredefined macros out of vm_core.h and into basic_operators.h so that we can use them more broadly in places where we currently use a method look up via `rb_method_basic_definition_p` (e.g. object.c, numeric.c, complex.c, enum.c, but also in internal/compar.h after introducing BOP_CMP and elsewhere if we introduce more BOPs) The most controversial part of this change is probably moving redefined_flag out of rb_vm_t. [vm_opt_method_def_table and vm_opt_mid_table](`9da2a5204f/vm.c`) are not part of rb_vm_t either, and I think this fits well with those. But more significantly it seems to result in one fewer instruction. For example: Before: ``` (lldb) disassemble -n vm_opt_str_freeze miniruby`vm_exec_core: miniruby[0x10028233e] <+14558>: movq 0x11a86b(%rip), %rax ; ruby_current_vm_ptr miniruby[0x100282345] <+14565>: testb $0x4, 0x242c(%rax) ``` After: ``` (lldb) disassemble -n vm_opt_str_freeze ruby`vm_exec_core: ruby[0x100280ebe] <+14510>: testb $0x4, 0x120147(%rip) ; ruby_vm_redefined_flag + 43 ``` Co-authored-by: John Hawthorn <jhawthorn@github.com>	2022-12-06 12:37:23 -08:00
S-H-GAMELINKS	914cf26d6e	parenthesize to macro	2022-12-02 01:31:27 +09:00
S-H-GAMELINKS	1a64d45c67	Introduce encoding check macro	2022-12-02 01:31:27 +09:00
Samuel Williams	0436f1e15a	Introduce `Fiber#storage` for inheritable fiber-scoped variables. (#6612 )	2022-12-01 23:00:33 +13:00
Peter Zhu	648927d71b	Refactor obj_ivar_set and vm_setivar obj_ivar_set and vm_setivar_slowpath is essentially doing the same thing, but the code is duplicated and not quite implemented in the same way, which could cause bugs. This commit refactors vm_setivar_slowpath to use obj_ivar_set.	2022-11-21 09:58:53 -05:00
yui-knk	d8601621ed	Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods Implementation for Language Server Protocol (LSP) sometimes needs token information. For example both `m(1)` and `m(1, )` has same AST structure other than node locations then it's impossible to check the existence of `,` from AST. However in later case, it might be better to suggest variables list for the second argument. Token information is important for such case. This commit adds these methods. * Add `keep_tokens` option for `RubyVM::AbstractSyntaxTree.parse`, `.parse_file` and `.of` * Add `RubyVM::AbstractSyntaxTree::Node#tokens` which returns tokens for the node including tokens for descendants nodes. * Add `RubyVM::AbstractSyntaxTree::Node#all_tokens` which returns all tokens for the input script regardless the receiver node. [Feature #19070] Impacts on memory usage and performance are below: Memory usage: ``` $ cat test.rb root = RubyVM::AbstractSyntaxTree.parse_file(File.expand_path('../test/ruby/test_keyword.rb', __FILE__), keep_tokens: true) $ /usr/bin/time -f %Mkb /usr/local/bin/ruby -v ruby 3.2.0dev (2022-11-19T09:41:54Z 19070-keep_tokens d3af1b8057) [x86_64-linux] 11408kb # keep_tokens :false $ /usr/bin/time -f %Mkb /usr/local/bin/ruby test.rb 17508kb # keep_tokens :true $ /usr/bin/time -f %Mkb /usr/local/bin/ruby test.rb 30960kb ``` Performance: ``` $ cat ../ast_keep_tokens.yml prelude: \| src = <<~SRC module M class C def m1(a, b) 1 + a + b end end end SRC benchmark: without_keep_tokens: \| RubyVM::AbstractSyntaxTree.parse(src, keep_tokens: false) with_keep_tokens: \| RubyVM::AbstractSyntaxTree.parse(src, keep_tokens: true) $ make benchmark COMPARE_RUBY="./ruby" ARGS=../ast_keep_tokens.yml /home/kaneko.y/.rbenv/shims/ruby --disable=gems -rrubygems -I../benchmark/lib ../benchmark/benchmark-driver/exe/benchmark-driver \ --executables="compare-ruby::./ruby -I.ext/common --disable-gem" \ --executables="built-ruby::./miniruby -I../lib -I. -I.ext/common ../tool/runruby.rb --extout=.ext -- --disable-gems --disable-gem" \ --output=markdown --output-compare -v ../ast_keep_tokens.yml compare-ruby: ruby 3.2.0dev (2022-11-19T09:41:54Z 19070-keep_tokens d3af1b8057) [x86_64-linux] built-ruby: ruby 3.2.0dev (2022-11-19T09:41:54Z 19070-keep_tokens d3af1b8057) [x86_64-linux] warming up.. \| \|compare-ruby\|built-ruby\| \|:--------------------\|-----------:\|---------:\| \|without_keep_tokens \| 21.659k\| 21.303k\| \| \| 1.02x\| -\| \|with_keep_tokens \| 6.220k\| 5.691k\| \| \| 1.09x\| -\| ```	2022-11-21 09:01:34 +09:00
Jemma Issroff	5246f4027e	Transition shape when object's capacity changes This commit adds a `capacity` field to shapes, and adds shape transitions whenever an object's capacity changes. Objects which are allocated out of a bigger size pool will also make a transition from the root shape to the shape with the correct capacity for their size pool when they are allocated. This commit will allow us to remove numiv from objects completely, and will also mean we can guarantee that if two objects share shapes, their IVs are in the same positions (an embedded and extended object cannot share shapes). This will enable us to implement ivar sets in YJIT using object shapes. Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org>	2022-11-10 10:11:34 -05:00
Peter Zhu	4a8cd9e8bc	Use shared flags of the type The ELTS_SHARED flag is generic, so we should prefer to use the flags specific of the type (STR_SHARED for strings and RARRAY_SHARED_FLAG for arrays).	2022-11-02 11:03:21 -04:00
John Hawthorn	02f1554224	Implement object shapes for T_CLASS and T_MODULE (#6637 ) * Avoid RCLASS_IV_TBL in marshal.c * Avoid RCLASS_IV_TBL for class names * Avoid RCLASS_IV_TBL for autoload * Avoid RCLASS_IV_TBL for class variables * Avoid copying RCLASS_IV_TBL onto ICLASSes * Use object shapes for Class and Module IVs	2022-10-31 14:05:37 -07:00
John Hawthorn	b652dbf63b	Remove iv_index_tbl_entry	2022-10-24 10:54:01 -07:00
Takashi Kokubun	120b747b7d	YJIT: Lazily enable YJIT after prelude (#6597 ) * YJIT: Lazily enable YJIT after prelude * Update dependencies * Use a bit field for opt->yjit	2022-10-24 12:20:44 -04:00
Jemma Issroff	13bd617ea6	Remove unused class serial Before object shapes, we were using class serial to invalidate inline caches. Now that we use shape_id for inline cache keys, the class serial is unnecessary. Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org>	2022-10-21 14:56:48 -07:00
Koichi Sasada	e35c528d72	push dummy frame for loading process This patch pushes dummy frames when loading code for the profiling purpose. The following methods push a dummy frame: * `Kernel#require` * `Kernel#load` * `RubyVM::InstructionSequence.compile_file` * `RubyVM::InstructionSequence.load_from_binary` https://bugs.ruby-lang.org/issues/18559	2022-10-20 17:38:28 +09:00
Takashi Kokubun	d9d9005a3a	MJIT: Stop using the VM barrier for jit_cont This solves multiple problems. First, RB_VM_LOCK_ENTER/LEAVE is a barrier. We could at least use the _NO_BARRIER variant. Second, this doesn't need to interfere with GC or other GVL users when multiple Ractors are used. This needs to be used in very few places, so the benefit of fine-grained locking would outweigh its small maintenance cost. Third, it fixes a crash for YJIT. Because YJIT is never disabled until a process exits unlike MJIT that finishes earlier, we could call jit_cont_free when EC no longer exists, which crashes RB_VM_LOCK_ENTER.	2022-10-19 17:20:48 -07:00
Takashi Kokubun	e7166c9bb7	Allow passing a Rust closure to rb_iseq_callback (#6575 )	2022-10-18 09:07:11 -07:00
Takashi Kokubun	e7c71c6c92	Make mjit_cont sharable with YJIT (#6556 ) * Make mjit_cont sharable with YJIT * Update dependencies * Update YJIT binding	2022-10-17 09:27:59 -07:00
Aaron Patterson	1acc1a5c6d	YJIT doesn't need rb_obj_ensure_iv_index_mapping We should make this function static and remove it from YJIT bindings.	2022-10-14 17:14:41 -07:00
Nobuyoshi Nakada	5ccb625fbb	Use `roomof` macro for rounding up divisions	2022-10-14 19:23:25 +09:00
Jemma Issroff	ad63b668e2	Revert "Revert "This commit implements the Object Shapes technique in CRuby."" This reverts commit `9a6803c90b`.	2022-10-11 08:40:56 -07:00
yui-knk	fbbdbdd891	Add error_tolerant option to RubyVM::AST If this option is enabled, SyntaxError is not raised and Node is returned even if passed script is broken. [Feature #19013]	2022-10-08 17:59:11 +09:00
Aaron Patterson	9a6803c90b	Revert "This commit implements the Object Shapes technique in CRuby." This reverts commit 68bc9e2e97d12f80df0d113e284864e225f771c2.	2022-09-30 16:01:50 -07:00
Victor Shepelev	ad651925e3	Add Data class implementation: Simple immutable value object	2022-09-30 18:23:19 +09:00
Samuel Williams	9dd902b831	Add `eval: true/false` flag to `Coverage.setup`.	2022-09-29 09:44:14 +13:00
Jemma Issroff	d594a5a8bd	This commit implements the Object Shapes technique in CRuby. Object Shapes is used for accessing instance variables and representing the "frozenness" of objects. Object instances have a "shape" and the shape represents some attributes of the object (currently which instance variables are set and the "frozenness"). Shapes form a tree data structure, and when a new instance variable is set on an object, that object "transitions" to a new shape in the shape tree. Each shape has an ID that is used for caching. The shape structure is independent of class, so objects of different types can have the same shape. For example: ```ruby class Foo def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end class Bar def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end foo = Foo.new # `foo` has shape id 2 bar = Bar.new # `bar` has shape id 2 ``` Both `foo` and `bar` instances have the same shape because they both set instance variables of the same name in the same order. This technique can help to improve inline cache hits as well as generate more efficient machine code in JIT compilers. This commit also adds some methods for debugging shapes on objects. See `RubyVM::Shape` for more details. For more context on Object Shapes, see [Feature: #18776] Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org> Co-Authored-By: Eileen M. Uchitelle <eileencodes@gmail.com> Co-Authored-By: John Hawthorn <john@hawthorn.email>	2022-09-28 08:26:21 -07:00
Aaron Patterson	06abfa5be6	Revert this until we can figure out WB issues or remove shapes from GC Revert "* expand tabs. [ci skip]" This reverts commit `830b5b5c35`. Revert "This commit implements the Object Shapes technique in CRuby." This reverts commit `9ddfd2ca00`.	2022-09-26 16:10:11 -07:00
Jemma Issroff	9ddfd2ca00	This commit implements the Object Shapes technique in CRuby. Object Shapes is used for accessing instance variables and representing the "frozenness" of objects. Object instances have a "shape" and the shape represents some attributes of the object (currently which instance variables are set and the "frozenness"). Shapes form a tree data structure, and when a new instance variable is set on an object, that object "transitions" to a new shape in the shape tree. Each shape has an ID that is used for caching. The shape structure is independent of class, so objects of different types can have the same shape. For example: ```ruby class Foo def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end class Bar def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end foo = Foo.new # `foo` has shape id 2 bar = Bar.new # `bar` has shape id 2 ``` Both `foo` and `bar` instances have the same shape because they both set instance variables of the same name in the same order. This technique can help to improve inline cache hits as well as generate more efficient machine code in JIT compilers. This commit also adds some methods for debugging shapes on objects. See `RubyVM::Shape` for more details. For more context on Object Shapes, see [Feature: #18776] Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org> Co-Authored-By: Eileen M. Uchitelle <eileencodes@gmail.com> Co-Authored-By: John Hawthorn <john@hawthorn.email>	2022-09-26 09:21:30 -07:00
Yusuke Endoh	a78c733cc3	Revert "Revert "error.c: Let Exception#inspect inspect its message"" This reverts commit `b9f030954a`. [Bug #18170]	2022-09-23 16:40:59 +09:00
Benoit Daloze	6525b6f760	Remove get_actual_encoding() and the dynamic endian detection for dummy UTF-16/UTF-32 * And simplify callers of get_actual_encoding(). * See [Feature #18949]. * See https://github.com/ruby/ruby/pull/6322#issuecomment-1242758474	2022-09-12 14:02:34 +02:00
Nobuyoshi Nakada	576bdec03f	[Bug #18973 ] Promote US-ASCII to ASCII-8BIT when adding 8-bit char	2022-08-31 17:27:59 +09:00
Peter Zhu	efb91ff19b	Rename rb_ary_tmp_new to rb_ary_hidden_new rb_ary_tmp_new suggests that the array is temporary in some way, but that's not true, it just creates an array that's hidden and not on the transient heap. This commit renames it to rb_ary_hidden_new.	2022-07-26 09:12:09 -04:00
Jemma Issroff	63330ae4ac	Change ROBJECT_TRANSIENT_FLAG to use FL_USER2	2022-07-25 12:08:58 -07:00
Peter Zhu	e199ae3edc	Remove reference counting for all frozen arrays The RARRAY_LITERAL_FLAG was added in commit `5871ecf956` to improve CoW performance for array literals by not keeping track of reference counts. This commit reverts that commit and has an alternate implementation that is more generic for all frozen arrays. Since frozen arrays cannot be modified, we don't need to set the RARRAY_SHARED_ROOT_FLAG and we don't need to do reference counting.	2022-07-22 13:29:21 -04:00

1 2 3 4 5 ...

288 Коммитов