github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Jean Boussier	d4f3dcf4df	Refactor VM root modules This `st_table` is used to both mark and pin classes defined from the C API. But `vm->mark_object_ary` already does both much more efficiently. Currently a Ruby process starts with 252 rooted classes, which uses `7224B` in an `st_table` or `2016B` in an `RArray`. So a baseline of 5kB saved, but since `mark_object_ary` is preallocated with `1024` slots but only use `405` of them, it's a net `7kB` save. `vm->mark_object_ary` is also being refactored. Prior to this changes, `mark_object_ary` was a regular `RArray`, but since this allows for references to be moved, it was marked a second time from `rb_vm_mark()` to pin these objects. This has the detrimental effect of marking these references on every minors even though it's a mostly append only list. But using a custom TypedData we can save from having to mark all the references on minor GC runs. Addtionally, immediate values are now ignored and not appended to `vm->mark_object_ary` as it's just wasted space.	2024-03-06 15:33:43 -05:00
Jean Boussier	b4a69351ec	Move FL_SINGLETON to FL_USER1 This frees FL_USER0 on both T_MODULE and T_CLASS. Note: prior to this, FL_SINGLETON was never set on T_MODULE, so checking for `FL_SINGLETON` without first checking that `FL_TYPE` was `T_CLASS` was valid. That's no longer the case.	2024-03-06 13:11:41 -05:00
Jean Boussier	e626da82ea	Don't pin named structs defined in Ruby [Bug #20311] `rb_define_class_under` assumes it's called from C and that the reference might be held in a C global variable, so it adds the class to the VM root. In the case of `Struct.new('Name')` it's wasteful and make the struct immortal.	2024-03-01 08:23:38 +01:00
John Hawthorn	1c97abaaba	De-dup identical callinfo objects Previously every call to vm_ci_new (when the CI was not packable) would result in a different callinfo being returned this meant that every kwarg callsite had its own CI. When calling, different CIs result in different CCs. These CIs and CCs both end up persisted on the T_CLASS inside cc_tbl. So in an eval loop this resulted in a memory leak of both types of object. This also likely resulted in extra memory used, and extra time searching, in non-eval cases. For simplicity in this commit I always allocate a CI object inside rb_vm_ci_lookup, but ideally we would lazily allocate it only when needed. I hope to do that as a follow up in the future.	2024-02-20 18:55:00 -08:00
Peter Zhu	28a6e4ea9d	Set m_tbl right after allocation We should set the m_tbl right after allocation before anything that can trigger GC to avoid clone_p from becoming old and needing to fire write barriers. Co-authored-by: Aaron Patterson <tenderlove@ruby-lang.org>	2023-12-19 13:09:36 -08:00
Aaron Patterson	6fce8c7980	Don't try compacting ivars on Classes that are "too complex" Too complex classes use a hash table to store ivs, and should always pin their IVs. We shouldn't touch those classes in compaction.	2023-11-20 16:09:48 -08:00
Yusuke Endoh	591336a0f2	Avoid the pointer hack in RCLASS_EXT ... because GCC 13 warns it. ``` In file included from class.c:24: In function ‘RCLASS_SET_ALLOCATOR’, inlined from ‘class_alloc’ at class.c:251:5, inlined from ‘rb_module_s_alloc’ at class.c:1045:17: internal/class.h:159:43: warning: array subscript 0 is outside array bounds of ‘rb_classext_t[0]’ {aka ‘struct rb_classext_struct[]’} [-Warray-bounds=] 159 \| RCLASS_EXT(klass)->as.class.allocator = allocator; \| ^ ``` https://rubyci.s3.amazonaws.com/arch/ruby-master/log/20231015T030003Z.log.html.gz	2023-10-15 15:35:45 +09:00
Nobuyoshi Nakada	4634405f7c	Stop exposing FrozenCore in headers Revert commit "Directly allocate FrozenCore as an ICLASS", `813a5f4fc4`.	2023-09-19 14:08:05 +09:00
Nobuyoshi Nakada	b934976024	Prefer `0` over `NULL` as function pointers SunC warns use of `NULL`, pointer to data as function pointers.	2023-06-23 03:15:55 +09:00
Peter Zhu	813a5f4fc4	Directly allocate FrozenCore as an ICLASS It's a bad idea to overwrite the flags as the garbage collector may have set other flags.	2023-06-14 10:42:40 -04:00
eileencodes	40f090f433	Revert "Revert "Fix cvar caching when class is cloned"" This reverts commit `10621f7cb9`. This was reverted because the gc integrity build started failing. We have figured out a fix so I'm reopening the PR. Original commit message: Fix cvar caching when class is cloned The class variable cache that was added in ruby#4544 changed the behavior of class variables on cloned classes. As reported when a class is cloned AND a class variable was set, and the class variable was read from the original class, reading a class variable from the cloned class would return the value from the original class. This was happening because the IC (inline cache) is stored on the ISEQ which is shared between the original and cloned class, therefore they share the cache too. To fix this we are now storing the `cref` in the cache so that we can check if it's equal to the current `cref`. If it's different we don't want to read from the cache. If it's the same we do. Cloned classes don't share the same cref with their original class. This will need to be backported to 3.1 in addition to 3.2 since the bug exists in both versions. We also added a marking function which was missing. Fixes [Bug #19379] Co-authored-by: Aaron Patterson <tenderlove@ruby-lang.org>	2023-06-05 11:11:12 -07:00
Aaron Patterson	10621f7cb9	Revert "Fix cvar caching when class is cloned" This reverts commit `77d1b08247`.	2023-06-01 14:55:36 -07:00
eileencodes	77d1b08247	Fix cvar caching when class is cloned The class variable cache that was added in https://github.com/ruby/ruby/pull/4544 changed the behavior of class variables on cloned classes. As reported when a class is cloned AND a class variable was set, and the class variable was read from the original class, reading a class variable from the cloned class would return the value from the original class. This was happening because the IC (inline cache) is stored on the ISEQ which is shared between the original and cloned class, therefore they share the cache too. To fix this we are now storing the `cref` in the cache so that we can check if it's equal to the current `cref`. If it's different we don't want to read from the cache. If it's the same we do. Cloned classes don't share the same cref with their original class. This will need to be backported to 3.1 in addition to 3.2 since the bug exists in both versions. We also added a marking function which was missing. Fixes [Bug #19379] Co-authored-by: Aaron Patterson <tenderlove@ruby-lang.org>	2023-06-01 08:52:48 -07:00
Peter Zhu	a0d1069e03	Make classes embedded on 32 bit Classes are now exactly 80 bytes when embedded, which perfectly fits the 3rd size pool on 32 bit systems.	2023-04-16 11:06:31 -04:00
Peter Zhu	24b137336b	Move shape ID to flags for classes on 32 bit Moves shape ID to FL_USER4 to FL_USER19 for the shape ID on 32 bit systems. This makes the rb_classext_struct smaller so that it can be embedded.	2023-04-16 11:06:31 -04:00
Peter Zhu	ad3d4e87d7	Move RCLASS_CLONED to rb_classext_struct This commit moves RCLASS_CLONED from the flags to the rb_classext_struct. This frees the FL_USER1 bit.	2023-04-16 11:06:31 -04:00
Peter Zhu	91dcce5ed1	Change max_iv_count to type attr_index_t max_iv_count is calculated from next_iv_index of the shape, which is of type attr_index_t, so we can also make max_iv_count of type attr_index_t.	2023-04-11 15:02:44 -04:00
Aaron Patterson	365fed6369	Revert "Allow classes and modules to become too complex" This reverts commit `69465df424`.	2023-03-10 08:50:43 -08:00
HParker	69465df424	Allow classes and modules to become too complex This makes the behavior of classes and modules when there are too many instance variables match the behavior of objects with too many instance variables.	2023-03-09 15:34:49 -08:00
Takashi Kokubun	233ddfac54	Stop exporting symbols for MJIT	2023-03-06 21:59:23 -08:00
Jean Boussier	1a4b4cd7f8	Move `attached_object` into `rb_classext_struct` Given that signleton classes don't have an allocator, we can re-use these bytes to store the attached object in `rb_classext_struct` without making it larger.	2023-02-16 08:14:44 +01:00
Jean Boussier	7413079dae	Encapsulate RCLASS_ATTACHED_OBJECT Right now the attached object is stored as an instance variable and all the call sites that either get or set it have to know how it's stored. It's preferable to hide this implementation detail behind accessors so that it is easier to change how it's stored.	2023-02-15 15:24:22 +01:00
Jean Boussier	bac4d2eefa	Check !RCLASS_EXT_EMBEDDED instead of SIZE_POOL_COUNT == 1 It's much more self documenting and consistent	2023-02-15 10:47:22 +01:00
Peter Zhu	4fa7d38324	Don't redefine RB_OBJ_WRITE RB_OBJ_WRITE already exists in rgengc.h, so we shouldn't redefine it in gc.h.	2023-01-18 08:49:32 -05:00
Peter Zhu	abff5f6203	Move classpath to rb_classext_t This commit moves the classpath (and tmp_classpath) from instance variables to the rb_classext_t. This improves performance as we no longer need to set an instance variable when assigning a classpath to a class. I benchmarked with the following script: ```ruby name = :MyClass puts(Benchmark.measure do 10_000_000.times do \|i\| Object.const_set(name, Class.new) Object.send(:remove_const, name) end end) ``` Before this patch: ``` 5.440119 0.025264 5.465383 ( 5.467105) ``` After this patch: ``` 4.889646 0.028325 4.917971 ( 4.942678) ```	2023-01-11 11:06:58 -05:00
Peter Zhu	d86833e717	Make variation_count an unsigned char Since SHAPE_MAX_VARIATIONS is 8, it can easily fit inside an unsigned char.	2023-01-11 11:06:58 -05:00
Jemma Issroff	a3d552aedd	Add variation_count on classes Count how many "variations" each class creates. A "variation" is a a unique ordering of instance variables on a particular class. This can also be thought of as a branch in the shape tree. For example, the following Foo class will have 2 variations: ```ruby class Foo ; end Foo.new.instance_variable_set(:@a, 1) # case 1: creates one variation Foo.new.instance_variable_set(:@b, 1) # case 2: creates another variation foo = Foo.new foo.instance_variable_set(:@a, 1) # does not create a new variation foo.instance_variable_set(:@b, 1) # does not create a new variation (a continuation of the variation in case 1) ``` We will use this number to limit the amount of shapes that a class can create and fallback to using a hash iv lookup. Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org>	2022-12-15 10:06:04 -08:00
Jemma Issroff	5246f4027e	Transition shape when object's capacity changes This commit adds a `capacity` field to shapes, and adds shape transitions whenever an object's capacity changes. Objects which are allocated out of a bigger size pool will also make a transition from the root shape to the shape with the correct capacity for their size pool when they are allocated. This commit will allow us to remove numiv from objects completely, and will also mean we can guarantee that if two objects share shapes, their IVs are in the same positions (an embedded and extended object cannot share shapes). This will enable us to implement ivar sets in YJIT using object shapes. Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org>	2022-11-10 10:11:34 -05:00
John Hawthorn	02f1554224	Implement object shapes for T_CLASS and T_MODULE (#6637 ) * Avoid RCLASS_IV_TBL in marshal.c * Avoid RCLASS_IV_TBL for class names * Avoid RCLASS_IV_TBL for autoload * Avoid RCLASS_IV_TBL for class variables * Avoid copying RCLASS_IV_TBL onto ICLASSes * Use object shapes for Class and Module IVs	2022-10-31 14:05:37 -07:00
John Hawthorn	b652dbf63b	Remove iv_index_tbl_entry	2022-10-24 10:54:01 -07:00
Jemma Issroff	13bd617ea6	Remove unused class serial Before object shapes, we were using class serial to invalidate inline caches. Now that we use shape_id for inline cache keys, the class serial is unnecessary. Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org>	2022-10-21 14:56:48 -07:00
Jemma Issroff	ad63b668e2	Revert "Revert "This commit implements the Object Shapes technique in CRuby."" This reverts commit `9a6803c90b`.	2022-10-11 08:40:56 -07:00
Aaron Patterson	9a6803c90b	Revert "This commit implements the Object Shapes technique in CRuby." This reverts commit 68bc9e2e97d12f80df0d113e284864e225f771c2.	2022-09-30 16:01:50 -07:00
Jemma Issroff	d594a5a8bd	This commit implements the Object Shapes technique in CRuby. Object Shapes is used for accessing instance variables and representing the "frozenness" of objects. Object instances have a "shape" and the shape represents some attributes of the object (currently which instance variables are set and the "frozenness"). Shapes form a tree data structure, and when a new instance variable is set on an object, that object "transitions" to a new shape in the shape tree. Each shape has an ID that is used for caching. The shape structure is independent of class, so objects of different types can have the same shape. For example: ```ruby class Foo def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end class Bar def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end foo = Foo.new # `foo` has shape id 2 bar = Bar.new # `bar` has shape id 2 ``` Both `foo` and `bar` instances have the same shape because they both set instance variables of the same name in the same order. This technique can help to improve inline cache hits as well as generate more efficient machine code in JIT compilers. This commit also adds some methods for debugging shapes on objects. See `RubyVM::Shape` for more details. For more context on Object Shapes, see [Feature: #18776] Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org> Co-Authored-By: Eileen M. Uchitelle <eileencodes@gmail.com> Co-Authored-By: John Hawthorn <john@hawthorn.email>	2022-09-28 08:26:21 -07:00
Aaron Patterson	06abfa5be6	Revert this until we can figure out WB issues or remove shapes from GC Revert "* expand tabs. [ci skip]" This reverts commit `830b5b5c35`. Revert "This commit implements the Object Shapes technique in CRuby." This reverts commit `9ddfd2ca00`.	2022-09-26 16:10:11 -07:00
Jemma Issroff	9ddfd2ca00	This commit implements the Object Shapes technique in CRuby. Object Shapes is used for accessing instance variables and representing the "frozenness" of objects. Object instances have a "shape" and the shape represents some attributes of the object (currently which instance variables are set and the "frozenness"). Shapes form a tree data structure, and when a new instance variable is set on an object, that object "transitions" to a new shape in the shape tree. Each shape has an ID that is used for caching. The shape structure is independent of class, so objects of different types can have the same shape. For example: ```ruby class Foo def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end class Bar def initialize # Starts with shape id 0 @a = 1 # transitions to shape id 1 @b = 1 # transitions to shape id 2 end end foo = Foo.new # `foo` has shape id 2 bar = Bar.new # `bar` has shape id 2 ``` Both `foo` and `bar` instances have the same shape because they both set instance variables of the same name in the same order. This technique can help to improve inline cache hits as well as generate more efficient machine code in JIT compilers. This commit also adds some methods for debugging shapes on objects. See `RubyVM::Shape` for more details. For more context on Object Shapes, see [Feature: #18776] Co-Authored-By: Aaron Patterson <tenderlove@ruby-lang.org> Co-Authored-By: Eileen M. Uchitelle <eileencodes@gmail.com> Co-Authored-By: John Hawthorn <john@hawthorn.email>	2022-09-26 09:21:30 -07:00
Jeremy Evans	7cda7fbbdc	Add Module#undefined_instance_methods Implements [Feature #12655] Co-authored-by: Nobuyoshi Nakada <nobu@ruby-lang.org>	2022-06-06 09:57:32 -07:00
Jemma Issroff	8a13a2e8d1	RCLASS uses FLUSER bits 0 through 3	2022-05-27 13:45:47 -07:00
John Hawthorn	19f331f588	Dedup superclass array in leaf sibling classes Previously, we would build a new `superclasses` array for each class, even though for all immediate subclasses of a class, the array is identical. This avoids duplicating the arrays on leaf classes (those without subclasses) by calculating and storing a "superclasses including self" array on a class when it's first inherited and sharing that among all superclasses. An additional trick used is that the "superclass array including self" is valid as "self"'s superclass array. It just has it's own class at the end. We can use this to avoid an extra pointer of storage and can use one bit of a flag to track that we've "upgraded" the array.	2022-03-03 11:23:27 -08:00
John Hawthorn	b13a7c8e36	Constant time class to class ancestor lookup Previously when checking ancestors, we would walk all the way up the ancestry chain checking each parent for a matching class or module. I believe this was especially unfriendly to CPU cache since for each step we need to check two cache lines (the class and class ext). This check is used quite often in: * case statements * rescue statements * Calling protected methods * Class#is_a? * Module#=== * Module#<=> I believe it's most common to check a class against a parent class, to this commit aims to improve that (unfortunately does not help checking for an included Module). This is done by storing on each class the number and an array of all parent classes, in order (BasicObject is at index 0). Using this we can check whether a class is a subclass of another in constant time since we know the location to expect it in the hierarchy.	2022-02-23 19:57:42 -08:00
Nobuyoshi Nakada	ab11cafe0b	Parenthesize the macro argument	2022-01-17 01:56:04 +09:00
Peter Zhu	6b7eff9086	Separately allocate class_serial on 32-bit systems On 32-bit systems, VWA causes class_serial to not be aligned (it only guarantees 4 byte alignment but class_serial is 8 bytes and requires 8 byte alignment). This commit uses a hack to allocate class_serial through malloc. Once VWA allocates with 8 byte alignment in the future, we will revert this commit.	2022-01-14 14:36:33 -05:00
Matt Valentine-House	b680b632e5	Make RCLASS_EXT(c)->subclasses a doubly linked list Updating RCLASS_PARENT_SUBCLASSES and RCLASS_MODULE_SUBCLASSES while compacting can trigger the read barrier. This commit makes RCLASS_SUBCLASSES a doubly linked list with a dedicated head object so that we can add and remove entries from the list without having to touch an object in the Ruby heap	2021-11-22 09:11:04 -05:00
Matt Valentine-House	a9a94540d6	Remove RCLASS(obj)->ptr when RVARGC is enabled With RVARGC we always store the rb_classext_t in the same slot as the RClass struct that refers to it. So we don't need to store the pointer or access through the pointer anymore and can switch the RCLASS_EXT macro to use an offset	2021-11-11 13:47:45 -05:00
Nobuyoshi Nakada	65285bf673	Consider modified modules initialized [Bug #18185 ]	2021-09-24 08:29:00 +09:00
Nobuyoshi Nakada	c0a892a7f0	Fix a typo [Bug #17048 ]	2021-09-19 22:39:18 +09:00
Nobuyoshi Nakada	178ee1e801	Already initialized modules cannot be replaced [Bug #17048 ]	2021-09-17 11:14:04 +09:00
卜部昌平	daf0c04a47	internal/*.h: skip doxygen These contents are purely implementation details, not worth appearing in CAPI documents. [ci skip]	2021-09-10 20:00:06 +09:00
Nobuyoshi Nakada	6963f8f743	Remove old warning aged nearly 8 years	2021-08-19 17:44:48 +09:00
eileencodes	b91b3bc771	Add a cache for class variables Redo of 34a2acdac788602c14bf05fb616215187badd504 and 931138b00696419945dc03e10f033b1f53cd50f3 which were reverted. GitHub PR #4340. This change implements a cache for class variables. Previously there was no cache for cvars. Cvar access is slow due to needing to travel all the way up th ancestor tree before returning the cvar value. The deeper the ancestor tree the slower cvar access will be. The benefits of the cache are more visible with a higher number of included modules due to the way Ruby looks up class variables. The benchmark here includes 26 modules and shows with the cache, this branch is 6.5x faster when accessing class variables. ``` compare-ruby: ruby 3.1.0dev (2021-03-15T06:22:34Z master `9e5105c`) [x86_64-darwin19] built-ruby: ruby 3.1.0dev (2021-03-15T12:12:44Z add-cache-for-clas.. c6be009) [x86_64-darwin19] \| \|compare-ruby\|built-ruby\| \|:--------\|-----------:\|---------:\| \|vm_cvar \| 5.681M\| 36.980M\| \| \| -\| 6.51x\| ``` Benchmark.ips calling `ActiveRecord::Base.logger` from within a Rails application. ActiveRecord::Base.logger has 71 ancestors. The more ancestors a tree has, the more clear the speed increase. IE if Base had only one ancestor we'd see no improvement. This benchmark is run on a vanilla Rails application. Benchmark code: ```ruby require "benchmark/ips" require_relative "config/environment" Benchmark.ips do \|x\| x.report "logger" do ActiveRecord::Base.logger end end ``` Ruby 3.0 master / Rails 6.1: ``` Warming up -------------------------------------- logger 155.251k i/100ms Calculating ------------------------------------- ``` Ruby 3.0 with cvar cache / Rails 6.1: ``` Warming up -------------------------------------- logger 1.546M i/100ms Calculating ------------------------------------- logger 14.857M (± 4.8%) i/s - 74.198M in 5.006202s ``` Lastly we ran a benchmark to demonstate the difference between master and our cache when the number of modules increases. This benchmark measures 1 ancestor, 30 ancestors, and 100 ancestors. Ruby 3.0 master: ``` Warming up -------------------------------------- 1 module 1.231M i/100ms 30 modules 432.020k i/100ms 100 modules 145.399k i/100ms Calculating ------------------------------------- 1 module 12.210M (± 2.1%) i/s - 61.553M in 5.043400s 30 modules 4.354M (± 2.7%) i/s - 22.033M in 5.063839s 100 modules 1.434M (± 2.9%) i/s - 7.270M in 5.072531s Comparison: 1 module: 12209958.3 i/s 30 modules: 4354217.8 i/s - 2.80x (± 0.00) slower 100 modules: 1434447.3 i/s - 8.51x (± 0.00) slower ``` Ruby 3.0 with cvar cache: ``` Warming up -------------------------------------- 1 module 1.641M i/100ms 30 modules 1.655M i/100ms 100 modules 1.620M i/100ms Calculating ------------------------------------- 1 module 16.279M (± 3.8%) i/s - 82.038M in 5.046923s 30 modules 15.891M (± 3.9%) i/s - 79.459M in 5.007958s 100 modules 16.087M (± 3.6%) i/s - 81.005M in 5.041931s Comparison: 1 module: 16279458.0 i/s 100 modules: 16087484.6 i/s - same-ish: difference falls within error 30 modules: 15891406.2 i/s - same-ish: difference falls within error ``` Co-authored-by: Aaron Patterson <tenderlove@ruby-lang.org>	2021-06-18 10:02:44 -07:00

1 2

73 Коммитов