github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Alan Wu	264e4cd04f	Remove write barrier exemption for T_ICLASS Before this commit, iclasses were "shady", or not protected by write barriers. Because of that, the GC needs to spend more time marking these objects than otherwise. Applications that make heavy use of modules should see reduction in GC time as they have a significant number of live iclasses on the heap. - Put logic for iclass method table ownership into a function - Remove calls to WB_UNPROTECT and insert write barriers for iclasses This commit relies on the following invariant: for any non oirigin iclass `I`, `RCLASS_M_TBL(I) == RCLASS_M_TBL(RBasic(I)->klass)`. This invariant did not hold prior to `98286e9` for classes and modules that have prepended modules. [Feature #16984]	2020-08-17 17:17:47 -04:00
AGSaidi	511b55bcef	Enable arm64 optimizations that exist for power/x86 (#3393 ) * Enable unaligned accesses on arm64 64-bit Arm platforms support unaligned accesses. Running the string benchmarks this change improves performance by an average of 1.04x, min .96x, max 1.21x, median 1.01x * arm64 enable gc optimizations Similar to x86 and powerpc optimizations. \| \|compare-ruby\|built-ruby\| \|:------\|-----------:\|---------:\| \|hash1 \| 0.225\| 0.237\| \| \| -\| 1.05x\| \|hash2 \| 0.110\| 0.110\| \| \| 1.00x\| -\| * vm_exec.c: improve performance for arm64 \| \|compare-ruby\|built-ruby\| \|:------------------------------\|-----------:\|---------:\| \|vm_array \| 26.501M\| 27.959M\| \| \| -\| 1.06x\| \|vm_attr_ivar \| 21.606M\| 31.429M\| \| \| -\| 1.45x\| \|vm_attr_ivar_set \| 21.178M\| 26.113M\| \| \| -\| 1.23x\| \|vm_backtrace \| 6.621\| 6.668\| \| \| -\| 1.01x\| \|vm_bigarray \| 26.205M\| 29.958M\| \| \| -\| 1.14x\| \|vm_bighash \| 504.155k\| 479.306k\| \| \| 1.05x\| -\| \|vm_block \| 16.692M\| 21.315M\| \| \| -\| 1.28x\| \|block_handler_type_iseq \| 5.083\| 7.004\| \| \| -\| 1.38x\|	2020-08-14 02:15:54 +09:00
Aaron Patterson	3dc313a239	Don't pin objects if we're just walking the heap Walking the heap can inadvertently pin objects. Only mark the object's pin bit if the mark_func_data pointer is NULL (similar to the mark bits)	2020-08-03 12:28:00 -07:00
Koichi Sasada	f7cf600c8b	fix mark bit operation. To optimize the sweep phase, there is bit operation to set mark bits for out-of-range bits in the last bit_t. However, if there is no out-of-ragnge bits, it set all last bit_t as mark bits and it braek the assumption (unmarked objects will be swept). GC_DEBUG=1 makes sizeof(RVALUE)=64 on my machine and this condition happens. It took me one Saturday to debug this.	2020-08-02 03:31:58 +09:00
Alan Wu	73ee1295a3	Add memsize support for the call cache table Each class/module/iclass can potentially have their own cc table. Count their malloc usage.	2020-07-20 20:20:08 -04:00
Alan Wu	cbf52087a2	Fix missing imemo cases in objspace_dump by refactoring imemo_callcache and imemo_callinfo were not handled by the `objspace` module and were showing up as "unknown" in the dump. Extract the code for naming imemos and use that in both the GC and the `objspace` module.	2020-07-10 22:42:35 -04:00
Yusuke Endoh	ecfc09d053	gc.c: Cast int literal "1" to bits_t ... because shifting by more than 31 bits has undefined behavior (depending upon platform). Coverity Scan found this issue.	2020-07-08 09:58:48 +09:00
Aaron Patterson	b06a4dc6f1	Expand heap pages to be exactly 16kb This commit expands heap pages to be exactly 16KiB and eliminates the `REQUIRED_SIZE_BY_MALLOC` constant. I believe the goal of `REQUIRED_SIZE_BY_MALLOC` was to make the heap pages consume some multiple of OS page size. 16KiB is convenient because OS page size is typically 4KiB, so one Ruby page is four OS pages. Do not guess how malloc works ============================= We should not try to guess how `malloc` works and instead request (and use) four OS pages. Here is my reasoning: 1. Not all mallocs will store metadata in the same region as user requested memory. jemalloc specifically states[1]: > Information about the states of the runs is stored as a page map at the beginning of each chunk. 2. We're using `posix_memalign` to request memory. This means that the first address must be divisible by the alignment. Our allocation is page aligned, so if malloc is storing metadata before the page, then we've already crossed page boundaries. 3. Some allocators like glibc will use the memory at the end of the page. I am able to demonstrate that glibc will return pointers within the page boundary that contains `heap_page_body`[2]. We expected the allocation to look like this: ![Expected alignment](https://user-images.githubusercontent.com/3124/85803661-8a81d600-b6fc-11ea-8cb6-7dbdb434a43b.png) But since `heap_page` is allocated immediately after `heap_page_body`[3], instead the layout looks like this: ![Actual alignment](https://user-images.githubusercontent.com/3124/85803714-a1c0c380-b6fc-11ea-8c17-8b37369e17ee.png) This is not optimal because `heap_page` gets allocated immediately after `heap_page_body`. We frequently write to `heap_page`, so the bottom OS page of `heap_page_body` is very likely to be copied. One more object per page ======================== In jemalloc, allocation requests are rounded to the nearest boundary, which in this case is 16KiB[4], so `REQUIRED_SIZE_BY_MALLOC` space is just wasted on jemalloc. On glibc, the space is not wasted, but instead it is very likely to cause page faults. Instead of wasting space or causing page faults, lets just use the space to store one more Ruby object. Using the space to store one more Ruby object will prevent page faults, stop wasting space, decrease memory usage, decrease GC time, etc. 1. https://people.freebsd.org/~jasone/jemalloc/bsdcan2006/jemalloc.pdf 2. `33390d15e7` 3 `289a28e68f/gc.c (L1757-L1763)` 4. https://people.freebsd.org/~jasone/jemalloc/bsdcan2006/jemalloc.pdf page 4 Co-authored-by: John Hawthorn <john@hawthorn.email>	2020-07-06 14:17:54 -07:00
卜部昌平	c5f4345138	get_envparam_double: do not goto into a branch I'm not necessarily against every goto in general, but jumping into a branch is definitely a bad idea. Better refactor.	2020-06-29 11:05:41 +09:00
卜部昌平	228118482e	gc_marks_finish: do not goto into a branch I'm not necessarily against every goto in general, but jumping into a branch is definitely a bad idea. Better refactor.	2020-06-29 11:05:41 +09:00
Aaron Patterson	e2d94f61c8	Convert RMoved to a doubly linked list This commit converts RMoved slots to a doubly linked list. I want to convert this to a doubly linked list because the read barrier (currently in development) must remove nodes from the moved list sometimes. Removing nodes from the list is much easier if the list is doubly linked. In addition, we can reuse the list manipulation routines.	2020-06-22 16:27:35 -07:00
Nobuyoshi Nakada	26c179d7e7	Check argument to ObjectSpace._id2ref Ensure that the argument is an Integer or implicitly convert to, before dereferencing as a Bignum. Addressed a regression in `b99833baec`. Reported by u75615 at https://hackerone.com/reports/898614	2020-06-16 18:25:35 +09:00
Nobuyoshi Nakada	1fb16dbb6e	Adjusted indents [ci skip]	2020-06-11 10:20:08 +09:00
Peter Zhu	0213f5b08a	Fix ASan crash	2020-06-10 16:36:44 -07:00
Aaron Patterson	62ce8f96cd	Revert "Combine sweeping and moving" This reverts commit `02b216e5a7`. This reverts commit `9b8825b6f9`. I found that combining sweep and move is not safe. I don't think that we can do compaction concurrently with _anything_ unless there is a read barrier installed. Here is a simple example. A class object is freed, and during it's free step, it tries to remove itself from its parent's subclass list. However, during the sweep step, the parent class was moved and the "currently being freed" class didn't have references updated yet. So we get a segv like this: ``` (lldb) bt * thread #1, name = 'ruby', stop reason = signal SIGSEGV * frame #0: 0x0000560763e344cb ruby`rb_st_lookup at st.c:320:43 frame #1: 0x0000560763e344cb ruby`rb_st_lookup(tab=0x2f7469672f6e6f72, key=3809, value=0x0000560765bf2270) at st.c:1010 frame #2: 0x0000560763e8f16a ruby`rb_search_class_path at variable.c:99:9 frame #3: 0x0000560763e8f141 ruby`rb_search_class_path at variable.c:145 frame #4: 0x0000560763e8f141 ruby`rb_search_class_path(klass=94589785585880) at variable.c:191 frame #5: 0x0000560763ec744e ruby`rb_vm_bugreport at vm_dump.c:996:17 frame #6: 0x0000560763f5b958 ruby`rb_bug_for_fatal_signal at error.c:675:5 frame #7: 0x0000560763e27dad ruby`sigsegv(sig=<unavailable>, info=<unavailable>, ctx=<unavailable>) at signal.c:955:5 frame #8: 0x00007f8b891d33c0 libpthread.so.0`___lldb_unnamed_symbol1$$libpthread.so.0 + 1 frame #9: 0x0000560763efa8bb ruby`rb_class_remove_from_super_subclasses(klass=94589790314280) at class.c:93:56 frame #10: 0x0000560763d10cb7 ruby`gc_sweep_step at gc.c:2674:2 frame #11: 0x0000560763d1187b ruby`gc_sweep at gc.c:4540:2 frame #12: 0x0000560763d101f0 ruby`gc_start at gc.c:6797:6 frame #13: 0x0000560763d15153 ruby`rb_gc_compact at gc.c:7479:12 frame #14: 0x0000560763eb4eb8 ruby`vm_exec_core at vm_insnhelper.c:5183:13 frame #15: 0x0000560763ea9bae ruby`rb_vm_exec at vm.c:1953:22 frame #16: 0x0000560763eac08d ruby`rb_yield at vm.c:1132:9 frame #17: 0x0000560763edb4f2 ruby`rb_ary_collect at array.c:3186:9 frame #18: 0x0000560763e9ee15 ruby`vm_call_cfunc_with_frame at vm_insnhelper.c:2575:12 frame #19: 0x0000560763eb2e66 ruby`vm_exec_core at vm_insnhelper.c:4177:11 frame #20: 0x0000560763ea9bae ruby`rb_vm_exec at vm.c:1953:22 frame #21: 0x0000560763eac08d ruby`rb_yield at vm.c:1132:9 frame #22: 0x0000560763edb4f2 ruby`rb_ary_collect at array.c:3186:9 frame #23: 0x0000560763e9ee15 ruby`vm_call_cfunc_with_frame at vm_insnhelper.c:2575:12 frame #24: 0x0000560763eb2e66 ruby`vm_exec_core at vm_insnhelper.c:4177:11 frame #25: 0x0000560763ea9bae ruby`rb_vm_exec at vm.c:1953:22 frame #26: 0x0000560763ceee01 ruby`rb_ec_exec_node(ec=0x0000560765afa530, n=0x0000560765b088e0) at eval.c:296:2 frame #27: 0x0000560763cf3b7b ruby`ruby_run_node(n=0x0000560765b088e0) at eval.c:354:12 frame #28: 0x0000560763cee4a3 ruby`main(argc=<unavailable>, argv=<unavailable>) at main.c:50:9 frame #29: 0x00007f8b88e560b3 libc.so.6`__libc_start_main + 243 frame #30: 0x0000560763cee4ee ruby`_start + 46 (lldb) f 9 frame #9: 0x0000560763efa8bb ruby`rb_class_remove_from_super_subclasses(klass=94589790314280) at class.c:93:56 90 91 *RCLASS_EXT(klass)->parent_subclasses = entry->next; 92 if (entry->next) { -> 93 RCLASS_EXT(entry->next->klass)->parent_subclasses = RCLASS_EXT(klass)->parent_subclasses; 94 } 95 xfree(entry); 96 } (lldb) command script import -r misc/lldb_cruby.py lldb scripts for ruby has been installed. (lldb) rp entry->next->klass (struct RMoved) $1 = (flags = 30, destination = 94589792806680, next = 94589784369160) (lldb) ```	2020-06-09 13:53:18 -07:00
Aaron Patterson	2ba2b32d9e	Freeing cc tables doesn't need access to ID We don't need to resolve symbols when freeing cc tables, so this commit just changes the id table iterator to look at values rather than keys and values.	2020-06-09 10:44:52 -07:00
Aaron Patterson	42a2fa3b17	fix debugging output	2020-06-08 15:08:27 -07:00
Aaron Patterson	02b216e5a7	Combine sweeping and moving This commit combines the sweep step with moving objects. With this commit, we can do: ```ruby GC.start(compact: true) ``` This code will do the following 3 steps: 1. Fully mark the heap 2. Sweep + Move objects 3. Update references By default, this will compact in order that heap pages are allocated. In other words, objects will be packed towards older heap pages (as opposed to heap pages with more pinned objects like `GC.compact` does).	2020-05-29 15:24:32 -07:00
Aaron Patterson	c7ceaa6d3c	Extract "free moved list" function Extract a function to free the moved list. We'll use this function later on to compact at the same time as sweep.	2020-05-28 15:01:10 -07:00
Jeremy Evans	ad729a1d11	Fix origin iclass pointer for modules If a module has an origin, and that module is included in another module or class, previously the iclass created for the module had an origin pointer to the module's origin instead of the iclass's origin. Setting the origin pointer correctly requires using a stack, since the origin iclass is not created until after the iclass itself. Use a hidden ruby array to implement that stack. Correctly assigning the origin pointers in the iclass caused a use-after-free in GC. If a module with an origin is included in a class, the iclass shares a method table with the module and the iclass origin shares a method table with module origin. Mark iclass origin with a flag that notes that even though the iclass is an origin, it shares a method table, so the method table should not be garbage collected. The shared method table will be garbage collected when the module origin is garbage collected. I've tested that this does not introduce a memory leak. This change caused a VM assertion failure, which was traced to callable method entries using the incorrect defined_class. Update rb_vm_check_redefinition_opt_method and find_defined_class_by_owner to treat iclass origins different than class origins to avoid this issue. This also includes a fix for Module#included_modules to skip iclasses with origins. Fixes [Bug #16736]	2020-05-22 20:31:23 -07:00
Jeremy Evans	8d798e7c53	Revert "Fix origin iclass pointer for modules" This reverts commit `c745a60634`. This triggers a VM assertion. Reverting until the issue can be debugged.	2020-05-22 07:54:34 -07:00
Jeremy Evans	c745a60634	Fix origin iclass pointer for modules If a module has an origin, and that module is included in another module or class, previously the iclass created for the module had an origin pointer to the module's origin instead of the iclass's origin. Setting the origin pointer correctly requires using a stack, since the origin iclass is not created until after the iclass itself. Use a hidden ruby array to implement that stack. Correctly assigning the origin pointers in the iclass caused a use-after-free in GC. If a module with an origin is included in a class, the iclass shares a method table with the module and the iclass origin shares a method table with module origin. Mark iclass origin with a flag that notes that even though the iclass is an origin, it shares a method table, so the method table should not be garbage collected. The shared method table will be garbage collected when the module origin is garbage collected. I've tested that this does not introduce a memory leak. This also includes a fix for Module#included_modules to skip iclasses with origins. Fixes [Bug #16736]	2020-05-22 07:36:52 -07:00
Aaron Patterson	6e7e7c1e57	Only marked objects should be considered movable Ruby's GC is incremental, meaning that during the mark phase (and also the sweep phase) programs are allowed to run. This means that programs can allocate objects before the mark or sweep phase have actually completed. Those objects may not have had a chance to be marked, so we can't know if they are movable or not. Something that references the newly created object might have called the pinning function during the mark phase, but since the mark phase hasn't run we can't know if there is a "pinning" relationship. To be conservative, we must only allow objects that are not pinned but also marked to move.	2020-05-20 15:00:32 -07:00
Aaron Patterson	6efb9fe042	Allow references stored in the VM stack to move We can update these references too, so lets allow them to move.	2020-05-18 16:57:10 -07:00
卜部昌平	15e977349e	more on NULL versus functions Function pointers are not void*. See also `115fec062c` `ce4ea956d2` `8427fca49b`	2020-05-11 16:47:25 +09:00
卜部昌平	9e41a75255	sed -i 's\|ruby/impl\|ruby/internal\|' To fix build failures.	2020-05-11 09:24:08 +09:00
卜部昌平	122f96c362	sed -i s/ruby3/rbimpl/g	2020-05-11 09:24:08 +09:00
卜部昌平	97672f669a	sed -i s/RUBY3/RBIMPL/g Devs do not love "3". The only exception is RUBY3_KEYWORDS in parse.y, which seems unrelated to our interests.	2020-05-11 09:24:08 +09:00
卜部昌平	d7f4d732c1	sed -i s\|ruby/3\|ruby/impl\|g This shall fix compile errors.	2020-05-11 09:24:08 +09:00
Nobuyoshi Nakada	5d430c1b34	Added more NORETURN declarations	2020-05-11 00:40:14 +09:00
Aaron Patterson	ff4f9cf95d	Allow global variables to move This patch allows global variables that have been assigned in Ruby to move. I added a new function for the GC to call that will update global references and introduced a new callback in the global variable struct for updating references. Only pure Ruby global variables are supported right now, other references will be pinned.	2020-05-07 11:42:39 -07:00
Aaron Patterson	00698f26a9	`T_MOVED` should never be pushed on the mark stack No objects should ever reference a `T_MOVED` slot. If they do, it's absolutely a bug. If we kill the process when `T_MOVED` is pushed on the mark stack it will make it easier to identify which object holds a reference that hasn't been updated.	2020-05-07 08:44:11 -07:00
Aaron Patterson	5ef019e8af	Output compaction stats in one loop / eliminate 0 counts We only need to loop `T_MASK` times once. Also, not every value between 0 and `T_MASK` is an actual Ruby type. Before this change, some integers were being added to the result hash even though they aren't actual types. This patch omits considered / moved entries that total 0, cleaning up the result hash and eliminating these "fake types".	2020-05-04 13:50:21 -07:00
Benoit Daloze	c2dc52e18b	Rename arguments for ObjectSpace::WeakMap#[]= for clarity	2020-05-02 16:16:56 +02:00
Benoit Daloze	a2be428c5f	Fix ObjectSpace::WeakMap#key? to work if the value is nil * Fixes [Bug #16826]	2020-05-02 16:08:36 +02:00
Nobuyoshi Nakada	ac0c760843	Mark ruby_memerror as NORETURN	2020-04-29 00:34:14 +09:00
Yusuke Endoh	1994ed90e4	Remove debugging code from gc.c Partially revert `adab82b9a7` and `c63b5c6179`. The issue that these commits attempt to address was maybe fixed with `1c7f5a5712`.	2020-04-29 00:05:46 +09:00
Kazuhiro NISHIYAMA	fd2df58451	Fix a typo [ci skip]	2020-04-27 09:41:45 +09:00
Nobuyoshi Nakada	42ac3f79ba	Assert that typed data is distinguished from non-typed	2020-04-25 09:29:27 +09:00
卜部昌平	c63b5c6179	rb_memerror: abort immediately Ditto for `adab82b9a7`. TRY_WITH_GC was found innocent.	2020-04-21 16:30:33 +09:00
Nobuyoshi Nakada	dc9089b51f	Fixed a typo [ci skip]	2020-04-21 13:35:31 +09:00
卜部昌平	adab82b9a7	TRY_WITH_GC: abort immediately NoMemoryError is observed on icc but I fail to reproduce so far. Let me see the backtrace on CI.	2020-04-21 12:59:35 +09:00
Nobuyoshi Nakada	693378f105	Moved noreturn call to end of noreturn function	2020-04-16 18:02:11 +09:00
Nobuyoshi Nakada	e474c189da	Suppress -Wswitch warnings	2020-04-08 15:13:37 +09:00
卜部昌平	9e6e39c351	Merge pull request #2991 from shyouhei/ruby.h Split ruby.h	2020-04-08 13:28:13 +09:00
Nobuyoshi Nakada	2a4049b23c	Bail out before pushing unexpected object	2020-04-03 01:16:57 +09:00
Koichi Sasada	d05455d083	fix type cast	2020-03-11 02:55:07 +09:00
Koichi Sasada	ec78b8b62a	show method entry with iseq details	2020-03-11 02:50:44 +09:00
卜部昌平	97fa6468dc	fix compile error w/ -DCALC_EXACT_MALLOC_SIZE	2020-03-04 12:30:42 +09:00
卜部昌平	62c2b8c74e	kill USE_RGENGC=0 This compile-time option has been broken for years (at least since commit `49369ef173`, according to git bisect). Let's delete codes that no longer works.	2020-02-26 16:00:10 +09:00

1 2 3 4 5 ...

1788 Коммитов