github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Samuel Williams	0402193723	Fix `Thread#join(timeout)` when running inside the fiber scheduler. (#7903 )	2023-06-03 12:41:36 +09:00
KJ Tsanaktsidis	edee9b6a12	Use a real Ruby mutex in rb_io_close_wait_list (#7884 ) Because a thread calling IO#close now blocks in a native condvar wait, it's possible for there to be _no_ threads left to actually handle incoming signals/ubf calls/etc. This manifested as failing tests on Solaris 10 (SPARC), because: * One thread called IO#close, which sent a SIGVTALRM to the other thread to interrupt it, and then waited on the condvar to be notified that the reading thread was done. * One thread was calling IO#read, but it hadn't yet reached the actual call to select(2) when the SIGVTALRM arrived, so it never unblocked itself. This results in a deadlock. The fix is to use a real Ruby mutex for the close lock; that way, the closing thread goes into sigwait-sleep and can keep trying to interrupt the select(2) thread. See the discussion in: https://github.com/ruby/ruby/pull/7865/	2023-06-01 17:37:18 +09:00
git	cc698c6cc2	* expand tabs. [ci skip] Please consider using misc/expand_tabs.rb as a pre-commit hook.	2023-05-26 05:51:35 +00:00
KJ Tsanaktsidis	66871c5a06	Fix busy-loop when waiting for file descriptors to close When one thread is closing a file descriptor whilst another thread is concurrently reading it, we need to wait for the reading thread to be done with it to prevent a potential EBADF (or, worse, file descriptor reuse). At the moment, that is done by keeping a list of threads still using the file descriptor in io_close_fptr. It then continually calls rb_thread_schedule() in fptr_finalize_flush until said list is empty. That busy-looping seems to behave rather poorly on some OS's, particulary FreeBSD. It can cause the TestIO#test_race_gets_and_close test to fail (even with its very long 200 second timeout) because the closing thread starves out the using thread. To fix that, I introduce the concept of struct rb_io_close_wait_list; a list of threads still using a file descriptor that we want to close. We call `rb_notify_fd_close` to let the thread scheduler know we're closing a FD, which fills the list with threads. Then, we call rb_notify_fd_close_wait which will block the thread until all of the still-using threads are done. This is implemented with a condition variable sleep, so no busy-looping is required.	2023-05-26 14:51:23 +09:00
KJ Tsanaktsidis	8e1abef469	Fix a potential busy-loop in the thread scheduler (esp. on FreeBSD) This patch fixes a potential busy-loop in the thread scheduler. If there are two threads, the main thread (where Ruby signal handlers must run) and a sleeping thread, it is possible for the following sequence of events to occur: * The sleeping thread is in native_sleep -> sigwait_sleep A signal * arives, kicking this thread out of rb_sigwait_sleep The sleeping * thread calls THREAD_BLOCKING_END and eventually thread_sched_to_running_common * the sleeping thread writes into the sigwait_fd pipe by calling rb_thread_wakeup_timer_thread * the sleeping thread re-loops around in native_sleep() because the desired sleep time has not actually yet expired * that calls rb_sigwait_sleep again the ppoll() in rb_sigwait_sleep * immediately returns because of the byte written into the sigwait_fd by rb_thread_wakeup_timer_thread * that wakes the thread up again and kicks the whole cycle off again. Such a loop can only be broken by the main thread waking up and handling the signal, such that ubf_threads_empty() below becomes true again; however this loop can actually keep things so busy (and cause so much contention on the main thread's interrupt_lock) that the main thread doesn't deal with the signal for many seconds. This seems particuarly likely on FreeBSD 13. (the cycle can also be broken by the sleeping thread finally elapsing its desired sleep time). The fix for _this_ loop is to only wakeup the timer thrad in thread_sched_to_running_common if the current thread is not itself the sigwait thread. An almost identical loop also happens in the same circumstances because the call to check_signals_nogvl (through sigwait_timeout) in rb_sigwait_sleep returns true if there is any pending signal for the main thread to handle. That then causes rb_sigwait_sleep to skip over sleeping entirely. This is unnescessary and counterproductive, I believe; if the main thread needs to be woken up that is done inline in check_signals_nogvl anyway. See https://bugs.ruby-lang.org/issues/19680	2023-05-26 14:48:08 +09:00
Samuel Williams	2df5a697e2	Add Fiber#kill, similar to Thread#kill. (#7823 )	2023-05-18 23:33:42 +09:00
Samuel Williams	ab7bb38aca	Remove explicit SIGCHLD handling. (#7816 ) * Remove unused SIGCHLD handling. * Remove unused `init_sigchld`. * Remove unnecessary `#define RUBY_SIGCHLD (0)`. * Remove unused `SIGCHLD_LOSSY`.	2023-05-15 23:14:51 +09:00
Koichi Sasada	b2e848193a	fix deadlock on `Thread#join` because of `9720f5ac89` http://rubyci.s3.amazonaws.com/solaris11-sunc/ruby-master/log/20230403T130011Z.fail.html.gz ``` 1) Failure: TestThread#test_signal_at_join [/export/home/chkbuild/chkbuild-sunc/tmp/build/20230403T130011Z/ruby/test/ruby/test_thread.rb:1488]: Exception raised: <#<fatal:"No live threads left. Deadlock?\n1 threads, 1 sleeps current:0x00891288 main thread:0x00891288\n* #<Thread:0xfef89a18 sleep_forever>\n rb_thread_t:0x00891288 native:0x00000001 int:0\n \n">> Backtrace: -:30:in `join' -:30:in `block (3 levels) in <main>' -:21:in `times' -:21:in `block (2 levels) in <main>'. ``` The mechanism: * Main thread (M) calls `Thread#join` * M: calls `sleep_forever()` * M: set `th->status = THREAD_STOPPED_FOREVER` * M: do `checkints` * M: handle a trap handler with `th->status = THREAD_RUNNABLE` * M: thread switch at the end of the trap handler * Another thread (T) will process `Thread#kill` by M. * T: `rb_threadptr_join_list_wakeup()` at the end of T tris to wakeup M, but M's state is runnable because M is handling trap handler and just ignore the waking up and terminate T$a * T: switch to M. * M: after the trap handler, reset `th->status = THREAD_STOPPED_FOREVER` and check deadlock -> Deadlock because only M is living. To avoid such situation, add new sleep flags `SLEEP_ALLOW_SPURIOUS` and `SLEEP_NO_CHECKINTS` to skip any check ints. BTW this is instentional to leave second `vm_check_ints_blocking()` without checking `SLEEP_NO_CHECKINTS` because `SLEEP_ALLOW_SPURIOUS` should be specified with `SLEEP_NO_CHECKINTS` and skipping this checkints can skip any interrupts.	2023-04-04 07:57:51 +09:00
Koichi Sasada	9720f5ac89	use `sleep_forever()` on `thread_join_sleep()` because it does same thing.	2023-04-01 09:48:37 +09:00
Koichi Sasada	1d19776c7f	cosmetic change reorder `sleep_forever()` and so on.	2023-03-31 19:26:47 +09:00
Koichi Sasada	f803bcfc87	pass `th` to `thread_sched_to_waiting()` for future extension	2023-03-31 18:50:10 +09:00
Koichi Sasada	4c0f82eb5b	remove "\n" for `RUBY_DEBUG_LOG()` because `RUBY_DEBUG_LOG()` add "\n" at the end of message.	2023-03-31 18:15:04 +09:00
Koichi Sasada	30b43f4f1a	`rb_ractor_thread_list()` only for current ractor so that no need to lock the ractor.	2023-03-30 14:56:37 +09:00
Koichi Sasada	ba72849a3f	cosmetic change	2023-03-30 14:56:10 +09:00
Matt Valentine-House	60b8c7d9fd	Rename RB_GC_SAVE_MACHINE_CONTEXT -> RB_VM_SAVE_MACHINE_CONTEXT	2023-03-15 21:26:26 +00:00
Samuel Williams	7fd53eeb46	Remove SIGCHLD `waidpid`. (#7527 ) * Remove `waitpid_lock` and related code. * Remove un-necessary test. * Remove `rb_thread_sleep_interruptible` dead code.	2023-03-15 19:48:27 +13:00
Samuel Williams	ac65ce16e9	Revert SIGCHLD changes to diagnose CI failures. (#7517 ) * Revert "Remove special handling of `SIGCHLD`. (#7482)" This reverts commit `44a0711eab`. * Revert "Remove prototypes for functions that are no longer used. (#7497)" This reverts commit `4dce12bead`. * Revert "Remove SIGCHLD `waidpid`. (#7476)" This reverts commit `1658e7d966`. * Fix change to rjit variable name.	2023-03-14 20:07:59 +13:00
Samuel Williams	1658e7d966	Remove SIGCHLD `waidpid`. (#7476 ) * Remove `waitpid_lock` and related code. * Remove un-necessary test. * Remove `rb_thread_sleep_interruptible` dead code.	2023-03-09 16:05:47 +13:00
Takashi Kokubun	9ad19069f9	Remove obsoleted functions in rjit.c	2023-03-07 23:59:50 -08:00
Takashi Kokubun	b67f07fa2c	Get rid of MJIT's special fork	2023-03-07 23:08:57 -08:00
Takashi Kokubun	23ec248e48	s/mjit/rjit/	2023-03-06 23:44:01 -08:00
Takashi Kokubun	2e875549a9	s/MJIT/RJIT/	2023-03-06 23:44:01 -08:00
Samuel Williams	2c4b2053ca	Correctly clean up `keeping_mutexes` before resuming any other threads. (#7460 ) It's possible (but very rare) to have a race condition between setting `mutex->fiber = NULL` and `thread_mutex_remove(th, mutex)` which results in the following bug: ``` [BUG] invalid keeping_mutexes: Attempt to unlock a mutex which is not locked ``` Fixes <https://bugs.ruby-lang.org/issues/19480>.	2023-03-07 20:23:00 +13:00
Takashi Kokubun	233ddfac54	Stop exporting symbols for MJIT	2023-03-06 21:59:23 -08:00
Jean Boussier	704dd25812	TestThreadInstrumentation: emit the EXIT event sooner ``` 1) Failure: TestThreadInstrumentation#test_thread_instrumentation [/tmp/ruby/src/trunk-repeat20-asserts/test/-ext-/thread/test_instrumentation_api.rb:33]: Call counters[4]: [3, 4, 4, 4, 0]. Expected 0 to be > 0. ``` We fire the EXIT hook after the call to `thread_sched_to_dead` which mean another thread might be running before the `EXIT` hook have been executed.	2023-03-06 13:10:42 +01:00
Matt Valentine-House	72aba64fff	Merge gc.h and internal/gc.h [Feature #19425]	2023-02-09 10:32:29 -05:00
Jean byroot Boussier	8ce2fb9bbb	Only emit circular dependency warning for owned thread shields [Bug #19415] If multiple threads attemps to load the same file concurrently it's not a circular dependency issue. So we check that the existing ThreadShield is owner by the current fiber before warning about circular dependencies.	2023-02-08 09:50:00 +01:00
Jean byroot Boussier	c19defd026	Revert "Only emit circular dependency warning for owned thread shields" This reverts commit `fa49651e05`.	2023-02-06 23:30:35 +01:00
Jean Boussier	fa49651e05	Only emit circular dependency warning for owned thread shields [Bug #19415] If multiple threads attemps to load the same file concurrently it's not a circular dependency issue. So we check that the existing ThreadShield is owner by the current fiber before warning about circular dependencies.	2023-02-06 19:35:38 +01:00
Matt Valentine-House	0457e43423	Remove unused struct member thgroup->group	2023-01-20 11:01:03 -05:00
Samuel Williams	0436f1e15a	Introduce `Fiber#storage` for inheritable fiber-scoped variables. (#6612 )	2022-12-01 23:00:33 +13:00
S-H-GAMELINKS	1f4f6c9832	Using UNDEF_P macro	2022-11-16 18:58:33 +09:00
Yusuke Endoh	c3de7a3c58	Make pending_interrupt?(Exception) work A patch from katsu (Katsuhiro Ueno) [Bug #19110]	2022-11-09 12:25:35 +09:00
Samuel Williams	7f175e5648	Avoid missed wakeup with fiber scheduler and Fiber.blocking. (#6588 ) * Ensure that blocked fibers don't prevent valid wakeups.	2022-10-20 13:38:52 +13:00
Samuel Williams	e4f91bbdba	Add IO#timeout attribute and use it for blocking IO operations. (#5653 )	2022-10-07 21:48:38 +13:00
Takashi Kokubun	aa8a3b2358	MJIT: Do not hang after forking with threads First, rb_mjit_fork should call rb_thread_atfork to stop threads after fork in the child process. Unfortunately, we cannot use rb_fork_ruby to prevent this kind of mistakes because MJIT needs special handling of waiting_pid and mjit_pause/resume. Second, mjit_waitpid_finished should be checked regardless of trap_interrupt. It doesn't seem like the flag is not set when SIGCHLD is handled for an MJIT child process.	2022-09-11 09:09:05 +09:00
Nobuyoshi Nakada	51022b8e06	Exit status macros need sys/wait.h on FreeBSD	2022-09-07 00:04:28 +09:00
Takashi Kokubun	f6925fab85	Do not fork the process on --mjit-wait fork is for parallel compilation, but --mjit-wait cancels it. It's more useful to not fork it for binding.irb, debugging, etc.	2022-09-06 15:42:02 +09:00
Nobuyoshi Nakada	27173e3735	Allow `RUBY_DEBUG_LOG` format to be empty GCC warns of empty format strings, perhaps because they have no effects in printf() and there are better ways than sprintf(). However, ruby_debug_log() adds informations other than the format, this warning is not the case.	2022-08-06 10:52:00 +09:00
Jean Boussier	e3aabe93aa	Implement Queue#pop(timeout: sec) [Feature #18774] As well as `SizedQueue#pop(timeout: sec)` If both `non_block=true` and `timeout:` are supplied, ArgumentError is raised.	2022-08-02 11:04:28 +02:00
Peter Zhu	efb91ff19b	Rename rb_ary_tmp_new to rb_ary_hidden_new rb_ary_tmp_new suggests that the array is temporary in some way, but that's not true, it just creates an array that's hidden and not on the transient heap. This commit renames it to rb_ary_hidden_new.	2022-07-26 09:12:09 -04:00
Takashi Kokubun	5b21e94beb	Expand tabs [ci skip] [Misc #18891]	2022-07-21 09:42:04 -07:00
Jean Boussier	664c23db79	GVL Instrumentation: remove the EXITED count assertion It's very flaky for some unknown reason. Something we have an extra EXITED event. I suspect some other test is causing this.	2022-07-13 19:39:31 +02:00
Aaron Patterson	de51bbcb54	Use VM Lock when mutating waiting threads list `rb_thread_wait_for_single_fd` needs to mutate the `waiting_fds` list that is stored on the VM. We need to delete the FD from the list before returning, and deleting from the list requires a VM lock (because the list is a global). [Bug #18816] [ruby-core:108771] Co-Authored-By: Alan Wu <alanwu@ruby-lang.org>	2022-07-12 16:07:41 -07:00
Jean Boussier	65ae2bb2e0	Thread#value: handle threads killed by a fork [Bug #18902] When a thread is killed because we forked, the `value` if left to `Qundef`. Returning it woudl crash the VM.	2022-07-07 19:49:25 +02:00
Jean Boussier	b6c1e1158d	GVL Instrumentation API: add STARTED and EXITED events [Feature #18339] After experimenting with the initial version of the API I figured there is a need for an exit event to cleanup instrumentation data. e.g. if you record data in a {thread_id -> data} table, you need to free associated data when a thread goes away.	2022-06-17 09:08:26 +02:00
John Hawthorn	17d260a87f	Restore rb_exec_recursive_outer This was a public method, so we should probably keep it.	2022-06-15 16:07:29 -07:00
Takashi Kokubun	1162523bae	Remove MJIT worker thread (#6006 ) [Misc #18830]	2022-06-15 09:40:54 -07:00
John Hawthorn	52da90acee	Make method id explicit in rb_exec_recursive_outer Previously, because opt_aref and opt_aset don't push a frame, when they would call rb_hash to determine the hash value of the key, the initial level of recursion would incorrectly use the method id at the top of the stack instead of "hash". This commit replaces rb_exec_recursive_outer with rb_exec_recursive_outer_mid, which takes an explicit method id, so that we can make the hash calculation behave consistently. rb_exec_recursive_outer was documented as being internal, so I believe this should be okay to change.	2022-06-10 14:48:21 -07:00
John Hawthorn	f4747958e5	Add mid argument to exec_recursive	2022-06-10 14:48:21 -07:00

1 2 3 4 5 ...

1092 Коммитов