github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Peter Zhu	10574857ce	Fix memory leak in Regexp capture group when timeout [Bug #20650] The capture group allocates memory that is leaked when it times out. For example: re = Regexp.new("^#{"(a)" 10_000}x$", timeout: 0.000001) str = "a" * 1000000 + "x" 10.times do 100.times do re =~ str rescue Regexp::TimeoutError end puts `ps -o rss= -p #{$$}` end Before: 34688 56416 78288 100368 120784 140704 161904 183568 204320 224800 After: 16288 16288 16880 16896 16912 16928 16944 17184 17184 17200	2024-07-25 09:23:49 -04:00
Alan Wu	8cf708d7b4	Make rb_check_frozen_inline() static inline again Since `730e3b2ce0` ("Stop exposing `rb_str_chilled_p`"), we noticed a speed loss on a few benchmarks that are string operations heavy. This is partially due to routines no longer having the options to inline rb_check_frozen_inline() in non-LTO builds. Make it an inlining candidate again to recover speed. Testing this patch on my machine, the fannkuchredux benchmark gets a 1.15 speed-up with YJIT and 1.03 without YJIT.	2024-07-19 17:47:12 -04:00
Alan Wu	99825a539f	[DOC] Note that rb_obj_freeze_inline() can raise NoMemoryError And move it back to a public header because Doxygen might not be scanning the .c files. [Feature #18776]	2024-07-17 10:25:20 -04:00
Alan Wu	cd428b490d	[DOC] No more is rb_ary_freeze() an alias of rb_obj_freeze() [Feature #20589]	2024-07-17 10:25:20 -04:00
卜部昌平	fa6bf1da57	give up USE_GC_MALLOC_OBJ_INFO_DETAILS This feature is no longer possible under current design; now that our GC is pluggable, we cannot assume what was achieved by this compiler flag is always possble by the dynamically-loaded GC implementation.	2024-07-12 10:21:07 +09:00
Yusuke Endoh	114e32b357	Add rb_block_call2, a flexible variant of rb_block_call This function accepts flags: RB_NO_KEYWORDS, RB_PASS_KEYWORDS, RB_PASS_CALLED_KEYWORDS: Works as the same as rb_block_call_kw. RB_BLOCK_NO_USE_PACKED_ARGS: The given block ("bl_proc") does not use "yielded_arg" of rb_block_call_func_t. Instead, the block accesses the yielded arguments via "argc" and "argv". This flag allows the called method to yield arguments without allocating an Array.	2024-07-10 13:00:47 +09:00
Nobuyoshi Nakada	249a1fb0eb	Show more in `RBIMPL_ASSERT_TYPE`	2024-06-21 09:25:00 +09:00
Nobuyoshi Nakada	b834c62efd	Delegate from `RBIMPL_ASSERT_OR_ASSUME` to `RUBY_ASSERT_ALWAYS` Get rid of expansion of the argument which often contains complicated macros, and simplify the failure message.	2024-06-21 09:24:59 +09:00
Alan Wu	2699e230e4	Crash instead of raising with Check_Type() in RBIMPL_ASSERT_TYPE() in debug builds Previously, RBIMPL_ASSERT_TYPE() used Check_Type() only in RUBY_DEBUG builds. It raised TypeError, but only in debug builds. For people testing type mismatch using debug builds looking for a Ruby exception, this can be misleading -- the code could be missing a type check in non-debug builds if it is relying on for example RSTRING_LEN() to raise. Also, Check_Type() can obscure the true cause of error in debug mode. When type check fails because the object is corrupt, instead of crashing with a clear type assertion message, it can crash while trying to construct an exception object to raise. You can see this for example in <https://github.com/ruby/ruby/actions/runs/9489999591/job/26152506434?pr=10985>, where RB_ENCODING_GET() is used on a corrupt object, but the crash happens later and says "Assertion Failed: ../src/vm_method.c:1477:callable_method_entry_or_negative". RBIMPL_ASSERT_TYPE() should assert right away. RBIMPL_ASSERT_OR_ASSUME() asserts when RUBY_DEBUG and assumes in release builds, as desired. This should help investigate flaky CI failures that show up as TypeError from `Kernel#require`, e.g. "'Kernel#require': wrong argument type false (expected String) (TypeError)". Same CI failure examples: - https://github.com/ruby/ruby/actions/runs/9034787861/job/24828147431 - https://github.com/ruby/ruby/actions/runs/9418303667/job/25945492440 - https://github.com/ruby/ruby/actions/runs/9505650952/job/26201031314 The failure occurs with and without use of YJIT.	2024-06-13 21:43:41 +00:00
Nobuyoshi Nakada	0396050f5a	Cast `RUBY_ATOMIC_PTR_CAS` arguments As well as `RUBY_ATOMIC_PTR_EXCHANGE` and `RUBY_ATOMIC_PTR_LOAD`.	2024-06-07 10:42:41 +09:00
Jean Boussier	fbb61a26e7	Mark old Data API as deprecated [Feature #19998]	2024-06-06 11:44:27 +02:00
Jean Boussier	730e3b2ce0	Stop exposing `rb_str_chilled_p` [Feature #20205] Now that chilled strings no longer appear as frozen, there is no need to offer an API to check for chilled strings. We however need to change `rb_check_frozen_internal` to no longer be a macro, as it needs to check for chilled strings.	2024-06-02 13:53:35 +02:00
Jean Boussier	ceeb9957c3	Make value_type.h compatible with -Wconversion [Feature #20507] This was missed from the initial commit. ``` ../../.././include/ruby/internal/value_type.h:446:27: error: implicit conversion changes signedness: 'enum ruby_value_type' to 'int' [-Werror,-Wsign-conversion] rb_unexpected_type(v, t); ~~~~~~~~~~~~~~~~~~ ^ ```	2024-05-28 08:43:43 +02:00
Mike Dalessio	1b8ba1551b	Allow compilation of C extensions with `-Wconversion` C extension maintainers can now compile with this warning option and the Ruby header files will generate no warnings. [Feature #20507]	2024-05-28 07:33:07 +02:00
Étienne Barrié	1376881e9a	Stop marking chilled strings as frozen They were initially made frozen to avoid false positives for cases such as: str = str.dup if str.frozen? But this may cause bugs and is generally confusing for users. [Feature #20205] Co-authored-by: Jean Boussier <byroot@ruby-lang.org>	2024-05-28 07:32:33 +02:00
卜部昌平	b7bd55cdc7	suppress -Wold-style-cast warnings	2024-04-29 03:09:15 +02:00
卜部昌平	17a0e2ac04	workaround C++ compile error We observe compiler error on FreeBSD. Their stdckdint.h does not understand C++. This shall be addressed on their side. Unti then we resport to our own version. https://rubyci.s3.amazonaws.com/freebsd14/ruby-master/log/20240427T143002Z.log.html.gz	2024-04-29 03:03:50 +02:00
卜部昌平	bb5a538207	use of stdckdint.h C23 is going to have this header. The industry is already moving towards accepting it; OSes and compilers started to implement theirs. Why not detect its presence and if any, prefer over other ways. See also: - https://www.open-std.org/jtc1/sc22/wg14/www/docs/n2683.pdf - https://reviews.freebsd.org/D41734 - https://reviews.llvm.org/D157331 - https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=8441841a1b985d68245954af1ff023db121b0635	2024-04-27 21:55:28 +09:00
Nobuyoshi Nakada	7f87ad9fc4	Refer autoconfigured endian macro (#10572 ) Remove the case `RB_IO_BUFFER_HOST_ENDIAN` is not defined.	2024-04-19 15:58:53 +12:00
Peter Zhu	ea7975c59b	Include coderange.h in encoding.h ruby_coderange_type is defined in ruby/internal/encoding/coderange.h so we need to include it.	2024-04-18 14:50:20 -04:00
Jean Boussier	7380e3d30f	RB_OBJ_FREEZE_RAW: Set the object shape	2024-04-16 17:20:35 +02:00
Koichi Sasada	5d9fd674c9	put empty `rb_gc_force_recycle()` and declare it will be removed soon. ddtrace is still referes the API and build was failed. See https://github.com/DataDog/dd-trace-rb/pull/3578 Maybe threre are only few users of this C-API now so we can remove it soon.	2024-04-11 12:00:33 +09:00
Nobuyoshi Nakada	4dd9e5cf74	Add builtin type assertion	2024-04-08 11:13:29 +09:00
Peter Zhu	dbe8886f4d	Remove deprecated function rb_gc_force_recycle This function has been deprecated since Ruby 3.1, so we should remove it for Ruby 3.4.	2024-04-05 11:39:54 -04:00
crazeteam	b2b665eba5	[DOC] remove repetitive words in comments Signed-off-by: crazeteam <lilujing@outlook.com>	2024-03-27 07:52:18 +09:00
Étienne Barrié	2b08406cd0	Expose rb_str_chilled_p Some extensions (like stringio) may need to differentiate between chilled strings and frozen strings. They can now use rb_str_chilled_p but must check for its presence since the function will be removed when chilled strings are removed. [Bug #20389] [Feature #20205] Co-authored-by: Jean Boussier <byroot@ruby-lang.org>	2024-03-26 12:54:54 +01:00
Xavier Noria	401251979b	[DOC] Small edits in rbasic.h	2024-03-23 08:19:30 +00:00
Samuel Williams	b4d73e9f80	Revert "Hide public implementation of `rb_io`. (#9568 )" (#10283 ) This reverts commit `9ab1fa3bf5`.	2024-03-22 14:56:02 +13:00
Étienne Barrié	12be40ae6b	Implement chilled strings [Feature #20205] As a path toward enabling frozen string literals by default in the future, this commit introduce "chilled strings". From a user perspective chilled strings pretend to be frozen, but on the first attempt to mutate them, they lose their frozen status and emit a warning rather than to raise a `FrozenError`. Implementation wise, `rb_compile_option_struct.frozen_string_literal` is no longer a boolean but a tri-state of `enabled/disabled/unset`. When code is compiled with frozen string literals neither explictly enabled or disabled, string literals are compiled with a new `putchilledstring` instruction. This instruction is identical to `putstring` except it marks the String with the `STR_CHILLED (FL_USER3)` and `FL_FREEZE` flags. Chilled strings have the `FL_FREEZE` flag as to minimize the need to check for chilled strings across the codebase, and to improve compatibility with C extensions. Notes: - `String#freeze`: clears the chilled flag. - `String#-@`: acts as if the string was mutable. - `String#+@`: acts as if the string was mutable. - `String#clone`: copies the chilled flag. Co-authored-by: Jean Boussier <byroot@ruby-lang.org>	2024-03-19 09:26:49 +01:00
Peter Zhu	ff51dc5654	[Feature #20265 ] Remove rb_newobj_of and RB_NEWOBJ_OF	2024-03-14 12:53:04 -04:00
Peter Zhu	8e1831406f	[Feature #20265 ] Remove rb_newobj and RB_NEWOBJ	2024-03-14 12:53:04 -04:00
Peter Zhu	83618f2cfa	[Feature #20306 ] Implement ruby_free_at_exit_p ruby_free_at_exit_p is a way for extensions to determine whether they should free all memory at shutdown.	2024-03-14 08:33:30 -04:00
Jean Boussier	b4a69351ec	Move FL_SINGLETON to FL_USER1 This frees FL_USER0 on both T_MODULE and T_CLASS. Note: prior to this, FL_SINGLETON was never set on T_MODULE, so checking for `FL_SINGLETON` without first checking that `FL_TYPE` was `T_CLASS` was valid. That's no longer the case.	2024-03-06 13:11:41 -05:00
Samuel Williams	9ab1fa3bf5	Hide public implementation of `rb_io`. (#9568 ) Remove `struct rb_io {...}`.	2024-03-06 19:47:38 +13:00
cui fliter	226a889dc7	[DOC] fix some comments Signed-off-by: cui fliter <imcusg@gmail.com>	2024-03-05 18:50:47 +09:00
Jean Boussier	c09e5ad17d	Clarify C API documentation about pinned classes They are not only pinned, but also immortal. Even if the constant referencing them is removed, they will remain alive. It's a precision worth noting.	2024-03-01 08:24:16 +01:00
Nobuyoshi Nakada	d4e24021d3	Revise `9ec342e07d`	2024-02-26 13:12:05 +09:00
Nobuyoshi Nakada	a0f7de814a	[Bug #20296 ] Fix the default assertion message	2024-02-26 12:29:23 +09:00
Misaki Shioi	9ec342e07d	Introduction of Happy Eyeballs Version 2 (RFC8305) in Socket.tcp (#9374 ) * Introduction of Happy Eyeballs Version 2 (RFC8305) in Socket.tcp This is an implementation of Happy Eyeballs version 2 (RFC 8305) in Socket.tcp. [Background] Currently, `Socket.tcp` synchronously resolves names and makes connection attempts with `Addrinfo::foreach.` This implementation has the following two problems. 1. In name resolution, the program stops until the DNS server responds to all DNS queries. 2. In a connection attempt, while an IP address is trying to connect to the destination host and is taking time, the program stops, and other resolved IP addresses cannot try to connect. [Proposal] "Happy Eyeballs" ([RFC 8305](https://datatracker.ietf.org/doc/html/rfc8305)) is an algorithm to solve this kind of problem. It avoids delays to the user whenever possible and also uses IPv6 preferentially. I implemented it into `Socket.tcp` by using `Addrinfo.getaddrinfo` in each thread spawned per address family to resolve the hostname asynchronously, and using `Socket::connect_nonblock` to try to connect with multiple addrinfo in parallel. [Outcome] This change eliminates a fatal defect in the following cases. Case 1. One of the A or AAAA DNS queries does not return --- require 'socket' class Addrinfo class << self # Current Socket.tcp depends on foreach def foreach(nodename, service, family=nil, socktype=nil, protocol=nil, flags=nil, timeout: nil, &block) getaddrinfo(nodename, service, Socket::AF_INET6, socktype, protocol, flags, timeout: timeout) .concat(getaddrinfo(nodename, service, Socket::AF_INET, socktype, protocol, flags, timeout: timeout)) .each(&block) end def getaddrinfo(_, _, family, _) case family when Socket::AF_INET6 then sleep when Socket::AF_INET then [Addrinfo.tcp("127.0.0.1", 4567)] end end end end Socket.tcp("localhost", 4567) --- Because the current `Socket.tcp` cannot resolve IPv6 names, the program stops in this case. It cannot start to connect with IPv4 address. Though `Socket.tcp` with HEv2 can promptly start a connection attempt with IPv4 address in this case. Case 2. Server does not promptly return ack for syn of either IPv4 / IPv6 address family --- require 'socket' fork do socket = Socket.new(Socket::AF_INET6, :STREAM) socket.setsockopt(:SOCKET, :REUSEADDR, true) socket.bind(Socket.pack_sockaddr_in(4567, '::1')) sleep socket.listen(1) connection, _ = socket.accept connection.close socket.close end fork do socket = Socket.new(Socket::AF_INET, :STREAM) socket.setsockopt(:SOCKET, :REUSEADDR, true) socket.bind(Socket.pack_sockaddr_in(4567, '127.0.0.1')) socket.listen(1) connection, _ = socket.accept connection.close socket.close end Socket.tcp("localhost", 4567) --- The current `Socket.tcp` tries to connect serially, so when its first name resolves an IPv6 address and initiates a connection to an IPv6 server, this server does not return an ACK, and the program stops. Though `Socket.tcp` with HEv2 starts to connect sequentially and in parallel so a connection can be established promptly at the socket that attempted to connect to the IPv4 server. In exchange, the performance of `Socket.tcp` with HEv2 will be degraded. --- 100.times { Socket.tcp("www.ruby-lang.org", 80) } --- This is due to the addition of the creation of IO objects, Thread objects, etc., and calls to `IO::select` in the implementation. Avoid NameError of Socket::EAI_ADDRFAMILY in MinGW * Support Windows with SO_CONNECT_TIME * Improve performance I have additionally implemented the following patterns: - If the host is single-stack, name resolution is performed in the main thread. This reduces the cost of creating threads. - If an IP address is specified, name resolution is performed in the main thread. This also reduces the cost of creating threads. - If only one IP address is resolved, connect is executed in blocking mode. This reduces the cost of calling IO::select. Also, I have added a fast_fallback option for users who wish not to use HE. Here are the results of each performance test. ```ruby require 'socket' require 'benchmark' HOSTNAME = "www.ruby-lang.org" PORT = 80 ai = Addrinfo.tcp(HOSTNAME, PORT) Benchmark.bmbm do \|x\| x.report("Domain name") do 30.times { Socket.tcp(HOSTNAME, PORT).close } end x.report("IP Address") do 30.times { Socket.tcp(ai.ip_address, PORT).close } end x.report("fast_fallback: false") do 30.times { Socket.tcp(HOSTNAME, PORT, fast_fallback: false).close } end end ``` ``` user system total real Domain name 0.015567 0.032511 0.048078 ( 0.325284) IP Address 0.004458 0.014219 0.018677 ( 0.284361) fast_fallback: false 0.005869 0.021511 0.027380 ( 0.321891) ```` And this is the measurement result when executed in a single stack environment. ``` user system total real Domain name 0.007062 0.019276 0.026338 ( 1.905775) IP Address 0.004527 0.012176 0.016703 ( 3.051192) fast_fallback: false 0.005546 0.019426 0.024972 ( 1.775798) ``` The following is the result of the run on Ruby 3.3.0. (on Dual stack environment) ``` user system total real Ruby 3.3.0 0.007271 0.027410 0.034681 ( 0.472510) ``` (on Single stack environment) ``` user system total real Ruby 3.3.0 0.005353 0.018898 0.024251 ( 1.774535) ``` * Do not cache `Socket.ip_address_list` As mentioned in the comment at https://github.com/ruby/ruby/pull/9374#discussion_r1482269186, caching Socket.ip_address_list does not follow changes in network configuration. But if we stop caching, it becomes necessary to check every time `Socket.tcp` is called whether it's a single stack or not, which could further degrade performance in the case of a dual stack. From this, I've changed the approach so that when a domain name is passed, it doesn't check whether it's a single stack or not and resolves names in parallel each time. The performance measurement results are as follows. require 'socket' require 'benchmark' HOSTNAME = "www.ruby-lang.org" PORT = 80 ai = Addrinfo.tcp(HOSTNAME, PORT) Benchmark.bmbm do \|x\| x.report("Domain name") do 30.times { Socket.tcp(HOSTNAME, PORT).close } end x.report("IP Address") do 30.times { Socket.tcp(ai.ip_address, PORT).close } end x.report("fast_fallback: false") do 30.times { Socket.tcp(HOSTNAME, PORT, fast_fallback: false).close } end end user system total real Domain name 0.004085 0.011873 0.015958 ( 0.330097) IP Address 0.000993 0.004400 0.005393 ( 0.257286) fast_fallback: false 0.001348 0.008266 0.009614 ( 0.298626) * Wait forever if fallback addresses are unresolved, unless resolv_timeout Changed from waiting only 3 seconds for name resolution when there is no fallback address available, to waiting as long as there is no resolv_timeout. This is in accordance with the current `Socket.tcp` specification. * Use exact pattern to match IPv6 address format for specify address family	2024-02-26 12:14:11 +09:00
Koichi Sasada	d578684989	`rb_thread_lock_native_thread()` Introduce `rb_thread_lock_native_thread()` to allocate dedicated native thread to the current Ruby thread for M:N threads. This C API is similar to Go's `runtime.LockOSThread()`. Accepted at https://github.com/ruby/dev-meeting-log/blob/master/2023/DevMeeting-2023-08-24.md (and missed to implement on Ruby 3.3.0)	2024-02-21 15:38:29 +09:00
Nobuyoshi Nakada	c77f736bc1	Win32: Fix pre-defined macros for platforms Use `_WIN64` for word-size, `_M_AMD64` for CPU-specific feature.	2024-02-11 19:43:06 +09:00
Nobuyoshi Nakada	34581410f2	Extract `RBIMPL_VA_OPT_ARGS` Similar to splat argument in Ruby, which be expanded to `__VA_ARGS__` with a leading comma if any arguments given, otherwise empty.	2024-02-08 18:08:42 +09:00
Nobuyoshi Nakada	d31a12a210	Optional detail info at assertion failure	2024-02-08 18:08:41 +09:00
Nobuyoshi Nakada	7b3e05c392	Do not define ABI version in statically linked objects It is for dynamically loading, useless for statically linked objects.	2024-02-04 20:33:45 +09:00
Nobuyoshi Nakada	8531ac3115	Suppress unused-local-typedef warnings	2024-02-01 21:17:37 +09:00
KJ Tsanaktsidis	807714447e	Pass down "stack start" variables from closer to the top of the stack This commit changes how stack extents are calculated for both the main thread and other threads. Ruby uses the address of a local variable as part of the calculation for machine stack extents: * pthreads uses it as a lower-bound on the start of the stack, because glibc (and maybe other libcs) can store its own data on the stack before calling into user code on thread creation. * win32 uses it as an argument to VirtualQuery, which gets the extent of the memory mapping which contains the variable However, the local being used for this is actually too low (too close to the leaf function call) in both the main thread case and the new thread case. In the main thread case, we have the `INIT_STACK` macro, which is used for pthreads to set the `native_main_thread->stack_start` value. This value is correctly captured at the very top level of the program (in main.c). However, this is _not_ what's used to set the execution context machine stack (`th->ec->machine_stack.stack_start`); that gets set as part of a call to `ruby_thread_init_stack` in `Init_BareVM`, using the address of a local variable allocated _inside_ `Init_BareVM`. This is too low; we need to use a local allocated closer to the top of the program. In the new thread case, the lolcal is allocated inside `native_thread_init_stack`, which is, again, too low. In both cases, this means that we might have VALUEs lying outside the bounds of `th->ec->machine.stack_{start,end}`, which won't be marked correctly by the GC machinery. To fix this, * In the main thread case: We already have `INIT_STACK` at the right level, so just pass that local var to `ruby_thread_init_stack`. * In the new thread case: Allocate the local one level above the call to `native_thread_init_stack` in `call_thread_start_func2`. [Bug #20001] fix	2024-01-19 09:55:12 +11:00
KJ Tsanaktsidis	396e94666b	Revert "Pass down "stack start" variables from closer to the top of the stack" This reverts commit `4ba8f0dc99`.	2024-01-12 17:58:54 +11:00
KJ Tsanaktsidis	4ba8f0dc99	Pass down "stack start" variables from closer to the top of the stack The implementation of `native_thread_init_stack` for the various threading models can use the address of a local variable as part of the calculation of the machine stack extents: * pthreads uses it as a lower-bound on the start of the stack, because glibc (and maybe other libcs) can store its own data on the stack before calling into user code on thread creation. * win32 uses it as an argument to VirtualQuery, which gets the extent of the memory mapping which contains the variable However, the local being used for this is actually allocated _inside_ the `native_thread_init_stack` frame; that means the caller might allocate a VALUE on the stack that actually lies outside the bounds stored in machine.stack_{start,end}. A local variable from one level above the topmost frame that stores VALUEs on the stack must be drilled down into the call to `native_thread_init_stack` to be used in the calculation. This probably doesn't _really_ matter for the win32 case (they'll be in the same memory mapping so VirtualQuery should return the same thing), but definitely could matter for the pthreads case. [Bug #20001]	2024-01-12 17:29:48 +11:00
Peter Zhu	824ff48adc	Move internal ST functions to internal/st.h st_replace and st_init_existing_table_with_size are functions used internally in Ruby and should not be publicly visible.	2023-12-25 10:41:12 -05:00
Yukihiro "Matz" Matsumoto	98eeadc932	Development of 3.4.0 started.	2023-12-25 18:13:40 +09:00

1 2 3 4 5 ...

2302 Коммитов