github/ruby - ruby

Граф коммитов

Автор	SHA1	Сообщение	Дата
Kenta Murata	b4e784434c	Optimize Array#min (#3324 ) The benchmark result is below: \| \|compare-ruby\|built-ruby\| \|:---------------\|-----------:\|---------:\| \|ary2.min \| 39.105M\| 39.442M\| \| \| -\| 1.01x\| \|ary10.min \| 23.995M\| 30.762M\| \| \| -\| 1.28x\| \|ary100.min \| 6.249M\| 10.783M\| \| \| -\| 1.73x\| \|ary500.min \| 1.408M\| 2.714M\| \| \| -\| 1.93x\| \|ary1000.min \| 828.397k\| 1.465M\| \| \| -\| 1.77x\| \|ary2000.min \| 332.256k\| 570.504k\| \| \| -\| 1.72x\| \|ary3000.min \| 338.079k\| 573.868k\| \| \| -\| 1.70x\| \|ary5000.min \| 168.217k\| 286.114k\| \| \| -\| 1.70x\| \|ary10000.min \| 85.512k\| 143.551k\| \| \| -\| 1.68x\| \|ary20000.min \| 43.264k\| 71.935k\| \| \| -\| 1.66x\| \|ary50000.min \| 17.317k\| 29.107k\| \| \| -\| 1.68x\| \|ary100000.min \| 9.072k\| 14.540k\| \| \| -\| 1.60x\| \|ary1000000.min \| 872.930\| 1.436k\| \| \| -\| 1.64x\| compare-ruby is `9f4b7fc82e`.	2020-07-18 23:45:25 +09:00
Kenta Murata	a63f520971	Optimize Array#max (#3325 ) The benchmark result is below: \| \|compare-ruby\|built-ruby\| \|:---------------\|-----------:\|---------:\| \|ary2.max \| 38.837M\| 40.830M\| \| \| -\| 1.05x\| \|ary10.max \| 23.035M\| 32.626M\| \| \| -\| 1.42x\| \|ary100.max \| 5.490M\| 11.020M\| \| \| -\| 2.01x\| \|ary500.max \| 1.324M\| 2.679M\| \| \| -\| 2.02x\| \|ary1000.max \| 699.167k\| 1.403M\| \| \| -\| 2.01x\| \|ary2000.max \| 284.321k\| 570.446k\| \| \| -\| 2.01x\| \|ary3000.max \| 282.613k\| 571.683k\| \| \| -\| 2.02x\| \|ary5000.max \| 145.120k\| 285.546k\| \| \| -\| 1.97x\| \|ary10000.max \| 72.102k\| 142.831k\| \| \| -\| 1.98x\| \|ary20000.max \| 36.065k\| 72.077k\| \| \| -\| 2.00x\| \|ary50000.max \| 14.343k\| 29.139k\| \| \| -\| 2.03x\| \|ary100000.max \| 7.586k\| 14.472k\| \| \| -\| 1.91x\| \|ary1000000.max \| 726.915\| 1.495k\| \| \| -\| 2.06x\|	2020-07-18 23:45:00 +09:00
Takashi Kokubun	167d139487	Inline builtin struct aref We don't do this for aset because it might raise a FrozenError. ``` $ benchmark-driver -v --rbenv 'before;after;before --jit;after --jit' benchmark/mjit_struct_aref.yml --repeat-count=4 before: ruby 2.8.0dev (2020-07-06T01:47:11Z master `d94ef7c6b6`) [x86_64-linux] after: ruby 2.8.0dev (2020-07-06T07:11:51Z master 85425168f4) [x86_64-linux] last_commit=Inline builtin struct aref before --jit: ruby 2.8.0dev (2020-07-06T01:47:11Z master `d94ef7c6b6`) +JIT [x86_64-linux] after --jit: ruby 2.8.0dev (2020-07-06T07:11:51Z master 85425168f4) +JIT [x86_64-linux] last_commit=Inline builtin struct aref Calculating ------------------------------------- before after before --jit after --jit mjit_struct_aref(struct) 34.783M 34.810M 48.321M 58.378M i/s - 40.000M times in 1.149996s 1.149097s 0.827794s 0.685192s Comparison: mjit_struct_aref(struct) after --jit: 58377836.7 i/s before --jit: 48321205.7 i/s - 1.21x slower after: 34809935.5 i/s - 1.68x slower before: 34782736.5 i/s - 1.68x slower ```	2020-07-06 00:14:00 -07:00
Takashi Kokubun	24fa37d87a	Make Kernel#then, #yield_self, #frozen? builtin (#3283 ) * Make Kernel#then, #yield_self, #frozen? builtin * Fix test_jit	2020-07-03 18:02:43 -07:00
Takashi Kokubun	f3a0d7a203	Rewrite Kernel#tap with Ruby (#3281 ) * Rewrite Kernel#tap with Ruby This was good for VM too, but of course my intention is to unblock JIT's inlining of a block over yield (inlining invokeyield has not been committed though). * Fix test_settracefunc About the :tap deletions, the :tap events are actually traced (we already have a TracePoint test for builtin methods), but it's filtered out by tp.path == "xyzzy" (it became "<internal:kernel>"). We could trace tp.path == "<internal:kernel>" cases too, but the lineno is impacted by kernel.rb changes and I didn't want to make it fragile for kernel.rb lineno changes.	2020-07-03 09:52:35 -07:00
Takashi Kokubun	0703e01471	Mark some Integer methods as inline (#3264 )	2020-06-27 10:07:47 -07:00
Vladimir Dementyev	6770d8f1b0	Add pattern matching with arrays benchmark	2020-06-27 13:51:03 +09:00
Takashi Kokubun	7982dc1dfd	Decide JIT-ed insn based on cached cfunc for opt_* insns. opt_eq handles rb_obj_equal inside opt_eq, and all other cfunc is handled by opt_send_without_block. Therefore we can't decide which insn should be generated by checking whether it's cfunc cc or not. ``` $ benchmark-driver -v --rbenv 'before --jit;after --jit' benchmark/mjit_opt_cc_insns.yml --repeat-count=4 before --jit: ruby 2.8.0dev (2020-06-26T05:21:43Z master `9dbc2294a6`) +JIT [x86_64-linux] after --jit: ruby 2.8.0dev (2020-06-26T06:30:18Z master 75cece1b0b) +JIT [x86_64-linux] last_commit=Decide JIT-ed insn based on cached cfunc Calculating ------------------------------------- before --jit after --jit mjit_nil?(1) 73.878M 74.021M i/s - 40.000M times in 0.541432s 0.540391s mjit_not(1) 72.635M 74.601M i/s - 40.000M times in 0.550702s 0.536187s mjit_eq(1, nil) 7.331M 7.445M i/s - 8.000M times in 1.091211s 1.074596s mjit_eq(nil, 1) 49.450M 64.711M i/s - 8.000M times in 0.161781s 0.123627s Comparison: mjit_nil?(1) after --jit: 74020528.4 i/s before --jit: 73878185.9 i/s - 1.00x slower mjit_not(1) after --jit: 74600882.0 i/s before --jit: 72634507.6 i/s - 1.03x slower mjit_eq(1, nil) after --jit: 7444657.4 i/s before --jit: 7331304.3 i/s - 1.02x slower mjit_eq(nil, 1) after --jit: 64710790.6 i/s before --jit: 49449507.4 i/s - 1.31x slower ```	2020-06-25 23:33:08 -07:00
Takashi Kokubun	946e5cc668	Annotate Kernel#class as inline (#3250 ) ``` $ benchmark-driver -v --rbenv 'before;after;before --jit;after --jit' benchmark/mjit_class.yml --repeat-count=4 before: ruby 2.8.0dev (2020-06-23T07:09:54Z master `37a2e48d76`) [x86_64-linux] after: ruby 2.8.0dev (2020-06-23T17:29:56Z inline-class 0ff147c007) [x86_64-linux] before --jit: ruby 2.8.0dev (2020-06-23T07:09:54Z master `37a2e48d76`) +JIT [x86_64-linux] after --jit: ruby 2.8.0dev (2020-06-23T17:29:56Z inline-class 0ff147c007) +JIT [x86_64-linux] Calculating ------------------------------------- before after before --jit after --jit mjit_class(self) 39.219M 40.060M 53.502M 69.202M i/s - 40.000M times in 1.019915s 0.998495s 0.747631s 0.578021s mjit_class(1) 39.567M 41.242M 52.100M 68.895M i/s - 40.000M times in 1.010935s 0.969885s 0.767749s 0.580591s Comparison: mjit_class(self) after --jit: 69201690.7 i/s before --jit: 53502336.4 i/s - 1.29x slower after: 40060289.1 i/s - 1.73x slower before: 39218939.2 i/s - 1.76x slower mjit_class(1) after --jit: 68895358.6 i/s before --jit: 52100353.0 i/s - 1.32x slower after: 41241993.6 i/s - 1.67x slower before: 39567314.0 i/s - 1.74x slower ```	2020-06-23 23:49:03 -07:00
Takashi Kokubun	78352fb52e	Compile opt_send for opt_* only when cc has ISeq because opt_nil/opt_not/opt_eq populates cc even when it doesn't fallback to opt_send_without_block because of vm_method_cfunc_is. ``` $ benchmark-driver -v --rbenv 'before --jit;after --jit' benchmark/mjit_opt_cc_insns.yml --repeat-count=4 before --jit: ruby 2.8.0dev (2020-06-22T08:11:24Z master `d231b8f95b`) +JIT [x86_64-linux] after --jit: ruby 2.8.0dev (2020-06-22T08:53:27Z master e1125879ed) +JIT [x86_64-linux] last_commit=Compile opt_send for opt_* only when cc has ISeq Calculating ------------------------------------- before --jit after --jit mjit_nil?(1) 54.106M 73.693M i/s - 40.000M times in 0.739288s 0.542795s mjit_not(1) 53.398M 74.477M i/s - 40.000M times in 0.749090s 0.537075s mjit_eq(1, nil) 7.427M 6.497M i/s - 8.000M times in 1.077136s 1.231326s Comparison: mjit_nil?(1) after --jit: 73692594.3 i/s before --jit: 54106108.4 i/s - 1.36x slower mjit_not(1) after --jit: 74477487.9 i/s before --jit: 53398125.0 i/s - 1.39x slower mjit_eq(1, nil) before --jit: 7427105.9 i/s after --jit: 6497063.0 i/s - 1.14x slower ``` Actually opt_eq becomes slower by this. Maybe it's indeed using opt_send_without_block, but I'll approach that one in another commit.	2020-06-22 02:08:21 -07:00
Takashi Kokubun	4c5780e51e	Share warmup logic across MJIT benchmarks	2020-06-22 00:54:27 -07:00
Takashi Kokubun	faf93e4545	The RUBYOPT= comment is no longer needed	2020-06-22 00:20:30 -07:00
Takashi Kokubun	8838600c1e	Stop relying on `make benchmark`'s `-I$(srcdir)/benchmark/lib` These days I don't use `make benchmark`. The YAML files should be executable with bare `benchmark-driver` CLI without passing `RUBYOPT=-Ibenchmark/lib`.	2020-06-22 00:17:10 -07:00
Takashi Kokubun	7561db8c00	Introduce Primitive.attr! to annotate 'inline' (#3242 ) [Feature #15589]	2020-06-20 17:13:03 -07:00
Takashi Kokubun	95b0fed371	Make Integer#zero? a separated method and builtin (#3226 ) A prerequisite to fix https://bugs.ruby-lang.org/issues/15589 with JIT. This commit alone doesn't make a significant difference yet, but I thought this commit should be committed independently. This method override was discussed in [Misc #16961].	2020-06-20 14:55:09 -07:00
Ryuta Kamizono	9d24ddbb53	Fix `make benchmark` example `make benchmark ARGS=../benchmark/erb_render.yml` does not work. ``` % make benchmark ARGS=../benchmark/erb_render.yml /Users/kamipo/.rbenv/shims/ruby --disable=gems -rrubygems -I./benchmark/lib ./benchmark/benchmark-driver/exe/benchmark-driver \ --executables="compare-ruby::/Users/kamipo/.rbenv/shims/ruby --disable=gems -I.ext/common --disable-gem" \ --executables="built-ruby::./miniruby -I./lib -I. -I.ext/common ./tool/runruby.rb --extout=.ext -- --disable-gems --disable-gem" \ ../benchmark/erb_render.yml Traceback (most recent call last): 6: from ./benchmark/benchmark-driver/exe/benchmark-driver:112:in `<main>' 5: from ./benchmark/benchmark-driver/exe/benchmark-driver:112:in `flat_map' 4: from ./benchmark/benchmark-driver/exe/benchmark-driver:112:in `each' 3: from ./benchmark/benchmark-driver/exe/benchmark-driver:122:in `block in <main>' 2: from /Users/kamipo/.rbenv/versions/2.6.6/lib/ruby/2.6.0/psych.rb:577:in `load_file' 1: from /Users/kamipo/.rbenv/versions/2.6.6/lib/ruby/2.6.0/psych.rb:577:in `open' /Users/kamipo/.rbenv/versions/2.6.6/lib/ruby/2.6.0/psych.rb:577:in `initialize': No such file or directory @ rb_sysopen - ../benchmark/erb_render.yml (Errno::ENOENT) make: *** [benchmark] Error 1 % make benchmark ARGS=benchmark/erb_render.yml /Users/kamipo/.rbenv/shims/ruby --disable=gems -rrubygems -I./benchmark/lib ./benchmark/benchmark-driver/exe/benchmark-driver \ --executables="compare-ruby::/Users/kamipo/.rbenv/shims/ruby --disable=gems -I.ext/common --disable-gem" \ --executables="built-ruby::./miniruby -I./lib -I. -I.ext/common ./tool/runruby.rb --extout=.ext -- --disable-gems --disable-gem" \ benchmark/erb_render.yml Calculating ------------------------------------- compare-ruby built-ruby erb_render 825.454k 783.664k i/s - 1.500M times in 1.817181s 1.914086s Comparison: erb_render compare-ruby: 825454.4 i/s built-ruby: 783663.8 i/s - 1.05x slower ```	2020-06-07 10:33:14 +09:00
卜部昌平	d4015cfee3	add benchmark for different block handlers	2020-06-03 16:13:47 +09:00
Nobuyoshi Nakada	02cb643ddb	Added String#split benchmark for empty regexp \| \|compare-ruby\|built-ruby\| \|:--------------\|-----------:\|---------:\| \|re_chars-1 \| 169.230k\| 973.855k\| \| \| -\| 5.75x\| \|re_chars-10 \| 25.536k\| 107.598k\| \| \| -\| 4.21x\| \|re_chars-100 \| 2.621k\| 11.207k\| \| \| -\| 4.28x\| \|re_chars-1000 \| 259.098\| 1.133k\| \| \| -\| 4.37x\|	2020-05-12 22:59:58 +09:00
Nobuyoshi Nakada	693f7ab315	Optimize String#split Optimized `String#split` with `/ /` (single space regexp) as simple string splitting. [ruby-core:98272] \| \|compare-ruby\|built-ruby\| \|:--------------\|-----------:\|---------:\| \|re_space-1 \| 432.786k\| 1.539M\| \| \| -\| 3.56x\| \|re_space-10 \| 76.231k\| 191.547k\| \| \| -\| 2.51x\| \|re_space-100 \| 8.152k\| 19.557k\| \| \| -\| 2.40x\| \|re_space-1000 \| 837.405\| 2.022k\| \| \| -\| 2.41x\| ruby-core:98272: https://bugs.ruby-lang.org/issues/15771#change-85511	2020-05-12 19:58:58 +09:00
S.H	17083011ee	support builtin for Kernel#Float # Iteration per second (i/s) \| \|compare-ruby\|built-ruby\| \|:------------\|-----------:\|---------:\| \|float \| 30.395M\| 38.314M\| \| \| -\| 1.26x\| \|float_true \| 3.833M\| 27.322M\| \| \| -\| 7.13x\| \|float_false \| 4.182M\| 24.938M\| \| \| -\| 5.96x\|	2020-04-22 09:49:13 +09:00
Takashi Kokubun	f883d6219e	Unify vm benchmark prefixes to vm_ (#3028 ) The vm1_ prefix and vm2_ had had special meaning until `820ad9cb1d` and `12068aa4e9`. AFAIK there's no special meaning in vm3_ prefix. As they have confused people (like "In `benchmark` what is difference between `vm1_`, `vm2_` and `vm3_`"), I'd like to remove the obsoleted prefix as we obviated that two years ago.	2020-04-13 21:37:42 -07:00
Takashi Kokubun	310ef9f40b	Make vm_call_cfunc_with_frame a fastpath (#3027 ) when there's no need to call CALLER_SETUP_ARG and CALLER_REMOVE_EMPTY_KW_SPLAT (i.e. !rb_splat_or_kwargs_p(ci) && !calling->kw_splat). Micro benchmark: ``` $ benchmark-driver -v --rbenv 'before;after' benchmark/vm_send_cfunc.yml --repeat-count=4 before: ruby 2.8.0dev (2020-04-13T23:45:05Z master `b9d3ceee8f`) [x86_64-linux] after: ruby 2.8.0dev (2020-04-14T00:48:52Z no-splat-fastpath 418d363722) [x86_64-linux] Calculating ------------------------------------- before after vm_send_cfunc 69.585M 88.724M i/s - 100.000M times in 1.437097s 1.127096s Comparison: vm_send_cfunc after: 88723605.2 i/s before: 69584737.1 i/s - 1.28x slower ``` Optcarrot: ``` $ benchmark-driver -v --rbenv 'before;after' benchmark.yml --repeat-count=12 --output=all before: ruby 2.8.0dev (2020-04-13T23:45:05Z master `b9d3ceee8f`) [x86_64-linux] after: ruby 2.8.0dev (2020-04-14T00:48:52Z no-splat-fastpath 418d363722) [x86_64-linux] Calculating ------------------------------------- before after Optcarrot Lan_Master.nes 50.76119601545175 42.73858236484051 fps 50.76388649761503 51.04211379912850 50.80930672252514 51.39455790755538 50.90236000778749 51.75656936556145 51.01744746340430 51.86875277356489 51.06495279015112 51.88692482485558 51.07785337168974 51.93429603190578 51.20163525187862 51.95768145071314 51.34671771913112 52.45577266040274 51.35918340835583 52.53163888762858 51.46641337418146 52.62172484121034 51.50835463462257 52.85064021113239 ```	2020-04-13 20:32:59 -07:00
Takashi Kokubun	b9d3ceee8f	Unwrap vm_call_cfunc indirection on JIT for VM_METHOD_TYPE_CFUNC. This has been known to decrease optcarrot fps: ``` $ benchmark-driver -v --rbenv 'before --jit;after --jit' benchmark.yml --repeat-count=24 --output=all before --jit: ruby 2.8.0dev (2020-04-13T16:25:13Z master `fb40495cd9`) +JIT [x86_64-linux] after --jit: ruby 2.8.0dev (2020-04-13T23:23:11Z mjit-inline-c bdcd06d159) +JIT [x86_64-linux] Calculating ------------------------------------- before --jit after --jit Optcarrot Lan_Master.nes 66.38132676191719 67.41369177299630 fps 69.42728743772243 68.90327567263054 72.16028300263211 69.62605130880686 72.46631319102777 70.48818243767207 73.37078877002490 70.79522887347566 73.69422431217367 70.99021920193194 74.01471487018695 74.69931965402584 75.48685183295630 74.86714575949016 75.54445264507932 75.97864419721677 77.28089738169756 76.48908637569581 78.04183397891302 76.54320932488021 78.36807984096562 76.59407262898067 78.92898762543574 77.31316743361343 78.93576483233765 77.97153484180480 79.13754917503078 77.98478782102325 79.62648945850653 78.02263322726446 79.86334213878064 78.26333724045934 80.05100635898518 78.60056756355614 80.26186843769584 78.91082645644468 80.34205717020330 79.01226659142263 80.62286066044338 79.32733939423721 80.95883033058557 79.63793060542024 80.97376819251613 79.73108936622778 81.23050939202896 80.18280109433088 ``` and I deleted this capability in an early stage of YARV-MJIT development: `0ab130feee` I suspect either of the following things could be the cause: * Directly calling vm_call_cfunc requires more optimization effort in GCC, resulting in 30ms-ish compilation time increase for such methods and decreasing the number of methods compiled in a benchmarked period. * Code size increase => icache miss hit These hypotheses could be verified by some methodologies. However, I'd like to introduce this regardless of the result because this blocks inlining C method's definition. I may revert this commit when I give up to implement inlining C method definition, which requires this change. Microbenchmark-wise, this gives slight performance improvement: ``` $ benchmark-driver -v --rbenv 'before --jit;after --jit' benchmark/mjit_send_cfunc.yml --repeat-count=4 before --jit: ruby 2.8.0dev (2020-04-13T16:25:13Z master `fb40495cd9`) +JIT [x86_64-linux] after --jit: ruby 2.8.0dev (2020-04-13T23:23:11Z mjit-inline-c bdcd06d159) +JIT [x86_64-linux] Calculating ------------------------------------- before --jit after --jit mjit_send_cfunc 41.961M 56.489M i/s - 100.000M times in 2.383143s 1.770244s Comparison: mjit_send_cfunc after --jit: 56489372.5 i/s before --jit: 41961388.1 i/s - 1.35x slower ```	2020-04-13 16:45:05 -07:00
Takashi Kokubun	151f8be40d	Make JIT-ed leave insn leaf to eliminate sp / pc moves by cancelling JIT execution on interrupts. $ benchmark-driver benchmark.yml -v --rbenv 'before --jit;after --jit' --repeat-count=12 --output=all before --jit: ruby 2.8.0dev (2020-04-01T03:48:56Z master `5a81562dfe`) +JIT [x86_64-linux] after --jit: ruby 2.8.0dev (2020-04-01T04:58:01Z master 39beb26a27) +JIT [x86_64-linux] Calculating ------------------------------------- before --jit after --jit Optcarrot Lan_Master.nes 75.06409603894944 76.06422026555558 fps 75.12025067279242 78.48161731616810 77.42020273492177 79.78958240950033 79.07253675128945 79.88645902325614 79.99179109732327 80.33743931749331 80.07633091008627 80.53790081529166 80.15450942667547 80.99048270668010 80.48372803283709 81.70497146081003 80.57410149187352 82.79494539467382 81.80449157081202 82.85797792223954 82.24629397834902 83.00603891515506 82.63708148686703 83.23221006969828 $ benchmark-driver -v --rbenv 'before;before --jit;after --jit' benchmark/mjit_leave.yml --repeat-count=4 before: ruby 2.8.0dev (2020-04-01T03:48:56Z master `5a81562dfe`) [x86_64-linux] before --jit: ruby 2.8.0dev (2020-04-01T03:48:56Z master `5a81562dfe`) +JIT [x86_64-linux] after --jit: ruby 2.8.0dev (2020-04-01T04:58:01Z master 39beb26a27) +JIT [x86_64-linux] Calculating ------------------------------------- before before --jit after --jit mjit_leave 106.656M 82.786M 91.635M i/s - 200.000M times in 1.875183s 2.415881s 2.182569s Comparison: mjit_leave before: 106656239.9 i/s after --jit: 91635143.7 i/s - 1.16x slower before --jit: 82785537.2 i/s - 1.29x slower	2020-03-31 22:10:16 -07:00
Takashi Kokubun	dad110d068	Remove an unused pragma It originally had a string literal, but it no longer has one.	2020-03-30 23:30:08 -07:00
Takashi Kokubun	b736ea63bd	Optimize exivar access on JIT-ed getivar JIT support of `dd723771c1`. $ benchmark-driver -v --rbenv 'before;before --jit;after --jit' benchmark/mjit_exivar.yml --repeat-count=4 before: ruby 2.8.0dev (2020-03-30T12:32:26Z master `e5db3da9d3`) [x86_64-linux] before --jit: ruby 2.8.0dev (2020-03-30T12:32:26Z master `e5db3da9d3`) +JIT [x86_64-linux] after --jit: ruby 2.8.0dev (2020-03-31T05:57:24Z mjit-exivar 128625baec) +JIT [x86_64-linux] Calculating ------------------------------------- before before --jit after --jit mjit_exivar 57.944M 53.579M 54.471M i/s - 200.000M times in 3.451588s 3.732772s 3.671687s Comparison: mjit_exivar before: 57944345.1 i/s after --jit: 54470876.7 i/s - 1.06x slower before --jit: 53579483.4 i/s - 1.08x slower	2020-03-30 23:16:35 -07:00
Jeremy Evans	d2c41b1bff	Reduce allocations for keyword argument hashes Previously, passing a keyword splat to a method always allocated a hash on the caller side, and accepting arbitrary keywords in a method allocated a separate hash on the callee side. Passing explicit keywords to a method that accepted a keyword splat did not allocate a hash on the caller side, but resulted in two hashes allocated on the callee side. This commit makes passing a single keyword splat to a method not allocate a hash on the caller side. Passing multiple keyword splats or a mix of explicit keywords and a keyword splat still generates a hash on the caller side. On the callee side, if arbitrary keywords are not accepted, it does not allocate a hash. If arbitrary keywords are accepted, it will allocate a hash, but this commit uses a callinfo flag to indicate whether the caller already allocated a hash, and if so, the callee can use the passed hash without duplicating it. So this commit should make it so that a maximum of a single hash is allocated during method calls. To set the callinfo flag appropriately, method call argument compilation checks if only a single keyword splat is given. If only one keyword splat is given, the VM_CALL_KW_SPLAT_MUT callinfo flag is not set, since in that case the keyword splat is passed directly and not mutable. If more than one splat is used, a new hash needs to be generated on the caller side, and in that case the callinfo flag is set, indicating the keyword splat is mutable by the callee. In compile_hash, used for both hash and keyword argument compilation, if compiling keyword arguments and only a single keyword splat is used, pass the argument directly. On the caller side, in vm_args.c, the callinfo flag needs to be recognized and handled. Because the keyword splat argument may not be a hash, it needs to be converted to a hash first if not. Then, unless the callinfo flag is set, the hash needs to be duplicated. The temporary copy of the callinfo flag, kw_flag, is updated if a hash was duplicated, to prevent the need to duplicate it again. If we are converting to a hash or duplicating a hash, we need to update the argument array, which can including duplicating the positional splat array if one was passed. CALLER_SETUP_ARG and a couple other places needs to be modified to handle similar issues for other types of calls. This includes fairly comprehensive tests for different ways keywords are handled internally, checking that you get equal results but that keyword splats on the caller side result in distinct objects for keyword rest parameters. Included are benchmarks for keyword argument calls. Brief results when compiled without optimization: def kw(a: 1) a end def kws(kw) kw end h = {a: 1} kw(a: 1) # about same kw(h) # 2.37x faster kws(a: 1) # 1.30x faster kws(h) # 2.19x faster kw(a: 1, h) # 1.03x slower kw(h, h) # about same kws(a: 1, h) # 1.16x faster kws(h, **h) # 1.14x faster	2020-03-17 12:09:43 -07:00
S.H	290d608637	support builtin for Kernel#clone	2020-03-17 19:37:07 +09:00
Nobuyoshi Nakada	5e897227ff	Added more benchmarks for String	2020-02-29 15:42:24 +09:00
Nobuyoshi Nakada	05229cef45	Improve `String#slice!` performance Instead of searching twice to extract and to delete, extract and delete the found position at the first search. This makes faster nearly twice, for regexps and strings. \| \|compare-ruby\|built-ruby\| \|:-------------\|-----------:\|---------:\| \|regexp-short \| 2.143M\| 3.918M\| \|regexp-long \| 105.162k\| 205.410k\| \|string-short \| 3.789M\| 7.964M\| \|string-long \| 1.301M\| 2.457M\|	2020-01-31 17:12:05 +09:00
Kazuhiro NISHIYAMA	c90fc55a1f	Drop executable bit of *.{yml,h,mk.tmpl}	2020-01-22 16:04:38 +09:00
Lourens Naudé	40c57ad4a1	Let execution context local storage be an ID table	2020-01-11 14:40:36 +13:00
Lourens Naudé	592d7ceeeb	Speeds up fallback to Hash#default_proc in rb_hash_aref by removing a method call	2020-01-08 18:09:52 +09:00
David Rodríguez	f48655d04d	Remove unneeded exec bits from some files I noticed that some files in rubygems were executable, and I could think of no reason why they should be. In general, I think ruby files should never have the executable bit set unless they include a shebang, so I run the following command over the whole repo: ```bash find . -name '*.rb' -type f -executable -exec bash -c 'grep -L "^#!" $1 \|\| chmod -x $1' _ {} \; ```	2019-11-09 21:36:30 +09:00
Nobuyoshi Nakada	8390057d1e	Benchmark for [Feature #16155 ]	2019-10-22 22:49:48 +09:00
Dylan Thacker-Smith	b970259044	Stop making a redundant hash copy in Hash#dup (#2489 ) * Stop making a redundant hash copy in Hash#dup It was making a copy of the hash without rehashing, then created an extra copy of the hash to do the rehashing. Since rehashing creates a new copy already, this change just uses that rehashing to make the copy. [Bug #16121] * Remove redundant Check_Type after to_hash * Fix freeing and clearing destination hash in Hash#initialize_copy The code was assuming the state of the destination hash based on the source hash for clearing any existing table on it. If these don't match, then that can cause the old table to be leaked. This can be seen by compiling hash.c with `#define HASH_DEBUG 1` and running the following script, which will crash from a debug assertion. ```ruby h = 9.times.map { \|i\| [i, i] }.to_h h.send(:initialize_copy, {}) ``` * Remove dead code paths in rb_hash_initialize_copy Given that `RHASH_ST_TABLE_P(h)` is defined as `(!RHASH_AR_TABLE_P(h))` it shouldn't be possible for a hash to be neither of these, so there is no need for the removed `else if` blocks. * Share implementation between Hash#replace and Hash#initialize_copy This also fixes key rehashing for small hashes backed by an array table for Hash#replace. This used to be done consistently in ruby 2.5.x, but stopped being done for small arrays in ruby 2.6.x. This also bring optimization improvements that were done for Hash#initialize_copy to Hash#replace. * Add the Hash#dup benchmark	2019-10-21 17:29:21 +09:00
Dylan Thacker-Smith	a1fda16b23	Optimize Array#flatten and flatten! for already flattened arrays (#2495 ) * Optimize Array#flatten and flatten! for already flattened arrays * Add benchmark for Array#flatten and Array#flatten! [Bug #16119]	2019-09-28 01:24:24 +09:00
Takashi Kokubun	41e3c204fd	Reduce ISeq size of mjit_exec benchmark to avoid unwanted memory pressure	2019-09-26 22:13:31 +09:00
Takashi Kokubun	4a4c502825	Add special runner to benchmark mjit_exec I wanted to dynamically generate benchmark cases to test various number of methods. Thus I added a dedicated runner of benchmark-driver.	2019-09-26 16:34:40 +09:00
Takashi Kokubun	b414999370	Add a benchmark for JIT-ed code dispatch	2019-09-21 16:09:52 +09:00
卜部昌平	d74fa8e55c	reuse cc->call I noticed that in case of cache misshit, re-calculated cc->me can be the same method entry than the pevious one. That is an okay situation but can't we partially reuse the cache, because cc->call should still be valid then? One thing that has to be special-cased is when the method entry gets amended by some refinements. That happens behind-the-scene of call cache mechanism. We have to check if cc->me->def points to the previously saved one. Calculating ------------------------------------- trunk ours vm2_poly_same_method 1.534M 2.025M i/s - 6.000M times in 3.910203s 2.962752s Comparison: vm2_poly_same_method ours: 2025143.9 i/s trunk: 1534447.2 i/s - 1.32x slower	2019-09-19 15:18:10 +09:00
Takashi Kokubun	ff462bc6c3	Add a benchmark for opt_regexpmatch2 vm2_regexp was for opt_regexpmatch1.	2019-09-02 13:46:33 +09:00
Nobuyoshi Nakada	07e42e88d9	Close created files [ci skip]	2019-08-10 11:26:23 +09:00
Masato Ohba	6902824729	Fix typo in comment [ci skip] s/Thtread/Thread	2019-08-10 09:35:28 +09:00
Yaw Boakye	6bb3618f28	n+1 to include n in range Python's range stop right before n, which means factL never returns the correct result. Closes: https://github.com/ruby/ruby/pull/1982	2019-08-05 09:04:32 +09:00
Yusuke Endoh	086ffe72c7	Revert "Revert "Add a specialized instruction for `.nil?` calls"" This reverts commit `a0980f2446`. Retry for macOS Mojave.	2019-08-02 23:25:38 +09:00
Yusuke Endoh	a0980f2446	Revert "Add a specialized instruction for `.nil?` calls" This reverts commit `9faef3113f`. It seemed to cause a failure on macOS Mojave, though I'm unsure how. https://rubyci.org/logs/rubyci.s3.amazonaws.com/osx1014/ruby-master/log/20190802T034503Z.fail.html.gz This tentative revert is to check if the issue is actually caused by the change or not.	2019-08-02 15:03:34 +09:00
Aaron Patterson	9faef3113f	Add a specialized instruction for `.nil?` calls This commit adds a specialized instruction for called to `.nil?`. It is about 27% faster than master in the case where the object is nil or not nil. In the case where an object implements `nil?`, I think it may be slightly slower. Here is a benchmark: ```ruby require "benchmark/ips" class Niller def nil?; true; end end not_nil = Object.new xnil = nil niller = Niller.new Benchmark.ips do \|x\| x.report("nil?") { xnil.nil? } x.report("not nil") { not_nil.nil? } x.report("niller") { niller.nil? } end ``` On Ruby master: ``` [aaron@TC ~/g/ruby (master)]$ ./ruby compil.rb Warming up -------------------------------------- nil? 429.195k i/100ms not nil 437.889k i/100ms niller 437.935k i/100ms Calculating ------------------------------------- nil? 20.166M (± 8.1%) i/s - 100.002M in 5.002794s not nil 20.046M (± 7.6%) i/s - 99.839M in 5.020086s niller 22.467M (± 6.1%) i/s - 112.111M in 5.013817s [aaron@TC ~/g/ruby (master)]$ ./ruby compil.rb Warming up -------------------------------------- nil? 449.660k i/100ms not nil 433.836k i/100ms niller 443.073k i/100ms Calculating ------------------------------------- nil? 19.997M (± 8.8%) i/s - 99.375M in 5.020458s not nil 20.529M (± 7.0%) i/s - 102.385M in 5.020689s niller 21.796M (± 8.0%) i/s - 108.110M in 5.002300s [aaron@TC ~/g/ruby (master)]$ ./ruby compil.rb Warming up -------------------------------------- nil? 402.119k i/100ms not nil 438.968k i/100ms niller 398.226k i/100ms Calculating ------------------------------------- nil? 20.050M (±12.2%) i/s - 98.519M in 5.008817s not nil 20.614M (± 8.0%) i/s - 102.280M in 5.004531s niller 22.223M (± 8.8%) i/s - 110.309M in 5.013106s ``` On this branch: ``` [aaron@TC ~/g/ruby (specialized-nilp)]$ ./ruby compil.rb Warming up -------------------------------------- nil? 468.371k i/100ms not nil 456.517k i/100ms niller 454.981k i/100ms Calculating ------------------------------------- nil? 27.849M (± 7.8%) i/s - 138.169M in 5.001730s not nil 26.417M (± 8.7%) i/s - 131.020M in 5.011674s niller 21.561M (± 7.5%) i/s - 107.376M in 5.018113s [aaron@TC ~/g/ruby (specialized-nilp)]$ ./ruby compil.rb Warming up -------------------------------------- nil? 477.259k i/100ms not nil 428.712k i/100ms niller 446.109k i/100ms Calculating ------------------------------------- nil? 28.071M (± 7.3%) i/s - 139.837M in 5.016590s not nil 25.789M (±12.9%) i/s - 126.470M in 5.011144s niller 20.002M (±12.2%) i/s - 98.144M in 5.001737s [aaron@TC ~/g/ruby (specialized-nilp)]$ ./ruby compil.rb Warming up -------------------------------------- nil? 467.676k i/100ms not nil 445.791k i/100ms niller 415.024k i/100ms Calculating ------------------------------------- nil? 26.907M (± 8.0%) i/s - 133.755M in 5.013915s not nil 25.319M (± 7.9%) i/s - 125.713M in 5.007758s niller 19.569M (±11.8%) i/s - 96.286M in 5.008533s ``` Co-Authored-By: Ashe Connor <kivikakk@github.com>	2019-07-31 16:21:25 -07:00
Takashi Kokubun	1392b821b9	Explain what's benchmark/lib/load.rb [ci skip] I'm actually not using this, but ko1 is.	2019-07-20 15:33:55 +09:00
Samuel Williams	56fcf98849	Add note about setting `vm.max_map_count` for Linux.	2019-07-18 20:54:55 +12:00
Samuel Williams	9b28eefeb2	Add benchmark to help diagnose performance regression. See https://bugs.ruby-lang.org/issues/16009 for more details.	2019-07-18 11:13:49 +12:00
Nobuyoshi Nakada	ae599db22f	* remove trailing spaces.	2019-07-12 17:57:28 +09:00
Samuel Williams	012e954b47	Improved fiber benchmarks. Increase number of iterations.	2019-07-12 11:56:51 +12:00
Hiroshi SHIBATA	e8a2521abe	Adjust memory_status.rb under the tool directory.	2019-07-02 21:39:22 +09:00
Jeremy Evans	11c311e36f	Use realpath(3) instead of custom realpath implementation if available This approach is simpler than the previous approach which tries to emulate realpath(3). It also performs much better on both Linux and OpenBSD on the included benchmarks. By using realpath(3), we can better integrate with system security features such as OpenBSD's unveil(2) system call. This does not use realpath(3) on Windows even if it exists, as the approach for checking for absolute paths does not work for drive letters. This can be fixed without too much difficultly, though until Windows defines realpath(3), there is no need to do so. For File.realdirpath, where the last element of the path is not required to exist, fallback to the previous approach, as realpath(3) on most operating systems requires the whole path be valid (per POSIX), and the operating systems where this isn't true either plan to conform to POSIX or may change to conform to POSIX in the future. glibc realpath(3) does not handle /path/to/file.rb/../other_file.rb paths, returning ENOTDIR in that case. Fallback to the previous code if realpath(3) returns ENOTDIR. glibc doesn't like realpath(3) usage for paths like /dev/fd/5, returning ENOENT even though the path may appear to exist in the filesystem. If ENOENT is returned and the path exists, then fall back to the default approach.	2019-07-01 11:46:30 -07:00
Samuel Williams	3fd83cb6fc	Improve benchmarks and tests for threads.	2019-06-19 20:39:10 +12:00
Takashi Kokubun	caa90202c9	Make sure to suppress .irbrc on benchmark By the way, this is already improved by nobu: ``` $ benchmark-driver benchmark/irb_exec.yml --rbenv '2.6.3;2.7.0-preview1;before;after' -v 2.6.3: ruby 2.6.3p62 (2019-04-16 revision 67580) [x86_64-linux] 2.7.0-preview1: ruby 2.7.0preview1 (2019-05-31 trunk `c55db6aa27`) [x86_64-linux] before: ruby 2.7.0dev (2019-06-10T21:13:14+09:00 master `973fd18f11`) [x86_64-linux] after: ruby 2.7.0dev (2019-06-10T21:18:56+09:00 master `976c689ad4`) [x86_64-linux] Calculating ------------------------------------- 2.6.3 2.7.0-preview1 before after irb_exec 11.868 5.872 6.297 10.278 i/s - 30.000 times in 2.527776s 5.108997s 4.764167s 2.918821s Comparison: irb_exec 2.6.3: 11.9 i/s after: 10.3 i/s - 1.15x slower before: 6.3 i/s - 1.88x slower 2.7.0-preview1: 5.9 i/s - 2.02x slower ```	2019-06-10 22:04:52 +09:00
Takashi Kokubun	973fd18f11	Add a benchmark of irb boot time ``` $ benchmark-driver benchmark/irb_exec.yml --rbenv '2.6.3;2.7.0-preview1' Calculating ------------------------------------- 2.6.3 2.7.0-preview1 irb_exec 11.844 5.171 i/s - 30.000 times in 2.532887s 5.801960s Comparison: irb_exec 2.6.3: 11.8 i/s 2.7.0-preview1: 5.2 i/s - 2.29x slower ```	2019-06-10 21:13:14 +09:00
Takashi Kokubun	0a29dc87e6	Optimize CGI.escapeHTML by reducing buffer extension and switch-case branches. Buffer allocation optimization using `ALLOCA_N` would be the main benefit of patch. It eliminates the O(N) buffer extensions. It also reduces the number of branches using escape table like https://mattn.kaoriya.net/software/lang/c/20160817011915.htm. Closes: https://github.com/ruby/ruby/pull/2226 Co-authored-by: Nobuyoshi Nakada <nobu@ruby-lang.org> Co-authored-by: Yasuhiro MATSUMOTO <mattn.jp@gmail.com>	2019-06-05 21:07:04 +09:00
Takashi Kokubun	71b14affc6	Revert "Optimize CGI.escapeHTML by reducing buffer extension" This reverts commit `8d81e59aa7`. `ALLOCA_N` does not check stack overflow unlike ALLOCV. I'll fix it and re-commit it again.	2019-06-05 11:00:54 +09:00
Takashi Kokubun	8d81e59aa7	Optimize CGI.escapeHTML by reducing buffer extension and switch-case branches. Buffer allocation optimization using `ALLOCA_N` would be the main benefit of patch. It eliminates the O(N) buffer extensions. It also reduces the number of branches using escape table like https://mattn.kaoriya.net/software/lang/c/20160817011915.htm. Closes: https://github.com/ruby/ruby/pull/2226 Co-authored-by: Nobuyoshi Nakada <nobu@ruby-lang.org> Co-authored-by: Yasuhiro MATSUMOTO <mattn.jp@gmail.com>	2019-06-05 10:08:55 +09:00
Takashi Kokubun	f630359d9b	Add a benchmark using IRB::Color I heard actually this part would not be a bottleneck for rendering because writing anything to terminal takes way longer time anyway, but I thought this benchmark script might be useful for benchmarking Ruby itself.	2019-06-01 20:07:50 +09:00
Lourens Naudé	a47f598d77	Reduce ONIG_NREGION from 10 to 4: power of 2 and testing revealed most pattern matches are less than or equal to 4 results Closes: https://github.com/ruby/ruby/pull/2135	2019-05-07 21:58:55 +09:00
Nobuyoshi Nakada	77440e949b	Improve performance of case-conversion methods	2019-05-03 23:59:18 +09:00
nobu	e1eb54b99d	string.c: improve splitting into chars * string.c (rb_str_split_m): improve splitting into chars by an empty string, without a regexp. Comparison: to_chars-1 built-ruby: 1273527.6 i/s compare-ruby: 189423.3 i/s - 6.72x slower to_chars-10 built-ruby: 120993.5 i/s compare-ruby: 37075.8 i/s - 3.26x slower to_chars-100 built-ruby: 15646.4 i/s compare-ruby: 4012.1 i/s - 3.90x slower to_chars-1000 built-ruby: 1295.1 i/s compare-ruby: 408.5 i/s - 3.17x slower git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@67582 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-04-17 05:34:46 +00:00
eregon	f2d6338574	benchmark/app_aobench.rb: complete commented code to write the image to a file git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66900 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-21 12:31:29 +00:00
eregon	32e259f097	benchmark/app_aobench.rb: remove extra printf arguments git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66899 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-21 12:31:20 +00:00
eregon	dfe6918604	benchmark/app_aobench.rb: move `srand(0)` at the top git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66898 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-21 12:31:10 +00:00
mame	8a294e0f20	benchmark/app_aobench.rb: add `srand(0)` To prevent noise for benchmark result. Just for the case. [Bug #15552] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66893 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-21 07:29:00 +00:00
mame	be3a0c7042	benchmark/app_aobench.rb: add a magic comment To support the change of default encoding. It had not worked correctly since 2.0 :-) [Bug #15552] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66892 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-21 07:13:32 +00:00
nobu	72ad092960	Time.strptime benchmarks git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66743 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-07 02:05:21 +00:00
mrkn	d41fb6ce0b	benchmark/range_last.yml: remove needless prelude [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66740 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-06 15:46:21 +00:00
mrkn	bf6584b5d0	benchmark/range_last.yml: add benchmark cases git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66733 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-06 00:46:35 +00:00
mrkn	56a6dd66d4	benchmark/range_last.yml: add a benchmark of Range#last [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66721 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-05 10:07:40 +00:00
mrkn	8d473272db	benchmark/complex_float_*.yml: New benchmarks Add new benchmark scripts for binary operations of Complex with float components. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66680 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2019-01-01 12:20:04 +00:00
shyouhei	bc64df876e	delete emacs mode lines [ci skip] These settings are now covered by .dir-locals.el. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66584 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-12-27 06:12:09 +00:00
k0kubun	17ca870a4d	benchmark/app_erb.yml: remove unused variable `3efcb74036 (r31762996)` git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66490 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-12-22 01:43:45 +00:00
tenderlove	bf41340ebb	Add benchmark for hash small literals Co-Authored-By: Krzysztof Rybka <krzysztof.rybka@gmail.com> git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@66256 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-12-06 18:20:11 +00:00
samuel	5547c65bdf	Rename fiber chain benchmark. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65859 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 10:17:08 +00:00
svn	b58dc42abb	* remove trailing spaces. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65852 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 10:16:35 +00:00
samuel	20f1c6c185	Remove `Benchmark` times. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65851 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 10:16:34 +00:00
samuel	9a324cf9cc	Better benchmark name. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65845 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 10:13:51 +00:00
svn	3f0ae69186	* remove trailing spaces. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65844 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 10:09:55 +00:00
samuel	12ae8c0813	Fix broken benchmark git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65843 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 10:09:53 +00:00
svn	635dd3408f	* remove trailing spaces. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65842 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 10:06:59 +00:00
samuel	995f6ee102	Fix code climate cognitive complexity issue git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65841 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 10:06:58 +00:00
svn	acd84d3208	* remove trailing spaces. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65837 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 09:59:19 +00:00
samuel	748c497a6d	Rewrite fiber benchmark git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@65836 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-11-20 09:59:18 +00:00
ko1	f251ce9f48	don't use GC::Profiler on default. * benchmark/gc/gcbench.rb: disable GC::Profiler on default. add -p option instead. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@64872 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-09-28 05:28:12 +00:00
ko1	cebd2c4aba	add small utility for benchmarking. * benchmark/lib/load.rb: add small utility which requires benchmark-driver.rb. You can load this file and can use benchmark-driver. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@64870 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-09-28 03:32:49 +00:00
shyouhei	02b52b2733	make opt_str_freeze leaf Simply use DISPATCH_ORIGINAL_INSN instead of rb_funcall. This is, when possible, overall performant because method dispatch results are cached inside of CALL_CACHE. Should also be good for JIT. ---- trunk: ruby 2.6.0dev (2018-09-12 trunk 64689) [x86_64-darwin15] ours: ruby 2.6.0dev (2018-09-12 leaf-insn 64688) [x86_64-darwin15] last_commit=make opt_str_freeze leaf Calculating ------------------------------------- trunk ours vm2_freezestring 5.440M 31.411M i/s - 6.000M times in 1.102968s 0.191017s Comparison: vm2_freezestring ours: 31410864.5 i/s trunk: 5439865.4 i/s - 5.77x slower git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@64690 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-09-12 03:39:36 +00:00
nobu	7d6a26f2a1	Adding Enumerator::Lazy#uniq and Enumerator::Lazy#grep_v to proc chaining [Feature #14994] [Fix GH-1930] From: Anmol Chopra <chopraanmol1@gmail.com> git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@64385 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-08-16 00:58:21 +00:00
k0kubun	706d266496	benchmark/README.md: skip showing --timeout option [ci skip] because it's only available for limited platforms for now. I'll make it portable and show it later. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63954 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-11 22:49:48 +00:00
k0kubun	3e93bf3b46	common.mk: upgrade benchmark_driver benchmark/README.md: fix help output, which is changed on v0.14.6. Especially `e1::path1,arg1,...; e2::path2,arg2` part was wrong since `,` can't be used to split arguments anymore. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63945 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-11 00:34:32 +00:00
k0kubun	1053c3d4bf	benchmark/README.md: some are defined with YAML [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63944 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 23:56:57 +00:00
k0kubun	fdd7f57b97	benchmark/README.md: fix typo [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63942 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 16:03:51 +00:00
k0kubun	3efcb74036	benchmark: revise ERB benchmarks using YAML syntax to improve the accuracy of measurement by stop using block. benchmark/app_erb.rb -> benchmark/app_erb.yml: renamed and revised benchmark/erb_render.rb -> benchmark/erb_render.yml: ditto benchmark/README.md: follow renames git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63941 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 15:58:52 +00:00
k0kubun	498487ec2a	benchmark/README.md: erb_render is no longer yml but rb [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63940 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 15:51:29 +00:00
k0kubun	301cb9ea7c	benchmark/README.md: wording consistency in one sentence [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63939 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 15:49:42 +00:00
k0kubun	5d35664ad6	output/driver.rb: add output plugin to reproduce the original behavior of benchmark/driver.rb. Probably I won't use this but this is requested by ko1. Use this with: make benchmark OPTS="-o driver" git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63937 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 15:44:45 +00:00
k0kubun	020bb75808	revert r63899 this is no longer needed due to r63926 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63936 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 15:06:59 +00:00
k0kubun	f328fe40b3	benchmark_driver/runner: add runners for metrics supported by legacy benchmark/driver.rb. benchmark/README.md: document them common.mk: update benchmark_driver to correct 0.0 output and to fix spacing format of `-o simple` and `-o markdown`. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63933 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 15:01:27 +00:00
k0kubun	79a8c10b0b	runner/size.rb: fix metric passed to output runner/peak.rb: ditto This is needed to make commands like `make -C .ruby-svn benchmark ITEM=erb OPTS="-r size -o simple"` succeed. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63932 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 14:16:57 +00:00
k0kubun	34eed0cd47	benchmark/README.md: add help output [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63931 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 13:41:13 +00:00
k0kubun	59ec265e33	benchmark/README.md: don't recommend alias installation [ci skip] benchmark_driver is the official way to install benchmark_driver. benchmark-driver is just an alias for it. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63930 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 13:36:44 +00:00
k0kubun	433af16167	benchmark: drop all bm_ prefix for legacy driver.rb benchmark/*.rb is only benchmarks now. We don't need prefixes. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63928 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 13:08:40 +00:00
k0kubun	8068d34cfe	Revert "benchmark/.yml: convert from benchmark/bm_.rb" This reverts r63900. Having single-execution benchmark as a normal Ruby script is preferred by ko1. I'm not a big fan of having inconsistent benchmark formats, but I can understand some benefits of it. common.mk: remove obsolsted benchmark-each PHONY declaration, support running Ruby scripts added by this commit. README.md: follow ARGS change git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63926 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 13:05:29 +00:00
k0kubun	12a38cbac0	benchmark: resurrect peak / size metrics by adding runner plugins for them. benchmark/lib/benchmark_driver/runner/peak.rb: added peak runner plugin benchmark/lib/benchmark_driver/runner/size.rb: added size runner plugin common.mk: allow using them benchmark/memory_wrapper.rb: deleted in favor of those runner plugins benchmark/README.md: document them git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63924 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-10 12:14:04 +00:00
k0kubun	443773836e	benchmark/README.md: fix -e interface for v0.14 [ci skip] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63919 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-09 16:43:42 +00:00
k0kubun	7bed459a26	benchmark/driver.rb: fully obsolete this in favor of just using benchmark_driver.gem. common.mk: The new `make benchmark` covers the both usages for old `make benchmark` and old `make benchmark-each`. So `make benchmark-each` is dropped now. benchmark/README.md: Explain its details git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63918 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-09 16:31:35 +00:00
k0kubun	5b9ff71077	common.mk: upgrade benchmark_driver to v0.14 benchmark/driver.rb: deal with breaking changes which are actually introduced for this driver. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63916 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-09 15:45:02 +00:00
k0kubun	6c4e05d2d8	file_rename.yml: allow running this on mswin/mingw Suggested by MSP-Grep: http://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-core/87883 git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63914 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-09 13:20:25 +00:00
k0kubun	14153a9bab	app_pentominio.yml: drop unnecessary prelude git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63913 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-09 12:50:50 +00:00
k0kubun	2a12b82dcd	benchmark/driver.rb: make this more similar to original benchmark-driver command. I'm going to add some runner plugins to resurrect metrics which were originally supported by benchmark/driver.rb... git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63906 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 18:33:43 +00:00
k0kubun	2cab2fc8b1	benchmark/README.md: [ci skip] change syntax highlight git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63905 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 18:10:14 +00:00
k0kubun	8b0158671b	benchmark/README.md: [ci skip] write documentation about this directory benchmark/driver.rb: ditto git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63903 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 18:08:26 +00:00
k0kubun	dc8327d79a	benchmark/driver.rb: drop legacy Ruby script support Now all benchmarks are converted to YAMLs. common.mk: Drop obsoleted bm_* pattern git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63902 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 17:48:58 +00:00
k0kubun	a7681376e0	benchmark/app_pentomino.yml: remove original scripts after __END__ mistakenly left by conversion script... git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63901 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 17:47:32 +00:00
k0kubun	929982ecea	benchmark/.yml: convert from benchmark/bm_.rb git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63900 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 17:36:26 +00:00
k0kubun	d933fe9b85	benchmark/gc: prevent GC bench from depending on normal benchmark scripts. This is needed to finish converting Ruby scripts to YAMLs. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63899 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 17:31:06 +00:00
k0kubun	2079fe863c	benchmark/vm2_struct_small_aset.yml: unwrap loop_count since `i` is involved in this script as well... git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63898 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 17:22:58 +00:00
k0kubun	71f8406f6c	benchmark/vm2_*.yml: fix ugly conversion errors git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63897 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 17:19:57 +00:00
k0kubun	12068aa4e9	benchmark/vm2_*.yml: abstract away the while loop git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63896 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 17:03:23 +00:00
k0kubun	1aedeaa476	benchmark/vm1_ensure.yml: rollback loop_count abstraction for now. When measured script is really too fast, while loop substituion may return a negative benchmark result. Probably benchmark_driver.gem has rooms to be improved about this. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63895 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 16:46:49 +00:00
k0kubun	94e55bbd37	benchmark/vm1_*.yml: manual fixes for ugly conversions git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63894 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 16:35:44 +00:00
k0kubun	820ad9cb1d	benchmark/vm1_.yml: abstract away the while loop benchmark/driver.rb had removed the cost for while loop in benchmark/bm_vm1_.rb, and benchmark_driver.gem can achieve the same thing with `loop_count`. But unfortunately current benchmark_driver.gem can't solve it only for vm1_yield.yml... git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63893 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 16:28:34 +00:00
k0kubun	9e62c93d0d	benchmark: convert bm_vm2_.rb to vm2_.yml This YAML transformation is needed to support whileloop2 time substituion by benchmark_driver.gem later. This commmit changes no benchmark behavior. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63892 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 15:35:22 +00:00
k0kubun	95a187e040	benchmark: convert bm_vm1_.rb to vm1_.yml This YAML transformation is needed to support whileloop time substituion by benchmark_driver.gem later. This commmit changes no benchmark behavior. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63891 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 15:12:24 +00:00
k0kubun	893b2d97a0	benchmark/driver.rb: fix wrong multiline regexp git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63889 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 15:05:18 +00:00
k0kubun	3293322a39	benchmark: introduce benchmark_driver.gem Makefile.in: Clone benchmark-driver repository in benchmark/benchmark-driver `make update-benchmark-driver`, like simplecov. win32/Makefile.sub: Roughly do the same thing. .gitignore: Ignore the cloned repository. common.mk: Trigger `make update-benchmark-driver` to run `make benchmark` and adjust arguments for benchmark_driver.gem. benchmark/require.yml: renamed from benchmark/bm_require.rb, benchmark/prepare_require.rb benchmark/require_thread.yml: renamed from benchmark/bm_require_thread.rb, benchmark/prepare_require_thread.rb benchmark/so_count_words.yml: renamed from benchmark/bm_so_count_words.rb, benchmark/prepare_so_count_words.rb, benchmark/wc.input.base benchmark/so_k_nucleotide.yml: renamed from benchmark/bm_so_k_nucleotide.rb, benchmark/prepare_so_k_nucleotide.rb, benchmark/make_fasta_output.rb benchmark/so_reverse_complement.yml: renamed from benchmark/bm_so_reverse_complement.rb, benchmark/prepare_so_reverse_complement.rb, benchmark/make_fasta_output.rb I'm sorry but I made some duplications between benchmark/require.yml and benchmark/require_thread.yml, and between benchmark/so_k_nucleotide.yml and benchmark/so_reverse_complement.yml. If you're not comfortable with it, please combine these YAMLs to share the same prelude. One YAML file can have multiple benchmark definitions sharing prelude. benchmark/driver.rb: Replace its core feature with benchmark_driver.gem. Some old features are gone for now, but I'll add them again later. [Misc #14902] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63888 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 14:38:05 +00:00
k0kubun	9fd839c0b2	benchmark: drop legacy benchmark drivers It seems like they are all benchmark drivers but "benchmark/driver.rb" is the latest and others are no longer used. It's confusing to have multiple drivers (and actually I used benchmark/run.rb since I didn't know I should use benchmark/driver.rb). As I'm going to support only benchmark/driver.rb features in Misc#14902, let me delete them. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63886 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 13:03:01 +00:00
k0kubun	c2ae11e308	benchmark/driver.rb: simplify LoadError handling git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63882 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-07-08 04:12:04 +00:00
normal	c27a8e87a9	benchmark/driver.rb: add CPU time measurement targets "Real" time is too unstable on my systems, hopefully counting only CPU time can gain more reliable benchmark results. [ruby-core:87362] [Feature #14815] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63564 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-06-03 21:06:26 +00:00
normal	8a2aa4eddc	benchmark/bm_erb: fix for --measure=size/peak in driver.rb benchmark/memory_wrapper.rb will Kernel#load these scripts, preventing DATA from being initialized, so use heredoc instead. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@63497 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-05-23 20:58:46 +00:00
ko1	19a1899470	use `--version`. * benchmark/driver.rb: use `--version` instead of `-v` to get version information. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@62515 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-02-21 08:51:49 +00:00
nobu	9a9a4e874d	Benchmarks for Array#values_at git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@62412 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2018-02-15 06:14:42 +00:00
ko1	5ee9513a71	Lazy Proc allocation for block parameters [Feature #14045] * insns.def (getblockparam, setblockparam): add special access instructions for block parameters. getblockparam checks VM_FRAME_FLAG_MODIFIED_BLOCK_PARAM and if it is not set this instruction creates a Proc object from a given blcok and set VM_FRAME_FLAG_MODIFIED_BLOCK_PARAM. setblockparam is similar to setlocal, but set VM_FRAME_FLAG_MODIFIED_BLOCK_PARAM. * compile.c: use get/setblockparm instead get/setlocal instructions. Note that they are used for method local block parameters (def m(&b)), not for block local method parameters (iter{\|&b\|). * proc.c (get_local_variable_ptr): creates Proc object for Binding#local_variable_get/set. * safe.c (safe_setter): we need to create Proc objects for postponed block parameters when $SAFE is changed. * vm_args.c (args_setup_block_parameter): used only for block local blcok parameters. * vm_args.c (vm_caller_setup_arg_block): if called with VM_CALL_ARGS_BLOCKARG_BLOCKPARAM flag then passed block values should be a block handler. * test/ruby/test_optimization.rb: add tests. * benchmark/bm_vm1_blockparam*: added. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@60397 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-10-24 11:13:49 +00:00
normal	cb9c849af6	file.c: apply2files releases GVL This means File.chmod, File.lchmod, File.chown, File.lchown, File.unlink, and File.utime operations on slow filesystems no longer hold up other threads. The platform-specific utime_failed changes is compile-tested using a new UTIME_EINVAL macro This hurts performance on fast filesystem, but these methods are unlikely to be performance bottlenecks and (IMHO) avoiding pathological slowdowns and stalls are more important. benchmark results: minimum results in each 3 measurements. Execution time (sec) name trunk built file_chmod 0.591 0.801 Speedup ratio: compare with the result of `trunk' (greater is better) name built file_chmod 0.737 * file.c (UTIME_EINVAL): new macro to ease compile-testing * file.c (struct apply_arg): new struct * file.c (no_gvl_apply2files): new function * file.c (apply2files): release GVL * file.c (chmod_internal): adjust for apply2files changes * file.c (lchmod_internal): ditto * file.c (chown_internal): ditto * file.c (lchown_internal): ditto * file.c (utime_failed): ditto * file.c (utime_internal): ditto * file.c (unlink_internal): ditto [ruby-core:83200] [Feature #13996] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@60386 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-10-24 01:20:04 +00:00
k0kubun	dafb230c0b	driver.rb: allow executing driver.rb directly git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@60301 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-10-21 15:45:32 +00:00
k0kubun	8949479e1a	driver.rb: add option to specify target with rbenv [close GH-1724] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@60299 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-10-21 15:39:35 +00:00
normal	1e14126b4f	Dir.empty? releases GVL This converts all slow syscalls in the Dir.empty? implementation to release GVL. We avoid unnecessarily GVL release and reacquire for each slow call (opendir, readdir, closedir) and instead only release and acquire the GVL once in the common case. Benchmark results show a small degradation in single-threaded performance: Execution time (sec) name trunk built dir_empty_p 0.689 0.758 Speedup ratio: compare with the result of `trunk' (greater is better) name built dir_empty_p 0.909 * dir.c (rb_gc_for_fd_with_gvl): new function (nogvl_dir_empty_p): ditto (dir_s_empty_p): use new functions to release GVL * benchmark/bm_dir_empty_p.rb: new benchmark [ruby-core:83071] [Feature #13958] git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@60111 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-10-04 00:04:51 +00:00
normal	e552afc30e	File#rename releases GVL rename(2) requires two pathname resolution operations which can take considerable time on slow filesystems, release the GVL so operations on other threads may proceed. On fast, local filesystems, this change results in some slowdown as shown by the new benchmark. I consider the performance trade off acceptable as cases are avoided. benchmark results: minimum results in each 3 measurements. Execution time (sec) name trunk built file_rename 2.648 2.804 Speedup ratio: compare with the result of `trunk' (greater is better) name built file_rename 0.944 * file.c (no_gvl_rename): new function (rb_file_s_rename): release GVL for renames * benchmark/bm_file_rename.rb: new benchmark git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@60088 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-10-01 21:19:24 +00:00
glass	8320be1007	string.c: avoid unnecessary call of str_strlen() * string.c (rb_strseq_index): refactor and avoid call of str_strlen() when offset == 0. it will improve performance of String#index and #include? * benchmark/bm_string_index.rb: benchmark for this change git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@60086 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-10-01 13:44:49 +00:00
nobu	9bc73cd81f	array.c: improve operations on small arrays [Feature #13884] Reduce number of memory allocations for "and", "or" and "diff" operations on small arrays Very often, arrays are used to filter parameters and to select interesting items from 2 collections and very often these collections are small enough, for example: ```ruby SAFE_COLUMNS = [:id, :title, :created_at] def columns @all_columns & SAFE_COLUMNS end ``` In this patch, I got rid of unnecessary memory allocations for small arrays when "and", "or" and "diff" operations are performed. name \| HEAD \| PATCH -----------------+------:+------: array_small_and \| 0.615 \| 0.263 array_small_diff \| 0.676 \| 0.282 array_small_or \| 0.953 \| 0.463 name \| PATCH -----------------+------: array_small_and \| 2.343 array_small_diff \| 2.392 array_small_or \| 2.056 name \| HEAD \| PATCH -----------------+------:+------: array_small_and \| 1.429 \| 1.005 array_small_diff \| 1.493 \| 0.878 array_small_or \| 1.672 \| 1.152 name \| PATCH -----------------+------: array_small_and \| 1.422 array_small_diff \| 1.700 array_small_or \| 1.452 Author: Dmitry Bochkarev <dimabochkarev@gmail.com> git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@60057 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-09-29 07:43:22 +00:00
nobu	c751bad51d	benchmarks for String#scan git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@59494 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-08-04 04:25:12 +00:00
k0kubun	697c82b691	benchmark: Rename to bm_erb_render.rb from bm_app_erb_render.rb. I'm told from ko1 that bm_app_* is namespace for Ruby applications, not for ERB and we should use bm_erb_* for ERB benchmark instead. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@58915 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-05-27 02:47:11 +00:00
k0kubun	6f685e106c	erb.rb: Generate static string with opt_str_uminus to skip object allocation for static string. We can't always enable frozen_string_literal pragma because we can't freeze string literals embedded by user for backward compatibility. So we need to use fstring for each static string. Since adding ".freeze" to string literals in #content_dump is slow on compiling, I used unary "-" operator instead. benchmark/bm_app_erb_render.rb: Added rendering-only benchmark to test rendering performance on production environment. This benchmark is created to reproduce the behavior on Sinatra (Tilt). Thus it doesn't use ERB#result to skip parsing compiled code. It doesn't use ERB#def_method too to regard `title` and `content` as local variables. If we use #def_method, `title` and `content` needs to be method call. I wanted to avoid it. This patch's benchmark results is: * Before app_erb_render 1.250 app_erb 0.704 * After app_erb_render 1.066 app_erb 0.686 This patch optimizes rendering performance (app_erb_render) without spoiling (total of rendering +) compiling performance (app_erb). git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@58905 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-05-26 13:49:35 +00:00
normal	60473fe0e3	benchmark/bm_vm2_fiber_switch.rb: check for fiber performance There are currently no benchmarks for Fiber performance, I should've committed this years ago when [Feature #10341] was implemented. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@58606 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-05-08 01:59:12 +00:00
normal	1267dad8da	benchmark/bm_vm_thread_pass_flood.rb: add comment I was about to write off this benchmark while working on GVL improvements on multi-core systems. However I noticed it exposes a weakness in my work-in-progress code when I tested on an old single CPU system. Further testing reveals setting CPU affinity ("schedtool -a 0x1" on Linux) on a modern multi-core system is enough to reproduce the problem exposed by this benchmark. git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@58571 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-05-04 16:47:17 +00:00
normal	9f9d9e5000	benchmark/bm_vm1_gc_wb_.rb: fix GC.start args for invalid keyword "lazy_sweep" does not appear to have ever been a valid kwarg for GC.start, however the opposite of "lazy_sweep" appears to be "immediate_sweep". So use immediate_sweep, and flip the boolean value of each arg. I guess this only started failing with r56981 in Dec 2016 ("class.c: missing unknown_keyword_error", commit e3f0cca2f26ba44c810ac980cdff7dda129ae533) benchmark/bm_vm1_gc_wb_ary.rb: "lazy_sweep: false" => "immediate_sweep: true" * benchmark/bm_vm1_gc_wb_ary_promoted.rb: ditto * benchmark/bm_vm1_gc_wb_obj.rb: ditto * benchmark/bm_vm1_gc_wb_obj_promoted.rb: ditto git-svn-id: svn+ssh://ci.ruby-lang.org/ruby/trunk@58565 b2dd03c8-39d4-4d8f-98ff-823fe69b080e	2017-05-04 08:55:54 +00:00

1 2 3 4 5 ...

424 Коммитов