development fork of ruby/ruby
Перейти к файлу
yui-knk d8601621ed Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods
Implementation for Language Server Protocol (LSP) sometimes needs token information.
For example both `m(1)` and `m(1, )` has same AST structure other than node locations
then it's impossible to check the existence of `,` from AST. However in later case,
it might be better to suggest variables list for the second argument.
Token information is important for such case.

This commit adds these methods.

* Add `keep_tokens` option for `RubyVM::AbstractSyntaxTree.parse`, `.parse_file` and `.of`
* Add `RubyVM::AbstractSyntaxTree::Node#tokens` which returns tokens for the node including tokens for descendants nodes.
* Add `RubyVM::AbstractSyntaxTree::Node#all_tokens` which returns all tokens for the input script regardless the receiver node.

[Feature #19070]

Impacts on memory usage and performance are below:

Memory usage:

```
$ cat test.rb
root = RubyVM::AbstractSyntaxTree.parse_file(File.expand_path('../test/ruby/test_keyword.rb', __FILE__), keep_tokens: true)

$ /usr/bin/time -f %Mkb /usr/local/bin/ruby -v
ruby 3.2.0dev (2022-11-19T09:41:54Z 19070-keep_tokens d3af1b8057) [x86_64-linux]
11408kb

# keep_tokens :false
$ /usr/bin/time -f %Mkb /usr/local/bin/ruby test.rb
17508kb

# keep_tokens :true
$ /usr/bin/time -f %Mkb /usr/local/bin/ruby test.rb
30960kb
```

Performance:

```
$ cat ../ast_keep_tokens.yml
prelude: |
  src = <<~SRC
    module M
      class C
        def m1(a, b)
          1 + a + b
        end
      end
    end
  SRC
benchmark:
  without_keep_tokens: |
    RubyVM::AbstractSyntaxTree.parse(src, keep_tokens: false)
  with_keep_tokens: |
    RubyVM::AbstractSyntaxTree.parse(src, keep_tokens: true)

$ make benchmark COMPARE_RUBY="./ruby" ARGS=../ast_keep_tokens.yml
/home/kaneko.y/.rbenv/shims/ruby --disable=gems -rrubygems -I../benchmark/lib ../benchmark/benchmark-driver/exe/benchmark-driver \
            --executables="compare-ruby::./ruby -I.ext/common --disable-gem" \
            --executables="built-ruby::./miniruby -I../lib -I. -I.ext/common  ../tool/runruby.rb --extout=.ext  -- --disable-gems --disable-gem" \
            --output=markdown --output-compare -v ../ast_keep_tokens.yml
compare-ruby: ruby 3.2.0dev (2022-11-19T09:41:54Z 19070-keep_tokens d3af1b8057) [x86_64-linux]
built-ruby: ruby 3.2.0dev (2022-11-19T09:41:54Z 19070-keep_tokens d3af1b8057) [x86_64-linux]
warming up..

|                     |compare-ruby|built-ruby|
|:--------------------|-----------:|---------:|
|without_keep_tokens  |     21.659k|   21.303k|
|                     |       1.02x|         -|
|with_keep_tokens     |      6.220k|    5.691k|
|                     |       1.09x|         -|
```
2022-11-21 09:01:34 +09:00
.github Let mjit-bindgen use BASERUBY and bundle/inline (#6740) 2022-11-15 23:42:41 -08:00
basictest
benchmark Rename --mjit-min-calls to --mjit-call-threshold (#6731) 2022-11-14 23:38:52 -08:00
bin
bootstraptest Fix bug involving .send and overwritten methods. (#6752) 2022-11-17 23:17:40 -05:00
ccan Fix -Wundef warnings 2022-10-26 18:57:26 +09:00
coroutine Fix and improve coroutines for Darwin (macOS) ppc/ppc64. (#5975) 2022-10-19 23:49:45 +13:00
coverage
cygwin
defs Control non-parallel parts with `.WAIT` if available 2022-11-13 23:54:43 +09:00
doc [ruby/net-http] About the Examples moved to separate file 2022-11-19 15:33:28 +00:00
enc
ext Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods 2022-11-21 09:01:34 +09:00
gems Update bundled gems list at 2022-10-30 2022-10-30 07:04:08 +00:00
include Refactor RB_SPECIAL_CONST_P (#6759) 2022-11-17 17:55:24 -08:00
internal Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods 2022-11-21 09:01:34 +09:00
lib [ruby/irb] Add edit command (https://github.com/ruby/irb/pull/453) 2022-11-20 04:47:54 +00:00
libexec
man [ruby/irb] Add description of IRB_LANG, IRBRC, and XDG_CONFIG_HOME to man 2022-10-05 19:20:22 +09:00
misc Ivar copy needs to happen _before_ setting the shape 2022-11-01 15:38:44 -07:00
missing
sample Sync TRICK 2018 (02-mame) 2022-11-05 23:18:32 +09:00
spec Update RSpec gems 2022-11-15 14:45:51 +09:00
template Control non-parallel parts with `.WAIT` if available 2022-11-13 23:54:43 +09:00
test Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods 2022-11-21 09:01:34 +09:00
tool sync_default_gems.rb: Fix substitution [ci skip] 2022-11-20 18:51:41 +09:00
wasm wasm/README.md: Add a note about the Ruby built for wasm. [ci skip] 2022-11-11 07:57:25 +09:00
win32 Avoid warnings on MINGW: 2022-11-20 11:06:28 +09:00
yjit YJIT: Improve the failure message on enlarging a branch (#6769) 2022-11-18 17:27:07 -08:00
.appveyor.yml Ignore manual files only commits [ci skip] 2022-10-20 21:58:06 +09:00
.cirrus.yml YJIT: Test --yjit-verify-ctx on GitHub Actions as well (#6639) 2022-10-26 18:20:33 -04:00
.dir-locals.el
.document Rewrite Symbol#to_sym and #intern in Ruby (#6683) 2022-11-15 21:34:30 -08:00
.editorconfig
.gdbinit
.git-blame-ignore-revs
.gitattributes
.gitignore
.indent.pro
.rdoc_options
.rspec_parallel
.travis.yml
BSDL
CONTRIBUTING.md
COPYING
COPYING.ja
GPL
KNOWNBUGS.rb
LEGAL
NEWS.md Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods 2022-11-21 09:01:34 +09:00
README.EXT
README.EXT.ja
README.ja.md
README.md Revert wrong sync in 5958c305e5 [ci skip] 2022-11-20 14:22:41 -08:00
aclocal.m4
addr2line.c Fix and improve coroutines for Darwin (macOS) ppc/ppc64. (#5975) 2022-10-19 23:49:45 +13:00
addr2line.h
array.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
array.rb
ast.c Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods 2022-11-21 09:01:34 +09:00
ast.rb Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods 2022-11-21 09:01:34 +09:00
autogen.sh
bignum.c Use `roomof` macro for rounding up divisions 2022-10-14 19:23:25 +09:00
builtin.c
builtin.h
class.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
common.mk Update fake.rb for test-spec 2022-11-20 01:10:40 +09:00
compar.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
compile.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
complex.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
configure.ac Add support for `sockaddr_un` on Windows. (#6513) 2022-11-17 14:50:25 -08:00
constant.h
cont.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
darray.h
debug.c
debug_counter.c
debug_counter.h Remove unused debug counters 2022-11-13 14:00:30 -08:00
dir.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
dir.rb
dln.c Fix and improve coroutines for Darwin (macOS) ppc/ppc64. (#5975) 2022-10-19 23:49:45 +13:00
dln.h
dln_find.c
dmydln.c
dmyenc.c
dmyext.c
encindex.h
encoding.c
enum.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
enumerator.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
error.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
eval.c [Bug #19016] Handle syntax error in main script like other errors 2022-11-20 20:00:40 +09:00
eval_error.c Use `enum ruby_tag_type` over `int` 2022-11-20 20:00:40 +09:00
eval_intern.h
eval_jump.c
file.c Add support for `sockaddr_un` on Windows. (#6513) 2022-11-17 14:50:25 -08:00
gc.c Differentiate T_OBJECT shapes from other objects 2022-11-18 08:31:56 -08:00
gc.h Transition shape when object's capacity changes 2022-11-10 10:11:34 -05:00
gc.rb Transition shape when object's capacity changes 2022-11-10 10:11:34 -05:00
gem_prelude.rb
golf_prelude.rb
goruby.c
hash.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
hrtime.h Fix per-instance Regexp timeout (#6621) 2022-10-24 18:03:26 +09:00
id_table.c
id_table.h
inits.c Rewrite Symbol#to_sym and #intern in Ruby (#6683) 2022-11-15 21:34:30 -08:00
insns.def Adjust indents [ci skip] 2022-11-10 10:52:16 +09:00
internal.h Define `UNDEF_P` and `NIL_OR_UNDEF_P` [EXPERIMENTAL] 2022-10-20 22:05:27 +09:00
io.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
io.rb
io_buffer.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
iseq.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
iseq.h Allow passing a Rust closure to rb_iseq_callback (#6575) 2022-10-18 09:07:11 -07:00
kernel.rb
lex.c.blt
load.c Rename misleading label 2022-11-18 19:42:24 -05:00
loadpath.c
localeinit.c
main.c
marshal.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
marshal.rb
math.c
memory_view.c
method.h
mini_builtin.c Rework `first_lineno` to be `int`. 2022-09-26 00:41:16 +13:00
miniinit.c
mjit.c Rename --mjit-min-calls to --mjit-call-threshold (#6731) 2022-11-14 23:38:52 -08:00
mjit.h Rename --mjit-min-calls to --mjit-call-threshold (#6731) 2022-11-14 23:38:52 -08:00
mjit.rb
mjit_c.rb rename SHAPE_BITS to SHAPE_ID_NUM_BITS 2022-11-18 12:04:10 -08:00
mjit_compiler.c Avoid type limits (#6435) 2022-09-26 09:21:05 +09:00
mjit_compiler.h Revert "Revert "This commit implements the Object Shapes technique in CRuby."" 2022-10-11 08:40:56 -07:00
mjit_compiler.rb
mjit_unit.h
nilclass.rb
node.c Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods 2022-11-21 09:01:34 +09:00
node.h Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods 2022-11-21 09:01:34 +09:00
numeric.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
numeric.rb Improve performance some `Integer` and `Float` methods [Feature #19085] (#6638) 2022-10-27 09:13:16 -07:00
object.c Update assertion 2022-11-18 13:58:13 -08:00
pack.c Fix bug in array pack with shared strings 2022-11-10 09:26:37 -05:00
pack.rb [DOC] Link to packed data doc (#6567) 2022-10-18 10:16:22 -05:00
parse.y Enhance keep_tokens option for RubyVM::AbstractSyntaxTree parsing methods 2022-11-21 09:01:34 +09:00
prelude.rb
probes.d
probes_helper.h
proc.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
process.c [DOC] Change formatting in the exec docs 2022-11-19 11:38:16 +09:00
ractor.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
ractor.rb [Bug #19081] Show the caller location in warning for Ractor 2022-10-26 19:43:14 +09:00
ractor_core.h rename SHAPE_BITS to SHAPE_ID_NUM_BITS 2022-11-18 12:04:10 -08:00
random.c [Bug #19100] Add `init_int32` function to `rb_random_interface_t` 2022-11-10 12:06:13 +09:00
range.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
rational.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
re.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
regcomp.c Use `roomof` macro for rounding up divisions 2022-10-14 19:23:25 +09:00
regenc.c Prevent potential buffer overrun in onigmo 2022-10-25 17:02:43 +09:00
regenc.h Use `roomof` macro for rounding up divisions 2022-10-14 19:23:25 +09:00
regerror.c
regexec.c Add default cases for cache point finding function 2022-11-17 23:19:17 +09:00
regint.h Use long instead of int 2022-11-09 23:21:26 +09:00
regparse.c Prevent potential buffer overrun in onigmo 2022-10-25 17:02:43 +09:00
regparse.h
regsyntax.c
ruby-runner.c
ruby.c YJIT: Lazily enable YJIT after prelude (#6597) 2022-10-24 12:20:44 -04:00
ruby_assert.h
ruby_atomic.h
rubystub.c
scheduler.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
shape.c 32 bit comparison on shape id 2022-11-18 12:04:10 -08:00
shape.h 32 bit comparison on shape id 2022-11-18 12:04:10 -08:00
signal.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
siphash.c Fix and improve coroutines for Darwin (macOS) ppc/ppc64. (#5975) 2022-10-19 23:49:45 +13:00
siphash.h
sparc.c
sprintf.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
st.c Fix and improve coroutines for Darwin (macOS) ppc/ppc64. (#5975) 2022-10-19 23:49:45 +13:00
strftime.c
string.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
string.rb
struct.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
symbol.c Rewrite Symbol#to_sym and #intern in Ruby (#6683) 2022-11-15 21:34:30 -08:00
symbol.h
symbol.rb Rewrite Symbol#to_sym and #intern in Ruby (#6683) 2022-11-15 21:34:30 -08:00
thread.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
thread_none.c [wasm] Scan machine stack based on `ec->machine.stack_{start,end}` 2022-11-06 05:03:21 +09:00
thread_none.h
thread_pthread.c Fix possible use of undefined macros on very old macOS [ci skip] 2022-10-17 18:36:08 +09:00
thread_pthread.h
thread_sync.c mutex: Raise a ThreadError when detecting a fiber deadlock (#6680) 2022-11-09 00:43:16 +13:00
thread_sync.rb thread_sync.c: Clarify and document the behavior of timeout == 0 2022-10-17 16:56:00 +02:00
thread_win32.c
thread_win32.h
time.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
timev.h
timev.rb [DOC] Update about `sec` argument of `Time.new` 2022-11-17 21:52:50 +09:00
trace_point.rb
transcode.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
transcode_data.h
transient_heap.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
transient_heap.h
util.c
variable.c Differentiate T_OBJECT shapes from other objects 2022-11-18 08:31:56 -08:00
variable.h Revert "Revert "This commit implements the Object Shapes technique in CRuby."" 2022-10-11 08:40:56 -07:00
version.c YJIT: Show YJIT build option in RUBY_DESCRIPTION (#6738) 2022-11-16 10:08:52 -08:00
version.h
vm.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
vm_args.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
vm_backtrace.c push dummy frame for loading process 2022-10-20 17:38:28 +09:00
vm_callinfo.h Prevent wrong integer expansion 2022-10-13 08:14:04 -07:00
vm_core.h Remove numiv from RObject 2022-11-10 10:11:34 -05:00
vm_debug.h
vm_dump.c push dummy frame for loading process 2022-10-20 17:38:28 +09:00
vm_eval.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
vm_exec.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
vm_exec.h
vm_insnhelper.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
vm_insnhelper.h Remove unused class serial 2022-10-21 14:56:48 -07:00
vm_method.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
vm_opts.h
vm_sync.c
vm_sync.h
vm_trace.c Using UNDEF_P macro 2022-11-16 18:58:33 +09:00
vsnprintf.c
warning.rb
yjit.c YJIT: Add object shape count to stats (#6754) 2022-11-17 12:59:59 -08:00
yjit.h YJIT: Invalidate redefined methods only through cme (#6734) 2022-11-15 12:57:43 -08:00
yjit.rb YJIT: Add object shape count to stats (#6754) 2022-11-17 12:59:59 -08:00

README.md

Actions Status: MinGW Actions Status: MJIT Actions Status: Ubuntu Actions Status: Windows AppVeyor status Travis Status Cirrus Status

What is Ruby?

Ruby is an interpreted object-oriented programming language often used for web development. It also offers many scripting features to process plain text and serialized files, or manage system tasks. It is simple, straightforward, and extensible.

Features of Ruby

  • Simple Syntax
  • Normal Object-oriented Features (e.g. class, method calls)
  • Advanced Object-oriented Features (e.g. mix-in, singleton-method)
  • Operator Overloading
  • Exception Handling
  • Iterators and Closures
  • Garbage Collection
  • Dynamic Loading of Object Files (on some architectures)
  • Highly Portable (works on many Unix-like/POSIX compatible platforms as well as Windows, macOS, etc.) cf. https://github.com/ruby/ruby/blob/master/doc/maintainers.rdoc#label-Platform+Maintainers

How to get Ruby with Git

For a complete list of ways to install Ruby, including using third-party tools like rvm, see:

https://www.ruby-lang.org/en/downloads/

The mirror of the Ruby source tree can be checked out with the following command:

$ git clone https://github.com/ruby/ruby.git

There are some other branches under development. Try the following command to see the list of branches:

$ git ls-remote https://github.com/ruby/ruby.git

You may also want to use https://git.ruby-lang.org/ruby.git (actual master of Ruby source) if you are a committer.

How to build

see Building Ruby

Ruby home page

https://www.ruby-lang.org/

Documentation

Mailing list

There is a mailing list to discuss Ruby. To subscribe to this list, please send the following phrase:

subscribe

in the mail body (not subject) to the address ruby-talk-request@ruby-lang.org.

Copying

See the file COPYING.

Feedback

Questions about the Ruby language can be asked on the Ruby-Talk mailing list or on websites like https://stackoverflow.com.

Bugs should be reported at https://bugs.ruby-lang.org. Read "Reporting Issues" for more information.

Contributing

See "Contributing to Ruby", which includes setup and build instructions.

The Author

Ruby was originally designed and developed by Yukihiro Matsumoto (Matz) in 1995.

matz@ruby-lang.org