ruby/doc/memory_view.md

6.5 KiB

MemoryView

MemoryView provides the features to share multidimensional homogeneous arrays of fixed-size element on memory among extension libraries.

Disclaimer

  • This feature is still experimental. The specification described here can be changed in the future.

  • This document is under construction. Please refer the master branch of ruby for the latest version of this document.

Overview

We sometimes deal with certain kinds of objects that have arrays of the same typed fixed-size elements on a contiguous memory area as its internal representation. Numo::NArray in numo-narray and Magick::Image in rmagick are typical examples of such objects. MemoryView plays the role of the hub to share the internal data of such objects without copy among such libraries.

Copy-less sharing of data is very important in some field such as data analysis, machine learning, and image processing. In these field, people need to handle large amount of on-memory data with several libraries. If we are forced to copy to exchange large data among libraries, a large amount of the data processing time must be occupied by copying data. You can avoid such wasting time by using MemoryView.

MemoryView has two categories of APIs:

  1. Producer API

    Classes can register own MemoryView entry which allows objects of that classes to expose their MemoryView

  2. Consumer API

    Consumer API allows us to obtain and manage the MemoryView of an object

MemoryView structure

A MemoryView structure, rb_memory_view_t, is used for exporting objects' MemoryView. This structure contains the reference of the object, which is the owner of the MemoryView, the pointer to the head of exported memory, and the metadata that describes the structure of the memory. The metadata can describe multidimensional arrays with strides.

The member of MemoryView structure

The MemoryView structure consists of the following members.

  • VALUE obj

    The reference to the original object that has the memory exported via the MemoryView.

    RubyVM manages the reference count of the MemoryView-exported objects to guard them from the garbage collection. The consumers do not have to struggle to guard this object from GC.

  • void *data

    The pointer to the head of the exported memory.

  • ssize_t byte_size

    The number of bytes in the memory pointed by data.

  • bool readonly

    true for readonly memory, false for writable memory.

  • const char *format

    A string to describe the format of an element, or NULL for unsigned byte.

  • ssize_t item_size

    The number of bytes in each element.

  • const rb_memory_view_item_component_t *item_desc.components

    The array of the metadata of the component in an element.

  • size_t item_desc.length

    The number of items in item_desc.components.

  • ssize_t ndim

    The number of dimensions.

  • const ssize_t *shape

    A ndim size array indicating the number of elements in each dimension. This can be NULL when ndim is 1.

  • const ssize_t *strides

    A ndim size array indicating the number of bytes to skip to go to the next element in each dimension. This can be NULL when ndim is 1.

  • const ssize_t *sub_offsets

    A ndim size array consisting of the offsets in each dimension when the MemoryView exposes a nested array. This can be NULL when the MemoryView exposes a flat array.

  • void *private_data

    The private data that MemoryView provider uses internally. This can be NULL when any private data is unnecessary.

MemoryView APIs

For consumers

  • bool rb_memory_view_available_p(VALUE obj)

    Return true if obj supports to export a MemoryView. Return false otherwise.

    If this function returns true, it doesn't mean the function rb_memory_view_get will succeed.

  • bool rb_memory_view_get(VALUE obj, rb_memory_view_t *view, int flags)

    If the given obj supports to export a MemoryView that conforms the given flags, this function fills view by the information of the MemoryView and returns true. In this case, the reference count of obj is increased.

    If the given combination of obj and flags cannot export a MemoryView, this function returns false. The content of view is not touched in this case.

    The exported MemoryView must be released by rb_memory_view_release when the MemoryView is no longer needed.

  • bool rb_memory_view_release(rb_memory_view_t *view)

    Release the given MemoryView view and decrement the reference count of view->obj.

    Consumers must call this function when the MemoryView is no longer needed. Missing to call this function leads memory leak.

  • ssize_t rb_memory_view_item_size_from_format(const char *format, const char **err)

    Calculate the number of bytes occupied by an element.

    When the calculation fails, the failed location in format is stored into err, and returns -1.

  • void *rb_memory_view_get_item_pointer(rb_memory_view_t *view, const ssize_t *indices)

    Calculate the location of the item indicated by the given indices. The length of indices must equal to view->ndim. This function initializes view->item_desc if needed.

  • VALUE rb_memory_view_get_item(rb_memory_view_t *view, const ssize_t *indices)

    Return the Ruby object representation of the item indicated by the given indices. The length of indices must equal to view->ndim. This function uses rb_memory_view_get_item_pointer.

  • rb_memory_view_init_as_byte_array(rb_memory_view_t *view, VALUE obj, void *data, const ssize_t len, const bool readonly)

    Fill the members of view as an 1-dimensional byte array.

  • void rb_memory_view_fill_contiguous_strides(const ssize_t ndim, const ssize_t item_size, const ssize_t *const shape, const bool row_major_p, ssize_t *const strides)

    Fill the strides array with byte-Strides of a contiguous array of the given shape with the given element size.

  • void rb_memory_view_prepare_item_desc(rb_memory_view_t *view)

    Fill the item_desc member of view.

  • bool rb_memory_view_is_contiguous(const rb_memory_view_t *view)

    Return true if the data in the MemoryView view is row-major or column-major contiguous.

    Return false otherwise.

  • bool rb_memory_view_is_row_major_contiguous(const rb_memory_view_t *view)

    Return true if the data in the MemoryView view is row-major contiguous.

    Return false otherwise.

  • bool rb_memory_view_is_column_major_contiguous(const rb_memory_view_t *view)

    Return true if the data in the MemoryView view is column-major contiguous.

    Return false otherwise.