当前位置:网站首页>In V8 how arrays (with source code, picture and text easier to understand)

In V8 how arrays (with source code, picture and text easier to understand)

2022-08-03 23:34:00 all-smile

Continued from the previous Nuggets V8 中的快慢属性,本篇分析V8 How in the array,Knowing whether an array is fully populated or with holes、fast and slow array、Fast and slow conversion、动态扩缩容等等.Many of these language underlying a similar approach,比如:Golangmedium slicedappendThe operation involves expansion processing.

D8A debugging tool using the nuggets, please D8调试工具——jsvu的使用细则

1、全填充 or 带孔

by a small plum,Take a look at what a fully populated array is(Paked-Array),What is an array with holes(Holey-Array)

It was written before稀疏数组,Sparse arrays are more business-applicable,Cleaned is meaningless data,You can compare the array with holes to analyze,If you are interested, please see Nuggets 稀疏数组——Realize the function of backgammon save and reload

const o = ['a', 'b', 'c']
console.log(o[1])          // 'b'

delete o[1]
console.log(o[1])          // undefined
o.__proto__ = { 1: 'B' }
console.log(o[0])          // 'a'
console.log(o[1])          // 'B'   但如何确定要访问原型链??
console.log(o[2])          // 'c'
console.log(o[3])          // undefined

如果一个数组中所有位置均有值,我们称之为全填充Packed)数组;

若某些位置在初始化时未定义(如 const arr = [1, , 3] 中的 arr[1]),或定义后被删除(delete,如上述例子),称之为带孔Holey)数组.

该例子在 V8 的访问可以通过下图解释:

image.png

一开始数组 o 是 packed 的,所以访问 o[1] 时可以直接获取值,而不需要访问原型.

而行 4:delete o[1] 为数组引入了一个孔洞(the_hole),用于标记不存在的属性,同时又行 6 为 o 定义了原型上的 1 属性,当再次获取 o[1] 时会穿孔Then continue to query on the prototype chain.原型链上的查询是昂贵的,可以根据是否有 the_hole 来降低这部分查询开销.

image.png

2、fast and slow array

const arr = [1, 2, 3]
arr[1999] = 1999
// arr 会如何存储?

这个例子中,在行 1 声明完毕后 arr 是一个全填充的数组,但在行 2 马上又定义索引 1999 处值为 1999,此时如果为 arr 创建一个长度为 2000 的完整数组来存储这样的稀疏数据将会非常占用内存,为了应对这种情况,V8 会将数组降级为慢数组,创建一个字典来存储「键、值、描述符」key、value、descriptor) 三元组.这就是 Object.defineProperty(object, key, descriptor) API 同样会做的事情.

  1. 鉴于我们没有办法在 JavaScript 的 API 层面让 V8 找到 HiddenClass 并存储对应的 descriptor 信息,所以当使用 Object.defineProperty 自定义 key、value、descriptor 时,V8 都会使用慢属性,对应到数组中就是慢数组.

  2. Object.defineProperty 是 Vue 2 的核心 API,当对象或数组很庞大时,不可避免地导致访问速度下降,这是底层原理决定的.

So what exactly are fast arrays and slow arrays??我们看下V8The underlying definition of an array: 源代码:v8/src/objects/js-array.h

image.png

  • 快模式:Arrays implement V8 里一个叫 FixedArray 的类,它在内存中是连续的空间,Read and write values ​​directly by index,非常快.如果有 push 或 pop 操作,It will expand or contract dynamically.

  • 慢模式:如前文所介绍,V8 创建了一个字典(HashTable)To record the mapping relationship,The integer value of the index is the key of the dictionary.

Why array is also the object types?

在 V8 The source code clearly states,JSArray 继承自 JSObject,i.e. an array is a special object,而 JS All non-primitive types in are instances of objects,所以 JS Arrays can store multiple types of values.

Also used inside the arraykey-value的存储形式

const testArr = [1, "hello", true, function () {
  return 1;
}];

image.png

2.1、When is a fast array converted to a slow array

(1)、Take a look at the source code first

  1. path:v8/src/objects/js-objects-inl.h

    Fast and slow mode conversion: ShouldConvertToSlowElements

// path:v8/src/objects/js-objects-inl.h

// If the fast-case backing storage takes up much more memory than a dictionary
// backing storage would, the object should have slow elements.
// static
static inline bool ShouldConvertToSlowElements(uint32_t used_elements,
                                               uint32_t new_capacity) {
  uint32_t size_threshold = NumberDictionary::kPreferFastElementsSizeFactor *
                            NumberDictionary::ComputeCapacity(used_elements) *
                            NumberDictionary::kEntrySize;
  return size_threshold <= new_capacity;
}

static inline bool ShouldConvertToSlowElements(JSObject object,
                                               uint32_t capacity,
                                               uint32_t index,
                                               uint32_t* new_capacity) {
  STATIC_ASSERT(JSObject::kMaxUncheckedOldFastElementsLength <=
                JSObject::kMaxUncheckedFastElementsLength);
  if (index < capacity) {
    *new_capacity = capacity;
    return false;
  }
  if (index - capacity >= JSObject::kMaxGap) return true;
  *new_capacity = JSObject::NewElementsCapacity(index + 1);
  DCHECK_LT(index, *new_capacity);
  if (*new_capacity <= JSObject::kMaxUncheckedOldFastElementsLength ||
      (*new_capacity <= JSObject::kMaxUncheckedFastElementsLength &&
       ObjectInYoungGeneration(object))) {
    return false;
  }
  return ShouldConvertToSlowElements(object.GetFastElementsUsage(),
                                     *new_capacity);
}

(2)、分析

  • 如果快数组扩容后的容量是原来的 3 倍以上,意味着它比 HashTable 形式存储占用更大的内存,快数组会转换为慢数组

  • 如果快数组新增的索引与原来最大索引的差值大于 1024,快数组会被转换会慢数组

所以,前面的例子:

const arr = [1, 2, 3];
arr[1999] = 1999;
%DebugPrint(arr);

1999 - 2 > 1024,arr 从快数组转换为哈希形式存储的慢数组.

Look at the detailed operation information below

  • 修改arr之前:

image.png

  • 修改arr之后:
    9c1a1af43947b4da39b9c554d56c312.png

2.2、When is a slow array converted to a fast array

(1)、Take a look at the source code first

  1. path:v8/src/objects/js-objects.cc
// path:v8/src/objects/js-objects.cc

// line:4932
static bool ShouldConvertToFastElements(JSObject object,
                                        NumberDictionary dictionary,
                                        uint32_t index,
                                        uint32_t* new_capacity) {
  // If properties with non-standard attributes or accessors were added, we
  // cannot go back to fast elements.
  if (dictionary.requires_slow_elements()) return false;

  // Adding a property with this index will require slow elements.
  if (index >= static_cast<uint32_t>(Smi::kMaxValue)) return false;

  if (object.IsJSArray()) {
    Object length = JSArray::cast(object).length();
    if (!length.IsSmi()) return false;
    *new_capacity = static_cast<uint32_t>(Smi::ToInt(length));
  } else if (object.IsJSArgumentsObject()) {
    return false;
  } else {
    *new_capacity = dictionary.max_number_key() + 1;
  }
  *new_capacity = std::max(index + 1, *new_capacity);

  uint32_t dictionary_size = static_cast<uint32_t>(dictionary.Capacity()) *
                             NumberDictionary::kEntrySize;

  // 看这里, 当慢数组转换成快数组能节省 不少于 50% 的空间时,才会将其转换
  // Turn fast if the dictionary only saves 50% space.
  return 2 * dictionary_size >= *new_capacity;
}

(2)、分析

Element can be stored in fast array and length is notsmi之间(64位-231到232-1),And the current slow array space saving value is less than or equal to the fast array50%,is transformed into a fast array.

Fast and slow conversion summary

  • 快数组就是以空间换时间的方式,申请了大块连续内存,提高了执行效率.

  • 慢数组以时间换空间,不必申请连续的空间,节省了内存,但需要付出效率变差的代价.

3、Dynamic expansion and contraction

3.1、扩容

看下源码

  1. path:v8/src/objects/js-array.h

    Empty array preallocated size: 4

// path:v8/src/objects/js-array.h

// Dispatched behavior.
DECL_PRINTER(JSArray)
DECL_VERIFIER(JSArray)

// Number of element slots to pre-allocate for an empty array.
// An empty array for pre-allocated size4
static const int kPreallocatedArrayElements = 4;

static const int kLengthDescriptorIndex = 0;

上面代码表明,When the statement an empty array,pre-allocated 4 个字节的存储空间.

所以 [] 与 [1, 2, 3, 4] Take as much memory. 前面说过,JSArray 继承自 JSObject,我们可以在 js-objects.h 中找到如下代码:

  1. path:v8/src/objects/js-objects.h

    扩容公式

// path:v8/src/objects/js-objects.h

// line:551
static const uint32_t kMinAddedElementsCapacity = 16;

// Computes the new capacity when expanding the elements of a JSObject.
static uint32_t NewElementsCapacity(uint32_t old_capacity) {
  // (old_capacity + 50%) + kMinAddedElementsCapacity
  // 扩容公式:The original memory capacity(1.5倍)+ 16
  return old_capacity + (old_capacity >> 1) + kMinAddedElementsCapacity;
}

这是对 JSObject elements expansion and JSArray General approach to scaling.The calculation logic of the capacity after expansion is:in the original space old_capacity increased by half(old_capacity >> 1 右移 1 位表示除 2,Add the original space 1.5 倍),再加上 16.

举例:

const arr = [1, 2, 3, 4];
arr.push(5);
%DebugPrint(arr);
  • arr.push 之前:
    image.png

  • arr.push 后:

image.png

具体分析如下:

  1. 向数组 [1, 2, 3, 4] push 5 时,First determine that the current capacity is full,New capacity needs to be calculated.

  2. old_capacity = 4,new_capacity = 4 + 4 >> 1 + 16 = 22,得出 [1, 2, 3, 4, 5] 的容量为 22 个字节,

  3. V8 Apply to the operating system for a block with a continuous size of 22 字节的内存空间,Then the old data one by one copy,Write the new element again.

3.2 缩容

紧接着,我们在 src/objects/elements.cc 中找到 SetLengthImpl 方法中的如下代码:

// path:src/objects/elements.cc

// line:750
if (2 * length + JSObject::kMinAddedElementsCapacity <= capacity) {
  // If more than half the elements won't be used, trim the array.
  // Do not trim from short arrays to prevent frequent trimming on
  // repeated pop operations.
  // Leave some space to allow for subsequent push operations.
  int elements_to_trim = length + 1 == old_length
                             ? (capacity - length) / 2
                             : capacity - length;
  isolate->heap()->RightTrimFixedArray(*backing_store, elements_to_trim);
  // Fill the non-trimmed elements with holes.
  BackingStore::cast(*backing_store)
      .FillWithHoles(length,
                     std::min(old_length, capacity - elements_to_trim));
} else {
  // Otherwise, fill the unused tail with holes.
  BackingStore::cast(*backing_store).FillWithHoles(length, old_length);
}

When the array elements(如 pop)后,如果数组容量大于等于 length 的 2 倍,则进行容量调整,使用 RightTrimFixedArray 函数,计算出需要释放的空间大小,做好标记,等待 GC 回收;如果数组容量小于 length 的 2 倍,则用 holes 对象填充.

总结:

  1. When there are few elements in the array, it is stored in a linear structure(FixedArray)的,内存地址连续,查找速度快,Can be dynamically expanded and contracted;

  2. Convert to slow array when there are many elements in the array,By creating a dictionary to record the mapping relationship,内存不连续,通过大名鼎鼎的Object.defineProperty(object, key, descriptor)创建

jsThe arrays look different,其实只是V8 在底层实现上做了一层封装,使用两种数据结构实现数组,and through time and space2latitude choice,Optimized the performance of arrays.

参考学习博客


关注我,你会发现一个踏实努力的宝藏前端,让我们一起学习,共同成长吧.

喜欢的小伙伴记得点赞关注收藏哟,回看不迷路

原网站

版权声明
本文为[all-smile]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/215/202208032231590314.html