perf: reduce large pre-allocations for `JavascriptParser::new` #7286

h-a-n-a · 2024-07-23T12:38:24Z

Summary

AsyncDependenciesBlock is of the size of 208 bytes. It was pre-allocated for the capacity of 256:

size_of::<AsyncDependenciesBlock>() = 208

Taking a project with 10000 modules. It will at least allocate (and destroy) the memory of 256 * 208 * 10000 bytes (507Mbs), causing a memory increment and decrement back and forth.

Since the layout AsyncDependenciesBlock is quite compat, two approaches remain:

Change Vec<AsyncDependenciesBlock> to Vec<Box<AsyncDependenciesBlock>>
Change initial capacity of the Vec to a smaller value

The first solution is to reduce the memory usage from 256 * 208 to 256 * 24, giving us a significant memory reduce, however with some potential overhead for sending the value to the heap.
With the second capacity change from 256 to 64.

I also changed the initial capacity of other vecs, although it does not(and should not) have much memory cost difference.

Performance for big projects

The performance difference between these two changes of our internal project of 38134 modules remain unnoticeable. It's around 30 seconds for each build on my computer (Apple M2 Max, 64 GB).

Memory cost

The memory cost of this is a constant value for each module. With the difference number of modules, it results in different amount of allocations created and destroyed. Taking 10000 modules case as an example, we can notice the difference of memory being created and destroyed during each build:

Before:

After:

It's 12.5% improvement of the total memory being allocated.

Clippy complaints of `Vec<Box<T>> where T: Sized`

Clippy is complaining that T of Vec<T> was already allocated on the heap. Since it's a huge struct with that much of pre-allocation, we can workaround this as it was said in the detail page: rust-lang/rust-clippy#3530

Checklist

Tests updated (or not required).
Documentation updated (or not required).

h-a-n-a · 2024-07-23T12:40:07Z

!bench

rspack-bot · 2024-07-23T12:40:29Z

📝 Benchmark detail: Open

Name	Base (2024-07-23 26d1cb6)	Current	Change
10000_development-mode exec	2.25 s ± 22 ms	2.24 s ± 13 ms	-0.25 %
10000_development-mode_hmr exec	699 ms ± 10 ms	697 ms ± 7.3 ms	-0.36 %
10000_production-mode exec	2.84 s ± 38 ms	2.84 s ± 25 ms	-0.20 %
arco-pro_development-mode exec	1.91 s ± 73 ms	1.91 s ± 67 ms	0.00 %
arco-pro_development-mode_hmr exec	434 ms ± 0.95 ms	434 ms ± 0.9 ms	-0.08 %
arco-pro_production-mode exec	3.47 s ± 109 ms	3.51 s ± 87 ms	1.14 %
threejs_development-mode_10x exec	1.77 s ± 20 ms	1.79 s ± 21 ms	1.44 %
threejs_development-mode_10x_hmr exec	865 ms ± 6.6 ms	884 ms ± 6 ms	2.21 %
threejs_production-mode_10x exec	5.74 s ± 25 ms	5.8 s ± 40 ms	1.00 %

h-a-n-a · 2024-07-23T13:15:15Z

!bench

rspack-bot · 2024-07-23T13:15:38Z

📝 Benchmark detail: Open

Name	Base (2024-07-23 26d1cb6)	Current	Change
10000_development-mode exec	2.25 s ± 22 ms	2.24 s ± 20 ms	-0.03 %
10000_development-mode_hmr exec	699 ms ± 10 ms	693 ms ± 6.8 ms	-0.84 %
10000_production-mode exec	2.84 s ± 38 ms	2.83 s ± 23 ms	-0.47 %
arco-pro_development-mode exec	1.91 s ± 73 ms	1.94 s ± 39 ms	1.70 %
arco-pro_development-mode_hmr exec	434 ms ± 0.95 ms	435 ms ± 1.3 ms	0.11 %
arco-pro_production-mode exec	3.47 s ± 109 ms	3.48 s ± 68 ms	0.23 %
threejs_development-mode_10x exec	1.77 s ± 20 ms	1.79 s ± 25 ms	1.31 %
threejs_development-mode_10x_hmr exec	865 ms ± 6.6 ms	882 ms ± 7.1 ms	1.99 %
threejs_production-mode_10x exec	5.74 s ± 25 ms	5.79 s ± 45 ms	0.71 %

github-actions bot added release: performance release: performance related release(mr only) team The issue/pr is created by the member of Rspack. labels Jul 23, 2024

h-a-n-a force-pushed the perf-javascript-parser branch from 9c40c17 to ad225cc Compare July 23, 2024 13:11

perf: init

64e6c98

h-a-n-a force-pushed the perf-javascript-parser branch from ad225cc to 64e6c98 Compare July 23, 2024 14:23

h-a-n-a changed the title ~~perf: reduce allocation for JavascriptParser::new~~ perf: reduce large pre-allocations for JavascriptParser::new Jul 23, 2024

h-a-n-a marked this pull request as ready for review July 23, 2024 15:10

h-a-n-a requested a review from jerrykingxyz as a code owner July 23, 2024 15:10

h-a-n-a enabled auto-merge (squash) July 23, 2024 15:10

github-actions bot assigned h-a-n-a Jul 23, 2024

h-a-n-a requested review from JSerFeng and ahabhgk July 23, 2024 15:10

ahabhgk approved these changes Jul 23, 2024

View reviewed changes

h-a-n-a merged commit f6c8932 into main Jul 23, 2024
46 checks passed

h-a-n-a deleted the perf-javascript-parser branch July 23, 2024 15:12

chenjiahan mentioned this pull request Jul 26, 2024

release: 1.0.0-beta.0 #7316

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: reduce large pre-allocations for `JavascriptParser::new` #7286

perf: reduce large pre-allocations for `JavascriptParser::new` #7286

h-a-n-a commented Jul 23, 2024 •

edited

Loading

h-a-n-a commented Jul 23, 2024

rspack-bot commented Jul 23, 2024 •

edited

Loading

h-a-n-a commented Jul 23, 2024

rspack-bot commented Jul 23, 2024 •

edited

Loading

perf: reduce large pre-allocations for JavascriptParser::new #7286

perf: reduce large pre-allocations for JavascriptParser::new #7286

Conversation

h-a-n-a commented Jul 23, 2024 • edited Loading

Summary

Performance for big projects

Memory cost

Clippy complaints of Vec<Box<T>> where T: Sized

Checklist

h-a-n-a commented Jul 23, 2024

rspack-bot commented Jul 23, 2024 • edited Loading

h-a-n-a commented Jul 23, 2024

rspack-bot commented Jul 23, 2024 • edited Loading

perf: reduce large pre-allocations for `JavascriptParser::new` #7286

perf: reduce large pre-allocations for `JavascriptParser::new` #7286

h-a-n-a commented Jul 23, 2024 •

edited

Loading

Clippy complaints of `Vec<Box<T>> where T: Sized`

rspack-bot commented Jul 23, 2024 •

edited

Loading

rspack-bot commented Jul 23, 2024 •

edited

Loading