Fix slow compilation of large zero-initialized arrays; warn on large non-zero fills by 39ali · Pull Request #586 · Rust-GPU/rust-gpu

39ali · 2026-04-22T18:43:32Z

let arr = [0; 32 * 1024] compiled very slowly. The cause: Rvalue::Repeat with a zero element routes through memset, while will create a huge list of repeated store instructions, the fix: use OpConstantNull instead when fill_byte == 0, valid for any SPIR-V composite type, single instruction regardless of size.

SPIR-V has no equivalent of OpConstantNull for non-zero values , so the best we can do is tell the user early by emitting a warning

…e with N operands

Firestar99

Fix slow compilation

First, could you explain what kind of slowdown you're seeing?

use OpConstantNull instead when fill_byte == 0

I like that idea, although I'm unsure if spirt supports that command @eddyb?

will create a huge list of repeated store instructions

Isn't the bug then that the emitted OpConstantComposite was broken up into a bunch of individual OpStore instructions / not being reassembled into an OpCompositeConstruct / OpConstantComposite? (Our destructure_composites post-link pass may be related to this?)

39ali · 2026-04-24T22:43:11Z

i needed to clarify that for let arr = [0; 32 * 1024] it'll translate to OpConstantComposite
but for arr = [1; 32 * 1024] it'll translate to OpInBoundsAccessChain + OpStore

First, could you explain what kind of slowdown you're seeing?

compile time is slow, i believe its because of the long OpConstantComposite instruction

i'll attach the .spv file for this shader :

#![no_std]
use spirv_std::glam::Vec4;
use spirv_std::spirv;

#[inline(never)]
#[spirv(compute(threads(1)))]
pub fn self_assignment(
    #[spirv(storage_buffer, descriptor_set = 0, binding = 1)] y: &mut [f32],
) {
    let arr = [0f32; 32 * 1024];
    for i in 0..arr.len() {
        y[i] = arr[i];
    }
}

spirv.txt

39ali requested review from Firestar99, LegNeato and eddyb as code owners April 22, 2026 18:43

39ali force-pushed the array-memset branch from 29260f6 to d0267ca Compare April 22, 2026 18:44

[0; N] emit a single OpConstantNull rather than an OpConstantComposit…

9fb34da

…e with N operands

39ali force-pushed the array-memset branch from 81da8b8 to 9fb34da Compare April 22, 2026 19:03

Firestar99 requested changes Apr 24, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix slow compilation of large zero-initialized arrays; warn on large non-zero fills#586

Fix slow compilation of large zero-initialized arrays; warn on large non-zero fills#586
39ali wants to merge 1 commit intoRust-GPU:mainfrom
39ali:array-memset

39ali commented Apr 22, 2026

Uh oh!

Firestar99 left a comment

Uh oh!

39ali commented Apr 24, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

39ali commented Apr 22, 2026

Uh oh!

Firestar99 left a comment

Choose a reason for hiding this comment

Uh oh!

39ali commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

First, could you explain what kind of slowdown you're seeing?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

39ali commented Apr 24, 2026 •

edited

Loading