[persist] refactor Blob impl for Azure for higher performance #31127

ParkMyCar · 2025-01-21T15:07:53Z

This refactors the impl of Blob for Azure in a way that should be faster. The BlobClient we use from the azure_storage_blob crate returns a Stream that when await-ed sends a ranged GET request for a chunk of a blob. This PR refactors our impl so we await each ranged request in a tokio::task which increases the concurrency at which we fetch chunks of a Part.

It also refactors how we handle the case when the content-length header is missing, and adds metrics so we can track how often this occurs.

Motivation

Maybe progress against https://github.com/MaterializeInc/database-issues/issues/8892

Checklist

This PR has adequate test coverage / QA involvement has been duly considered. (trigger-ci for additional test/nightly runs)
This PR has an associated up-to-date design doc, is a design doc (template), or is sufficiently small to not require a design.
If this PR evolves an existing $T ⇔ Proto$T mapping (possibly in a backwards-incompatible way), then it is tagged with a T-proto label.
If this PR will require changes to cloud orchestration or tests, there is a companion cloud PR to account for those changes that is tagged with the release-blocker label (example).
If this PR includes major user-facing behavior changes, I have pinged the relevant PM to schedule a changelog post.

* move fetching each chunk of a Part into a tokio::task * reduce copying in the case we get an invalid content-length header * add metrics for tracking the number of responses missing content-length

bkirwi · 2025-01-21T20:42:27Z

src/persist/src/azure.rs

+        // valuable.
+        let mut stream = blob.get().into_stream();
+
+        while let Some(value) = stream.next().await {


Could we map/buffered here instead of spinning up individual tasks? A bit closer to the S3 impl (which does not fork off individual tasks) and makes it easier to cap the concurrency per fetch...

good call! refactored to use FuturesOrdered like the S3 blob impl

bkirwi · 2025-01-21T20:47:14Z

src/persist/src/azure.rs

                        .lgbytes
                        .persist_azure
                        .new_region(usize::cast_from(content_length));
                    PreSizedBuffer::Sized(region)
                }
-                0 => PreSizedBuffer::Unknown(Vec::new()),
+                0 => {
+                    metrics.get_invalid_resp.inc();


The S3 metrics say that a "content-length of 0 isn't necessarily invalid", which makes sense to me. Could we inc this only if the size turns out to not match the header?

(Generally I'm not convinced of the need to have this defensive coding here, so it'd be handy if this metric fired only in the cases that it was actually load-bearing!)

Ahhh yeah you're totally right, updated!

* remove metrics counting * use FuturesOrdered instead of tokio::task

…es not match content-length

ParkMyCar changed the title ~~[persist] slightly different impl for Azure blob store~~ [persist] less mem-copying in impl for Azure blob store Jan 21, 2025

ParkMyCar changed the title ~~[persist] less mem-copying in impl for Azure blob store~~ [persist] less mem-copying in impl for Azure blob store, in the worst case Jan 21, 2025

start, refactor Blob impl for Azure

8cc2778

* move fetching each chunk of a Part into a tokio::task * reduce copying in the case we get an invalid content-length header * add metrics for tracking the number of responses missing content-length

ParkMyCar force-pushed the persist/azure-blob branch from 0f4186d to 8cc2778 Compare January 21, 2025 18:00

ParkMyCar changed the title ~~[persist] less mem-copying in impl for Azure blob store, in the worst case~~ [persist] refactor Blob impl for Azure for higher performance Jan 21, 2025

ParkMyCar marked this pull request as ready for review January 21, 2025 18:04

ParkMyCar requested a review from a team as a code owner January 21, 2025 18:04

ParkMyCar requested review from pH14 and bkirwi January 21, 2025 18:08

bkirwi reviewed Jan 21, 2025

View reviewed changes

ParkMyCar added 2 commits January 21, 2025 22:58

GitHub feedback

7d2b67f

* remove metrics counting * use FuturesOrdered instead of tokio::task

report invalid header if the number of bytes read from the network do…

8bf647e

…es not match content-length

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[persist] refactor Blob impl for Azure for higher performance #31127

[persist] refactor Blob impl for Azure for higher performance #31127

ParkMyCar commented Jan 21, 2025 •

edited

Loading

bkirwi Jan 21, 2025 •

edited

Loading

ParkMyCar Jan 22, 2025

bkirwi Jan 21, 2025

ParkMyCar Jan 22, 2025

[persist] refactor Blob impl for Azure for higher performance #31127

Are you sure you want to change the base?

[persist] refactor Blob impl for Azure for higher performance #31127

Conversation

ParkMyCar commented Jan 21, 2025 • edited Loading

Motivation

Checklist

bkirwi Jan 21, 2025 • edited Loading

Choose a reason for hiding this comment

ParkMyCar Jan 22, 2025

Choose a reason for hiding this comment

bkirwi Jan 21, 2025

Choose a reason for hiding this comment

ParkMyCar Jan 22, 2025

Choose a reason for hiding this comment

ParkMyCar commented Jan 21, 2025 •

edited

Loading

bkirwi Jan 21, 2025 •

edited

Loading