Fetch the entire payload within the pipeline #2073

heaths · 2025-02-05T22:03:09Z

Based on our discussion (see OP history for context), we decided on a pattern that would allow most client methods to work like so:

let secret = client.get_secret("name", "version", None).await?.into_body()?;

The entire response is buffered into memory by default. Deserialization happens on that buffer and is not, therefore, async. This also should provide opportunity to attach the raw response to the ErrorKind::HttpResponse, on which we could provide deserialization helpers but would not deserialize by default.

To support downloading large payloads - or for any case where a customer might otherwise want to stream the response - all Response<T> would support something like:

let response = client.download_blob("blob", None).await?; // get at least the headers
let mut stream = response.into_stream();
while let Some(buf) = stream.next().await? {
    // e.g., write buf to file
}

While we'll still have helpers to deserialize into custom model types attached to Response<T> (see #1831 (comment)), this would still allow customers to do something like this if, say, a blob were a structured model or for any model response:

let content: Vec<u8> = stream.try_collect().await?;
let m: Model = serde_json::from_slice(&content);

This does mean that into_body() et. al. are implemented only for something like Response<T> where T: Deserialize, so pure streaming methods need to return a type that would never implement Deserialize but can stream, like our own ResponseBody or something. Or maybe we return a ResponseBody in lieu of Response<T>.

To clarify, the into_stream method does not change anything in the pipeline itself (because, in fact, it's called too late for it to do so), so if you called it on get_secret's response, you'd end up with a stream that yields all of its bytes synchronously. Separately to adding that into_stream method we'll be changing the pipeline to eagerly read the entire body unless a special flag is provided in the Context.

Note: if the HTTP status code is not an acceptable success code (see #1733), we should always buffer the entire error response in the first await call so it's available on ErrorKind::HttpResponse (see #2495).

The text was updated successfully, but these errors were encountered:

heaths · 2025-02-05T22:16:38Z

One question that arises is whether we should advertise a way for customers to opt into this same behavior of not buffering the entire payload e.g., a field on azure_core::ClientMethodOptions or a type to pass through the Context (the former is idiomatic). However we do it would have to be pub anyway.

That said, I should clarify that we shouldn't try to deserialize in the pipeline. Deserialization is meant to be late but does not need to be async. This lets customers still grab the buffered response and do whatever - save it, deserialize their own types, whatever - without awaiting again. Besides, if deserialization fails with the entire response buffered already, it's not going to be something a retry will fix.

I'll update the OP.

heaths · 2025-04-17T00:44:41Z

We discussed this today and decided on a pattern that would allow most client methods to work like so:

let secret = client.get_secret("name", "version", None).await?.into_body()?;

The entire response is buffered into memory by default. Deserialization happens on that buffer and is not, therefore, async. This also should provide opportunity to attach the raw response to the ErrorKind::HttpResponse, on which we could provide deserialization helpers but would not deserialize by default.

To support downloading large payloads - or for any case where a customer might otherwise want to stream the response - all Response<T> would support something like:

let response = client.download_blob("blob", None).await?; // get at least the headers
let mut stream = response.into_stream();
while let Some(buf) = stream.next().await? {
    // e.g., write buf to file
}

While we'll still have helpers to deserialize into custom model types attached to Response<T> (see #1831 (comment)), this would still allow customers to do something like this if, say, a blob were a structured model or for any model response:

let content: Vec<u8> = stream.try_collect().await?;
let m: Model = serde_json::from_slice(&content);

This does mean that into_body() et. al. are implemented only for something like Response<T> where T: Deserialize, so pure streaming methods need to return a type that would never implement Deserialize but can stream, like our own ResponseBody or something. Or maybe we return a ResponseBody in lieu of Response<T>. @analogrelay relay, since you did some refactor with those types, any initial thoughts?

Note: if the HTTP status code is not an acceptable success code (see #1733), we should always buffer the entire error response in the first await call so it's available on ErrorKind::HttpResponse (see #2495).

analogrelay · 2025-04-17T16:19:24Z

To support downloading large payloads - or for any case where a customer might otherwise want to stream the response - all Response would support something like:

To clarify, the into_stream method does not change anything in the pipeline itself (because, in fact, it's called too late for it to do so), so if you called it on get_secret's response, you'd end up with a stream that yields all of its bytes synchronously. Separately to adding that into_stream method we'll be changing the pipeline to eagerly read the entire body unless a special flag is provided in the Context.

Aside from that clarification (which we did cover in the meeting, just wanted to get it here in writing), this sounds good to me!

heaths · 2025-04-17T19:59:48Z

Thank you, @analogrelay! Good catch on that. Yes, and we already have at least partial support with our ResponseBody type.

heaths added Azure.Core The azure_core crate Client This issue points to a problem in the data-plane of the library. design-discussion An area of design currently under discussion and open to team and community feedback. labels Feb 5, 2025

heaths self-assigned this Feb 5, 2025

github-project-automation bot added this to Azure SDK Rust Feb 5, 2025

github-project-automation bot moved this to Untriaged in Azure SDK Rust Feb 5, 2025

RickWinter moved this from Untriaged to Not Started in Azure SDK Rust Apr 15, 2025

RickWinter added this to the 2025-07 milestone Apr 15, 2025

heaths moved this from Not Started to In Progress in Azure SDK Rust Apr 17, 2025

heaths removed the design-discussion An area of design currently under discussion and open to team and community feedback. label May 2, 2025

heaths mentioned this issue May 7, 2025

Replace Model trait with Format trait (applied to Response instead) #2559

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fetch the entire payload within the pipeline #2073

Fetch the entire payload within the pipeline #2073

heaths commented Feb 5, 2025 •

edited

Loading

heaths commented Feb 5, 2025

Uh oh!

heaths commented Apr 17, 2025 •

edited

Loading

Uh oh!

analogrelay commented Apr 17, 2025 •

edited

Loading

Uh oh!

heaths commented Apr 17, 2025

Uh oh!

Fetch the entire payload within the pipeline #2073

Fetch the entire payload within the pipeline #2073

Comments

heaths commented Feb 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

heaths commented Feb 5, 2025

Uh oh!

heaths commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

analogrelay commented Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

heaths commented Apr 17, 2025

Uh oh!

heaths commented Feb 5, 2025 •

edited

Loading

heaths commented Apr 17, 2025 •

edited

Loading

analogrelay commented Apr 17, 2025 •

edited

Loading