Change proc_macro::Span::byte_range -> byte_offset. #139901

m-ou-se · 2025-04-16T08:26:15Z

This is about #54725's open question on byte offsets:

Next to span.line() and span.column(), we also want a way to get the byte offset into the file.

We have basically two competing options to expose this:

Option 1 (merge this PR)

impl Span {
    pub fn byte_offset(&self) -> usize;
}

This is consistent with the line and column methods on Span.
To get the offset of the end of the span, you use span.end().byte_offset(), just like how you use span.end().column() for the end column.

Option 2 (close this PR)

impl Span {
    pub fn byte_range(&self) -> Range<usize>;
}

This gives you both ends of the range at once, and uses the Range type (which is usable to index a slice), but is arguably less consistent with the line and column methods.

Curious to hear what y'all think.

👍 for option 1 (merge this PR)
👎 for option 2 (close this PR)

(The decision is still up to the libs-api team. Votes are just useful as input, not as the final decision.)

This is more consistent with Span::line and Span::column.

m-ou-se · 2025-04-16T08:26:20Z

I slightly prefer option 1, as it fits well in our earlier design choice to get rid of LineColumn (or Location) type and make Span itself usable as a way to store a location. E.g.

let location = range.end();

let _ = location.line();
let _ = location.column();
let _ = location.byte_offset();

If one is using an (empty) Span to store a specific location, it might be awkward to have to write .byte_range().start to get its offset.

m-ou-se · 2025-04-16T08:27:03Z

@rust-lang/libs-api If you have an opinion on this, please speak up!

BurntSushi · 2025-04-16T12:13:23Z

I'm fine with option (1) personally.

I don't think option (1) precludes option (2) though, and I could see it being a nice convenience routine to have. But I don't feel strongly about it. Following the API pattern created with line and column numbers SGTM.

dtolnay · 2025-04-16T15:48:25Z

I prefer to keep byte_range and not adding byte_offset.

The use case for byte offset is significantly different than line/column so the consistency objective is weak. For line/column, the overwhelming majority use cases is rendering a line/column in an error message or log message or similar, where you would practically never also want the upper end. Exposing directly on Span the line/column you almost definitely want, and putting the end line/column behind span.end(), prioritizes the majority use cases being obvious and concise (span.line()) and adding the minimal API that facilitates niche other use cases.

A Range-based API for line/column would not fit. For a span that goes from line 2 column 18 to line 4 column 5 (e.g. the braces of a for) returning Range { start: 18, end: 5 } for the column is bizarre and not coherent with a typical meaning of that type. A more coherent API would be something like Range { start: LineColumn { line: 2, column: 18 }, end: LineColumn { line: 4, column: 5 } } but span.line_column_range().start.line is just a worse API for all use cases than span.line().

Byte range is different than this. In my experience the majority use case for byte offset is slicing. Code like https://github.com/dtolnay/cxx/blob/f9d547b60324bc02d9983622159973a75d06ea10/gen/src/error.rs#L115-L142. The PR summary writes off Range for being an iterator: "This gives you both ends of the range at once, but uses the Range type (which is an Iterator)", but misses that Range is more importantly a RangeBounds. People will not be using iteration on a byte range but they will be using it as RangeBounds for slicing.

I don't find that pushing this to be consistent with line/column improves the API. Unlike line/column, a single byte offset (span.byte_offset()) is generally not going to be useful. In the unusual case that someone needs a single one, span.byte_range().start is not worse for them than span.byte_offset(). Meanwhile span.byte_range() is better than span.byte_offset()..span.end().byte_offset().

m-ou-se · 2025-04-16T21:11:34Z

The PR summary writes off Range for being an iterator

I've reworded it a bit, instead focussing on how it can be used to index a slice. Thanks for pointing that out.

m-ou-se · 2025-04-16T21:18:40Z

Your argument sounds convincing to me.

If anyone still prefers option 1 over 2, please speak up.

Amanieu · 2025-04-20T21:26:43Z

I also prefer option 2.

However I have a different concern about stabilization: how will this interact with the new range types tracked in #123741?

Change proc_macro::Span::byte_range -> byte_offset.

c6161c5

This is more consistent with Span::line and Span::column.

m-ou-se added T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. A-proc-macros Area: Procedural macros labels Apr 16, 2025

m-ou-se self-assigned this Apr 16, 2025

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Apr 16, 2025

This comment was marked as off-topic.

Sign in to view

m-ou-se mentioned this pull request Apr 16, 2025

Tracking issue for proc_macro::Span inspection APIs #54725

Open

m-ou-se added the I-libs-api-nominated Nominated for discussion during a libs-api team meeting. label Apr 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change proc_macro::Span::byte_range -> byte_offset. #139901

Change proc_macro::Span::byte_range -> byte_offset. #139901

m-ou-se commented Apr 16, 2025 •

edited

Loading

This comment was marked as off-topic.

m-ou-se commented Apr 16, 2025 •

edited

Loading

m-ou-se commented Apr 16, 2025

BurntSushi commented Apr 16, 2025

dtolnay commented Apr 16, 2025

m-ou-se commented Apr 16, 2025

m-ou-se commented Apr 16, 2025

Amanieu commented Apr 20, 2025

Change proc_macro::Span::byte_range -> byte_offset. #139901

Are you sure you want to change the base?

Change proc_macro::Span::byte_range -> byte_offset. #139901

Conversation

m-ou-se commented Apr 16, 2025 • edited Loading

Option 1 (merge this PR)

Option 2 (close this PR)

This comment was marked as off-topic.

m-ou-se commented Apr 16, 2025 • edited Loading

m-ou-se commented Apr 16, 2025

BurntSushi commented Apr 16, 2025

dtolnay commented Apr 16, 2025

m-ou-se commented Apr 16, 2025

m-ou-se commented Apr 16, 2025

Amanieu commented Apr 20, 2025

m-ou-se commented Apr 16, 2025 •

edited

Loading

m-ou-se commented Apr 16, 2025 •

edited

Loading