rustdoc: make langstring parsing more robust #14569

skade · 2014-05-31T22:38:49Z

This changes the parsing of the language string
in code examples so that unrecognized examples
are not considered Rust code. This was, for example,
the case when a code example was marked sh for shell
code.

This relieves authors of having to mark those samples
as notrust.

Also adds recognition of the positive marker rust.

By default, unmarked examples are still considered rust.

huonw · 2014-05-31T23:30:03Z

src/librustdoc/html/markdown.rs

+  let mut tokens = string.as_slice().split(|c: char| !c.is_alphanumeric());
+
+  for token in tokens {
+    if token.len() == 0 {


This chain could be written as

match token { "" => {} "should_fail" => should_fail = true, "no_run" => ignore = true, ... _ => notrust = true, }

skade · 2014-06-01T08:45:00Z

I fixed and re-indented the code. Also changed the logic a bit: If any of the rustdoc tags are seen, the code is considered rust, even if other unrecognized ones are seen (allowing for .rust .example).

alexcrichton · 2014-06-01T17:42:01Z

While rustdoc is usually not tested, this logic seems tricky enough that it would lend itself quite well to a unit test. Could you add a test in this module inside of a mod tests section?

This changes the parsing of the language string in code examples so that unrecognized examples are not considered Rust code. This was, for example, the case when a code example was marked `sh` for shell code. This relieves authors of having to mark those samples as `notrust`. Also adds recognition of the positive marker `rust`. By default, unmarked examples are still considered rust. If any rust-specific tags are seen, code is considered rust unless marked as "notrust". Adds test cases for the detection logic.

skade · 2014-06-01T22:18:23Z

Sure, I added one with all important examples.

…r=alexcrichton This changes the parsing of the language string in code examples so that unrecognized examples are not considered Rust code. This was, for example, the case when a code example was marked `sh` for shell code. This relieves authors of having to mark those samples as `notrust`. Also adds recognition of the positive marker `rust`. By default, unmarked examples are still considered rust.

skade changed the title ~~rustdoc: make langstring parsing robuster~~ rustdoc: make langstring parsing more robust May 31, 2014

skade mentioned this pull request May 31, 2014

Helping people to try out the project skade/knob#5

Closed

huonw reviewed May 31, 2014
View reviewed changes

bors closed this Jun 2, 2014

bors merged commit 3fef7a7 into rust-lang:master Jun 2, 2014

skade mentioned this pull request Dec 6, 2014

Switch from notrust to not_rust in doc comments for opting out of doc tests rust-lang/rfcs#500

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rustdoc: make langstring parsing more robust #14569

rustdoc: make langstring parsing more robust #14569

skade commented May 31, 2014

huonw May 31, 2014

skade commented Jun 1, 2014

alexcrichton commented Jun 1, 2014

skade commented Jun 1, 2014

rustdoc: make langstring parsing more robust #14569

rustdoc: make langstring parsing more robust #14569

Conversation

skade commented May 31, 2014

huonw May 31, 2014

Choose a reason for hiding this comment

skade commented Jun 1, 2014

alexcrichton commented Jun 1, 2014

skade commented Jun 1, 2014