Create a more flexible HTML comment parser and raise warning #516

yluobb · 2024-12-08T08:58:51Z

Towards: #501

This PR builds on top of the work from here. Create a more flexible HTML comment parser so we don't rely on spaces to exist to succesfully parse HTML comments. Also, raise warnings when there are missing spaces.

linux-foundation-easycla · 2024-12-08T08:58:56Z

✅login: yluobb / (70fa4d2)
✅login: jsuereth / (230a23e)
✅login: jsuereth / (230a23e, b8d7626)
✅login: jsuereth / (230a23e, b8d7626, cbc790e)

The committers listed above are authorized under a signed CLA.

jsuereth · 2024-12-09T12:55:10Z

crates/weaver_semconv_gen/src/parser.rs

+    let (input, result) = take_until("-->")(input)?;
+
+    // Check for spacing issues and warn if found
+    if result.starts_with("semconv") && !input.trim().starts_with("semconv ") {


Three things:

I don't think we need to force spaces here.

For the purpose of separation of concerns, the validation logic should go in parse_semconv_snippet_directive. Ideally this function JUST parses HTML headers, which have no restrictions on whitespaces.

We're not using eprintln! for warnings, instead you'll want to find a way to return a Diagnostic, e.g. using weaver's result WResult. This might require some wiring to make sure non-fatal-errors (NFE) can make it out of the parser. I haven't tried to sort out NFE w/ the nom parsers bits yet. Let me know if you have issues I'll see if I can put a scaffold up for you there.

jsuereth · 2024-12-10T12:39:45Z

A quick thought on warnings here:

You can update the update_markdown_contents function to use weaver result.
You can update the logic to something like:

  if parser::is_markdown_snippet_directive(line) {
    ... existing ...
  } else if parser::is_possible_markdown_snippet_directive(line) {
     ... issue warnings ...
 }

Where is_possible_markdown_snippet_directive is a function you can write re-using the HTML comment parser with some kind of very flexible mispelling finding parser.

jsuereth and others added 4 commits December 8, 2024 02:41

Add failing tests and better printlns to discover the issues.

230a23e

Increase flexibility of HTML comment parsing.

b8d7626

Fix clippy/fmt

cbc790e

Add warning message for incorrect format

70fa4d2

yluobb requested a review from a team as a code owner December 8, 2024 08:58

yluobb mentioned this pull request Dec 8, 2024

Raise warnings when special tags are misspelled, or missing spaces #501

Open

jsuereth reviewed Dec 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a more flexible HTML comment parser and raise warning #516

Create a more flexible HTML comment parser and raise warning #516

yluobb commented Dec 8, 2024

linux-foundation-easycla bot commented Dec 8, 2024 •

edited

Loading

jsuereth Dec 9, 2024

jsuereth commented Dec 10, 2024

Create a more flexible HTML comment parser and raise warning #516

Are you sure you want to change the base?

Create a more flexible HTML comment parser and raise warning #516

Conversation

yluobb commented Dec 8, 2024

linux-foundation-easycla bot commented Dec 8, 2024 • edited Loading

jsuereth Dec 9, 2024

Choose a reason for hiding this comment

jsuereth commented Dec 10, 2024

linux-foundation-easycla bot commented Dec 8, 2024 •

edited

Loading