Skip to content

Strikethrough detection wraps code identifiers in del tags on color-heavy papers #5

@sentientsergio

Description

@sentientsergio

Papers with color formatting and strikethrough in the PDF (P4003R0, P4007R0) produce output where most code identifiers are wrapped in <del> tags inside :::wording-remove blocks. tomd is correctly detecting real PDF formatting that other extractors ignore — P4007R0 has 292 strikethrough spans and 422 colored spans in just the first 5 pages.

The issue is readability: when every std::execution, coroutine_handle<>, set_value, paper reference, and library name is wrapped in <del>, the output is hard to read and hard to search.

Possible approaches:

Full evaluation report: https://github.com/cppalliance/paperlint-eval/blob/main/tomd-eval/report.md (see Check 2b and Limitations §3).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions