You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Can you elaborate on what you mean by "obvious"? On what basis/heuristic do you think readability should discard text like this?
(reuters' use of visually-hidden text that is semantically non-hidden is heavily frowned upon from an accessibility perspective... but I suppose that's not stopping them. 😞 )
Can you say why you wouldn't use them aside from "purity"? If browser engine developers were not tolerant to issues like this, we wouldn't have the web in today's form at all. To deliver remotely acceptable results any HTML parsing package must work under assumption that everything can happen, that is the standard behaviour accepted in early 90s.
Also I do agree with your point, but it has nothing to do with my issue. If you have to pick if it is proper to support 99% of users or 1%, it is normal to support 99% and not 1%. But I do agree with your idea that 1% should be supported too, just Readability API should have something like VoiceReaderAccessibility boolean parameter to switch its behaviour to properly support that 1% too. I would be very happy to see it and would use that on day 1 when it is implemented.
This is the sample page:
https://www.reuters.com/legal/qualcomm-saw-nuvia-buy-chance-save-14-billion-year-arm-fees-ceo-tells-jury-2024-12-18/
It contains the hidden text in this form:
<span style="border: 0px; clip: rect(0px, 0px, 0px, 0px); clip-path: inset(50%); height: 1px; margin: -1px; overflow: hidden; padding: 0px; position: absolute; width: 1px; white-space: nowrap;">, opens new tab</span>
This is passed as a cleaned span to Readability, though it should be obvious that it is hidden:
<span>, opens new tab</span>
The text was updated successfully, but these errors were encountered: