Seems there's an extraneous span end tag making an appearance.
matthewTue 14 Jul 2020
I'm sure these are not the only cases of unbalanced tags. I pushed a fix to the website, but I might suggest that improperly formatted HTML is a very common thing in general, and that your HtmlParser be able to more gracefully continue parsing in the face of issues like this. If you find an unexpected <end/> tag, log it as an error and keep parsing - or maybe have an optional strict mode or something.
SlimerDudeThu 16 Jul 2020
Don't worry @matthew, I'll not bug you again on this matter! :)
SlimerDude Wed 8 Jul 2020
Hi,
I know this is pedantic, I mention it only because it may be affecting SEO and the like...
The fantom.org website is declared with an XHTML DOCTYPE:
But the
<head>
uses an empty<meta>
void tag (invalid XML).But beyond this, there seems to be a missing
</div>
end tag somewhere that makes even HTML invalid.It was noticed by a colleague who was using the Fantom site to test a bug fix to HTML Parser.
matthew Fri 10 Jul 2020
Thanks for reporting. I'll look at fixing both these things next week.
matthew Mon 13 Jul 2020
@SlimerDude - switched the doctype to HTML5 doctype so you won't be able to parse it as XML anymore. I also fixed the unmatched
<div>
tagSlimerDude Tue 14 Jul 2020
Hi Matthew, cool the homepage is looking good and parsing nicely!
However, parsing this page, and any other forum page, gives:
sys::ParseErr: End tag </span> does not match start tag <h2>
Seems there's an extraneous span end tag making an appearance.
matthew Tue 14 Jul 2020
I'm sure these are not the only cases of unbalanced tags. I pushed a fix to the website, but I might suggest that improperly formatted HTML is a very common thing in general, and that your HtmlParser be able to more gracefully continue parsing in the face of issues like this. If you find an unexpected <end/> tag, log it as an error and keep parsing - or maybe have an optional strict mode or something.
SlimerDude Thu 16 Jul 2020
Don't worry @matthew, I'll not bug you again on this matter! :)