-
-
Notifications
You must be signed in to change notification settings - Fork 9.7k
Closed
Labels
DomCrawlerRFCRFC = Request For Comments (proposals about features that you want to be discussed)RFC = Request For Comments (proposals about features that you want to be discussed)
Description
Description
The DomCrawler has an optional integration with masterminds/html5
to provide a better parsing of HTML5 (the DOMDocument::loadHTML
method is based on the HTML4 spec, not the HTML5 one, and so has many issues when using HTML5 features) when the library is available.
Given the frequency of bug reports we get about broken parsing in dom-crawler, I'm wondering whether we should make masterminds/html5
a required dependency instead of an optional one, so that any HTML using the HTML5 doctype is always parsed using the HTML5 parser.
If the decision is a yes, the next question is which version should do that switch.
What do you think about that ?
jvasseur, kaznovac and ohaderkaznovac and ging-dev
Metadata
Metadata
Assignees
Labels
DomCrawlerRFCRFC = Request For Comments (proposals about features that you want to be discussed)RFC = Request For Comments (proposals about features that you want to be discussed)