Are Most Link Extractors (Including WordPress Comment Link Extractor) Wrong?
By Angsuman Chakraborty, Gaea News NetworkWednesday, January 30, 2008
Most link extractors I know, including the function in WordPress to linkify comments, incorrectly assumes that any text containing https:// is a hyperlink including obvious mistakes like the following:
I propose a new protocol starting with phttps://... What do you think?
phttp is not http protocol and it should never be treated as a link. Unfortunately most parsing algorithms are lazy not to detect such cases and linkify the word starting with https://. Thoughts?
January 30, 2008: 10:01 am
I have noticed this. personally I wish there were a way to turn off auto-linking for certain things, especially text within <code> tags. On at least one of my pages i’ve had comments relating to .htaccess code, including hyperlink-like mod_rewrite stuff, and WordPress turns a bunch of inappropriate stuff into links. Another really basic (and specific) thing these programs should be aware of is that https://www.example.com is _never_ a live URL! |
Another WordPress Comment Hyperlinking Bug