Are Most Link Extractors (Including WordPress Comment Link Extractor) Wrong?

By Angsuman Chakraborty, Gaea News Network
Wednesday, January 30, 2008

Most link extractors I know, including the function in WordPress to linkify comments, incorrectly assumes that any text containing https:// is a hyperlink including obvious mistakes like the following:
I propose a new protocol starting with phttps://... What do you think?

phttp is not http protocol and it should never be treated as a link. Unfortunately most parsing algorithms are lazy not to detect such cases and linkify the word starting with https://. Thoughts?

Discussion
May 6, 2008: 7:50 am

[...] Did you read about the other hyperlinking defect? [...]

January 30, 2008: 10:01 am

I have noticed this. personally I wish there were a way to turn off auto-linking for certain things, especially text within <code> tags. On at least one of my pages i’ve had comments relating to .htaccess code, including hyperlink-like mod_rewrite stuff, and WordPress turns a bunch of inappropriate stuff into links.

Another really basic (and specific) thing these programs should be aware of is that https://www.example.com is _never_ a live URL!

YOUR VIEW POINT
NAME : (REQUIRED)
MAIL : (REQUIRED)
will not be displayed
WEBSITE : (OPTIONAL)
YOUR
COMMENT :