Last modified: 2014-05-14 18:56:38 UTC
The PHP parser currently recognizes RFC/PMID/ISBN links even if they are buried in unrelated text, for example: fooRFC 1234bar We should add \b restrictions to the regexp to ensure that magic links stand apart from other text. See also bug 28950, which asks for the whitespace restrictions in magic links to be loosened somewhat.
Source code file name (and path) welcome in case a contributor would like to give fixing the regexes a shot. Marking as easy.
Unfortunately parser changes are never quite so "easy" -- even though the source code change is small, before deployment we need to grep through all our existing wikis to be sure that no one is using RFC links that will be broken. Perhaps some language wiki uses prefixes for possessives and quite likes the current behavior. We won't know until we look. But sure, patch welcome! Just don't be surprised if it's not immediately committed. (Oh, and be sure to include parser tests with your patch.) Source code file name is includes/parser/Parser.php -- here's the code in question: http://git.wikimedia.org/blob/mediawiki%2Fcore.git/2e50b896f1a55667ced32502caa9681c36df7587/includes%2Fparser%2FParser.php#L1387
Gotcha. Thanks for elaborating!