Subject: Re: [nycphp-talk] regexp for URLs (is this correct?)
Christopher R. Merlo
cmerlo at ncc.edu
Mon May 3 17:12:31 EDT 2004
On 2004-05-03 14:45 -0400, James B. Wetterau Jr. <james at surgam.net> wrote:
> A Perl expert of some repute did the hard work of reading all the
> relevant protocol specs, BNF notation, etc., and came up with this
> valid URL parsing regexp. Anything shorter is likely to miss some
> special case.
I'm glad I don't have to debug that.
I like the idea (sorry, I deleted the original post) of checking with
fopen. Perhaps create a DB table with a column for last good result
from fopen. If that column reaches a threshold, chuck the URL.
-c
--
cmerlo at ncc.edu http://turing.matcmp.ncc.edu/~cmerlo
Q: How many software engineers does it take to change a lightbulb?
A: None. We'll document it in the manual.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 232 bytes
Desc: not available
URL: <http://lists.nyphp.org/pipermail/talk/attachments/20040503/ffb2b28b/attachment.sig>
More information about the talk
mailing list