email edit
Jay R. Ashworth
jra at baylink.com
Wed Nov 1 10:35:52 PST 2006
On Wed, Nov 01, 2006 at 12:50:31PM -0500, Brian K. White wrote:
> >50K email addresses off Usenet over a month, and test their edit
> >against them. That's a sufficiently large corpus, I think, to catch
> >all the corner cases.
>
> Indeed, almost anything can be a valid email.
> You don't need even one @
That's actually no longer true. Most mailers can't handle
bang-addresses anymore, and there are vanishingly few mailers with UUCP
links configured at this point.
> >50K email addresses off Usenet over a month, and test their edit
> I wouldn't be surprised if you can have more than one @
You can't.
> You don't need even one .
You do.
> You can have any number of .'s
This is true.
> You can have one @ and no dots
Only on internal networks.
> then theres: network!hosta!hostb!hostc!user
See above; find me a live example of an address like this, and I'll buy
you dinner at Berns.
> and: john%node.bitnet at cunyvm.cuny.edu
This one, I did mention, but they're almost dead now, too; there aren't
too many networks without smart gateways these days.
> Those are all odd cases but perfectly correct if thats the way your
> buisiness happens to be set up.
True. But we're not programming a mailer...
> I think there are only a few bad characters you could really test for and
> safely say it's bad... except then there is utf8 and utf16 multibyte
> encodings coming our way pretty soon and any "bad" character might just be
> part of a multibyte japanese address or something.
At the moment, RHS's must be ASCII. International domain names are
available, but still kinda dodgy.
> The only correct way to validate an email address is to actually validate
> it.
> Which requires the machine running fp to have a working mta.
That's the best way, but you can't really *validate* it, either; "VRFY"
only works on mailers run by Really Stupid Sysadmins.
> Or you could use one of the many 3rd party apps out there that claim to do
> just this.
And they do that how?
> In the end, you _still_ can't really know because you have no control over
> the recipient mail servers.
No, but we're only trying to trap typos.
> really, "none" is a completely valid email address and by rights I have no
> right to delete it
For sufficiently small values of "valid", I guess.
> Thus it's not going to be handled properly by anything as simple as an edit.
> And I happen to rather not touch the data rather than touch it improperly.
There's something to be said for that viewpoint, certainly.
Cheers,
-- jra
--
Jay R. Ashworth jra at baylink.com
Designer Baylink RFC 2100
Ashworth & Associates The Things I Think '87 e24
St Petersburg FL USA http://baylink.pitas.com +1 727 647 1274
"That's women for you; you divorce them, and 10 years later,
they stop having sex with you." -- Jennifer Crusie; _Fast_Women_
More information about the Filepro-list
mailing list