Reply to post: Re: Surely the answer is

It has been 15 years, and we're still reporting homograph attacks – web domains that stealthily use non-Latin characters to appear legit

bombastic bob Silver badge
Devil

Re: Surely the answer is

right - OCR "homonyms" should all translate to the appropriate charset before name lookups happen, or at least before registrars accept them as non-duplicates.

And doing periodic name cleanup might be a good idea, requiring takedowns of any domain that's a lookalike (and assuming they're being used for fraud).

So basically construct a map of UTF-8 chars to ISO8859-1 lookalike chars, then run every domain name through that matrix, see if duplicates show up.

I assume other-than-english lingos might need something similar.

POST COMMENT House rules

Not a member of The Register? Create a new account here.

  • Enter your comment

  • Add an icon

Anonymous cowards cannot choose their icon