Reply to post: The dark shadows of C and Unix...

Hey, AI software developers, you are taking Unicode into account, right ... right?

Anonymous Coward
Anonymous Coward

The dark shadows of C and Unix...

.... where text was not regarded an important part of programming needing a specific support, and programmers thought it could handled like just an "array of bytes" and comparison could be simple byte-by-byte matches. Without understanding it could work only for a very small subset of languages - "ASCII English" only.

It looks this very narrow mindset is still alive today - I would think people interested in languages processing would have known more about how properly "normalize" text before processing it - but once again it looks they know and understand English only.

That's also why any translation that doesn't use English have far bigger chances of being even worse than those from or to English.

POST COMMENT House rules

Not a member of The Register? Create a new account here.

  • Enter your comment

  • Add an icon

Anonymous cowards cannot choose their icon