Update the strip accents and diacritics feature
I have some suggestions about the "Strip accents and diactritics" feature.
Instead if stripping everything unfamiliar in Western Europe, I suggest it should change Unicode characters to legal ones closely resembling them.
For example:
upper case Á should be upper case Å or lower case à
upper case Ú should be lower case ù
upper case Ó should be lower case ò
upper case Í should be lower case ì
upper case Ő should be upper case Ö
upper case Ű should be upper case Ü
lower case á should be lower case à
lower case ú should be lower case ù
lower case ó should be lower case ò
lower case í should be lower case ì
lower case ő should be lower case ö
lower case ű should be lower case ü
Characters stripped but actually legal in normal non-Unicode SMS: ö, Ö, ü, Ü.
-
Eske Rahn commented
I like the idea, but it is a bit complex, as the set of non-unicode characters available is language dependent, by a makrer in the SMS-header. But sure it should be possible to make an algorithm that searches for the language closest to the entered text, and then substitutes for the remaining.
For details, see e.g the "locking shift" sets here: https://en.wikipedia.org/wiki/GSM_03.38