Update the strip accents and diacritics feature

I have some suggestions about the "Strip accents and diactritics" feature.
Instead if stripping everything unfamiliar in Western Europe, I suggest it should change Unicode characters to legal ones closely resembling them.

For example:

upper case Á should be upper case Å or lower case à
upper case Ú should be lower case ù
upper case Ó should be lower case ò
upper case Í should be lower case ì
upper case Ő should be upper case Ö
upper case Ű should be upper case Ü

lower case á should be lower case à
lower case ú should be lower case ù
lower case ó should be lower case ò
lower case í should be lower case ì
lower case ő should be lower case ö
lower case ű should be lower case ü

Characters stripped but actually legal in normal non-Unicode SMS: ö, Ö, ü, Ü.

3 votes

Anonymous shared this idea · Oct 8, 2016 · Report… · Admin →

An error occurred while saving the comment

Eske Rahn commented · October 9, 2016 1:26 AM · Report

I like the idea, but it is a bit complex, as the set of non-unicode characters available is language dependent, by a makrer in the SMS-header. But sure it should be possible to make an algorithm that searches for the language closest to the entered text, and then substitutes for the remaining.

For details, see e.g the "locking shift" sets here: https://en.wikipedia.org/wiki/GSM_03.38

Submitting...

Give Feedback

Update the strip accents and diacritics feature

Your importance score has been recorded.

Feedback

Feedback

Update the strip accents and diacritics feature

We're glad you're here

Your importance score has been recorded.

We're glad you're here

We're glad you're here

We're glad you're here

Feedback

Categories