ascii - How do I translate 8bit characters into 7bit characters? (i.e. Ü to U)

Question

Welcome To Ask or Share your Answers For Others

ascii - How do I translate 8bit characters into 7bit characters? (i.e. Ü to U)

1 Answer

深蓝 · Answer 1 · 2021-10-17T02:50:10+0000

For .NET users the article in CodeProject (thanks to GvS's tip) does indeed answer the question more correctly than any other I've seen so far.

However the code in that article (in solution #1) is cumbersome. Here's a compact version:

// Based on http://www.codeproject.com/Articles/13503/Stripping-Accents-from-Latin-Characters-A-Foray-in
private static string LatinToAscii(string inString)
{
    var newStringBuilder = new StringBuilder();
    newStringBuilder.Append(inString.Normalize(NormalizationForm.FormKD)
                                    .Where(x => x < 128)
                                    .ToArray());
    return newStringBuilder.ToString();
}

To expand a bit on the answer, this method uses String.Normalize which:

Returns a new string whose textual value is the same as this string, but whose binary representation is in the specified Unicode normalization form.

Specifically in this case we use the NormalizationForm FormKD, described in those same MSDN docs as such:

FormKD - Indicates that a Unicode string is normalized using full compatibility decomposition.

For more information about unicode normalization forms, see Unicode Annex #15.

Categories

ascii - How do I translate 8bit characters into 7bit characters? (i.e. Ü to U)

ascii - How do I translate 8bit characters into 7bit characters? (i.e. Ü to U)

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags

Categories

ascii - How do I translate 8bit characters into 7bit characters? (i.e. &#220; to U)

ascii - How do I translate 8bit characters into 7bit characters? (i.e. &#220; to U)

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags

ascii - How do I translate 8bit characters into 7bit characters? (i.e. Ü to U)

ascii - How do I translate 8bit characters into 7bit characters? (i.e. Ü to U)