We ran some java code using cron in Linux to persist thousands of records in production database. The locale charmap in that box was "ANSI_X3.4-1968". Now, we took following steps before persisting those to database.
1. Use StringEscapeUtils.unescapeHtml4 on the text
2. Write the String in UTF-8 format and persist in database
Now the problem is after these steps special characters are showing up as "?". Is it possible to revert it back to the original character?
I have simulated the problem with following steps.
- Change Eclipse encoding to "ANSI_X3.4-1968"
- Write following lines of codes
String insertSpecial = StringEscapeUtils.unescapeHtml4("×");
System.out.println(insertSpecial);
String uni = new String(insertSpecial.getBytes(), "UTF-8");// This value is currently in DB
System.out.println(uni);
Now I want to get back "×" from the String "uni". Any help will be appreciated.
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…