character encoding - All inclusive Charset to avoid "java.nio.charset.MalformedInputException: Input length = 1"?

Question

Welcome To Ask or Share your Answers For Others

character encoding - All inclusive Charset to avoid "java.nio.charset.MalformedInputException: Input length = 1"?

asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)

character encoding - All inclusive Charset to avoid "java.nio.charset.MalformedInputException: Input length = 1"?

I'm creating a simple wordcount program in Java that reads through a directory's text-based files.

However, I keep on getting the error:

java.nio.charset.MalformedInputException: Input length = 1

from this line of code:

BufferedReader reader = Files.newBufferedReader(file,Charset.forName("UTF-8"));

I know I probably get this because I used a Charset that didn't include some of the characters in the text files, some of which included characters of other languages. But I want to include those characters.

I later learned at the JavaDocs that the Charset is optional and only used for a more efficient reading of the files, so I changed the code to:

BufferedReader reader = Files.newBufferedReader(file);

But some files still throw the MalformedInputException. I don't know why.

I was wondering if there is an all-inclusive Charset that will allow me to read text files with many different types of characters?

Thanks.

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Answer

深蓝 · Answer 1 · 2021-10-16T23:52:18+0000

You probably want to have a list of supported encodings. For each file, try each encoding in turn, maybe starting with UTF-8. Every time you catch the MalformedInputException, try the next encoding.

Categories

character encoding - All inclusive Charset to avoid "java.nio.charset.MalformedInputException: Input length = 1"?

character encoding - All inclusive Charset to avoid "java.nio.charset.MalformedInputException: Input length = 1"?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags