A friend of mine has been asking me a few questions about encoding for a paper he is working on.
While I don’t understand what his research is on, all I can understand from his research is that he is
working on analyzing Japanese texts and it involves understanding character encodings.
Character encoding is not a topic that most native-English programmers are familiar with.
The most that the average programmer will know is the existence of ASCII and UTF-8 encoding.
If we are using anything beyond the English alphabets
and arabic numerals (i.e. 1, 2, 3, 4, 5, 6, …) then we can utilize UTF-8, else use ASCII.