It seemed difficult to believe, but Java seems alarmingly Latin-1.
Quick and Dirty Hack for UTF-8 Support in ResourceBundle
Java UTF–8 international character support with Tomcat and Oracle
You can use native2ascii to convert files to ascii. This tool will expand characters to escaped character sequences. Personally, I think that is ridiculous. It makes the default Java utilities unusable for a majority of the world… Sure, I could write my own… but why don’t they just make the common libraries use UTF8?
Java Bug ID: 4508058 UTF-8 encoding does not recognize initial BOM
Problem outputting utf-8 from java
This makes me wonder, which programming language has the best support for internationalisation?










One Trackback
[…] We have been looking at this framework for an intranet site… it led to my realisation that Java really made things pretty difficult to display international character sets… but that was an earlier post. […]