Forum Discussion
RokuMarkn
10 years agoVisitor
I agree with everything in EnTerr's post except for one point -- it's possible for UTF-8 files to be larger than the equivalent UTF-16. Characters between U+0800 and U+FFFF are represented by 2 bytes in UTF-16 but 3 bytes in UTF-8. So if the file consists primarily of text using these characters with little markup, the UTF-8 file can be up to 50% larger than the UTF-16 version.
--Mark
--Mark