[mkgmap-dev] Far-east asia character encoding
From Tan, Eng Ten tanengten at yahoo.com on Mon Jul 19 02:42:30 BST 2010
Hi all, I dont know if this has been raised before but even mkgmap-1655 could not produce an .IMG with proper Japanese character encoding. I see the code assumes ENCODING_FORMAT9 when the specified --charset falls into the "any" category. The function CodeFunctions.guessCodePage() is also rather limited. To get an .IMG with the correct Japanese encoding, the correct settings are: ENCODING_FORMAT10 img encoding / Codepage 932 / SJIS/MS932 character encoding. I've added this IF-block into CodeFunctions.createEncoderForLBL(): } else if ("sjis".equals(charset) || "ms932".equals(charset)) { funcs.setEncodingType(ENCODING_FORMAT10); funcs.setEncoder(new AnyCharsetDecoder(charset)); funcs.setCodepage(932); } An alternative is to allow explicit setting of encodingType (FORMAT6/9/10) from the command line. I see there is already a --code-page command line parameter and maybe this explicit value (if set) should override guessCodePage(). Cheers ettan -------------- next part -------------- An HTML attachment was scrubbed... URL: http://lists.mkgmap.org.uk/pipermail/mkgmap-dev/attachments/20100718/96ac2b85/attachment.html
- Previous message: [mkgmap-dev] Strange strings in addresses (r1654)
- Next message: [mkgmap-dev] Far-east asia character encoding
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the mkgmap-dev mailing list