[mkgmap-dev] [PATCH] Translating curly quotes into straight ones
From Marko Mäkelä marko.makela at iki.fi on Sat May 9 21:46:54 BST 2009
On Mon, May 04, 2009 at 11:24:00AM +0300, Marko Mäkelä wrote: > I haven’t had problems with äö with mkgmap-r1024 --latin1, but I did > notice that the UTF-8 apostrophe ’ (U+2019), properly mapped to 0x92, > is displayed as the font substitution character by the Edge 705. > It should probably be mapped to the straight single quote ' (U+0027). > I'd suggest this translation for all the funny quotes: > > map U+2018..U+201B (‘’‚‛) to ' (U+0027) > map U+201C..U+201F (“”„‟) to " (U+0022) I tried the attached patch, but it is not working. If I get around to it, I will test this with a smaller input file again (not the whole Finland). I was using --latin1, and according to my understanding of CodeFunctions.createEncoderForLBL(), it should create a new AnyCharsetEncoder("cp1252"). However, the patch is apparently not translating the cp1252-encoded ’ into ' (U+0027). Also, I would rather perform the translation before the utf8-to-cp1252 step or integrate it in a custom utf-8-to-cp1252 translation that does not depend on java.nio.charset.Charset. I’m only posting this patch as “work in progress.” Marko -------------- next part -------------- A non-text attachment was scrubbed... Name: cp1252-quotes.patch Type: text/x-diff Size: 925 bytes Desc: not available Url : http://lists.mkgmap.org.uk/pipermail/mkgmap-dev/attachments/20090509/54b34500/attachment.bin
- Previous message: [mkgmap-dev] What about accentuated letters
- Next message: [mkgmap-dev] What about accentuated letters
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the mkgmap-dev mailing list