<html>
<head>
<style><!--
.hmmessage P
{
margin:0px;
padding:0px
}
body.hmmessage
{
font-size: 12pt;
font-family:Calibri
}
--></style></head>
<body class='hmmessage'><div dir='ltr'>Hi Gert,<br><br>yes, I am sure these problems were partly solved in each company to deduplicate<br>adresses, but we probably cannot code that in Java for the whole world. Besides that I think<br>now that mkgmap is not the right place to do it, or at least not the housenumber2 branch ;-)<br><br>Gerd<br><br><br><br><br><div><hr id="stopSpelling">From: thesurveyor@wolke7.net<br>To: mkgmap-dev@lists.mkgmap.org.uk<br>Date: Sat, 18 Apr 2015 14:40:48 +0200<br>Subject: Re: [mkgmap-dev] address search and case significance of street name<br><br><div style="font-family:Verdana;font-size:12.0px;"><div>
<div>Hi,</div>
<div> </div>
<div>oooh, comparing street names, thats a never ending story :-(</div>
<div> </div>
<div>I assume you will find every typographical error you can think of. And I'm sure you/we won't find a rule to correct those errors. So from my point of view the only thing we can do is to simplify the name to eliminate the typical typos.</div>
<div> </div>
<div>I've done this for a database of street names, just for Germany a long time ago. The data haven't been from OSM, it was long before OSM started.</div>
<div> </div>
<div>In that system we<br>
- we replaced all special chars like, ".,;-<>!§$%&/()=?#*+:" with nothing<br>
- we replaced more than one blank with one blank<br>
- we replaced other white characters, like TAB with a blank<br>
- we replaced the German "ß" (scharfes s) with "s", "ä" with "ae" and so on<br>
- we replaced all double chars e.g. "aa", "bb", "cc", ... with the single char "a", "b", "c", ...</div>
<div>- compared all street names case-insensitiv (in fact we did this by replacing all upper chars with the lower char)<br>
</div>
<div>and then we used that string for all comparisons of the street names. But we displayed the original string.</div>
<div> </div>
<div>Maybe that helps you a little bit.</div>
<div> </div>
<div>Regards,<br>
Gert</div>
<div> </div>
<div>
<div style="padding:10px 0 10px 10px;border-left:2px solid #C3D9E5;word-wrap:break-word;">
<div style=""><b>Gesendet:</b> Samstag, 18. April 2015 um 13:53 Uhr<br>
<b>Von:</b> "Thorsten Kukuk" <kukuk@suse.de><br>
<b>An:</b> mkgmap-dev@lists.mkgmap.org.uk<br>
<b>Betreff:</b> Re: [mkgmap-dev] address search and case significance of street name</div>
<div><br>
Hi Andrzej,<br>
<br>
On Sat, Apr 18, Andrzej Popowski wrote:<br>
<br>
> > If any element processed later has a name like "ABC street" or "abc<br>
> > Street" which we consider as a street name, we will use "Abc Street"<br>
> > again.<br>
><br>
> I'm not sure what for are used names from this table. I don't think<br>
> that case could be important for comparison of street names. But I<br>
> would prefer to see street name on a map with the original spelling.<br>
<br>
But what is the original spelling? And more important, what's the<br>
right one?<br>
name of highway?<br>
name of route=street?<br>
addr:street?<br>
<br>
Yes, some people add addr:street to highways ...<br>
<br>
During the last days I looked at the warnings from mkgmap in<br>
regards to mismatch of street names, and I can only say, that<br>
typos are everywhere.<br>
<br>
Thorsten<br>
<br>
--<br>
Thorsten Kukuk, Senior Architect SLES & Common Code Base<br>
SUSE LINUX GmbH, Maxfeldstr. 5, 90409 Nuernberg, Germany<br>
GF: Felix Imendörffer, Jane Smithard, Jennifer Guild, Dilip Upmanyu, Graham Norton, HRB 21284 (AG Nürnberg)<br>
_______________________________________________<br>
mkgmap-dev mailing list<br>
mkgmap-dev@lists.mkgmap.org.uk<br>
<a href="http://www.mkgmap.org.uk/mailman/listinfo/mkgmap-dev" target="_blank">http://www.mkgmap.org.uk/mailman/listinfo/mkgmap-dev</a></div>
</div>
</div>
</div>
<div> </div>
<div class="ecxsignature"> </div></div>
<br>_______________________________________________
mkgmap-dev mailing list
mkgmap-dev@lists.mkgmap.org.uk
http://www.mkgmap.org.uk/mailman/listinfo/mkgmap-dev</div>                                            </div></body>
</html>