Posts about addressing

Reinventing The Geocoder With Just Three Words

www.vicchi.org rather than 91.146.108.26, because letters and words are easier to remember than numbers, I didn't learn the OS grid reference or the latitude and longitude of my home.

Instead I learned the address. I learned I lived at 45 Ebbisham Road in Worcester Park in London's suburbs. Later, when I learned a bit more about the place I lived in I used to say my full address was 45 Ebbisham Road, Worcester Park, Surrey, KT4 8ND, United Kingdom.

It puzzled me that I knew I lived in England and England was a country but not part of my address. It also puzzled me that I knew I lived in the London Borough of Kingston upon Thames but that also wasn't part of my address.

Little did I know that I was being puzzled by the vague and capricious nature of addressing rules and that years later I'd try to work around these rules as part of my job.

When I was a lot younger than I am now I learned the address of where I was growing up. More about that in a moment. First I want to mention what I didn't learn.

I didn't learn that I was at TQ 23210 65789. Nor did I learn that I lived at 51.377792, -0.23107184. In just the same way that you probably pointed your browser at www.vicchi.org rather than 91.146.108.26, because letters and words are easier to remember than numbers, I didn't learn the OS grid reference or the latitude and longitude of my home.

Instead I learned the address. I learned I lived at 45 Ebbisham Road in Worcester Park in London's suburbs. Later, when I learned a bit more about the place I lived in I used to say my full address was 45 Ebbisham Road, Worcester Park, Surrey, KT4 8ND, United Kingdom.

It puzzled me that I knew I lived in England and England was a country but not part of my address. It also puzzled me that I knew I lived in the London Borough of Kingston upon Thames but that also wasn't part of my address.

Little did I know that I was being puzzled by the vague and capricious nature of addressing rules and that years later I'd try to work around these rules as part of my job.

Just as I learned names rather than coordinates and just as we use domain names rather than IP addresses, there's an unconscious sense that in Europe, North America and elsewhere an address is easy to remember, is hierarchical and structured in a certain consistent way. There’s also a sense that it’s easy to convert an address to and from coordinates and an address always identifies a position on a map precisely.

Which it does, mostly, except when it doesn't.

Last year I was in India and found out first hand just how vague an address can be in a country where there's no postal codes, sometimes no house or property numbers and sometimes not even road names. If someone gave me an address for a hotel in London where the address was North Circular Road, London, I probably wouldn't believe them because I know that there's more to an address in London than just that. Yet an address of Ring Road, New Delhi is absolutely correct. It's also wildly inaccurate but that's all there is. That's the address.

Which brings me, in a roundabout way, to the point of this post and to the problems of geocoding.

In the UK and elsewhere we instinctively know that an address is easy to parse, follows certain rules and identifies a unique location. It therefore follows that as there’s rules to an address, it should be easy to write a geocoder that understands and followed these rules. We know this to be so but in a lot of cases it's wrong. A lot of the time the best a geocoder can do is work out a rough position on a street, based on the fact it knows the start and end numbers on the street. A geocoder can make a good guess of where number 45 is on a street if it knows the street starts at number 1 and ends at number 100. This is often called address interpolation and this is used because gathering the precise coordinates for every single property on a street is a long and arduous task. If you have this sort of data it's called point addressing. It's difficult to make this data and it doesn't come cheap.

In Europe we smile wryly at online address forms which assume that European addresses work the same as US ones. That there's always 2 lines of street address. That there's always a state and ZIP code. We know it doesn't work that way over here.

But if we know that there's a difference in rules for UK and for US addresses why are we then often blind to the fact that other countries don't have street numbers? Or postal codes? Or street names? Or any other combination of the parts of an address we take for granted.

This is why global geocoding is to difficult to do consistently well and it's why most geocoders are really meta geocoders with multiple geocoding engines tuned for different countries or languages or levels in an address hierarchy if that country has such a thing.

Maybe it's time for a different approach. Maybe it's time to stop relying on a method of addressing that seems to have been designed to deliver things to a property or location and look for an alternative that doesn’t need addressing rules, or a hierarchy and works consistently wherever in the world you're taking about.

Here’s how you might do this. Firstly you draw a grid over the surface of the earth. Then you give each of the squares, made by the grid you’ve just drawn, a label. Not an address. Not a number, which would make this easy for computers to work with but not so easy for humans. A label, with words that people can remember and easily get to grips with. Think of it as a location system for humans and not for geocoders.

That’s how you might do it, but in 2013 Chris Sheldrick and co-founder Jack Waley-Cohen did this. The grid they drew was 3m by 3m and the labels they assigned to each square were 3 words. They called it what3words. In early 2014 I met with Chris and I have to say I didn’t quite get the concept. My initial reaction was that what we didn’t need was another unique set of location references. As an industry we’d been there, seen that and done it multiple times. Then a month later I was in India trying to help a cab driver navigate to my hotel whose address was the aforementioned Ring Road, New Delhi.

Then I got it.

w3w-barcelona

What we don’t need is another set of global location references that make it easy for geocoders to do their work. What we don’t need is another attempt to impose postal codes or consistent addressing schemes on the world, whether the world wants it or not.

What we do need is a way to say where something is, quickly, easily and unambiguously and to be able to tell someone. Think about it this way. Which is easier; to say I’m at 41.35636, 2.13491, I’m at Av. Joan Carles I, 64 08908 L’Hospitalet de Llobregat, Barcelona or to say I’m at toffee chest totally? I think the latter is by far the quickest way and one that offers the least potential for confusion when I’m sending this to someone, be that via text message, email or over the phone.

The possibilities for being able to say where you are suddenly got a lot easier and a whole lot wider in scope, allowing you to reference a place, no matter where you are in the world and regardless of what addressing system, if any exists in that place.

It also makes writing a geocoder far far easier too. The names and locations of each triple of words that what3words uses are all algorithmically generated, which makes it small, compact and easy to replicate. It also makes reverse geocoding, turning a set of coordinates into a locations, simpler as you just need to work out which 3m by 3m square you’re in and then looking that up.

Will this ever supplant the current systems out there in the world? Probably not. But does this make life a whole lot easier for people to talk about a place? Yes it does.

You are Here by Roo Reynolds, CC-BY-NC

The Non Golden Rules of Geo (Redux)

last post, that blog is now offline, presumed dead. But one post that seems to keep catching people's imagination is the one in which I, somewhat tongue in cheek, codified the Six Non Golden Rules Of Geo. Much to my satisfaction, it keeps getting mentioned, although the full original post is inaccessible, as is the rest of that blog. Nate Kelso reproduced part of it, as did John Goodwin but until earlier today I'd not been able to find the full post.

Step forward the aforementioned John Goodwin who, with a bit of internet detective work, managed to find a mirror of the post. While I much prefer to link to blog posts rather than reproduce them in full, in this case I'm plagiarising myself and making an exception on the ground of inaccessibility, and have mirrored the post in full here. It's worth mentioning that this post was originally written in February of 2009, when I was still working for Yahoo! so it's a little out of date and was originally posted as ...

Back when I used to work for Yahoo! I wrote a lot of posts for the Geo Technologies blog; for reasons partially explained in my last post, that blog is now offline, presumed dead. But one post that seems to keep catching people's imagination is the one in which I, somewhat tongue in cheek, codified the Six Non Golden Rules Of Geo. Much to my satisfaction, it keeps getting mentioned, although the full original post is inaccessible, as is the rest of that blog. Nate Kelso reproduced part of it, as did John Goodwin but until earlier today I'd not been able to find the full post.

Step forward the aforementioned John Goodwin who, with a bit of internet detective work, managed to find a mirror of the post. While I much prefer to link to blog posts rather than reproduce them in full, in this case I'm plagiarising myself and making an exception on the ground of inaccessibility, and have mirrored the post in full here. It's worth mentioning that this post was originally written in February of 2009, when I was still working for Yahoo! so it's a little out of date and was originally posted as ...

UK Addressing, The Non Golden Rules of Geo or Help! My County Doesn’t Exist

George Bernard Shaw once said the golden rule is that there are no golden rules and at Geo Technologies we understand that there is no one golden rule for geo and so we try to capture and express the world’s geography as it is used and called by the world’s people. Despite the pronouncement on golden rules, a significant proportion of the conversations we have with people about geo lend themselves to the Six Non Golden Rules of Geo, namely that: 1. Any attempt to codify a series of geo rules into a formal, one size fits all, taxonomy will fail due to Rule 2. 2. Geo is bizarre, odd, eclectic and utterly human. 3. People will in the main agree with Rule 1 with the exception of the rules governing their own region, area or country, which they will think are perfectly logical. 4. People will, in the main, think that postal, administrative and colloquial hiearachies are one and the same thing and will overlap. 5. Taking Rule 4 into account, they will then attempt to codify a one size fits all geo taxonomy. 6. There is no Rule 6, see Rule 1.

I codified these rules after a conversation last week, via Twitter and Yahoo! Messenger, with Andrew Woods, a US based developer who was, understandably, confused by the vagaries of the how addresses work in the UK. Andrew’s blog contains the full context but it can be distilled into three key questions: * If the country is The United Kingdom, how come the ISO 3166-2 code is GB? * If the country is The United Kingdom, is England a country? * If England is a country, do I use it in an address?

As a US developer, Andrew is naturally fluent with the US style of addressing, with all of its’ localised and regional exceptions. This is a good example of both Rules 3 and 4 in the real world; most people in the US will use number, street, city, State and ZIP for specifying an address. But how does this transfer to the UK? What’s the equivalent of a State … England, Scotland or Wales? Let’s try to answer some of these problems:

Middlesex In 1824

If the country is The United Kingdom, how come the ISO 3166-2 code is GB?

The UK’s full name is The United Kingdom of Great Britain and Northern Ireland and although the United Kingdom and Great Britain are used interchangeably, Great Britain really refers to England, Scotland and Wales. At the time of writing, both GB and UK are formal ISO 3166-2 codes for the United Kingdom with GB being the assigned code for Great Britain and UK being exceptionally reserved by the United Kingdom.

If the country is The United Kingdom, is England a country?

To be formal and precise, the United Kingdom is a unitary state, not a country, with four “member” countries; England, Northern Ireland, Scotland and Wales.

If England is a country, do I use it in an address?

Normally, no. A full UK address consists of the following: * The addressee’s name, if known or applicable * The company or organisation, if known or applicable * The building name; optional if the building has a number * The number of the building and the name of the street * The locality name;optional * The Post Town * The county; optional if a Post Town and Postcode are supplied * The Postcode

… for example, take our office address of Yahoo! Geo Technologies, 125 Shaftesbury Avenue, London, WC2H 8AD. This address has no building name, a building number and street, no locality name, a Post Town, no county as we have a Post Town and a Post Code, and a Post Code.

Which brings me neatly to another example of Rule 4 and the missing county of this post’s title. The UK’s postal hierarchy and administrative hierarchy are not the same. Since 1996 the first half of a UK postcode, known as the outward code, has been used to help in the sorting of mail but prior to this a set of postal counties were used as part of addresses and these frequently do not match the current set of administrative counties. For example, the county of Middlesex was formally abolished in 1965 with the majority of the county becoming part of Greater London. Despite this and despite the 1996 postcode changes, Middlesex lives on as a postal county and as informal area name with the side effect that it is still possible to send mail, and have it delivered, to places in a county which hasn’t existed for over 40 years.

Oh, and Yahoo! GeoPlanet, naturally, recognises Middlesex and correctly identifies it as a Historical County.