Yahoo! Pipes: Make it work at all?

A prize to anyone who can make a simple Atom entry or a simple RSS entry get geocoded by the Yahoo! Pipes Location Extractor. I’ve spent the last 30 minutes on it, and failed.

Non-working pipe is my attempt. GeoRSS works, but location extraction doesn’t.

Update:

  1. Location Extractor seems to work against the HTML pages referenced by the feed, not the content in the feed.
  2. Minor changes to the HTML page seem to break the parsing — it seems to be very targeted towards Craigslist postings.  A page with my address, but a different map link seems to extract the Sebastapool address, while a page with just a map link doesn’t seem to extract at all.

I guess I don’t need to sell my MetaCarta stock yet… unless I’m way off, this shows that MetaCarta is significantly ahead of the game for extracting locations from unstructured text. Not that this is a surprise to me 😉

3 Responses to “Yahoo! Pipes: Make it work at all?”

  1. Maciej Ceglowski Says:

    Thanks for pointing this out. We’ll take a look and see if we can’t get the location extractor to do a better job on these. Mind if we throw hardcopies of these feeds and pages into our test suite?

  2. A note about Pipes: Location Extractor at will work for doughnuts Says:

    […] I saw that crschmidt.net was having a little bit of trouble with the Pipes Location Extractor and I realized that our documentation is too short. […]

  3. crschmidt Says:

    Maciej: Feel free. Unit tests for all!