Some things I’ve liked recently

Here’s a few things I’ve come across the last few days which are pretty nifty…

A little bit of tab clearing

Continuing to dig Guardian America

A look at the literary journal, Zyzzyva

Cloud Oriented Architecture

Dealing with a geographically distributed Memcache

NeverBlock is a Ruby library that allows developers to write non-blocking, concurrent code in a transparent manner. Meaning that you will keep coding in your traditional ways while you get the benefit of non-blocking IO operations.

Interview with Twitter’s Alex Payne

Using CouchDB with Django

Cyberstar, a look at Adrian Holovaty and EveryBlock

Fourth publisher the charm for LAT?

Huffington Post Chicago

The Huffington Post Chicago site has launched and so far I’m impressed. Looks like a nice collection of opening day stories talking about the greatness of Chicago.

I found a few I liked, The Newspaper is Dead, Long Live the Newspaper, Blackhawk Down: Chicago’s Forgotten Franchise and Chicago Tribune’s Social Media Evolution.

One interesting thing is the right rail being mainly links to the local bloggers. I think that’s an incredible way of generating goodwill and probably bringing people back again and again. The key is distribution and aggregation. It’s something I try to preach here though it doesn’t always seem like people are listening.

A Person of Interest

We were talking about the Bruce Ivins tragedy on the way back from lunch and something about it finally clicked in my brain.

Let me take a step back, I’m a big fan of Law & Order Criminal Intent. I remembered an episode that is eerily similar to this case, the one entitled, A Person of Interest.

In a continuation of a previous episode, the season finale finds Detectives Goren (Vincent D’Onofrio) and Eames (Kathryn Erbe) probing the murder of a former Air Force nurse — but the discovery of anthrax nearby intensifies the search and pushes Goren into the headlines when he is blamed for a suspicious scientist’s suicide.

I’m really surprised people haven’t seen that connection. I found only one reference to it in the comments of this post at HuffPo.

Interesting times we live in…

Trying to run CouchDB

Update: I’m happy to say I got CouchDB working with all the dependencies. In order for things to link correctly, I used the patch that DarwinPorts uses for compilation. The main change seems to be adding -install_name to the linker call. Once that was done, everything fell into place.

I’m trying to get CouchDB up and running. Unfortunately, I’m getting no love even after building Spidermonkey. Here’s what I’m getting:

a21772:~/apache-couchdb-0.8.0-incubating jlucas$ sudo couchdbApache CouchDB 0.8.0-incubating (LogLevel=info)
Apache CouchDB is starting.

dyld: NSLinkModule() error
dyld: Library not loaded: Darwin_DBG.OBJ/libjs.dylib
Referenced from: /usr/local/lib/couchdb/erlang/lib/couch-0.8.0-incubating/priv/lib/couch_erl_driver.so
Reason: image not found
Trace/BPT trap

I wish I could get it going as there are some interesting things I want to experiment with.

Dog Database

We rolled out a new app last week which allows you to dive deep into the wackiness of dog owners, letting you see who names their dogs what and where they live. It’s really interesting and it’s just the start of more data-driven apps, allowing people to dig deeper.

Chrys Wu has a look at the site plus an interview of sorts with Ben Welsh, my co-worker and brother-in-arms with the Cubs.

Times database developer Ben Welsh says the project was a way for him to learn how to navigate through Los Angeles’ complex bureaucracy.

Welsh moved to L.A. from Washington, D.C. several months ago. “When I got here, I knew that learning how many cities make up L.A. County and how the different services get managed was going to be something I needed to get skilled at, so I thought: I need kind of a test case,” he says. The dog names database became his experiment.

The first step was to figure out which offices held the records, then to request the information in accordance with the California Public Records Act. To avoid being turned down for privacy concerns, “I made sure in my earliest communications with people, kind of the first round, to say I don’t want the address of the owner, but I do want their ZIP code,” Welsh says.

Opening the Archives

From Dan Gillmor

First is to open the archives, with permalinks on every story in the database. Newspapers hold more of their communities’ histories and all other media put together, yet they hoard it behind a paywall that produces pathetic revenues and keeps people in the communities from using it — as they would all the time — as part of their current lives. The revenues would go up with targeted search and keyword-specific ads on those pages, I’m absolutely convinced. But an equally important result would be to strengthen local ties.

Oh, you mean something like this?

Creating a Search Engine

Rich Skrenta knows a thing or two about search engines and crawlers. Here’s his easy two step process of building your own one.

Step 1 is to copy the internet onto your cluster. Step 2 is to analyze it..

Search is like 7 hard problems wrapped into a stack. Distributed systems, html analytics, text analytics/semantics, anti-spam, AI/ML, frontend/UI. And scale… Apart from the sexy high end algos there are also the boring 10-year old system libraries and off-the-shelf tools that crack under stress and sometimes need a look. You open the hood and wonder how the thing ever worked in the first place…