Dive into the archives.
- Using Solr’s AbstractSolrTestCase
This past week I worked on utilizing Solr’s AbstractSolrTestCase which extends JUnit’s TestCase. In theory, this makes it easier to create tests that hit an index and run thru the entire search pipeline if necessary.
Unfortunately, there isn’t a ton of docs to help out but there are plenty of examples within Solr’s source to [...]
- Google Flu Trends
Lots of people are linking to it but Google’s Flu Trends is a pretty amazing site.
The things you can figure out when you have the incredible amount of data Google has access to can provide insights into things previously not possible. I really think the idea that the CDC was up to [...]
- Creating a Search Engine
Rich Skrenta knows a thing or two about search engines and crawlers. Here’s his easy two step process of building your own one.
Step 1 is to copy the internet onto your cluster. Step 2 is to analyze it..
…
Search is like 7 hard problems wrapped into a stack. Distributed systems, html analytics, text analytics/semantics, anti-spam, [...]
