Talks, Articles, & Events
- 01 Feb 2011 » Committee Member for O'Reilly 2011 Strata Conference
- 09 Sep 2010 » O'Reilly Media >> Spatial Analytics Workshop Video
- 08 Sep 2010 » Amazon Public Dataset >> Data Wrangling Wikipedia Traffic Statistics V2
- 22 Aug 2010 » Quora >> What are the best blogs about data?
- 14 Aug 2010 » DevNation SF Talk >> Building Data Driven Products
- 29 Apr 2010 » Guest Post: MeasuringMeasures.com >> Datasets and Data-driven Startups
- 20 Apr 2010 » LinkedIn.com Blog >> Data Scientists: Finding Patterns in LinkedIn Data
- 30 Mar 2010 » O'Reilly Where 2.0: Spatial Analytics Workshop
- 18 Mar 2010 » NYC Hadoop meetup >> Rapid Data Exploration with Hadoop at LinkedIn - also see talk slides
- 16 Mar 2010 » TechCrunch >> Big Data Is Less About Size, And More About Freedom
- 11 Mar 2010 » TechCrunch >> Perseids, John Hughes, And G.I. Joe Are Trending Topics On Wikipedia
- 22 Dec 2010 » Sunlight Labs Blog >> Great American Hackathon Wrap-up - worked on Statistically Improbably Phrases
- 02 Oct 2009 » Hadoop World: NYC 2009 - Building Data Intensive Apps: A closer look at TrendingTopics.org
- 28 Sep 2009 » Cloudera Blog >> Grouping Related Trends with Hadoop and Hive
- 31 Jul 2009 » Cloudera Blog >> Tracking Trends with Hadoop and Hive on EC2
- 02 Apr 2009 » Amazon.com >> Finding Similar Items with Amazon EMR, Python, and Hadoop Streaming
- 14 Jan 2009 » Juice Analytics Blog >> Search Competition Among Travel Sites
- 09 Apr 2008 » ReadWriteWeb >> Where to Find Open Data on the Web
- 27 Mar 2008 » PyCon 2008 >> MPI Cluster Programming with Python & Amazon EC2 also led session on Netflix Prize

