Data Science Webinar

David Smith of Revolution Analytics will be presenting a webinar tomorrow, Thursday, November 1, 2012. The webinar is titled, The Rise of Data Science in the Age of Big Data Analytics: Why data distillation and machine learning aren’t enough. Registration for the webinar is free.

David Smith is one of the most popular bloggers on the topics of data science, big data, and R. He blogs at

Drew Conway Monktoberfest Talk

Drew provides a nice analysis of the popularity of programming languages.

Wanna be a Data Scientist at the CIA?

The Central Intelligence Agency is hiring data scientists. They appear to be hiring at multiple levels. Unfortunately, the position description is quite vague. It is tough to know exactly what the CIA is looking for and technologies are currently being used at the CIA.

This is very good read about data science at Engine Yard. It covers the following topics:

  • What is a data scientist?
  • What does a data scientist do?
  • What are the technologies?
  • Realities of Being a data scientist

Free Bayesian and Machine Learning Textbook

David Barber, Computer Science Professor at University College London, is still offering his textbook, Bayesian Reasoning and Machine Learning, for free. This text looks quite extensive. The website also includes matlab code for many of the algorithms in the book.

100 Machine Learning Videos

Here is a list of 100 machine learning videos from

Startup Showcase – How did I do?

Yesterday, I made some predictions about the startups I thought would win at the Strata Startup Showcase. Here are the winners.

So how did I do? Well, I got one of the winners correct. I selected Placed. Hopefully videos of the demos will be available. If I find them, I will post some of them to the blog.

Data Startup Showcase

As part of New York City Big Data Week, a startup showcase is being offered. It will consist of 14 startups. Each startup will get to give a quick demo/presentation. Then Tim O’Reilly and Fred Wilson will select 3 winners. Also, numerous investors and journalists will be present. A complete list of the startups presenting is available on the Startup Showcase page.

Which ones do you think will win?

Without seeing any of the presentations, here are my 3 picks.
This might be my darkhorse pick, but I think InfoActive has my vote. I also like Placed for location analytics, and TempoDB is very intriquing as it is simply a time-series database.

Hadoop World/Strata Conference

The 2012 edition of Hadoop World and Strata Conference is underway. The conference is in New York City and if you are not lucky enough to attend, then at least you can watch the live video feed.

3 Secrets for Aspiring Data Scientists | Software Advice

Michael Koploy wrote 3 Secrets for Aspiring Data Scientists about what it takes to enter a career as a data scientist. He lays out 3 steps:

  1. Sharpen Your Scientific Saw – Hone your math and science skills
  2. Learn the Language of Business – Data Scientists need to explain the data in business terms
  3. Keep Adding to Your Technical Toolbelt – Learn all the tools you can (NoSQL, Excel, Hadoop,…)

The article is a nice read.