Tag Archives: data scientist

Top 10 Ways to Know You are a Data Scientist

For some humor on this Friday, here is the Top 10 Ways You Know You’re a Data Scientist by Fico Labs.

I would add number 11:

You think the list is funny.


3 Top Data Scientists Change Jobs

Three of the Top Data Scientists have recently changed jobs.

Name Former Company New Company Announcement
Hilary Mason Bit.ly Data Scientist in Residence @ Accel Partners Techcrunch
DJ Patil Greylock Partners VP of Product @ RelateIQ Techcrunch
Monica Rogati LinkedIn VP of Data @ Jawbone Techcrunch

What does a data scientist do?

This is one of the better descriptions, I have seen, for what a data scientist does.

They must find interesting, novel, and useful insights about the real world in the data. And they must turn those insights into products and services, and deliver those products and services at a profit.

Notice, data scientists don’t just need to find insights in data. They also need create profitable products from that insight. I often times feel that data products are not seen as important as improving the machine learning algorithms, but the data products really are the end goal.

The quote came from the Harvard Business Review article, To Work with Data, You Need a Lab and a Factory.

Become a data scientist in 12 intense weeks

Want to learn Data Science in 12 weeks? Zipfian Academy is offering just that.

The inaugural class will begin Fall 2013. Also the schedule is five days a week from 9 a.m. to 7 p.m., so it is a very intensive program. You must be willing to relocate to San Francisco for the 12 weeks. The cost of the data science program is $14,400, but some scholarships and sponsorships are available.

At first the cost seems high, but when you consider the program will prepare you for a different career in just 12 weeks, it does not sound so bad. I think you are paying for 2 things: the immense amount of information and the condensed format. The information planned to be covered does look very extensive, everything from storing data to cleaning data to machine learning.

I am not aware of another program like this existing. If you are not concerned with getting a “university degree” and would like to learn data science, I think Zipfian Academy looks like a good choice.

A Couple Good Python Resources

In just the past month, a couple of great resources for learning python have been created.

  1. Getting started with Python: Tips, Tools and Resources – If you are new to python, this is a great place to start. It contains a brief description and links to books, tutorials, and MOOCs.
  2. Getting Started With Python for Data Scientists – This focuses more on tools specifically for data science.

Combined together, the previous links should provide a person all the resources necessary to begin doing some data science with the python language.

Data Scientist: Consider the Curriculum

A while back James Kobielus wrote the article, Data Scientist: Consider the Curriculum. It contains one of the best descriptions of a data science curriculum I have seen.  Also the article includes a list of algorithms/modeling techniques that should be known by a data scientist. Below is the list from the article.

  • linear algebra
  • basic statistics
  • linear and logistic regression
  • data mining
  • predictive modeling
  • cluster analysis
  • association rules
  • market basket analysis
  • decision trees
  • time-series analysis
  • forecasting
  • machine learning
  • Bayesian and Monte Carlo Statistics
  • matrix operations
  • sampling
  • text analytics
  • summarization
  • classification
  • primary components analysis
  • experimental design
  • unsupervised learning
  • constrained optimization

The list almost looks overwhelming.
Do you think anything is missing from the list?