The excellent and popular Machine Learning class from Coursera and Andrew Ng starts today. This is the 3rd or 4th run of the course.
There is nothing magical about this data. It is just income data. The magic comes from the excellent visualizations, and the story being told. If you need to make your data come to life, this video is an excellent example.
A new Data Science Association has been created. It is only a few months old. Currently, for a limited time, a one-year membership is free. The Data Science Association promises to offer networking possibilities, conference discounts, and other benefits.
One of the main contributions the Data Science Association has made so far is a Data Science Code of Conduct. It provides some solid definitions and guidelines for being a data scientist. It is worth reading through.
Not too long ago, I posted a list of 7 Data Science Papers. Since then, I have found a few more interesting and more recent developments in data science.
- Recursive Deep Models … – Recent Paper from Stanford about Sentiment Analysis with deep learning
- Google Spanner – Google’s globally distributed database
- Google Megastore – Another database out of Google, but this one includes SQL-like querying and other differences from Megastore.
- F1 – A distributed SQL database that scales. F1 is built on top of spanner.
- Apache Drill – A distributed system for ad-hoc query analysis. This journal article comes from the Big Data Journal.
EdX, a MOOC site, is offering Learning From Data. This is a course about machine learning offered by Caltech. The course started yesterday, so there is still time to get started. The course has 2 tracks: audit and certificate. It looks great. Good Luck.