Note: If you don’t enjoy podcasts, the link contains the entire text for reading as well.
Zipfian Academy, the same company that is creating the 12 week intensive data science training course, will be offering a series of 6 short courses on data science. The courses will be 1.5 hours each and will be taught live in San Francisco. For those of you that cannot be in San Francisco, the courses will be recorded and available online.
The short courses are not free, $35 each or $150 for all, and seating is limited. The seating is limited to allow all students access to the instructors. The first short course starts tomorrow (May 28, 2013), so register now if you are interested. Here are links to all the short courses:
- May 28, 2013: Introduction to Data Science
- June 4, 2013: Introduction to Statistics: Finding the Signal in the Noise
- June 11, 2013: Algorithms for the Rest of Us: A Gentle introduction to Machine Learning
- June 18, 2013: Data Wrangling 101: Looking for Data, and What to Do When You’ve Found it
- June 25, 2013: Computing at Scale: An Introduction to Secrets of Managing Big Data
- July 2, 2013: Introduction to Visualization: Learn How to Effectively Tell your Data Story
Here is a data visualization of the paths of tornadoes in the US over the past 56 years. The brighter the blue, the more intense the tornado. This is also an excellent example of using opendata. The raw data is available at data.gov.
Probabilistic Programming and Bayesian Methods for Hackers is an open source online book. The book is developed with iPython, so it can be read in a variety of formats: web, PDF, or locally with iPython installed.
Also, contributions are welcome via the Github repository for the book (or you can email the authors).
This is the first iPython project I have really looked at, and iPython looks very promising.
Are you confused on what hadoop is? What about Hbase, Pig Hive? Well, this link will help you out.
It provides a nice short explaination for the following terms:
Recently, both NYU and Columbia launched academic programs in data science. Well, another school in New York City is entering the mix. The City University of New York (CUNY) is now offering an online masters degree in data analytics. If you would like more information, there will be an online information session on May 22.
This looks to be a great webinar! It is today.
This Spring, Harvard University ran a data science course. Technically, the name of the course was Stat 221 Statistical Computing and Visualization. The course recently finished, and all the course lecture slides are available.
The slides contain a bunch of useful information, plus they show one possible layout for a data science course.
If you are looking for public data, Enigma.io is a new startup just for you. Enigma searches, finds, and connects a variety of formats of public data. The data is then linked and made accessible. Watch the video below for more details.