Karl Schmitt, Director of Data Sciences at Valparaiso University, has started a blog to share his experiences with building an undergraduate data science program. The blog is titled, From the Director’s Desk. Karl is regularly posting about textbooks, curriculum, visualizations and learning objectives from the perspective of an educator. Tons of great resources!
March 4, 2017 is Open Data Day.
Open Data Day is an annual celebration across the globe. Over 300 groups around the world schedule activities to use open data for their communities. See if there is a gathering in your area. Also, the focus this year is on:
- Open research data
- Tracking public money flows
- Open data for environment
- Open data for human rights
Monica Rogati, a legend in the data science space, recently provided some answers on Quora that are sheer internet gold.
She answers questions involving:
- What is a data science advisor?
- Challenges of Building a data science team?
- Characteristics of a good data scientist?
- and more
They are filled with great advice.
Martin Zinkevich, Research Scientist at Google, just compiled a large list (43 to be exact) of best practices for building machine learning systems.
If you do data engineering or are involved with building data science systems, this document is worth a look.
Kevin Hughes used TensorFlow to train a Neural Network to play MarioKart. He calls it TensorKart. See his post for more details. It is a nice blog post and sounds like a fun project.
Sorry, the video has no sound.
Georgia Tech University just announced a new online master’s degree in Analytics.
The degree will begin in August 2017 and will be fully online. It will offer 3 tracks:
- Big Data
- Analytical Tools
- Business Analytics (coming at a later date)
Deep Learning Summer School, Montreal 2016 is aimed at graduate students and industrial engineers and researchers who already have some basic knowledge of machine learning (and possibly but not necessarily of deep learning) and wish to learn more about this rapidly growing field of research. If that is you, there are plenty of videos to help you learn more.
The Association for Computing Machinery is hosting a Town Hall with Peter Norvig on A.I., Machine Learning, and More. Peter Norvig is the Research Director at Google, and a leader in the field of Artificial Intelligence (AI). Topics he might discuss:
- deep learning
- future of AI
- teaching AI
- academia vs industry
- advice for grad students
The free webinar is Thursday, December 08, 2016 at noon Eastern time.
In data science and statistics, probability distributions can be very important. I have been meaning to create a listing of them. However, I no longer need to since the fine folks at Cloudera have already created a list at Common Probability Distributions: The Data Scientist’s Crib Sheet.
Learn the distributions and pick a favorite. (My favorite of the common ones is the normal distribution. I also like the Cauchy distribution which is much less common.)
Our World in Data is data visualization site for exploring the history of civilization. The site was created by Max Roser. Our World in Data contains tons of information about many aspects of people’s lives. It also includes numerous visuals (like the one below) which can be easily shared or embedded on other sites.
Beware, the site is addicting, and you might spend a lot of time exploring data.