Avrim Blum, John Hopcroft, and Ravindran Kannan wrote the book, Foundations of Data Science (PDF download). It is free and available for download. It can be useful for academic work or in business. It covers topics such as:
The exam can be broken down into 4 components: Machine Learning, Azure ML Studio, Azure Products, and Python. Below is a breakdown of the topics I remember from the exam.
These are topics which would be covered in a traditional machine learning course. Here are some of the specific topics I remember.
Evaluation of Linear Regression
Evaluation of Classification
Fisher’s exact test
Deep learning – high-level, what is is for
Neural Networks (RNN vs CNN vs DCN vs GAN)
Azure ML Studio
Azure ML Studio is a major focus of the exam, so you need to be fluent in how to use it. Questions ranged from the basics of how to import data all the way to specifics about certain modules.
missing data questions
There were a number of questions from this category. The question would present you a scenario problem and ask which products would be useful for solving the problem. The questions did not go very deep into any of the products, but you will need to know the purpose of these products.
Azure Machine Learning Service
Blob storage – specifically how to get data in/out
Azure Cognitive Services (high level)
Data Science Virtual Machine
Python was the language of choice for the exam, so focus on it.
Azure Machine Learning SDK for Python
Not on the exam
The following topics were not covered on my exam. The exam questions are pulled from a pool of questions, so it is possible these topics may be cover on a different person’s exam. In any case, these are definitely not major portions of the exam.
While it is not one of the popular programming languages for data science, The Go Programming Language (aka Golang) has surfaced for me a few times in the past few years as an option for data science. I decided to do some searching and find some conclusions about whether golang is a good choice for data science.
Popularity of Go and Data Science
As the following figure from Google Trends demonstrates, golang and data science became trendy topics at about the same time and grew at a similar rate.
The timely trends may have created the desire to merge the two technologies together.
Golang Projects for Data Science
Some internet searching will reveal a number of interesting Golang/Data Science projects on Github. Unfortunately, many of the projects had good initial traction but have dwindled in activity over the last couple years. Below is a listing of some of the data science related projects for Golang.
Gopher Data – Gophers doing data analysis, no schedule events, last blog post was 2017
Microsoft Azure has an abundance of data science capabilities (and non-data science capabilities). It can be challenging to keep up with the latest updates/releases. Luckily, Azure has a page to let you know exactly what has changed. You just need to know where to find it, and the following video will help you find that page.