Tag Archives: machine learning

Microsoft Weekly Data Science News for May 18, 2018

Here are the latest articles from Microsoft regarding cloud data science products and updates.

  • Azure Content Spotlight – What’s New with Cognitive ServicesThis weeks content spotlight is all about Azure Cognitive Services. Seth Juarez’s AI Show on Channel 9 provides regular updates on all the new AI features on the Azure platform, including Cognitive Services. See below a collection of the latest video’s …[Read More]
  • A Scalable End-to-End Anomaly Detection System using Azure Batch AIThis post is authored by Said Bleik, Senior Data Scientist at Microsoft. In a previous post I showed how Batch AI can be used to train many anomaly detection models in parallel for IoT scenarios … several Azure cloud services and Python code that …[Read More]
  • Azure.Source – Volume 31In addition, Cognitive Services add pre-built, cloud-hosted APIs for developers to add AI capabilities, including new services announced at Build. This post also covers Cognitive Search and Azure Machine Learning (ML) advancements. The Microsoft data …[Read More]
  • Azure Stack: the last mile in Hybrid CloudThese include Microsoft Azure Cognitive Services, exceptionally large HDInsight environments, and Microsoft Azure Data Lake Store. Services which are best consumed in a Hyperscale Cloud will run on Azure, while services that best fit an enterprise …[Read More]
  • Using Azure for Machine LearningI’m interested in learning more about AI, Data Science, and Machine Learning to improve … other interesting and useful products such as Microsoft IoT Hub, SQL Database, and Cognitive Services which I use a lot for Pantrylogs. You can really play …[Read More]
  • Use AU Analyzer for faster, lower cost Data Lake AnalyticsDo you use Data Lake Analytics and wonder how many Analytics Units your jobs should have been assigned? Do you want to see if your job could consume a little less time or money? The recently-announced AU Analyzer tool can help you today! See our recent …[Read More]
  • Simple and robust way to operationalise Spark models on AzureIt gives you everything that Open Source Spark does and then some. I’ve been especially enjoying the effortless ways to move large datasets around and the ease of MLlib for my AI-projects. One of the questions with the simpler models like regressions and …[Read More]
  • New AI Services in Azure for students and academics announced at Build 20181.Object Detection update to custom vision (preview) http://aka.ms/cognitive 2.Video Indexer (Paid Preview) https://azure.microsoft.com/en-us/blog/build-2018-video-indexer-updates/ 1.Bot Builder SDK v4 (preview) Bot Builder homepage or the Bot Builder …[Read More]
  • How Azure IoT helped me buy a new house – Part 1shares a personal story on how he used Azure IoT to figure out a solution to a problem that many of us face – high electric bills. In the series, Steve shares the process and code that he used to implement this solution. Telemetry data is an important …[Read More]

Microsoft Weekly Data Science News for April 27, 2018

Here are the latest articles from Microsoft regarding cloud data science products and updates. This week includes IoT hubs, Time Series Insights, Deep Learning Virtual Machine, Python sample code for cognitive services, and more.

Microsoft Weekly Data Science News for March 30, 2018

Here are the latest articles from Microsoft regarding cloud data science products and updates. Some of the topics this week: Azure ML, AI Research, Intelligent Cloud, and Anomaly Detection.

Reinforcement Learning

Simply stated: Reinforcement Learning deals with actions and rewards (positive or negative). The rewards help to dictate the future actions.

Many children learn via reinforcement learning. Here is a simple example from my childhood. As a child, I was told not to touch the hot stove.

My action: I touched the door of the hot stove when it was open.

My reward (negative): I burned my hand, and it hurt.

My future actions resulted in me staying away from the hot stove.

This was a very simplified example of reinforcement learning. Here are two more great introductory references:

  1. Simple Beginner’s guide to Reinforcement Learning
  2. Deep Learning Research Review: Reinforcement Learning

Conversations with future data scientists (YouTube Playlist)

Last week I spent some time chatting with future data scientists. I set up a camera to record some of the answers. Below are a few of the questions addressed.

  • How did I transition to data science?
  • Why start a data science project?
  • Should a new person focus on machine learning or deep learning?
  • What is an example data science project?
  • Why is real-time important?

Hopefully the videos and answers are helpful to others. Enjoy! And I kept most of the videos fairly short. If you enjoy the videos, please subscribe to the YouTube channel, Learn Data Science. Also, if you have a question you would like answered, please leave a comment below.

Data Science Live Book

Pablo Casas has published a book freely available online, Data Science Live Book. To quote from the book,

It is a book about data preparation, data analysis and machine learning.

The book is open source, and the code examples are written in R.

Columbia University Applied Machine Learning Online

Columbia University’s course Applied Machine Learning Spring 2018 by Andreas C. Müller has all the lecture notes, slides, homework, and videos posted online.

Andreas is also the author of the book Introduction to Machine Learning with Python.

DataCamp Community News Site

DataCamp recently launched a new community site, Data Science News, for sharing and discovering data science news. It is similar to Hacker News if you are familiar with that site.

The 3 Stages of Data Science

Businesses everywhere are racing to extract meaningful insight from their data. Many organizations are spinning up data science teams and attacking problems (some more successful than others). However, one of the challenges is determining the current stage of data science within the organization. Next is determining the desired stage of data science.

Below are 3 stages of a truly mature data science organization.

1. Dashboards

The beginning stage of data science is dashboards. It is all about answering “How much?” and “What happened?” by looking at reports of historical data. If done well, it might even help an organization answer “Why”. Many organizations will refer to this phase as Business Intelligence.

The dashboard stage can be very expensive for an organization, in terms of people-hours and money. It usually involves investments in:

  1. Data Warehouse or some other storage environment, for storing the data in a single location for easy reporting
  2. ETL (Extract Transform Load) Tools for manipulating, combining, and moving data to the data warehouse
  3. Reporting Tools for displaying the results and allowing users to “explore” the data

Here are some common questions that can be answered via traditional dashboards:

  • How many customers live in each region?
  • What were the sales on Black Friday?
  • How many patients visited the hospital last month?

As you can see, there are large amounts of value that can be gained by this phase alone. It is critical for a business to clearly understand past performance. Unfortunately, this phase is where many businesses stop.

2. Machine Learning

The real “science” of data science does not begin until the second stage which is machine learning. It focuses on estimating quantities that cannot be directly observed. This could be what movies a customer will like, the price of a company’s stock tomorrow, or the causal effect of a particular advertising campaign. Machine Learning uses the data from the first phase and applies statistical or other methods to gain additional insights.

Think of machine learning as answering the following:

  • When a customer moves, will he/she spend money at a hardware store?
  • When a credit card purchase is made, what is the probability the charge was fraudulent?
  • What is the expected lifetime value of a new customer?
  • If a hurricane is coming, what will people buy? (pop tarts? it is true).

Notice the connection between an event and some outcome. The value of machine learning comes from estimating the causal outcome of potential events. This phase is filled with terms such as: machine learning, data mining, and statistical modeling. The machine learning stage is all about looking into the future!

3. Actions

Determining the actions to perform, is the third and final phase. It tries to capitalize on the results of machine learning in order to take appropriate actions. The following actions might be suitable for the events identified in the predictive section above.

  • When a customer moves, send a “welcome to the neighborhood” packet with coupons to nearby hardware stores.
  • Decline the fraudulent charge or deactivate the credit card.
  • If the new customer has very high expected lifetime value, provide some special treatment or offers to ensure the customer becomes a customer for life.
  • When a hurricane is approaching, place Pop tarts near the front of the store.

As you can see, good machine learning from the second phase can lead to clear actions.


Claiming success in Data Science is all about conquering all three stages. Each stage builds upon the previous stage. If you have put in the effort to complete the first stage, why not continue to the second and third stages?

5 Data Science Research Papers to read in Summer 2017

In the past, the blog has included 7 Important Data Science Papers and 5 More Data Science Papers. Here is another list if you are looking for something to read over the summer.