Tag Archives: education

Papers for Teaching Undergraduate Data Science

If you work at a university and are considering starting an undergraduate program in data science, then today’s post is for you.

If you know of any other papers, please leave a comment below.

Site For Undergraduate Data Science Programs

Karl Schmitt, Director of Data Sciences at Valparaiso University, has started a blog to share his experiences with building an undergraduate data science program. The blog is titled, From the Director’s Desk. Karl is regularly posting about textbooks, curriculum, visualizations and learning objectives from the perspective of an educator. Tons of great resources!

Georgia Tech Masters in Analytics for Less than $10k

Georgia Tech University just announced a new online master’s degree in Analytics.

Georgia Tech Creates First Online Master of Science in Analytics Degree for Less Than $10,000

The degree will begin in August 2017 and will be fully online. It will offer 3 tracks:

  1. Big Data
  2. Analytical Tools
  3. Business Analytics (coming at a later date)

The Most Popular Skills and Degrees of Today’s Data Scientists

Today, we are lucky to have Daniel Levine of RJMetrics provide a guest post. RJMetrics created an extensive report detailing The State of Data Science. I asked Daniel to provide some results as they relate to the current education of data scientists.

Recently, RJMetrics released a benchmark report that looked to answer many of the questions people have about today’s data scientists, such as how many data scientists are there, what degrees do they have, and what skills do they posses.

From LinkedIn data on the 11,400 data scientists working now, we can get a much better sense of what types of data scientists companies are hiring, and how senior data scientists differ from their junior counterparts.

Education Levels

While it was typical to see data scientists report multiple degrees, when we looked at the percentages of all distinct bachelor’s, master’s, and doctorate degrees, we found that 42% finished their education with a master’s.

Highest Education Level of Data Scientists
Highest Education Level of Data Scientists

The high number of data scientists that receive graduate degrees (79%) is indicative of the increasing demand for specialists and a desire from data scientist for advanced training.

Additionally, these numbers may indicate that data science is simply attracting highly educated educated individuals because of its sexy and lucrative career path.

So what does this distribution look like as you climb the corporate ladder? You may assume that the higher the position, the more PhDs; but in fact, across Junior, Senior, and Chief Data Scientists, we saw the highest ratio of PhDs to Master’s at the Senior level.

Data Scientist's Education Level By Seniority
Data Scientist’s Education Level By Seniority

We speculate that the drop from 43% at the Senior level to 35% at the chief level actually reflects how long those individuals have been in the field. In a study by Heirick & Struggles titled, “Understanding Today’s Chief Data Scientist,” they found that chief Data Scientists “average nearly 15 years of post-degree commercial (PDC) experience.” What we’re likely seeing in this data is the “first crop” of Chief Data Scientists who earned this title in the field, not in the classroom.

Subjects Studied

When we looked at what data scientists studied during their education, we found that besides Business Administration/Management, they were mostly STEM-focused.

Educational Background of Data Scientists
Educational Background of Data Scientists

We believe that Computer Science is so popular because a data scientist without CS skills is at an extreme disadvantage because they won’t be able to extract the data well enough to properly analyze it. DJ Patil and Hilary Mason, in their book Creating a Data Culture, went as far as to say, “a data scientist who lacks the tools to get data from a database into an analysis package and back out again will become a second-class citizen in the technical organization.”

Skills Reported

In analyzing 254,600 records of skills, we found the most popular skills to be more generic than we’d expect. Popular buzz term like “big data” and “hadoop” didn’t crack the top 10, while programming languages like “r” and “python” are extremely popular among data scientists.

Top 20 Data Science Skills
Top 20 Data Science Skills

When the data was sliced by seniority, we saw a major difference between Junior, Senior, and Chief levels. To make these differences easier to digest, we compared each level to the same common denominator: the average data scientist.

Data Science Skills Difference By Seniority
Data Science Skills Difference By Seniority

Again, the chief data scientists data is of particular interest. These C-suite professionals are more likely to list skills like “business intelligence,” “analytics,” “leadership,” “strategy,” and “management” among their skills than both junior and senior data scientists; but less likely to list skills on the more technical side, like “python” and “r”.

While it’s true that chief data scientists may be simply emphasizing skills that are more relevant to their position within the company, we also speculate that many chief data scientists assumed these roles by virtue of being in the field longer or having additional qualifications, such as a business degree. Therefore, it is also possible that some chief data scientists never actually learned many of the skills listed by more junior people.

If you’d like more analysis about this data and a more detailed explanation about our methods, you can check out the full State of Data Science.

Data Science Tech Institute Visiting Faculty

The Data ScienceTech Institute (DSTI) in France is starting 2 new master’s degree programs in data science. Both programs are highly innovative and offer a strong industry focus. Classes begin in October 2015, and each program is limited to 30 students. Therefore, if you are interested, it is important to apply as soon as possible.

The other day, the faculty at DSTI were announced. I am honored to say I was selected as one of the faculty. Thus, I will serve as a visiting faculty member for portions of the program.

DSTI offers 2 master’s degree programs:

  1. Data Scientist Designer – Located in Paris, this 2-year program is part-time and focused on working professionals looking to transition or enhance skills in the data science field. The course will rotate between 2 and 3 days a week.
  2. Executive Big Data Analyst – Located in Nice along the French Riviera, this program is a more traditional intensive 16-month program targeting full-time students.

If you are in France or Europe or interested in studying in France, the programs from DSTI are definitely worth a look.

School without Water, Electricity or Toilets

Sound appealing? Probably not! Unfortunately, this is the sad reality for many children in Sub-Saharan Africa. Even worse, this sad reality is only for those children lucky enough to even attend school. In the world today, there are 58 million out of school children, and 43% of those children will never start attending school.

UIS LeftBehind
UIS LeftBehind

FFunction, a Montreal-based data visualization studio, and UNESCO Institute for Statistics (UIS) recently launched 2 interactive data visualizations. Both are creative and innovative ways to present information.

  • Out of School Children – Explore how gender, income, and location affect a child’s education
  • Left Behind – View how and why African girls struggle to obtain an education

For more on the topic, see my entire guest post on the DataKind blog, Data Visualization for Good – Education in Africa

DataQuest – Free Browser-based Learning for Data Science

DataQuest is a recently launched online data science learning platform for python. The site consists of a gamified series of missions that increase in difficulty as your skills progress. Here are a few other features of the site.

  • Sample Code
  • Live, Interactive Browser-based Coding Environment
  • Step by Step Instructions
  • Instant Feedback
  • Helpful Forums for Q&A

The site is still under development and the founder, Vik Paruchuri, is looking for help developing more content and missions for the site. If that is something of interest to you, get in touch with Vik via the DataQuest website.