Mapping youth well-being worldwide with open data – From DataKind

Mapping youth well-being worldwide with open data – From DataKind

Once again, I was honored to write a guest post for DataKind. This time is was on the spread of open source software by data-do-gooders. A couple years ago, DataKind hosted a DataDive in Washington D.C. and some of the participants created a mapping software project titled DataTools 2.0. Since then, it has been replicated by a number of groups around the globe. Read the full post on the DataKind blog to find out more.

DrivenData – Data Science Competitions for Social Good

DrivenData.org is a relatively new site focused on running data science competitions for social good.

Current and Previous competitions deal with:

  • Finding Clean Water
  • Donating Blood
  • Spending for Education
  • Modeling Healthcare

Give it a try and Good Luck!

Data Science College Programs Across the Globe [Interactive Map]

Continuing this weeks theme on data science colleges, the nice folks at Silk.co created an interactive map of the Data Science University Programs across the globe. Click the map to view the interactive visualization.

Where are the Programs?

Global Map of Data Science Colleges
Map indicating the location of all the data science college programs

Based upon the visualization, it is easy to see most of the programs are in the United States and Western Europe.

What About Degree Types

Data Science Breakdown by Degree Type
Data Science Breakdown by Degree Type

The data can also easily be broken down by degree type and how the degree is delivered (online/on-campus).

What is Silk.co?

To state it simply, Silk.co is a place to very easily store and visualize data. It looks pretty awesome.

Data Science Colleges by US State

US Data Science Colleges
US states with the most data science college/university degree programs

Creating the Awesome Data Science Colleges List has opened the data for some analysis. The above chart shows the states with the most data science programs based upon the total number of colleges in the state (dark red is best, followed by orange, followed by yellow, and finally black is for states with no data science programs).

The Top States

  1. Washington D.C.
  2. Colorado
  3. Massachusetts
  4. South Dakota
  5. Nebraska
  6. Indiana
  7. New Hampshire
  8. Maryland
  9. Illinois
  10. Pennsylvania
  11. New York
  12. Michigan
  13. Arkansas

The States Needing to Create Data Science Programs

  • Alaska
  • Delaware
  • Hawaii
  • Idaho
  • Kansas
  • Maine
  • Mississippi
  • North Dakota
  • New Mexico
  • Vermont
  • Wyoming

See the original post at Need to Learn Data Science? These States are the Best! and the analysis at Create US States Choropleth for Data Science Degrees.

Awesome Data Science Colleges List

I recently compiled a huge list of colleges and universities with data science-related degree programs. The compiled list is available on Github as Awesome Data Science Colleges.

I encourage you to contribute to the list if you know of missing programs.

Free Data Science Book for Ordinary People

A great read for people without an extensive math, statistics or computer science background. And still an interesting read for those people.

The book includes tons of non-technical descriptions for data science terms.

You can download a copy of the book on SlideShare, or you can purchase a paperback copy via Lulu.

$200,000 Cognitive Computing Challenge

HeroX, an organization that runs competitions for big ideas, has recently launched a competition that is relevant to the data science community. It is called the Cognitive Computing Challenge

The challenge sounds fairly simple.

Build a cognitive system that can read a document, then load a database with what it finds.

However, don’t be fooled by the description. Getting a computer to accurately read and “understand” a document is very difficult.

You have until January 11, 2016 to submit a solution, and the winner receives a $200,000 prize.

Anyhow, if you are interested, please join the competition. If you happen to compete and/or win, please leave a comment and I would love to blog about it. Good Luck!

A Guide for Doing Data Science for Good via DataLook

A Guide for Doing Data Science for Good via DataLook

The guide provides some excellent tips on how to get involved.

Yinyang K-Means: A Drop-In Replacement of the Classic K-Means

This week; Yufei Ding, Yue Zhao, Xipeng Shen, Madanlal Musuvathi, and Todd Mytkowicz will be presenting Yinyang K-means at the 2015 International Conference on Machine Learning.

The algorithm guarantees the same results as traditional K-means, but it produces results with an order of magnitude higher performance.

An abstract of the paper and a PDF download can be accessed at Yinyang K-Means: A Drop-In Replacement of the Classic K-Means with Consistent Speedup.

Building an Analytics Team at 500px – Helpful Advice

Organizations everywhere are racing to build analytics/data science teams. Big Data is everywhere and companies don’t want to fall behind. Unfortunately, many organizations are struggling to get started because of questions similar to the following:

  1. How will Analytics help us?
  2. What does an analytics team look like in our organization?
  3. How do we start?

Luckily, the analytics team at 500px, a photography community site, was kind enough to provide a detailed overview, Building Analytics at 500px, of what really happens when building an analytics team. The overview provides:

  • Headaches
  • Infrastructure
  • Evangelism
  • And more

If your organization is considering adding an analytics or data science team, this article is definitely worth reading.