My previous list of Colleges with Data Science Degrees has grown very large, and numerous people have requested the ability to sort and/or filter. Thus, I built a new list. It is available at: Data Science Colleges. As far as I know, this is the most comprehensive list of data science programs available. Here are some of the features it offers:
- Over 200 Programs
- Certificate, Bachelors, Masters, and Doctorate programs included
- Sort and Filter Programs
- US and International
- Program Name
- Online Programs
- Ability to download the raw data as CSV or JSON
Yes, you read that last one correctly. All the data is freely available for you. If you do use the data for something, I would love to know and potentially blog about it.
The list will continue to evolve. If you find any broken links or missing programs, please leave a comment. Also, please leave a comment if you can think of ways to improve the list.
Stanford University has just released a collection of large datasets of network data. When I say network data, I am referring to the mathematical term of networks (think of a collection of nodes and edges). Here are just a few of the possible categories.
- Citation Networks
- Road Networks
- Web graphs
- Social Networks such as twitter
- and many more
If you are looking to study network data, or just want some practice analyzing big data, this just might be a good place to start.
Health Data Consortium is an advocacy group focused on helping the healthcare industry respond to the availability of health data. They are currently focused on innovation and the uses of open health data.
Healthcare is currently undergoing some radical changes and data science is going to play a key role in the future of healthcare. It is great to see the medical field building an official group to define the practice. I hope other industry will follow the lead of the medical field and begin forming their own groups around open data. I am eager to see how the Health Data Consortium progresses over the coming years and months.
Twitter has just released the idea of a Data Grant. You have to login with your twitter account to see the details. The gist is: Twitter will provide you with historical twitter data for research purposes.
What could you do with this data?
The United States White House has developed Project Open Data to encourage government agencies to share and produce opendata.
According the Project:
The White House developed Project Open Data – this collection of code, tools, and case studies – to help agencies adopt the Open Data Policy and unlock the potential of government data.
Here are some other materials in the Project.
- Open Data Licenses
- Data Formats
- Case Studies
- Other Resources
Best of all, the entire project is available on GitHub and contributions are welcomed.
I would love to see other organizations start to do similar things. Some organizations, such as Rackspace, have created support around open source coding projects. However, I am unaware of organizations doing things to standardize and encourage the sharing of opendata.
Does any know of any organizations working to create policies for releasing opendata? If so, please leave a comment.
This information goes along with the post last week, Open Data Could Be Worth $5.4 Trillion Annually. Just last week France released an action plan for open data. Honestly, I have not read the full report, but it is great to see a government create such a plan. See the full report below.
Michael Chui of McKinsey Global Institute provided some clear insights about the benefits of opendata. Here are the 4 characteristics of open data provided by Chui:
- Access by Everyone
- Formatted for Easy Reading by a Computer
- Free(no cost)
- Unlimited Rights to redistribute and reuse
Also, Chui describes how an organization can get the most from their open data. It is not enough to just make the data available, the organizations must provide an ecosystem focused around the open data. Here are some of the strategies he discussed.
- Identify and Prioritize the Correct Data to open
- Get Developers/Data Scientists (internal/external) Playing with the data
- Privacy/Policy Issues
- Platforms & Standards along with metadata
He also mentions the potential economic benefits of open data ranging from $3.2 billion to $5.4 trillion. For more information on open data see the latest Report from McKinsey Global about Open Data and/or watch the video below.