Deep Learning in Java

Deep Learning is the hottest topic in all of data science right now. Adam Gibson, cofounder of Blix.io, has created an open source deep learning library for Java named DeepLearning4j. For those curious, DeepLearning4j is open sourced on github. Below is a video of Adam introducing deep learning and DeepLearning4j. Also, if you are interested … Continue reading Deep Learning in Java

CMU Machine Learning Summer School Videos

It was a 2-week intensive course focused on machine learning for big data. Some of the top academics in machine learning gave presentations. Most of the videos are fairly long (around 1 hour each), but a whole lot of material is covered. All the CMU Machine Learning Summer School Videos are on Youtube. Here is … Continue reading CMU Machine Learning Summer School Videos

Want to Learn SQL? Here is a Great Tutorial!

Mode Analytics, a recently launched site for collaborative data science in the cloud, has published an excellent tutorial for learning SQL. The tutorial is named SQL School . This is one of the best SQL tutorials I have seen. Plus, it has the huge added advantage of not requiring you to setup your own database … Continue reading Want to Learn SQL? Here is a Great Tutorial!

Stanford Releases Large Network Datasets

Stanford University has just released a collection of large datasets of network data. When I say network data, I am referring to the mathematical term of networks (think of a collection of nodes and edges). Here are just a few of the possible categories. Citation Networks Road Networks Web graphs Social Networks such as twitter … Continue reading Stanford Releases Large Network Datasets

An Organization for Opendata and Healthcare

Health Data Consortium is an advocacy group focused on helping the healthcare industry respond to the availability of health data. They are currently focused on innovation and the uses of open health data. Healthcare is currently undergoing some radical changes and data science is going to play a key role in the future of healthcare. … Continue reading An Organization for Opendata and Healthcare

Analytics Handbook: Book 3 is Free

The team that brought you the Analytics Handbook, has freely published the third and final book, titled THE DATA ANALYTICS HANDBOOK RESEARCHERS + ACADEMICS. This book focuses on data science in research and academics communities. Like the previous 2 books in the series, it includes interviews with top experts in the field. Here are just … Continue reading Analytics Handbook: Book 3 is Free

Huge List of Big Data and Machine Learning Technologies

Onur Akpolat has put together A curated list of awesome big data frameworks and resources. The list is very extensive and includes: NoSQL databases, machine learning libraries, frameworks, filesystems and more. On a similar note, Joseph Misiti has compiled a large list of machine learning specific resources. The list is titled, Awesome Machine Learning, and … Continue reading Huge List of Big Data and Machine Learning Technologies

Data Science Productivity Platform

Tristan Zajonc, cofounder of Sense Platform, gave a recent thought-provoking talk at Data Driven NYC. He spoke about the future of data science productivity. According to Tristan: In the next 2 or 3 years, everybody doing data science should be using a data science productivity platform...a cloud-based data science platform. In addition to the productivity … Continue reading Data Science Productivity Platform