Tag Archives: data

The NFL Should Share this Data

The National Football League begins its regular season tonight. One feature you might not hear about is the addition of 2 RFID sensors on every player. Each stadium is equipped with receivers (not wide receivers) to capture the data emitted from the RFID tags. When the data is collected, it will be able to track players position, movement, speed, and acceleration. A company called Zebra Technologies is implementing the system.

It is a bit early to know exactly what the NFL teams will do with the data, but I think the NFL should open up the data. Analysis could be done for fantasy football. Data scientists could come up with some creative data visualizations. Plus, I think it contains great academic research potential.

As a side note, I am sure someone would start building some apps for the Microsoft Surface tablets.

null
See more at The IoT comes to the NFL

3 Great Data Science Books You Can Read Now…for free

Just this week, I have become aware of 3 free online books for data science.

Data Visualization with Javascript

If you are looking for a tutorial to teach you how to make wonderful visualizations on the web, look no further. Data Visualization with JavaScript is a free online book for learning data visualization with Javascript. It provides tons of examples and step by step instructions for how to create the graphs, charts, and other visualizations. Here is a quick list of the topics:

  • Graphs
  • D3.js
  • Interactive Charts
  • Geographic Plots
  • Timelines

Frontiers in Massive Datasets

Frontiers in Massive Datasets is a report all about how science, business, communications, national security and others need to learn to handle massive amounts of data. Whether the data has been sitting in a database for years or it is now just screaming into the systems, massive data is now a problem for almost every industry. This report covers many of the topics that need to be addressed when dealing with big data. Here is a very brief overview of the topics:

  • Limitations
  • Sampling
  • Building Models from Massive Data
  • Real-time Algorithms
  • 7 Computational Giants of Massive Data Analysis

Foundations of Data Science

Foundations of Data Science is a draft of textbook written by John Hopcroft and Ravindran Kannan. It is intended to be a text for computer science with an emphasis more on probability and statistics rather than discrete mathematics. The authors argue that knowledge of working with data is a necessary skill for computer scientists of the future. This is clearly the most technical and academic of the 3 books, but if that is your thing, your should really enjoy browsing through this book. Here are some of the topics.

  • High-Dimensional Space
  • Clustering
  • Algorithms for Massive Data Problems
  • Singular Value Decomposition
  • Graphical Models

What is a Data Hackathon like?

Here is a video of the final presentations of a data hackathon. You can watch the pitches, questions, and winners. If you are considering attending a data hackathon, this video should give you a good idea of what to expect at the end of a hackathon.

This video comes from the Critical Data Marathon held in London and Boston during September. This specific data hackathon focuses on health and medical data. I hope to post next time Critical Data schedules a hackathon.

Have you attended a data hackathon? What was it like?

Zipfian Academy Launches New Fellowship and Data Engineering Programs

Zipfian Academy, the company that offers the 12-week immersive training for data science, has just announced 2 new programs.

  1. Data Fellows 6-Week Fellowship – A free and intensive program to fill in your knowledge gaps and match you up with a top company. Hurry, applications are due today (June 16, 2014).
  2. Data Engineering 12-Week Immersive – A 12-week program to prepare software engineers to handle big data.

See the original announcement on the Zipfian blog or attend an upcoming virtual information session.

Last week, I got the opportunity to visit the Zipfian Academy office and sit down with the team (Ryan, Jonathan, and Katie). The programs are going well, and they strongly believe in the immersive nature of their program. I would have to agree; the program appears to be working well and graduates have a 91% placement rate and an average starting salary of $115,000. They even referred to it as a new alternative to graduate school. The future of Zipfian is exciting as the team hinted at some plans in the coming months and years. Stay tuned to this blog or join the Zipfian mailing list for future information.

Here is a video of Ryan Orban, one of the Zipfian Academy cofounders, explaining the new programs.

Scientific Data: A new publisher of Data

Nature.com is starting a new publication titled, Scientific Data. The goal is to help researchers publish and discover data. The publication content is called a Data Descriptor. It describes the data, explains the data collection methods, lists the columns, and states other essential information about the dataset.

Unfortunately, the site does not host any of the data. I think it will be interesting to watch how a site like this develops. The publication is currently accepting submissions.