Just this week, I have become aware of 3 free online books for data science.
- Interactive Charts
- Geographic Plots
Frontiers in Massive Datasets
Frontiers in Massive Datasets is a report all about how science, business, communications, national security and others need to learn to handle massive amounts of data. Whether the data has been sitting in a database for years or it is now just screaming into the systems, massive data is now a problem for almost every industry. This report covers many of the topics that need to be addressed when dealing with big data. Here is a very brief overview of the topics:
- Building Models from Massive Data
- Real-time Algorithms
- 7 Computational Giants of Massive Data Analysis
Foundations of Data Science
Foundations of Data Science is a draft of textbook written by John Hopcroft and Ravindran Kannan. It is intended to be a text for computer science with an emphasis more on probability and statistics rather than discrete mathematics. The authors argue that knowledge of working with data is a necessary skill for computer scientists of the future. This is clearly the most technical and academic of the 3 books, but if that is your thing, your should really enjoy browsing through this book. Here are some of the topics.
- High-Dimensional Space
- Algorithms for Massive Data Problems
- Singular Value Decomposition
- Graphical Models