O’Reilly is releasing a free early (unedited) edition of the book, Graph Databases. The book is authored by Ian Robinson, Jim Webber, and Emil Eifrém. All three authors are members of Neo Technology, the maker of the super-popular Neo4j graph database.
This is a wonderful talk by Max DeMarzi (he has a very informative blog as well). If you are new to NoSQL or Graph Databases, I highly recommend this video.
One comment stuck out for me:
You’re never gonna run out of nodes when you get to half a trillion…
That is a really big number, but I wonder how many years that statement will stand. If you have any thoughts, please leave a comment.
Intel Labs built a tool for constructing mathematical graphs out of large datasets. It is Java based and works with Hadoop and MapReduce. Intel has release a whitepaper explaining more about GraphBuilder. The code is available on Github. A big thanks to Mark Nickel for pointing out this project.
ArangoDB is a flexible NoSQL database. It is a document database with the ability to add edges. Thus it can become a graph database. I had a fun time playing around with the online tutorial and demo. ArangoDB also claims to support being a key/value store. The code is available on Github.
Neo Technology, the company behind the graph database Neo4j, is hosting a webinar on Thursday. Pablo Pareja from the Bio4j project will provide an overview of bioinformatics and neo4j, as well as some applications.
Bioinformatics can be viewed as data science for biology. Bioinformatics was cool before data science was even a term.
If you are interested in learning more about bioinformatics and graph databases, the register for this webinar and start learning.