Previously, I blogged about submitting your first solution to Kaggle for the Biological Response Competition. Well, that technique used Logistic Regression and the resulting score was not very good. Now, let's try to improve upon that score. In this example, we will use what is called a Random Forest. Kaggle claims that random forests have … Continue reading Increase Your Kaggle Score With a Random Forest
This helps to solidify MongoDB as the leading NoSQL database. Having used MongoDB myself, I can see why they were able to raise so much funding. MongoDB is fast, simple, and flexible.
The gorilla may be a bit too much, but the rest of the infographic contains valuable information. What do companies do with bigdata? Also, spending on bigdata is set to increase rapidly in the next few years. Via: Wikibon Big Data
Hans Rosling does it again. Hans may be the best storyteller of data on earth. He has a real gift of turning data into an exciting story. He also asks great questions and has some good wit. The whole goal of this TedTalk is to answer the following question. Do women of certain religions have … Continue reading Religion and the Number of Babies: A Hans Rosling TedTalk
Yesterday, I wrote a post explaining the Kaggle Biological Response competition. If you don't know, Kaggle is a website for data science competitions. Now it is time to submit a solution. After this post, you should have a spot on the Leaderboard. Granted, it will not be first place but it won't be last place … Continue reading Your First Kaggle Submission
Yesterday, I posted about the popularity of data hackathons. Well, today let's get started with Kaggle. This is the first of a few simple posts about making your first submission to a Kaggle competition. I also promise you won't be last place. You won't be first either. This is an excellent way to start developing … Continue reading Get Started With Kaggle – Description
It seems that competitions and meetups for hacking data are all over the place. Coding challenges have been around for a long time. Recently, it appears that data is being thrown into the mix. I think the idea is great. Instead of just hacking some app, why not hack with some data that might help … Continue reading Hackathons with Data are Everywhere
If you are in New York City or the surrounding area and you want to learn data science, this post is for you. General Assembly; a technology, design, and entrepreneurship campus in New York City; is running a 12-week Intensive Program in Data Science. The course consists of lectures (twice a week), labs, homework, and … Continue reading Data Science Training Program in New York
Dear Future Developers Please store as much data as possible. Do not worry about the cost of the extra storage disks. The value in the data will far outweigh the cost of the hardware. Here are some examples of data that could be stored but is typically not. Start storing data about the order in … Continue reading Challenge To Future Developers: Start Storing More Data
Easel.ly recently launched. It is a site for easily creating infographics. It looks pretty simple, but I am still not sure I have the artistic skills to make a good looking infographic. Infographics are still great for telling the story of your data.