Increase Your Kaggle Score With a Random Forest

Previously, I blogged about submitting your first solution to Kaggle for the Biological Response Competition. Well, that technique used Logistic Regression and the resulting score was not very good. Now, let's try to improve upon that score. In this example, we will use what is called a Random Forest. Kaggle claims that random forests have … Continue reading Increase Your Kaggle Score With a Random Forest

This helps to solidify MongoDB as the leading NoSQL database. Having used MongoDB myself, I can see why they were able to raise so much funding. MongoDB is fast, simple, and flexible.

Religion and the Number of Babies: A Hans Rosling TedTalk

Hans Rosling does it again. Hans may be the best storyteller of data on earth. He has a real gift of turning data into an exciting story. He also asks great questions and has some good wit. The whole goal of this TedTalk is to answer the following question. Do women of certain religions have … Continue reading Religion and the Number of Babies: A Hans Rosling TedTalk

Get Started With Kaggle – Description

Yesterday, I posted about the popularity of data hackathons. Well, today let's get started with Kaggle. This is the first of a few simple posts about making your first submission to a Kaggle competition. I also promise you won't be last place. You won't be first either. This is an excellent way to start developing … Continue reading Get Started With Kaggle – Description

Data Science Training Program in New York

If you are in New York City or the surrounding area and you want to learn data science, this post is for you. General Assembly; a technology, design, and entrepreneurship campus in New York City; is running a 12-week Intensive Program in Data Science. The course consists of lectures (twice a week), labs, homework, and … Continue reading Data Science Training Program in New York

Challenge To Future Developers: Start Storing More Data

Dear Future Developers Please store as much data as possible. Do not worry about the cost of the extra storage disks. The value in the data will far outweigh the cost of the hardware. Here are some examples of data that could be stored but is typically not. Start storing data about the order in … Continue reading Challenge To Future Developers: Start Storing More Data