Another Big Data startup launches.

Gigaom

Big data startup Skytree emerged from stealth mode on Thursday with its product that is designed to democratize the science of machine learning, while improving significantly on the speed and scale of existing options. Skytree has raised a $1.5 million Series A investment round from Javelin Venture Partners.

Machine learning is a particularly complex approach to big data, and one that has been largely relegated to only the most-advanced companies, such as financial institutions or large web properties. The technique enables systems to get smarter the more data they ingest, which is particularly useful for tasks such as finding hidden patterns or accurately classifying data without human interaction. The libraries and algorithms are out there for anyone to use if they have good enough skills, but deploying a system that can perform the task on large data sets with reasonable performance is the hard part.

That’s the problem Skytree thinks…

View original post 260 more words

Heroku Thinks Sharing Data is Important

Last week,  Heroku announced a new feature to its PostgreSQL database service.  The new feature is called Data Clip, and it allows users to share results of an SQL query.  It has options to store the exact data from when the query was originally run or the query can be refreshed to return the current data.  I can definitely see this being useful for debugging of code and troubleshooting, which may have been Heroku’s original intent.

I can also see the Data Clip being very useful for data science and quick sharing of relevant data. I doubt the Data clip can handle huge result sets, but huge data is not always necessary. Sometimes, being able to quickly share data results is just as important. Plus the Data Clip allows the results to be downloaded into Excel, csv, json, or yaml formats. Therefore the data can be easily manipulated from there.

See an example in action.

Visual.ly Launches an Infographic Site

I love infographics because they are a great way to convey information about data. They go well with the thought that Data Scientists need to also be good story tellers. Well Visual.ly is startup that is aimed at helping people create, share, and discover infographics. Here is a quick example I created about my twitter account.

My Twitter Infographic

Another Big Data startup launches.

Gigaom

Hot Mountain View, Calif., startup BloomReach emerged from stealth mode on Wednesday with a message about how its marketing-optimization engine will help ensure that companies get their web pages noticed above the noise online. Using a potent brew of big data techniques presented as a software-as-a-service application, BloomReach says it can significantly improve the amount of traffic on product web pages by making them more relevant to consumers.

The problem right now, BloomReach Co-Founder and CEO Raj De Datta told me, is that companies just cannot know how to best present their product catalogs or other content in a way that best aligns with what customers are looking for. In fact, he said, less than 25 percent of web pages see any traffic from natural search or paid search in any given month. Companies are missing out on large swaths of customers because they can’t display their content in a…

View original post 459 more words

Don’t Miss – Stanford Machine Learning

In a matter of days, Stanford will begin the second round of the free online machine learning course. I enrolled in the course last fall, and it exceded all expectations. Professor Andrew Ng is great. The prerequisites are minimal, so don’t worry if your math is a little rusty. Also, the videos are short (around 8 – 12 minutes). Therefore, you don’t need large blocks of time set aside. Just watch a video or two during your lunch and you should be able to keep up. There are programming assignments (optional) and review questions to go along with the videos.

Don’t worry if you fall behind. The videos will still be there. The material you learn is more important than the pace. If you don’t know machine learning, the Stanford class is a great opportunity to get started.

Here is Professor Ng’s introduction to the class.

%d bloggers like this: