Kaggle, the online data science competition platform, will be hosting a free online conference for people interested in landing their first data science job.
Kaggle CareerCon 2018 on March 20, 2018. There are a bunch of great speakers lined up, and all the talks are focused on preparing someone to get a data science job (resume tips, interview questions, portfolio examples, …).
Recently, a number of resources for publicly available datasets have been announced.
Kaggle becomes the place for Open Data – I think this is big news! Kaggle just announced Kaggle Datasets which aims to be a repository for publicly available datasets. This is great for organizations that want to release data, but do not necessarily want the overhead of running an open data portal. Hopefully it will gain some traction and become an exceptional resource for open data.
NASA Opens Research – NASA just announced all research papers funded by NASA will be publicly available. It appears the research articles will all be available at PubMed Central, and the data available at NASA’s Data Portal.
Google Robotics Data – Google continues to do interesting things, and this topic is definitely that. It is a dataset about how robots grasp objects (Google Brain Robot Data). I am not overly familiar with this topic, so if you want to know more, see their blog post, Deep Learning for Robots.
LinkedIn They turn data into products better than anyone else.
Facebook If you are the type of person that loves to analyze people’s lives, there is no better place.
Twitter Duh, It’s Twitter. lots of data and lots of possibilities
Cloudera Cloudera is a successful Hadoop-based startup. Build tools and explore huge datasets for a variety of industries.
Kaggle If optimizing algorithms and really diving into the data to get every last ounce of information is your thing, then Kaggle is it. Plus, there is nowhere else you will get to work on so many important problems in such a wide range of domains. Unfortunately, Kaggle is not currently hiring any data scientists, but they most likely will be seeking more in the future.
There are many other companies hiring data scientists. Where would you like to be a data scientist?
Kaggle They make data science a sport, enough said.
DataKind DataKind may not technically be a startup because it is a nonprofit, but they are doing cool stuff. They match nonprofit organizations with people that love to analyze data and create visualizations.
Cloudera They call themselves “The Platform for Big Data”. They are working hard to make hadoop easier to use.
Coursera Coursera is an education startup, but with 2 Computer Science Professors as founders, you can bet they are crunching a lot of data about how people learn.
BigML They are trying to make machine learning available to everyone. Machine Learning as a Service!
There is hoards of data science information available on the internet for free. With enough personal motivation, a person could learn all the skills necessary for free (or cheap) online. Coursera is probably a great place to start. There are also other good sites such as Udacity, the Kaggle Wiki, other blogs and websites.
The problem with this approach is knowing exactly what to learn. A course in machine learning is great, but data science is more than just machine learning. How do you know what to learn? It would be really nice to have a collection of data science topics and the associated online training materials.