The twitter hashtag #SoDS is being used in 2015 to help people track and share what they are learning. The hashtag originated on the Becoming Data Scientist blog.
I recently wrote a post for Sense about a number of freely available learning opportunities this summer, Start Learning with the Summer of Data Science. The post covers:
- MOOCs starting soon
- Large list of open-access journals
If you are interested, go check out the post and start your #SoDS. Hurry, many of the opportunities start very soon.
edX has just announced a new series of Big Data courses. The series consists of 2 courses focused around Apache Spark. If you are not familiar with Spark, it is a very fast engine for large-scale data processing. It claims to perform up to 100 times faster than hadoop. Here are the 2 courses:
- Introduction to Big Data with Apache Spark
- Scalable Machine Learning
The first course starts June 1, 2015, and lasts four weeks. The second course starts in late June and lasts five weeks.
The courses are free but verifiable certificates can be purchased for $50 per course.
If you have been hoping to learn Spark, this might be just the opportunity your were waiting for.
Coursera just launched 18 new specializations. Not all of them are relevant to data science, but here are 3 of the specializations that pertain to data science.
All of these specializations will provide great content. They are quite specific though, but if your goals match the topics, it is hard to beat Coursera.
Are you excited about any of the new specializations?
Process mining is a bridge between data mining and business process modeling. Process Mining can be used to study event and log files to extract meaning.
The Coursera course, Process Mining: Data science in Action, starts November 12, 2014.
EdX will be offering Foundations of Data Analysis via the University of Texas at Austin. The course starts November 4, 2014. Here is a list of topics:
- Tutorials on using R
- Descriptive Statistics
- Statistical Models (Regression)
- Inferential Stats
Coursera is offering the course Mining of Massive Datasets from Stanford University. This is a popular course at Stanford and goes along with the book by the same name. The FREE course starts September 29, 2014, and runs for 7 weeks. The prerequisites are some SQL, algorithms, and data structures knowledge.
Thanks to David Trower for the tip on this course
The widely popular Caltech course, Learning from Data, will be offered on EdX this fall. The course starts September 25, 2014, and it will run for 10 weeks. Here is an abbreviated list of the course topics.
- Linear Models
- Neural Networks
- Cross Validation
- and much more
EdX offers a number of other Data Science related courses. See all of them on the Statistics and Data Analysis course list.