Title says it all, Some datasets for teaching data science
Here are the latest articles from Microsoft regarding cloud data science products and updates.
- Partners make industrial IoT factories more capable than ever at Hannover Messe 2018 – For example, ABB has placed a tremendous bet on our Azure cloud. Its intelligent factory in Heidelberg, Germany, has integrated artificial intelligence (AI) and IIoT technology into its robots to self-learn and self-correct. For instance, the various …[Read More]
- Deploying Deep Learning Models on Kubernetes with GPUs – This post is authored by Mathew Salvaris and Fidan Boylu Uz, Senior Data Scientists at Microsoft. One of the major challenges that data scientists often face is closing the gap between training a deep learning model and deploying it at production scale.[Read More]
- Cloud computing and AI help The Nature Conservancy dive deeper into conservation – “Thanks to our work with Microsoft, we have the incredible opportunity to leverage technology to link science to actionable planning … with the technology—cloud-based computing, AI, machine learning, data visualization—now at their fingertips.[Read More]
- Advancing ocean research through data and technology – Seven months ago, we announced the launch of the AI for Earth EU Oceans award … Meanwhile in France, our second grantee IMT Atlantique is developing data-driven and learning-based schemes for the modelling, analysis and reconstruction of ocean atmosphere …[Read More]
- Inside the Microsoft Garage: Ink to Code – “We’re always looking for ways to innovate and had some ideas about trying to use AI to enhance our current designer projects in Visual Studio,” explained Corrado. “We kicked around some ideas and somebody mentioned “what about an ink-based …[Read More]
- Illinois Universities: Driving Innovation Forward – Matt Bragg is Program Manager for Data and Policy at the Illinois Science & Technology Coalition (ISTC). Matt is responsible for overseeing all aspects of the Illinois Innovation Index – the ISTC’s quarterly data publication tracking and …[Read More]
- Celebrating Innovation Day and the Imagine Cup Finals in Japan – Use AI to optimize the feeding for fish farmers … Team Emergensor (The University of Tokyo) Detect emergencies from smartphone data to alert nearby citizens and improve response time, reducing accidents. The technical level of all 10 team solutions …[Read More]
A common popular technique for learning data science is starting a project. Here are the 3 E’s for why building a data science project is a good idea.
A data science project will expose you to all the stages of the data science process. You will need to start with the identification of an interesting question or problem. Then you will have to find and collect the necessary data. After that, cleaning and modeling the data is important. Finally, the result needs to be presented (this is called deployment).
Employers always want experience, and a project can provide that.
Once a project is started, there is always something new to learn. For example, the project will have data. Where do you store that data? It might need a database. Should you use a cloud database or install one locally. You will have to learn how to do that. Another example, after you have collected your data, you might realize some rows are incomplete. Then you will have to look into methods for dealing with missing data. This will require more learning.
A project provides a better learning environment than a list of courses because each new thing you learn has a reason and a purpose. To follow the examples from above: You know why you are creating a database and you know why missing data is important.
Make sure to pick a project that is interesting to you. This decision will make the project more enjoyable. If you enjoy sports, maybe build a project around fantasy sports. Sports are filled with data. If you enjoy reading, build something around books, authors, or magazines. If exercise is your thing, build an app to predict your progress. Plus, if you find an answer to your question, it is always fun to solve something.
Got a question you would like answered? Let me know in the comments.
Here are the latest articles from Microsoft regarding cloud data science products and updates.
- Fábio Scopeta “A friendly geek helping other people make good use of AI” – Fábio Scopeta, from São Paulo, Brazil, is the Director of Digital Transformation and Artificial Intelligence (AI) for Microsoft Latin America. He is passionate about how Artificial Intelligence and the Cloud are transforming the world. His motivation is …[Read More]
- Optimizing defense systems with AI – With data collection expected to increase exponentially over the next decade, this problem is likely to increase. Assisting defense with AI and cognitive services Defense agencies require agility to rapidly handle asymmetric threats in dynamic environments …[Read More]
- Ochsner Health System: Preventing cardiac arrests with AI that predicts which patients will ‘code’ – A trained data scientist, Wilt is also using Epic’s advanced machine learning tools to predict pressure ulcers and hospital-acquired infections. But Epic’s platform enables health systems of all sizes – including those without AI experts – to …[Read More]
- AI bot that helps people prepare for job interviews wins £5,000 prize – A bot that uses facial recognition technology to help people prepare for job interviews has won £5,000 for three UK students. The trio developed Interview Bot, which uses facial analysis and speech recognition to offer real-time feedback on facial …[Read More]
- Rubikloud leverages Azure SQL Data Warehouse to disrupt retail market with accessible AI – Take a look at this example of what’s possible using Microsoft’s big data and advanced analytics products. AI can be complex, this is where Rubikloud comes in. Rubikloud is focused on accessible AI products for retailers and delivering on the promise …[Read More]
- Microsoft HoloLens now available in Singapore – JRCS, a major Japanese maritime services company, deployed mixed reality, the Internet of Things (IoT), and AI to change how shipping crews are trained, how ships are maintained, and how navigational safety and standards are promoted and enforced.[Read More]
- Technology, ethics and the law: Grappling with our AI-powered future – The vast computational power of the cloud and an immense accumulation of data have come together. Artificial intelligence (AI) is growing all around us … needed by technology companies is broadening. “Science, technology, engineering, and math (STEM …[Read More]
- Register for Webinar on 4/26: Using AI to transform the tactical edge – Enhance your predictive intelligence. Advanced capabilities such as artificial intelligence (AI) and machine learning (ML) serve several defense applications, including data analysis, predictive intelligence, and situational insights. Senior …[Read More]
- Connect Intel Edison to Azure IoT Hub (Node.js) – Create an IoT hub. Register a device for Edison in your IoT hub. Run a sample application on Edison to send sensor data to your IoT hub. Connect Intel Edison to an IoT hub that you create. Then you run a sample application on Edison to collect temperature …[Read More]
Here are the latest articles from Microsoft regarding cloud data science products and updates. This week, it covers: new AI training, IoT investments, learning for the deaf with AI, Cosmos DB, Project Roadrunner, and more.
- Aiming to fill skill gaps in AI, Microsoft makes training courses available to the public – The program provides job-ready skills and real-world experience to engineers and others who are looking to improve their skills in AI and data science through a series of online courses that feature hands-on labs and expert instructors. The program is part …[Read More]
- Microsoft will invest $5 billion in IoT. Here’s why. – We’re also getting a look into how both customers and partners overcome the specific challenges of building an IoT solution that harnesses massive amounts of data. Whether they … machine learning, AI, security and privacy. We are committed to helping …[Read More]
- AI technology helps students who are deaf learn – The AI in the Custom Speech Service uses this data to build models for how domain-specific words are pronounced. When a speaker then uses the words, the system recognizes them and displays the text in the real-time transcript. Chris Campbell is an NTID …[Read More]
- A new IP strategy for a new era of shared innovation – This AI-powered system guides surgeons during operations and sends … the statistical programming language used by data science experts around the world. Over 40 percent of the virtual machines running on Azure today use Linux. Some of our existing …[Read More]
- Inventory Optimization Solution in the Azure AI Gallery – This post is co-authored by Dmitry Pechyoni, Senior Data Scientist, Hong Lu and Chenhui Hu … We recently published a cloud-based inventory optimization solution for retail in the Azure AI Gallery. We designed this solution to be flexible, scalable …[Read More]
- Azure Cosmos DB – You can use a spark connector to do Machine Learning and Data Science in Spark using the Spark connector for … and all developers need to focus is on their application logic. Join our Data & AI Partner community call on April 6, where we’ll be …[Read More]
- Like a phone call: XiaoIce, Microsoft’s social chatbot in China, makes breakthrough in natural conversation – Now, Microsoft believes it has created the first technological breakthrough that can allow people to have a conversation with an AI-powered chatbot that is more like that natural experience a person might have when talking on the phone to a friend.[Read More]
- Run your Python script on demand with Azure Container Instances and Azure Logic Apps – An increasing focus has been placed recently on the data science process; a methodology to govern the enterprise-scale effort that goes into the development, deployment and maintenance of data analytics. Data scientists have not been lacking in terms of …[Read More]
- Project Road Runner uses photo-realistic simulation and deep learning to train autonomous driving algorithms – Project Road Runner hopes to make that task more manageable and accessible to the autonomous driving industry, developers, and researchers, by gathering data and training AI through real-world simulation. Simulation provides a means to collect extensive …[Read More]
The University of California at Berkeley is offering its very popular undergraduate data science course, Foundations of Data Science: Computational Thinking with Python, online via EdX. The course can be taken for free and it starts Monday April, 2 , 2018.
This course is actually part of a larger certificate program, Foundations of Data Science. You must pay to receive the certificate.
Here are the latest articles from Microsoft regarding cloud data science products and updates. Some of the topics this week: Azure ML, AI Research, Intelligent Cloud, and Anomaly Detection.
- What is Azure Machine Learning Studio? – Machine Learning Studio is where data science, predictive analytics … Click Gallery and you’ll be taken to the Azure AI Gallery. The Gallery is a place where a community of data scientists and developers share solutions created using components of …[Read More]
- Four Big Bets For Better AI Research: A Personal Journey – While mining those forums, I discovered a clear problem: people want to write scripts for transforming a column of data, but they don’t know how … heuristics that are today manually programmed in AI systems (and not just our program synthesizers).[Read More]
- Training Many Anomaly Detection Models using Azure Batch AI – This post is authored by Said Bleik, Senior Data Scientist at Microsoft. In the IoT world, it’s not uncommon that you’d want to monitor thousands of devices across different sites to ensure normal behavior. Devices can be as small as microcontrollers …[Read More]
- Satya Nadella email to employees: Embracing our future: Intelligent Cloud and Intelligent Edge – First, computing is more powerful and ubiquitous from the cloud to the edge. Second, AI capabilities are rapidly advancing across perception and cognition fueled by data and knowledge of the world. Third, physical and virtual worlds are coming together to …[Read More]
- Windows 10 RS4 Preview for HoloLens and ONNX offline Machine Learning – Yesterday we released the Windows 10 RS4 preview to HoloLens so this now allows Data scientists and developers creating AI models will be able to deploy their innovations to this large user base. From an academic perspective I have lots of HoloLens …[Read More]
- Dynatrace Managed Instance now available for Azure Government – Built with AI technology, Dynatrace provides full stack, all-in-one monitoring and operations analytics for the public sector at massive scale, in the largest government environments. Dynatrace utilizes artificial intelligence to understand the …[Read More]
- Want to sell Azure but don’t know where to start? – It’s the most trusted, open, and flexible cloud platform. Whether it’s for cloud app development, cloud and infrastructure, data and AI, or security, you’ll be able to give your customers an Azure package that works for them. But with such a broad range of …[Read More]
- Why a developer should build a solution with microservices – Isolated Data & State. As you move ahead you will see how this is done. Meanwhile let’s see what are the advantages you derive out of these. You will analyze the advantages from the perspective of the team involved in the actual development of the code and …[Read More]
- PowerPivot for SharePoint 2016 – Error when trying to schedule a data refresh: “Sorry, something went wrong.” – The SharePoint ULS logs might show the following error: UserProfileDBCache_WCFLogging :: ProfileDBCacheServiceClient.GetUserData threw exception: Access is denied. This can occur when the account running the PowerPivot System Service in SharePoint …[Read More]
- 3 intelligent manufacturing IoT trends to watch out for at HMI 2018 – 1: Enhancing productivity through connected IoT infrastructures Manufacturing technologies—aided by advances in machine learning, artificial intelligence (AI), digitization … secure standards such as OPC UA, data related to real-time manufacturing …[Read More]
I have been getting this question a lot lately, so I recorded my current answer. If you are interested in more videos like this, please subscribe to the Youtube Channel, Learn Data Science.
Got a question you would like answered? Let me know in the comments.
Apache Spark, Databricks, Neural Networks, University Partnerships, and water systems: Here are the latest articles from Microsoft regarding cloud data science products and updates.
- Training State-of-the-Art Neural Networks in the Microsoft Azure Cloud – I noticed that you said tested all of the code for your new deep learning book on the Microsoft Data Science Virtual Machine (DSVM). Does that include the chapters on training networks on the ImageNet dataset as well? I work at a university and we’re …[Read More]
- Azure Databricks, industry-leading analytics platform powered by Apache Spark™ – These capabilities have enabled companies such as renewables.AI to boost the productivity of their data science teams by over 50 percent. “Instead of one data scientist writing AI code and being the only person who understands it, everybody uses Azure …[Read More]
- Creating technology for a range of disabilities helps everyone, says Microsoft accessibility expert – “There have been massive advancements in AI over the past few years, fuelled by the rise of big data, created by the devices in our pockets and at home, and stored by the cloud. Microsoft is infusing AI into its products, but we also have AI platforms …[Read More]
- New digital skills university planned for Milton Keynes – Chancellor Philip Hammond used last year’s Budget to announce more money for AI and triple the number of trained computer science teachers, while Microsoft has launched a digital skills programme that aims to ensure the country remains one of the global …[Read More]
- Microsoft and Tsinghua University Work Together on Open Academic Data Research – The construction of the billion-scale OAG is challenging, because of the heterogeneous distribution of academic data that exists in the different … Students from Peking University, University of Science and Technology of China, and Harbin Institute …[Read More]
- Together, AI and nature are protecting Earth’s water systems for the future – This is exactly the kind of gap we’re intent on solving through AI … Science, we’ve deployed an Internet of Things (IoT)-based network of sensors in the IISc campus to efficiently monitor the flow of water from source to consumption. With these new …[Read More]