SAGE Campus Bytesize courses are a series of short courses that teach core data science skills to people who are eager to learn, but short on time.
In Bytesize: Cleaning Data and Preprocessing you will learn how to prepare data so that it is in a format that can be recognized by the coding function in R or Python.
By the end of this bytesize course, you will be able to:
Define what cleaning and preprocessing data are
Explain why these steps are necessary
Identify common cleaning tasks and possible solutions
Use regular expressions to standardize text
Join multiple data sources together
Incorporate best practices into your data science workflow
To successfully complete this course, you will need some knowledge of Python and/or R. We assume that participants can assign variables, direct the flow of control using conditionals, define their own functions and read files. These topics are all covered in SAGE Campus’ Introduction to Data Science with Python and Introduction to Data Science with R. If you have completed these courses, or a similar course, you are ready to take this bytesize course.
To successfully complete this course, you will need knowledge of Python and/or R.
No. All you need to complete this course is a web browser and an internet connection. We do all our programming using JupyterHub, which means that you can code in your browser window.
A computer or laptop with the suggested software and a modern browser e.g. Internet Explorer 10+ or the latest versions of Chrome and Firefox.
You will have access to the course for 3 months.
All of our courses offer a certificate of completion signed by your instructor. You will be able to download this certificate, from the Learning Platform, when you complete the course.
Can't find what you're looking for? Contact Us