SAGE Campus Bytesize courses are a series of short courses that teach core data science skills to people who are eager to learn, but short on time.
In Bytesize: Collecting Data from the Web you will learn how to extract data from web resources appropriate to your research question. Special attention will be given to how to obtain permission from hosts, and proper etiquette when using APIs and scraping.
By the end of this bytesize course, you will be able to:
Explain in simple terms how the internet works.
Define and use an API to collect data from the web.
Explain the difference between using an API and web scraping.
Recognize potential legal issues surrounding web scraping.
Use a programming language to collect web data.
To successfully complete this course, you will need some knowledge of Python and/or R. We assume that participants can assign variables, direct the flow of control using conditionals, define their own functions and read files. These topics are all covered in SAGE Campus’ Introduction to Data Science with Python and Introduction to Data Science with R. If you have completed these courses, or a similar course, you are ready to take this bytesize course.
To successfully complete this course, you will need knowledge of Python and/or R.
No. All you need to complete this course is a web browser and an internet connection. We do all our programming using JupyterHub, which means that you can code in your browser window.
A computer or laptop with the suggested software and a modern browser e.g. Internet Explorer 10+ or the latest versions of Chrome and Firefox.
You will have access to the course for 3 months.
All of our courses offer a certificate of completion signed by your instructor. You will be able to download this certificate, from the Learning Platform, when you complete the course.
Can't find what you're looking for? Contact Us