SAGE Campus Bytesize courses are a series of short courses that teach core data science skills to people who are eager to learn, but short on time.
In Bytesize: Text Analysis you will learn the building blocks that serve as the foundation for computational text analysis.
By the end of this bytesize course, you will be able to:
List and justify or criticize common preprocessing steps
Explain the "bag of words" (BoW) model
Define TFIDF value
Define "n-gram" and explain how it improves our language model
Create features suitable for a classification model
Correctly interpret a topic model
To successfully complete this course, you will need some knowledge of Python and/or R. We assume that participants can assign variables, direct the flow of control using conditionals, define their own functions and read files. These topics are all covered in SAGE Campus’ Introduction to Data Science with Python and Introduction to Data Science with R. If you have completed these courses, or a similar course, you are ready to take this bytesize course.
To successfully complete this course, you will need knowledge of Python and/or R.
No. All you need to complete this course is a web browser and an internet connection. We do all our programming using JupyterHub, which means that you can code in your browser window.
A computer or laptop with the suggested software and a modern browser e.g. Internet Explorer 10+ or the latest versions of Chrome and Firefox.
You will have access to the course for 3 months.
All of our courses offer a certificate of completion signed by your instructor. You will be able to download this certificate, from the Learning Platform, when you complete the course.
Can't find what you're looking for? Contact Us