iStock-821918926.jpg
iStock-821918926.jpg

Overview


Practical Data Management with R for Social Scientists 

Next course runs from [[date]]

SCROLL DOWN

Overview


Practical Data Management with R for Social Scientists 

Next course runs from [[date]]

Course description

Data management - the art and science of collecting, cleaning, and manipulating data - is an essential research skill for social scientists. 

This course will seek to demystify these 'dark arts', and provide you with the foundation to tackle a wide range of practical data management tasks using the R programming language.

By the end of this course, you will possess the programming skills and experience to manage data in a wide variety of formats, in a reproducible manner, at scale.

Learning outcomes

This course will teach you:

  • Basic R commands and data structures for manipulating data
  • The ability to read data from multiple formats in and out of R
  • Proficiency using loops, conditional statements, and functions to automate common data management tasks
  • Familiarity with R’s package system for extending its functionality
  • The skills to clean and manage multiple complex datasets
  • The ability to clean and manipulate textual data
  • An understanding of basic web scraping techniques, for both standard web pages and the Twitter API
  • An overview of the techniques and hardware necessary to manage large datasets efficiently

Any questions? - Contact us

For a bulk order of 5 or more learners on any of our courses, you can claim 50% discount. Contact us for more information.

Practical Data Management with R for Social Scientists

Effort
12-20 hours, divided into 5 units. The course is self-paced.
Prerequisites
You do not need to have any prior programming experience as you will be taught everything you need to know within the course.

Basic computer skills, and some experience working with any sort of statistical analysis software (Excel, SAS, Stata, SPSS) will be helpful.

You should have some experience working with datasets, such as datasets that record several different pieces of information about each individual.

Instructors
Matthew Denny
Language
English
 
399.00
Start Date:
Enroll
399.00
Start Date:
Enroll

Course Instructors


Course Instructors


Course Instructor

 
 

How it works


How it works


How It Works

This course is broken up into five units, starting with basic R programming and working up through more advanced data management, web scraping, and finally big data/HPC techniques and issues. Each unit is broken up into a number of discrete topics, with video lectures and supporting materials associated with each topic. At the end of each unit, participants will be given a homework assignment that synthesizes what they have learned in that unit.
 
Each unit should take about a week, with the earlier units requiring a bit more time. Each topic builds on the previous topics, so it is designed to be completed in order, unless you have previous experience with a topic. In the lectures, Matt provides as much context and explanation for each new concept as possible. If you put in the work, and try out the code yourself, you should have a strong foundation to start working on serious data management tasks in R by the end of the course.

We know you’re busy so we don’t set module deadlines. You are free to login and learn whenever suits your schedule. 
If you are able to commit to a few evenings a week, you will have completed the modules within approximately a month. However if you have a busy period, you have the flexibility to spread your module completion over a 3 months access period.  

SAGE Campus will help you with any IT or platform issues you might have.

Syllabus


Syllabus


Syllabus 

Unit One - Introduction to R and RStudio

As well as introducing you to the course, this unit will teach you:

  • How to install R and RStudio
  • Basic R programming skills, such as how to write commands in an R script
  • How to understand the core data structures you need to manage a huge variety of data

Unit Two - R Programming Fundamentals

This unit will teach you about:

  • Data I/O and packages so you can extend the functionality of R
  • Looping and conditional statements so you can automate wildly complex tasks
  • Functions so you don’t have to write the same code over and over again for similar tasks

Unit Three - Data Management in R

This unit will teach you how to:

  • Manage multiple datasets by example
  • Convert long and wide format data
  • Deal with poorly formatted data and/or missing data
  • Automate tasks using functions
  • Work with and manipulate text data

Unit Four - Automated Data Collection

This unit will give you:

  • An overview of web/text scraping and the related legal considerations
  • A basic web scraping example so you can learn how to treat a webpage as a messy text document
  • An understanding of scraping Twitter

Unit Five - Performance and Scalability

This unit will focus on:

  • Giving you an overview of big data and high-performance computing (HPC)
  • Teaching you about performant programming
  • Wrapping up with a range of next steps and ways to extend your skills

Testimonials


Testimonials


what our learners say

FAQs


FAQs


Frequently Asked Questions

Please see below for answers to some of the most frequent questions we get about this course.

Can't find what you're looking for?

How long will I have access to the course for?

You will have access to the course material for 3 months.

Do learners get a certificate?

All of our courses offer a certificate of completion signed by your instructor. You will be able to download this certificate, from the Learning Platform, when you complete the course.

 

Enroll Now


Enroll Now


Enroll Now

Next course starts [[date]]. Book your place today!

Practical Data Management with R for Social Scientists
399.00
Start Date:
Enroll