What do I need for this course?
You will need to bring a computer to class each day. Class sessions are a mix of lecture, demonstration, and live coding. It is essential to have a computer so you can follow along and complete the exercises.
Textbooks/Readings
- R for Data Science – Garrett Grolemund and Hadley Wickham
Completing the exercises in the book? No official solution manual exists, but several can be found online. I recommend this version by Jeffrey B. Arnold. Your exact solutions may vary, but these can be a good starting point.
Additional resources
- DataCamp - DataCamp offers interactive R and Python courses on topics in data science, statistics, and machine learning. These are a supplement to our primary textbook and lessons. Introductory DataCamp courses are free, whereas premium courses generally cost a fee. As a participant in this course, you have free access to all premium courses for the duration of the term. I will send you an email in the first week to activate your account if you wish to create one.
- ggplot2: Elegant Graphics for Data Analysis, 2nd Edition – Hadley Wickham
- Excellent resource for the
ggplot2
graphics library.
- Advanced R – Hadley Wickham
- Hardcover available online for around $55, but the online version is free
- A deep dive into R as a programming language, not just a tool for data science. We will use some chapters in class, but most of this material is best covered on your own after completion of this course
- RStudio Cheatsheets
Software
By the end of the first week (or even better, before the course starts), you should install the following software on your computer:
- R - easiest approach is to select a pre-compiled binary appropriate for your operating system.
- RStudio’s IDE - this is a powerful user interface for programming in R. You could use base R, but you would regret it.
- Git - Git is a version control system which is used to manage projects and track changes in computer files. Once installed, it can be integrated into RStudio to manage your course assignments and other projects.
- \(\LaTeX\) - \(\LaTeX\) is a powerful typesetting system which is the de facto standard for communication and publication of scientific documents. In order to render R Markdown documents as a PDF, you need to install \(\LaTeX\) on your machine.
Comprehensive instructions for downloading and setting up this software (with the exception of \(\LaTeX\)) can be found here.
How will I be evaluated?
Students will complete weekly problem sets with a combination of analytical and computational problems. Each problem set is worth 10 points. Final grades will be determined based on cumulative performance across the problem sets. Specific instructions for submitting homework assignments will be outlined in-class.
Statement on Disabilities
If you need any special accommodations, please provide me (Dr. Soltoff) with a copy of your Accommodation Determination Letter (provided to you by the Student Disability Services office) as soon as possible so that you may discuss with me how your accommodations may be implemented in this course.
This work is licensed under the CC BY-NC 4.0 Creative Commons License.