An Introduction to R for RNA-Seq Analysis¶
RNA-Sequencing (RNA-Seq) is a popular method for determining the presence and quantity of RNA in biological samples. In this 3 hour workshop, we will use R to explore publicly-available RNA-Seq data from the Gene Expression Tissue Project (GTEx). Attendees will be introduced to the R syntax, variables, functions, packages, and data structures common to RNA-Seq projects. We will use RStudio to import, tidy, transform, and visualize RNA-Seq count data. Attendees will learn tips and tricks for making the processes of data wrangling and data harmonization more manageable. This workshop will not cover cloud-based workflows for processing RNA-seq reads or statistics and modeling because these topics are covered in our RNA-Seq Concepts and RNA-Seq in the Cloud workshops. Rather, this workshop will focus on general R concepts applied to RNA-Seq data.
|Est. Time||Lesson name||Description|
|25 min||Introduction||Overview of RStudio and Binder|
|30 min||Import Data||Importing data with
|30 min||Visualize Data||Visualizing data with
|90 min||Wrangle Data||Tyding and transforming data|
In this workshop, you will learn how to use R and RStudio to:
- import and view files commonly associated with RNA-sequencing experiments
- select variables and observations that are relevant to research questions (tidy)
- create and rename variables (transform)
- join data frames by common variables (harmonize)
- visualize data using bar graphs, scatter plots, and box plots
Here are the notes from workshops taught with these materials.
Here is a 3 hours video of this workshop being taught at the May 2022 Hackathon.
Familiarity with R and RNA Sequencing is not required but would be useful. This lesson uses a standardized binder environment, which will work on Windows, Mac, and Linux operating systems, and Firefox, Safari, and Chrome web browsers.