NFCDS Workshop Overview - Programming and Data Analysis with R and BASH

This is a two-part workshop series delivered as part of the NFCDS pedagogdy fellowship program. In this workshop series participants will learn about biological data analysis with R and Unix/Linux tools, with a focus on techniques for creating modular R and BASH scripts.

Given the increasing amount of data being generated today, programmatic data analysis is an important skill for a wide range of fields. The R programming language and Unix/Linux command line can be powerful tools for analyzing data. Even more powerfully, the huge variety of R and command line tools can be used together with scripting to create custom pipelines to analyze large or complex data sets.

This workshop begins with an introduction to best practices in both programming and data analysis, then includes a walkthrough of essential programming and scripting techniques for biological data analysis automation using R and Unix/Linux tools.

Audience

The first lesson of this two-part workshop is designed for anyone interested in learning more about programming best practices, and how to create R and BASH scripts to automate data analysis. No prior programming or data analysis experience is required.

The second lesson of the workshop is designed for anyone interested in learning how to combine R and BASH scripting to automate the analysis of biological data sets. For example, biologists interested in learning how to create pipelines that use a combination of R scripts to analyze and visualize data. Another potential application of the skills learned from this workshop is the ability to create a set of scripts that work together to integrate specific omics tools in a data analysis workflow.

Prerequisites

  1. Participants should be comfortable with using a computer and basic statistical methods. Furthermore, participants should have:
    • used a computer before
    • limited to no experience with BASH
    • limited to no experience with R
  2. Each participant needs to have access to a computer with R, RStudio, and Unix/Linux or the Windows Ubuntu app. To get started, follow the directions in the Setup tab.

Workshop Overview

Day Lesson Description
1 A Beginner’s Guide to Programming and Data Analysis with R and BASH Learn more about programming best practices, and how to create R and BASH scripts to automate data analysis.
2 A Biologist’s Guide to Programming and Data Analysis with R and BASH Learn how to combine R and BASH scripting to automate the analysis of biological data sets.