Summary and Schedule
This is a new lesson built with The Carpentries Workbench.
Setup Instructions | Download files required for the lesson | |
Duration: 00h 00m | 1. What is sampling |
What is sampling? What requirements should a good sample fulfill? |
Duration: 00h 05m | 2. What is a probability distribution? | What is a probability distribution? |
Duration: 00h 17m | 3. The binomial distribution |
What is the binomial distribution? What kind of data is it used on? |
Duration: 00h 22m | 4. Probability distributions in R | How can I calculate probabilities in R? |
Duration: 00h 34m | 5. The Poisson distribution |
What is the Poisson distribution? What kind of data is it used on? |
Duration: 00h 44m | 6. Simulations in R | How can I make my own data in R? |
Duration: 00h 51m | 7. The Gamma-Poisson distribution |
What is the Gamma-Poisson distribution? What kind of data is it used on? |
Duration: 01h 05m | 8. The Gaussian distribution | What is the Gaussian distribution and what kind of data is it used on? |
Duration: 01h 07m | 9. Visualizing distributions |
How can I visualize the distribution of my data? How can I find out whether my data is well described by a certain probability distribution? |
Duration: 01h 12m | 10. Histograms in R |
How can I plot a histogram of my data in R? How can I compare my data to a distribution using histograms? |
Duration: 01h 27m | 11. The cumulative distribution function | What is the empirical cumulative distribution function? |
Duration: 01h 30m | 12. The QQ-plot |
What is the QQ-plot? How can I create a QQ-plot of my data? Why is it useful? |
Duration: 01h 38m | 13. Summary | |
Duration: 01h 43m | 14. Exercises | |
Duration: 03h 13m | Finish |
The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.
Summary
At the end of this course, you’ll able to:
- describe what sampling and probability distributions are
- list some common distributions of biological data
- visualise the distribution of your data in R
- pick a suitable distribution to model your data with
Time
The episodes can be taught in about 2 hours. Your individual reading time may be different.
For completing the exercises (episode 13), we typically plan 1.5 hours.
Prerequisites
Before starting this course, we recommend you complete a first tutorial on data handling and visualisation or have basic knowledge or R and the tidyverse.
Setup
You need R and RStudio running on your computer, as we will not fix installations during the course.
Links for installation:
- R (install this first): https://cloud.r-project.org/
- RStudio: https://www.rstudio.com/products/rstudio/download/
If you have an EMBL account, an alternative to installation can be using rstudio.embl.de - please check that you can log in, in case you want to use this option.
To install all packages necessary for completing the exercises and running the demonstrations, run the following command from the console in RStudio:
source(“https://www.huber.embl.de/users/kaspar/biostat_2021/install_packages_biostat.R”)