Skip to content

Commit f619172

Browse files
committed
differences for PR #326
1 parent b0d3724 commit f619172

File tree

4 files changed

+2563
-8
lines changed

4 files changed

+2563
-8
lines changed

03-basics-factors-dataframes.md

Lines changed: 10 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -183,6 +183,13 @@ a view of the data in a new tab.
183183

184184
![RStudio data frame view](fig/rstudio_dataframeview.png)
185185

186+
The majority of the columns in the data frame correspond to standard fields found in a
187+
*Variant Call Format (VCF)* file, while others were added during our data processing. The VCF
188+
format is a standard format for storing variant calls (also known as Single Nucleotide Polymorphisms or SNPs),
189+
and you can read more about it, including a description of the fields we have here
190+
in [the VCF specification](https://samtools.github.io/hts-specs/VCFv4.2.pdf)
191+
or [on wikipedia](https://en.wikipedia.org/wiki/Variant_Call_Format).
192+
186193
We can also quickly query the dimensions of the variable using `dim()`. You'll see that the first number `801` shows the number of rows, then `29` the number of columns
187194

188195

@@ -687,17 +694,13 @@ install.packages("ggplot2")
687694
```
688695

689696
``` output
690-
# Downloading packages -------------------------------------------------------
691-
- Downloading ggplot2 from https://packagemanager.posit.co/cran/__linux__/jammy/latest ... OK [8.1 Mb in 0.37s]
692-
Successfully downloaded 1 package in 1.5 seconds.
693-
694697
The following package(s) will be installed:
695698
- ggplot2 [4.0.1]
696699
These packages will be installed into "~/work/genomics-r-intro/genomics-r-intro/renv/profiles/lesson-requirements/renv/library/linux-ubuntu-jammy/R-4.5/x86_64-pc-linux-gnu".
697700
698701
# Installing packages --------------------------------------------------------
699-
- Installing ggplot2 ... OK [installed binary and cached in 1.1s]
700-
Successfully installed 1 package in 1.2 seconds.
702+
- Installing ggplot2 ... OK [linked from cache]
703+
Successfully installed 1 package in 5.9 milliseconds.
701704
```
702705

703706
``` r
@@ -711,7 +714,7 @@ These packages will be installed into "~/work/genomics-r-intro/genomics-r-intro/
711714
712715
# Installing packages --------------------------------------------------------
713716
- Installing dplyr ... OK [linked from cache]
714-
Successfully installed 1 package in 4.5 milliseconds.
717+
Successfully installed 1 package in 4.2 milliseconds.
715718
```
716719

717720
These two packages are among the most popular add on packages used in R, and they are part of a large set of very useful packages called the [tidyverse](https://www.tidyverse.org). Packages in the tidyverse are designed to work well together and are made to work with tidy data (which we described earlier in this lesson).

config.yaml

Lines changed: 88 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,88 @@
1+
#------------------------------------------------------------
2+
# Values for this lesson.
3+
#------------------------------------------------------------
4+
5+
# Which carpentry is this (swc, dc, lc, or cp)?
6+
# swc: Software Carpentry
7+
# dc: Data Carpentry
8+
# lc: Library Carpentry
9+
# cp: Carpentries (to use for instructor training for instance)
10+
# incubator: The Carpentries Incubator
11+
carpentry: 'dc'
12+
13+
# Overall title for pages.
14+
title: 'Data Analysis and Visualization with R for Genomics'
15+
16+
# Date the lesson was created (YYYY-MM-DD, this is empty by default)
17+
created: '2018-03-12'
18+
19+
# Comma-separated list of keywords for the lesson
20+
keywords: 'software, data, lesson, The Carpentries'
21+
22+
# Life cycle stage of the lesson
23+
# possible values: pre-alpha, alpha, beta, stable
24+
life_cycle: 'beta'
25+
26+
# License of the lesson materials (recommended CC-BY 4.0)
27+
license: 'CC-BY 4.0'
28+
29+
# Link to the source repository for this lesson
30+
source: 'https://github.com/datacarpentry/genomics-r-intro'
31+
32+
# Default branch of your lesson
33+
branch: 'main'
34+
35+
# Who to contact if there are any issues
36+
37+
38+
# Navigation ------------------------------------------------
39+
#
40+
# Use the following menu items to specify the order of
41+
# individual pages in each dropdown section. Leave blank to
42+
# include all pages in the folder.
43+
#
44+
# Example -------------
45+
#
46+
# episodes:
47+
# - introduction.md
48+
# - first-steps.md
49+
#
50+
# learners:
51+
# - setup.md
52+
#
53+
# instructors:
54+
# - instructor-notes.md
55+
#
56+
# profiles:
57+
# - one-learner.md
58+
# - another-learner.md
59+
60+
# Order of episodes in your lesson
61+
episodes:
62+
- 00-introduction.Rmd
63+
- 01-r-basics.Rmd
64+
- 02-data-prelude.Rmd
65+
- 03-basics-factors-dataframes.Rmd
66+
- 04-bioconductor-vcfr.Rmd
67+
- 05-dplyr.Rmd
68+
- 06-data-visualization.Rmd
69+
- 07-r-help.Rmd
70+
71+
# Information for Learners
72+
learners:
73+
74+
# Information for Instructors
75+
instructors:
76+
77+
# Learner Profiles
78+
profiles:
79+
80+
# Customisation ---------------------------------------------
81+
#
82+
# This space below is where custom yaml items (e.g. pinning
83+
# sandpaper and varnish versions) should live
84+
85+
86+
url: 'https://datacarpentry.github.io/genomics-r-intro'
87+
analytics: carpentries
88+
lang: en

md5sum.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@
66
"episodes/00-introduction.Rmd" "6c2897028a5b88c54afc90f62a90231d" "site/built/00-introduction.md" "2025-12-10"
77
"episodes/01-r-basics.Rmd" "cd2d5e1bef882767cb199933bdc080c4" "site/built/01-r-basics.md" "2025-12-10"
88
"episodes/02-data-prelude.Rmd" "3c871c508670de40fd3b2a853d67e0aa" "site/built/02-data-prelude.md" "2025-12-10"
9-
"episodes/03-basics-factors-dataframes.Rmd" "8443a46ba31357bbccd110e025904b9f" "site/built/03-basics-factors-dataframes.md" "2025-12-10"
9+
"episodes/03-basics-factors-dataframes.Rmd" "36b39c099d6a7e2e774db064ce6b8d46" "site/built/03-basics-factors-dataframes.md" "2025-12-15"
1010
"episodes/04-bioconductor-vcfr.Rmd" "10eb69b4697d7ecb9695d36c0d974208" "site/built/04-bioconductor-vcfr.md" "2025-12-10"
1111
"episodes/05-dplyr.Rmd" "0eb40a1862a4acb0e68cfa2f89d28b2a" "site/built/05-dplyr.md" "2025-12-10"
1212
"episodes/06-data-visualization.Rmd" "b6a629582ef76b79ecabdd6aeb6df9b1" "site/built/06-data-visualization.md" "2025-12-10"

0 commit comments

Comments
 (0)