Skip to content

Commit 9728d7c

Browse files
authored
Update Post “2023-11-gnomad-v4-0” (#193)
1 parent 21898ab commit 9728d7c

File tree

1 file changed

+12
-10
lines changed

1 file changed

+12
-10
lines changed

content/posts/2023-11-gnomad-v4-0.md

Lines changed: 12 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -51,7 +51,7 @@ We have removed support for subsets from the gnomAD v4 exomes for two major reas
5151

5252
### Updated filtering allele frequency
5353

54-
We have released a new, combined filtering allele frequency (FAF), which integrates variant frequencies across the 734,947 exomes and 76,215 genomes. The exome FAF and genome FAF are still listed on the variant pages as well.
54+
We have released a new, combined filtering allele frequency (FAF), which integrates variant frequencies across the 730,947 exomes and 76,215 genomes. The exome FAF and genome FAF are still listed on the variant pages as well.
5555

5656
### Structural Variants
5757

@@ -168,16 +168,16 @@ In gnomAD [v3.1.2](https://gnomad.broadinstitute.org/news/2021-10-gnomad-v3-1-2-
168168
For both of these methods, we compared samples to the defined distribution and flagged any samples for filtering if the sample's value was **4 median absolute deviations (MADs)**:
169169

170170
* **over** or **under** the following metrics
171-
* * insertion : deletion ratio
172-
* transition : transversion ratio
173-
* singleton transition : transversion ratio
174-
* * We only computed this metric for samples with:
175-
* * 50 nearest neighbors:
176-
* * number of singletons > median number of singletons calculated with the 50 nearest neighbors
177-
* Platform stratification + genetic ancestry PC regression filtering:
178-
* * residual number of singletons > median residual number of singletons for that platform
171+
* insertion : deletion ratio
172+
* transition : transversion ratio
173+
* singleton transition : transversion ratio
174+
* We only computed this metric for samples with:
175+
* 50 nearest neighbors:
176+
* number of singletons > median number of singletons calculated with the 50 nearest neighbors
177+
* Platform stratification + genetic ancestry PC regression filtering:
178+
* residual number of singletons > median residual number of singletons for that platform
179179
* OR **over** the median for:
180-
* * heterozygous : homozygous ratio
180+
* heterozygous : homozygous ratio
181181

182182
Only samples that were filtered by **both** the 50 nearest neighbors and the platform stratification + genetic ancestry PC regression filtering were removed.
183183

@@ -228,3 +228,5 @@ Finally, thanks to critical guidance by members of the [gnomAD steering committe
228228
1. Lu, W., Gauthier, L.D., Poterba, T., Giacopuzzi, E., Goodrich, J.K., Steves, C.R., King, D., Daly, M.J., Neale, B.M. &  Karczewski, K.J. CHARR efficiently estimates contamination from DNA sequencing data. *bioRxiv* 2023.06.28.545801 (2023). <https://doi.org/10.1101/2023.06.28.545801>
229229
2. Koenig, Z., Yohannes, M.T., Nkambule, L.LGoodrich, J.K., Kim, H.A., Zhao, X., Wilson, M.W., Tiao, G., Hao, S.P., Sahakian, N., Chao, K.R., gnomAD Project Consortium, Talkowski, M.E., Daly, M.J., Brand, H., Karczewski, K.J., Atkinson, E.G. & Martin, A.R. A harmonized public resource of deeply sequenced diverse human genomes. *bioRxiv* 2023.01.23.525248 (2023). <https://doi.org/10.1101/2023.01.23.525248>
230230
3. Bergström, A., McCarthy, S. A., Hui, R., Almarri, M. A., Ayub, Q., Danecek, P., Chen, Y., Felkel, S., Hallast, P., Kamm, J., Blanché, H., Deleuze, J. F., Cann, H., Mallick, S., Reich, D., Sandhu, M. S., Skoglund, P., Scally, A., Xue, Y., Durbin, R., … Tyler-Smith, C. (2020). Insights into human genetic variation and population history from 929 diverse genomes. Science (New York, N.Y.), 367(6484), eaay5012. <https://doi.org/10.1126/science.aay5012>
231+
232+
*Updated in August 2025 to fix typo and formatting.*

0 commit comments

Comments
 (0)