filmov
tv
Using base R and testthat to calculate probabilities (CC271)
Показать описание
Watch and code along with Pat as he uses test driven development using testthat and base R to count kmers and calculate probabilities for a naive Bayesian sequence classifier. Pat generates all possible kmers for a sequence and then all sequences in a collection. These kmer counts are then used to generate the word specific priors and genus-specific conditional probabilities that are needed to train the naive Bayesian classifier for 16S rRNA gene sequences. Along the way, Pat continues to use Test Driven Development using the testthat R package and a number of tools from base R. This episode is part of an ongoing effort to develop an R package that implements the naive Bayesian classifier.
Check out the GitHub repository at the:
#rdp #16S #classification #classifier #microbialecology #microbiome
Support Riffomonas by becoming a Patreon member!
You can also find complete tutorials for learning R with the tidyverse using...
0:00 Introduction
6:00 Generate all kmers for a sequence
14:01 Generate kmers across all sequences
21:27 Calculate word-specific priors
27:42 Calculate genus-specific conditional probabilities
Check out the GitHub repository at the:
#rdp #16S #classification #classifier #microbialecology #microbiome
Support Riffomonas by becoming a Patreon member!
You can also find complete tutorials for learning R with the tidyverse using...
0:00 Introduction
6:00 Generate all kmers for a sequence
14:01 Generate kmers across all sequences
21:27 Calculate word-specific priors
27:42 Calculate genus-specific conditional probabilities
Комментарии