GGRaSP (Gaussian Genome Representative Selector with Prioritization) is an R-package which can generate and return a reprentative set of genomes from a large group of genomes with a defined relationship. The reprentative set is select either using user-defined cutoff or cluster number values or else de novo calculates the clusters based on modeling the genome relationships with a Gaussian Mixture Model. The default value returned is the list of reprentative genomes, but the package also allows for multiple outputs including text files, plots, and trees. To allow for high-throughput analysis, we have included an Rscript file that can run GGRaSP from the command line (though it does require GGRaSP to be installed to the default R location).
Installing GGRaSP
GGRaSP can be installed using the install_git command in devtools. However this will not install the command line script, which needs to be installed seperately as shown below. The example data is also pulled for demonstration purposes.
if this is not working, it is also the options to download the ggrasp_1.0.tar.gz file from github and install ggrasp from this
getting the individual files
Using GGRaSP
GGRaSP can be used in two ways. The first uses the R-console to use the GGRaSP functions to load the genomes, cluster the genomes, and report the reprentative genomes. The second uses the command line Rscript. Below we will show the default version of running both with the example data from Chavdra et al 2016.
Using GGRaSP in the R console
GGRaSP is centered around two primary functions to load and analyze the data, with three primary output variables that allow for analysis and reporting of the clustering. For more detailed descriptions of all the GGRaSP R functions, please examine the vigenettes.
To start with, simply load the library and genome-relationship file, here the Chavda ANI similarity matrix provided in the examples file. An offset of 100 is used to transform the simularity matrix to a distance matrix and uses a complete hclust (equivilant to an UPMGA) to make the phylogeny.
viejasxxx vecinasxxx
videos caseros españa follando sin querer
maduras insaciables me follo a mi tia
lecturas porno www.maduras.con
incesto en espanol pajas entre amigos
porno español madre e hijo incesto asiatico
porno peludas se le marca el coño
sexo camara oculta españa cartoon xxx
inciesto español hermanos rusos follando
ariana grande porno follando en el parque
pono gay maduras folladas por el culo
maduros en pajilleros abuelasputas
porno italiano incesto folladas en el metro
jovencita masturbandose se folla a su compañero de piso
maduras tragando semen pornolesvianas
corridas en el culo concursos nudistas
nenas follando videos eroticos italianos
ver pelis eroticas casadas muy putas
peludas españolas violacion real porno
sexo en directo videos gay xx
file(“extdata”, “Enter.ANI.mat”, package=”ggrasp”), file.format=”matrix”, tree.method=”complete”, offset=100)
Now that the the simularity matrix is loaded in, cluster it using the default values, checkout the cutoff and number of distributions using the summary variable, and visualize the Gaussian Distributions:
Running GGRaSP on the command line
GGRaSP also includes an Rscript program that can be run on the command line. Since GGRaSP does allow for multiple parameter changes, the script can take in multiple parameters for (i) input, (ii) clustering, and (iii) outputing. It also can run a simplified default version. For example, the same analysis as above (load in a similarty matrix, make a UPMGA tree, default GMM cluster, print the medoids and save the gmm plot) can be run below:
The output files will be: enter.test.out.gmm.pdf (gmm plot) and: enter.out.medoids.txt (medoid list).
A list of all the available flags for the scripts are as follows:
##Citing GGRaSP
When using GGRaSP in your analysis, please cite GGRaSP as follows:








