In the context of my Ph.D., I am studying the molecular evolution of the mud snail genus Ecrobia. Five Ecrobia species are included in this study:
- Ecrobia grimmi (Ecgr)
- Ecrobia maritima (Ecma)
- Ecrobia spalatiana (Ecspa)
- Ecrobia ventrosa (Ecve)
- Ecrobia truncata (Ectr)
And two outgroups:
- Peringia ulvae (Peul)
- Salenthydrobia ferrerii (Safe)
My data include two mitochondrial markers:
- Cytochrome c oxidase subunit I (COI): 359 individuals
- 16S ribosomal RNA (r16S): 74 individuals
And one nuclear marker:
- Internal transcribed spacer 2 (ITS2): 46 individuals
To estimate maximum likelihood trees with RAxML, this program needs to be previously installed, and the path of the executable needs to be given.
I set the working directory to source file location.
The whole script is still under progress. The following steps are already included:
- Import DNA sequences (including outgroups)
- Exploratory data analyses
- Base frequency
- Base frequency for each individual
- Base frequency between genes for all species pooled - WIP
- Multiple sequence alignment
- Best-fit substitution models estimation
- Saturation test
- Merge the alignments
- Infer trees
- Brute force
- Evolutionary distances
- Distance-based phylogenetic methods
- Maximum parsimony
- Maximum likelihood methods
- Bayesian inference
- Codon model - WIP
- The class 'phylo' - WIP
- Visualization methods - WIP
- Plot trees - WIP
The following steps are not included yet:
- Tree comparison
- Bootstrap
- Divergence time estimation
- Coalescence analysis
- List trees
- Tree space
- phytools
- Sublime Text 2 version 2.0.2
- R version 3.4.1
- RStudio version 1.1.419
- Justine Vandendorpe
My project is funded by the European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement No 642973.
This script was started during the course Phylogenetic Analysis Using R organized by Transmitting Science in March 2017. I would like to thank Emmanuel Paradis and Klaus Schliep for teaching during this course. I also found a lot of information in Emmanuel Paradis's book 'Analysis of Phylogenetics and Evolution with R' (Second Edition).