ORCID ID
0000-0001-9639-2894
Publication Date
2018-03-19
Document Type
Doctoral Dissertation
Department
Department of Bioinformatics and Integrative Biology
First Thesis Advisor
Jeffrey Bailey
Keywords
plasmodium, sequencing, infectious diseases, targeted amplicon
Abstract
The rapid development of DNA sequencing technologies has opened up new avenues of research, including the investigation of population structure within infectious diseases (both within patient and between populations). In order to take advantage of these advances in technologies and the generation of new types of data, novel bioinformatics tools are needed that won’t succumb to artifacts introduced by the data generation, and thus provide accurate and precise results. To achieve this goal I have create several tools.
First, SeekDeep, a pipeline for analyzing targeted amplicon sequencing datasets from various technologies, is able to achieve 1-base resolution even at low frequencies and read depths allowing for accurate comparison between samples and the detection of important SNPs. Next, PathWeaver, a local haplotype assembler designed for complex infections and highly variable genomic regions with poor reference mapping. PathWeaver is able to create highly accurate haplotypes without generating chimeric assemblies. PathWeaver was used on the key protein in pregnancy-associated malaria Plasmodium falciparum VAR2CSA which revealed population sub-structuring within the key binding domain of the protein observed to be present globally along with confirming copy number variation. Finally, the program Carmen is able to utilize PathWeaver to augment the results from targeted amplicon approaches by reporting where and when local haplotypes have been found previously.
These rigorously tested tools allow the analysis of local haplotype data from various technologies and approaches to provide accurate, precise and easily accessible results.
Repository Citation
Hathaway NJ. (2018). A suite of computational tools to interrogate sequence data with local haplotype analysis within complex Plasmodium infections and other microbial mixtures. Morningside Graduate School of Biomedical Sciences Dissertations and Theses. https://doi.org/10.13028/M2039K. Retrieved from https://escholarship.umassmed.edu/gsbs_diss/970
DOI
10.13028/M2039K
DOI Link
Rights and Permissions
Copyright is held by the author, with all rights reserved.