dc.contributor.advisor | Tiemann-Boege, Irene | |
dc.contributor.author | Heinzl, Monika | |
dc.date.accessioned | 2022-03-08T14:23:06Z | |
dc.date.available | 2022-03-08T14:23:06Z | |
dc.date.issued | 2018 | |
dc.date.submitted | 2018-05-29 | |
dc.identifier.uri | https://dspace.jcu.cz/handle/123456789/38558 | |
dc.description.abstract | Duplex sequencing detects ultra-rare mutations by tagging DNA molecules with double-stranded tags. This method creates single-stranded consensus sequences (SSCS) from the reads, which then form duplex consensus sequences (DCS) and are then aligned to the reference genome to call mutations. During this process, a large amount of sequencing data is lost. Therefore, we have developed new algorithms, that give insight in the sequencing data which helps to improve the reads/SSCS/DCS ratios. In addition, a graphical representation of the sequencing data was implemented. The first part of the thesis is focusing on the distribution of sizes of read families. Second, a detailed analysis of the tags is shown by calculating their Hamming distances which can identify sequencing or PCR errors from true molecules. In addition, we can detect artificial produced chimeric reads during PCR. The fourth part includes the application of our algorithms on shorter tag lengths and on only those tags which are involved in the formation of DCSs. Finally, we investigated different sources of read loss during data analysis. | cze |
dc.format | 38 p. | |
dc.format | 38 p. | |
dc.language.iso | eng | |
dc.publisher | Jihočeská univerzita | cze |
dc.rights | Bez omezení | |
dc.subject | duplex sequencing | cze |
dc.subject | mutations | cze |
dc.subject | data analysis | cze |
dc.subject | Hamming distance | cze |
dc.subject | read family size | cze |
dc.subject | chimeric reads | cze |
dc.subject | duplex sequencing | eng |
dc.subject | mutations | eng |
dc.subject | data analysis | eng |
dc.subject | Hamming distance | eng |
dc.subject | read family size | eng |
dc.subject | chimeric reads | eng |
dc.title | Development of algorithms for the analysis of duplex sequencing data | cze |
dc.title.alternative | Development of algorithms for the analysis of duplex sequencing data | eng |
dc.type | bakalářská práce | cze |
dc.identifier.stag | 53028 | |
dc.description.abstract-translated | Duplex sequencing detects ultra-rare mutations by tagging DNA molecules with double-stranded tags. This method creates single-stranded consensus sequences (SSCS) from the reads, which then form duplex consensus sequences (DCS) and are then aligned to the reference genome to call mutations. During this process, a large amount of sequencing data is lost. Therefore, we have developed new algorithms, that give insight in the sequencing data which helps to improve the reads/SSCS/DCS ratios. In addition, a graphical representation of the sequencing data was implemented. The first part of the thesis is focusing on the distribution of sizes of read families. Second, a detailed analysis of the tags is shown by calculating their Hamming distances which can identify sequencing or PCR errors from true molecules. In addition, we can detect artificial produced chimeric reads during PCR. The fourth part includes the application of our algorithms on shorter tag lengths and on only those tags which are involved in the formation of DCSs. Finally, we investigated different sources of read loss during data analysis. | eng |
dc.date.accepted | 2018-07-10 | |
dc.description.department | Přírodovědecká fakulta | cze |
dc.thesis.degree-discipline | Bioinformatics | cze |
dc.thesis.degree-grantor | Jihočeská univerzita. Přírodovědecká fakulta | cze |
dc.thesis.degree-name | Bc. | |
dc.thesis.degree-program | Applied Informatics | cze |
dc.description.grade | Dokončená práce s úspěšnou obhajobou | cze |
dc.contributor.referee | Regl, Alois | |