You are here

Gapless alignment revisited

Download pdf | Full Screen View

Date Issued:
2001
Summary:
The purpose of sequence alignment is to detect mutual similarity, characterized by the so-called "alignment score", between sequences compared. To quantitatively assess the confidence level of an alignment result requires the knowledge of alignment score statistics under a certain null model and is the central issue in sequence alignment. In this thesis, the score statistics of Markov null model were revisited and the score statistics of non-Markov null model were investigated for two state-of-the-art algorithms, namely, the gapless Smith-Waterman and Hybrid algorithms. These two algorithms were further used to find highly related signals in unrelated sequences and in weakly related sequences corresponding, respectively, to Markov null model and non-Markov null model. The confidence levels of these models were also studied. Since the sequence similarity we are interested in comes from evolutionary history, we also investigated the relationship between sequence alignment, the tool to find similarity, and evolution. The average evolution distance between the daughter sequences was found and compared with their expected values, for individual trees and as an average over many trees.
Title: Gapless alignment revisited.
170 views
51 downloads
Name(s): Raley, Elizabeth Anne
Florida Atlantic University, Degree grantor
Yu, Yi-Kuo, Thesis advisor
Type of Resource: text
Genre: Electronic Thesis Or Dissertation
Issuance: monographic
Date Issued: 2001
Publisher: Florida Atlantic University
Place of Publication: Boca Raton, Fla.
Physical Form: application/pdf
Extent: 66 p.
Language(s): English
Summary: The purpose of sequence alignment is to detect mutual similarity, characterized by the so-called "alignment score", between sequences compared. To quantitatively assess the confidence level of an alignment result requires the knowledge of alignment score statistics under a certain null model and is the central issue in sequence alignment. In this thesis, the score statistics of Markov null model were revisited and the score statistics of non-Markov null model were investigated for two state-of-the-art algorithms, namely, the gapless Smith-Waterman and Hybrid algorithms. These two algorithms were further used to find highly related signals in unrelated sequences and in weakly related sequences corresponding, respectively, to Markov null model and non-Markov null model. The confidence levels of these models were also studied. Since the sequence similarity we are interested in comes from evolutionary history, we also investigated the relationship between sequence alignment, the tool to find similarity, and evolution. The average evolution distance between the daughter sequences was found and compared with their expected values, for individual trees and as an average over many trees.
Identifier: 9780493418186 (isbn), 12856 (digitool), FADT12856 (IID), fau:9730 (fedora)
Note(s): Charles E. Schmidt College of Science
Thesis (M.S.)--Florida Atlantic University, 2001.
Subject(s): Bioinformatics
Amino acid sequence--Databases
Markov processes
Held by: Florida Atlantic University Libraries
Persistent Link to This Record: http://purl.flvc.org/fcla/dt/12856
Sublocation: Digital Library
Use and Reproduction: Copyright © is held by the author with permission granted to Florida Atlantic University to digitize, archive and distribute this item for non-profit research and educational purposes. Any reuse of this item in excess of fair use or other copyright exemptions requires permission of the copyright holder.
Use and Reproduction: http://rightsstatements.org/vocab/InC/1.0/
Host Institution: FAU
Is Part of Series: Florida Atlantic University Digital Library Collections.