Electrical & Computer Engineering, Department of

Department of Electrical and Computer Engineering: Faculty Publications

Grammar-Based Distance in Progressive Multiple Sequence Alignment

David Russell, University of Nebraska-LincolnFollow
Hasan H. Otu, Harvard Medical SchoolFollow
Khalid Sayood, University of Nebraska-LincolnFollow

ORCID IDs

Hasan H. Otu

Date of this Version

2008

Comments

Abstract

Background: We propose a multiple sequence alignment (MSA) algorithm and compare the alignment-quality and execution-time of the proposed algorithm with that of existing algorithms. The proposed progressive alignment algorithm uses a grammar-based distance metric to determine the order in which biological sequences are to be pairwise aligned. The progressive alignment occurs via pairwise aligning new sequences with an ensemble of the sequences previously aligned.
Results: The performance of the proposed algorithm is validated via comparison to popular progressive multiple alignment approaches, ClustalW and T-Coffee, and to the more recently developed algorithms MAFFT, MUSCLE, Kalign, and PSAlign using the BAliBASE 3.0 database of amino acid alignment files and a set of longer sequences generated by Rose software. The proposed algorithm has successfully built multiple alignments comparable to other programs with significant improvements in running time. The results are especially striking for large datasets.
Conclusion: We introduce a computationally efficient progressive alignment algorithm using a grammar based sequence distance particularly useful in aligning large datasets.

Download

Included in

Electrical and Computer Engineering Commons

COinS

DigitalCommons@University of Nebraska - Lincoln

Electrical & Computer Engineering, Department of

Department of Electrical and Computer Engineering: Faculty Publications

Grammar-Based Distance in Progressive Multiple Sequence Alignment

ORCID IDs

Date of this Version

Comments

Abstract

Included in

Search

Browse

Author Corner

Links

DigitalCommons@University of Nebraska - Lincoln

Electrical & Computer Engineering, Department of

Department of Electrical and Computer Engineering: Faculty Publications

Grammar-Based Distance in Progressive Multiple Sequence Alignment

Authors

ORCID IDs

Date of this Version

Comments

Abstract

Included in

Share

Search

Browse

Author Corner

Links