Back-translation for discovering distant ...
Type de document :
Article dans une revue scientifique: Article original
DOI :
PMID :
Titre :
Back-translation for discovering distant protein homologies in the presence of frameshift mutations
Auteur(s) :
Gîrdea, Marta [Auteur correspondant]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Algorithms for large scale sequence analysis [SEQUOIA2]
Noé, Laurent [Auteur correspondant]
Algorithms for large scale sequence analysis [SEQUOIA2]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Kucherov, Gregory [Auteur correspondant]
Algorithms for large scale sequence analysis [SEQUOIA2]
Interdisciplinary Scientific Center Poncelet [ISCP]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Algorithms for large scale sequence analysis [SEQUOIA2]
Noé, Laurent [Auteur correspondant]

Algorithms for large scale sequence analysis [SEQUOIA2]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Kucherov, Gregory [Auteur correspondant]
Algorithms for large scale sequence analysis [SEQUOIA2]
Interdisciplinary Scientific Center Poncelet [ISCP]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Titre de la revue :
Algorithms for Molecular Biology
Pagination :
6
Éditeur :
BioMed Central
Date de publication :
2010
ISSN :
1748-7188
Discipline(s) HAL :
Sciences du Vivant [q-bio]/Biochimie, Biologie Moléculaire/Biologie moléculaire
Informatique [cs]/Bibliothèque électronique [cs.DL]
Informatique [cs]/Bibliothèque électronique [cs.DL]
Résumé en anglais : [en]
Background<br />Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. ...
Lire la suite >Background<br />Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of substitutions are additionally involved in the divergence, the homology detection becomes difficult even at the DNA level.<br />Results<br />We developed a novel method to infer distant homology relations of two proteins, that accounts for frameshift and point mutations that may have affected the coding sequences. We design a dynamic programming alignment algorithm over memory-efficient graph representations of the complete set of putative DNA sequences of each protein, with the goal of determining the two putative DNA sequences which have the best scoring alignment under a powerful scoring system designed to reflect the most probable evolutionary process. Our implementation is freely available at http://bioinfo.lifl.fr/path/.<br />Conclusions<br />Our approach allows to uncover evolutionary information that is not captured by traditional alignment methods, which is confirmed by biologically significant examples.Lire moins >
Lire la suite >Background<br />Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of substitutions are additionally involved in the divergence, the homology detection becomes difficult even at the DNA level.<br />Results<br />We developed a novel method to infer distant homology relations of two proteins, that accounts for frameshift and point mutations that may have affected the coding sequences. We design a dynamic programming alignment algorithm over memory-efficient graph representations of the complete set of putative DNA sequences of each protein, with the goal of determining the two putative DNA sequences which have the best scoring alignment under a powerful scoring system designed to reflect the most probable evolutionary process. Our implementation is freely available at http://bioinfo.lifl.fr/path/.<br />Conclusions<br />Our approach allows to uncover evolutionary information that is not captured by traditional alignment methods, which is confirmed by biologically significant examples.Lire moins >
Langue :
Anglais
Comité de lecture :
Oui
Audience :
Internationale
Vulgarisation :
Non
Collections :
Source :
Fichiers
- https://hal.inria.fr/hal-00784444/document
- Accès libre
- Accéder au document
- https://hal.inria.fr/hal-00784444/file/1748-7188-5-6.xml
- Accès libre
- Accéder au document
- https://almob.biomedcentral.com/track/pdf/10.1186/1748-7188-5-6
- Accès libre
- Accéder au document
- https://hal.inria.fr/hal-00784444/document
- Accès libre
- Accéder au document
- document
- Accès libre
- Accéder au document
- 1748-7188-5-6.pdf
- Accès libre
- Accéder au document
- 1748-7188-5-6.xml
- Accès libre
- Accéder au document
- 1748-7188-5-6
- Accès libre
- Accéder au document
- document
- Accès libre
- Accéder au document
- 1748-7188-5-6.pdf
- Accès libre
- Accéder au document
- 1748-7188-5-6.xml
- Accès libre
- Accéder au document