Improved Sensitivity And Reliability Of ...
Type de document :
Communication dans un congrès avec actes
Titre :
Improved Sensitivity And Reliability Of Anchor Based Genome Alignment
Auteur(s) :
Uricaru, Raluca [Auteur]
Méthodes et Algorithmes pour la Bioinformatique [MAB]
Michotey, Célia [Auteur]
Unité Mathématique Informatique et Génome [MIG]
Noé, Laurent [Auteur]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Sequential Learning [SEQUOIA]
Chiapello, Hélène [Auteur]
Unité Mathématique Informatique et Génome [MIG]
Rivals, Eric [Auteur correspondant]
Méthodes et Algorithmes pour la Bioinformatique [MAB]
Méthodes et Algorithmes pour la Bioinformatique [MAB]
Michotey, Célia [Auteur]
Unité Mathématique Informatique et Génome [MIG]
Noé, Laurent [Auteur]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Sequential Learning [SEQUOIA]
Chiapello, Hélène [Auteur]
Unité Mathématique Informatique et Génome [MIG]
Rivals, Eric [Auteur correspondant]
Méthodes et Algorithmes pour la Bioinformatique [MAB]
Éditeur(s) ou directeur(s) scientifique(s) :
Eric Rivals
Irena Rusu
Irena Rusu
Titre de la manifestation scientifique :
JOBIM 2009 - 10es Journées Ouvertes en Biologie, Informatique et Mathématiques
Ville :
Nantes
Pays :
France
Date de début de la manifestation scientifique :
2009-06-09
Date de publication :
2009-06-09
Mot(s)-clé(s) en anglais :
Spaced seeds
Anchor based strategy
Global genome alignment
Anchor based strategy
Global genome alignment
Discipline(s) HAL :
Informatique [cs]/Bio-informatique [q-bio.QM]
Résumé en anglais : [en]
Whole genome alignment is a challenging problem in computational comparative genomics. It is essential for the functional annotation of genomes, the understanding of their evolution, and for phylogenomics. Many global ...
Lire la suite >Whole genome alignment is a challenging problem in computational comparative genomics. It is essential for the functional annotation of genomes, the understanding of their evolution, and for phylogenomics. Many global alignment programs are heuristic variations on the anchor based strategy, which relies on the initial detection of similarities and their selection in an ordered chain. Considering that alignment tools fail to align some pairs of bacterial strains, we investigate whether this is intrinsically due to the strategy or to a lack of sensitivity of the similarity detection method. For this, we implement and compare 6 programs based on three different detection methods (from exact matches to local alignments) on a large benchmark set. Our results suggest that the sensitivity of well known methods, like MGA or Mauve, can be greatly improved in the case of divergent genomes if one exploits spaced seeds at the detection phase. In other cases, such methods yield alignments that cover nearly the whole genome. Then, we focus on global reliability of alignments: should an aligned pair of segments be included in the global genome alignment? We investigate this reliability according to both the segment "alignability" and to inclusion of orthologs. Again, we provide evidence that for both close and divergent genomes, one of our programs, YH, achieves alignments with sometimes a lower coverage, but a higher inclusion of orthologs. It opens the way to the first reliable alignments for some highly divergent species like Buchnera aphidicola or Prochlorococcus marinus.Lire moins >
Lire la suite >Whole genome alignment is a challenging problem in computational comparative genomics. It is essential for the functional annotation of genomes, the understanding of their evolution, and for phylogenomics. Many global alignment programs are heuristic variations on the anchor based strategy, which relies on the initial detection of similarities and their selection in an ordered chain. Considering that alignment tools fail to align some pairs of bacterial strains, we investigate whether this is intrinsically due to the strategy or to a lack of sensitivity of the similarity detection method. For this, we implement and compare 6 programs based on three different detection methods (from exact matches to local alignments) on a large benchmark set. Our results suggest that the sensitivity of well known methods, like MGA or Mauve, can be greatly improved in the case of divergent genomes if one exploits spaced seeds at the detection phase. In other cases, such methods yield alignments that cover nearly the whole genome. Then, we focus on global reliability of alignments: should an aligned pair of segments be included in the global genome alignment? We investigate this reliability according to both the segment "alignability" and to inclusion of orthologs. Again, we provide evidence that for both close and divergent genomes, one of our programs, YH, achieves alignments with sometimes a lower coverage, but a higher inclusion of orthologs. It opens the way to the first reliable alignments for some highly divergent species like Buchnera aphidicola or Prochlorococcus marinus.Lire moins >
Langue :
Anglais
Comité de lecture :
Oui
Audience :
Nationale
Vulgarisation :
Non
Collections :
Source :
Fichiers
- https://hal-lirmm.ccsd.cnrs.fr/lirmm-00407215/document
- Accès libre
- Accéder au document
- https://hal-lirmm.ccsd.cnrs.fr/lirmm-00407215/document
- Accès libre
- Accéder au document
- https://hal-lirmm.ccsd.cnrs.fr/lirmm-00407215/document
- Accès libre
- Accéder au document
- https://hal-lirmm.ccsd.cnrs.fr/lirmm-00407215/document
- Accès libre
- Accéder au document
- document
- Accès libre
- Accéder au document
- jobim26.pdf
- Accès libre
- Accéder au document