Improved Sensitivity And Reliability Of ...
Document type :
Communication dans un congrès avec actes
Title :
Improved Sensitivity And Reliability Of Anchor Based Genome Alignment
Author(s) :
Uricaru, Raluca [Auteur]
Méthodes et Algorithmes pour la Bioinformatique [MAB]
Michotey, Célia [Auteur]
Unité Mathématique Informatique et Génome [MIG]
Noé, Laurent [Auteur]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Sequential Learning [SEQUOIA]
Chiapello, Hélène [Auteur]
Unité Mathématique Informatique et Génome [MIG]
Rivals, Eric [Auteur correspondant]
Méthodes et Algorithmes pour la Bioinformatique [MAB]
Méthodes et Algorithmes pour la Bioinformatique [MAB]
Michotey, Célia [Auteur]
Unité Mathématique Informatique et Génome [MIG]
Noé, Laurent [Auteur]
Laboratoire d'Informatique Fondamentale de Lille [LIFL]
Sequential Learning [SEQUOIA]
Chiapello, Hélène [Auteur]
Unité Mathématique Informatique et Génome [MIG]
Rivals, Eric [Auteur correspondant]
Méthodes et Algorithmes pour la Bioinformatique [MAB]
Scientific editor(s) :
Eric Rivals
Irena Rusu
Irena Rusu
Conference title :
JOBIM 2009 - 10es Journées Ouvertes en Biologie, Informatique et Mathématiques
City :
Nantes
Country :
France
Start date of the conference :
2009-06-09
Publication date :
2009-06-09
English keyword(s) :
Spaced seeds
Anchor based strategy
Global genome alignment
Anchor based strategy
Global genome alignment
HAL domain(s) :
Informatique [cs]/Bio-informatique [q-bio.QM]
English abstract : [en]
Whole genome alignment is a challenging problem in computational comparative genomics. It is essential for the functional annotation of genomes, the understanding of their evolution, and for phylogenomics. Many global ...
Show more >Whole genome alignment is a challenging problem in computational comparative genomics. It is essential for the functional annotation of genomes, the understanding of their evolution, and for phylogenomics. Many global alignment programs are heuristic variations on the anchor based strategy, which relies on the initial detection of similarities and their selection in an ordered chain. Considering that alignment tools fail to align some pairs of bacterial strains, we investigate whether this is intrinsically due to the strategy or to a lack of sensitivity of the similarity detection method. For this, we implement and compare 6 programs based on three different detection methods (from exact matches to local alignments) on a large benchmark set. Our results suggest that the sensitivity of well known methods, like MGA or Mauve, can be greatly improved in the case of divergent genomes if one exploits spaced seeds at the detection phase. In other cases, such methods yield alignments that cover nearly the whole genome. Then, we focus on global reliability of alignments: should an aligned pair of segments be included in the global genome alignment? We investigate this reliability according to both the segment "alignability" and to inclusion of orthologs. Again, we provide evidence that for both close and divergent genomes, one of our programs, YH, achieves alignments with sometimes a lower coverage, but a higher inclusion of orthologs. It opens the way to the first reliable alignments for some highly divergent species like Buchnera aphidicola or Prochlorococcus marinus.Show less >
Show more >Whole genome alignment is a challenging problem in computational comparative genomics. It is essential for the functional annotation of genomes, the understanding of their evolution, and for phylogenomics. Many global alignment programs are heuristic variations on the anchor based strategy, which relies on the initial detection of similarities and their selection in an ordered chain. Considering that alignment tools fail to align some pairs of bacterial strains, we investigate whether this is intrinsically due to the strategy or to a lack of sensitivity of the similarity detection method. For this, we implement and compare 6 programs based on three different detection methods (from exact matches to local alignments) on a large benchmark set. Our results suggest that the sensitivity of well known methods, like MGA or Mauve, can be greatly improved in the case of divergent genomes if one exploits spaced seeds at the detection phase. In other cases, such methods yield alignments that cover nearly the whole genome. Then, we focus on global reliability of alignments: should an aligned pair of segments be included in the global genome alignment? We investigate this reliability according to both the segment "alignability" and to inclusion of orthologs. Again, we provide evidence that for both close and divergent genomes, one of our programs, YH, achieves alignments with sometimes a lower coverage, but a higher inclusion of orthologs. It opens the way to the first reliable alignments for some highly divergent species like Buchnera aphidicola or Prochlorococcus marinus.Show less >
Language :
Anglais
Peer reviewed article :
Oui
Audience :
Nationale
Popular science :
Non
Collections :
Source :
Files
- https://hal-lirmm.ccsd.cnrs.fr/lirmm-00407215/document
- Open access
- Access the document
- https://hal-lirmm.ccsd.cnrs.fr/lirmm-00407215/document
- Open access
- Access the document
- https://hal-lirmm.ccsd.cnrs.fr/lirmm-00407215/document
- Open access
- Access the document
- https://hal-lirmm.ccsd.cnrs.fr/lirmm-00407215/document
- Open access
- Access the document
- document
- Open access
- Access the document
- jobim26.pdf
- Open access
- Access the document