Introducing a Clustering Step in a Consensus ...
Type de document :
Article dans une revue scientifique
PMID :
URL permanente :
Titre :
Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models
Auteur(s) :
Chermak, Edrisse [Auteur]
King Abdullah University of Science and Technology [Saudi Arabia] [KAUST]
De Donato, Renato [Auteur]
King Abdullah University of Science and Technology [Saudi Arabia] [KAUST]
Lensink, Marc [Auteur]
Unité de Glycobiologie Structurale et Fonctionnelle - UMR 8576 [UGSF]
Petta, Andrea [Auteur]
Università degli Studi di Salerno = University of Salerno [UNISA]
Serra, Luigi [Auteur]
Università degli Studi di Salerno = University of Salerno [UNISA]
Scarano, Vittorio [Auteur]
Università degli Studi di Salerno = University of Salerno [UNISA]
Cavallo, Luigi [Auteur]
King Abdullah University of Science and Technology [Saudi Arabia] [KAUST]
Oliva, Romina [Auteur]
Università degli Studi di Napoli “Parthenope” = University of Naples [PARTHENOPE]
King Abdullah University of Science and Technology [Saudi Arabia] [KAUST]
De Donato, Renato [Auteur]
King Abdullah University of Science and Technology [Saudi Arabia] [KAUST]
Lensink, Marc [Auteur]
Unité de Glycobiologie Structurale et Fonctionnelle - UMR 8576 [UGSF]
Petta, Andrea [Auteur]
Università degli Studi di Salerno = University of Salerno [UNISA]
Serra, Luigi [Auteur]
Università degli Studi di Salerno = University of Salerno [UNISA]
Scarano, Vittorio [Auteur]
Università degli Studi di Salerno = University of Salerno [UNISA]
Cavallo, Luigi [Auteur]
King Abdullah University of Science and Technology [Saudi Arabia] [KAUST]
Oliva, Romina [Auteur]
Università degli Studi di Napoli “Parthenope” = University of Naples [PARTHENOPE]
Titre de la revue :
PLoS One
Nom court de la revue :
PLoS ONE
Numéro :
11
Pagination :
e0166460
Date de publication :
2016-11-15
ISSN :
1932-6203
Mot(s)-clé(s) en anglais :
Protein Structure, Secondary
Consensus
Protein Interaction Mapping
Proteins
Algorithms
Databases, Protein
Protein Binding
Molecular Docking Simulation
Software
Protein Interaction Domains and Motifs
Binding Sites
Research Design
Cluster Analysis
Consensus
Protein Interaction Mapping
Proteins
Algorithms
Databases, Protein
Protein Binding
Molecular Docking Simulation
Software
Protein Interaction Domains and Motifs
Binding Sites
Research Design
Cluster Analysis
Discipline(s) HAL :
Chimie/Chimie théorique et/ou physique
Résumé en anglais : [en]
Correctly scoring protein-protein docking models to single out native-like ones is an open challenge. It is also an object of assessment in CAPRI (Critical Assessment of PRedicted Interactions), the community-wide blind ...
Lire la suite >Correctly scoring protein-protein docking models to single out native-like ones is an open challenge. It is also an object of assessment in CAPRI (Critical Assessment of PRedicted Interactions), the community-wide blind docking experiment. We introduced in the field the first pure consensus method, CONSRANK, which ranks models based on their ability to match the most conserved contacts in the ensemble they belong to. In CAPRI, scorers are asked to evaluate a set of available models and select the top ten ones, based on their own scoring approach. Scorers' performance is ranked based on the number of targets/interfaces for which they could provide at least one correct solution. In such terms, blind testing in CAPRI Round 30 (a joint prediction round with CASP11) has shown that critical cases for CONSRANK are represented by targets showing multiple interfaces or for which only a very small number of correct solutions are available. To address these challenging cases, CONSRANK has now been modified to include a contact-based clustering of the models as a preliminary step of the scoring process. We used an agglomerative hierarchical clustering based on the number of common inter-residue contacts within the models. Two criteria, with different thresholds, were explored in the cluster generation, setting either the number of common contacts or of total clusters. For each clustering approach, after selecting the top (most populated) ten clusters, CONSRANK was run on these clusters and the top-ranked model for each cluster was selected, in the limit of 10 models per target. We have applied our modified scoring approach, Clust-CONSRANK, to SCORE_SET, a set of CAPRI scoring models made recently available by CAPRI assessors, and to the subset of homodimeric targets in CAPRI Round 30 for which CONSRANK failed to include a correct solution within the ten selected models. Results show that, for the challenging cases, the clustering step typically enriches the ten top ranked models in native-like solutions. The best performing clustering approaches we tested indeed lead to more than double the number of cases for which at least one correct solution can be included within the top ten ranked models.Lire moins >
Lire la suite >Correctly scoring protein-protein docking models to single out native-like ones is an open challenge. It is also an object of assessment in CAPRI (Critical Assessment of PRedicted Interactions), the community-wide blind docking experiment. We introduced in the field the first pure consensus method, CONSRANK, which ranks models based on their ability to match the most conserved contacts in the ensemble they belong to. In CAPRI, scorers are asked to evaluate a set of available models and select the top ten ones, based on their own scoring approach. Scorers' performance is ranked based on the number of targets/interfaces for which they could provide at least one correct solution. In such terms, blind testing in CAPRI Round 30 (a joint prediction round with CASP11) has shown that critical cases for CONSRANK are represented by targets showing multiple interfaces or for which only a very small number of correct solutions are available. To address these challenging cases, CONSRANK has now been modified to include a contact-based clustering of the models as a preliminary step of the scoring process. We used an agglomerative hierarchical clustering based on the number of common inter-residue contacts within the models. Two criteria, with different thresholds, were explored in the cluster generation, setting either the number of common contacts or of total clusters. For each clustering approach, after selecting the top (most populated) ten clusters, CONSRANK was run on these clusters and the top-ranked model for each cluster was selected, in the limit of 10 models per target. We have applied our modified scoring approach, Clust-CONSRANK, to SCORE_SET, a set of CAPRI scoring models made recently available by CAPRI assessors, and to the subset of homodimeric targets in CAPRI Round 30 for which CONSRANK failed to include a correct solution within the ten selected models. Results show that, for the challenging cases, the clustering step typically enriches the ten top ranked models in native-like solutions. The best performing clustering approaches we tested indeed lead to more than double the number of cases for which at least one correct solution can be included within the top ten ranked models.Lire moins >
Langue :
Anglais
Audience :
Non spécifiée
Établissement(s) :
CNRS
Université de Lille
Université de Lille
Équipe(s) de recherche :
Computational Molecular Systems Biology
Date de dépôt :
2020-02-12T15:11:20Z
2021-03-18T09:11:40Z
2021-03-18T09:11:40Z
Fichiers
- pone.0166460.pdf
- Version éditeur
- Accès libre
- Accéder au document