A novel multi-objectivisation approach for ...
Document type :
Communication dans un congrès avec actes
Title :
A novel multi-objectivisation approach for optimising the protein inverse folding problem
Author(s) :
Nielsen, Sune Steinbjorn [Auteur]
University of Luxembourg [Luxembourg]
Danoy, Gregoire [Auteur]
Jurkowski, Wiktor [Auteur]
The Genome Analysis Centre [TGAC]
Laredo, Juan Luis Jiménez [Auteur]
Laboratoire d'Informatique, de Traitement de l'Information et des Systèmes [LITIS]
Schneider, Reinhard [Auteur]
University of Luxembourg [Luxembourg]
Talbi, El-Ghazali [Auteur]
Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Parallel Cooperative Multi-criteria Optimization [DOLPHIN]
Bouvry, Pascal [Auteur]
University of Luxembourg [Luxembourg]
University of Luxembourg [Luxembourg]
Danoy, Gregoire [Auteur]
Jurkowski, Wiktor [Auteur]
The Genome Analysis Centre [TGAC]
Laredo, Juan Luis Jiménez [Auteur]
Laboratoire d'Informatique, de Traitement de l'Information et des Systèmes [LITIS]
Schneider, Reinhard [Auteur]
University of Luxembourg [Luxembourg]
Talbi, El-Ghazali [Auteur]

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189 [CRIStAL]
Parallel Cooperative Multi-criteria Optimization [DOLPHIN]
Bouvry, Pascal [Auteur]
University of Luxembourg [Luxembourg]
Conference title :
EvoStar‘2015 8th European Conf. on the Applications of Evolutionary Computation
City :
Copenhagen
Country :
Danemark
Start date of the conference :
2015
Journal title :
Lecture Notes in Computer Science
HAL domain(s) :
Informatique [cs]/Recherche opérationnelle [cs.RO]
English abstract : [en]
In biology, the subject of protein structure prediction is of continued interest, not only to chart the molecular map of the living cell, but also to design proteins of new functions. The Inverse Folding Problem (IFP) is ...
Show more >In biology, the subject of protein structure prediction is of continued interest, not only to chart the molecular map of the living cell, but also to design proteins of new functions. The Inverse Folding Problem (IFP) is in itself an important research problem, but also at the heart of most rational protein design approaches. In brief, the IFP consists in finding sequences that will fold into a given structure, rather than determining the structure for a given sequence - as in conventional structure prediction. In this work we present a Multi Objective Genetic Algorithm (MOGA) using the diversity-as-objective (DAO) variant of multi-objectivisation, to optimise secondary structure similarity and sequence diversity at the same time, hence pushing the search farther into wide-spread areas of the sequence solution-space. To control the high diversity generated by the DAO approach, we add a novel Quantile Constraint (QC) mechanism to discard an adjustable worst quantile of the population. This DAO-QC approach can efficiently emphasise exploitation rather than exploration to a selectable degree achieving a trade-off producing both better and more diverse sequences than the standard Genetic Algorithm (GA). To validate the final results, a subset of the best sequences was selected for tertiary structure prediction. The super-positioning with the original protein structure demonstrated that meaningful sequences are generated underlining the potential of this work.Show less >
Show more >In biology, the subject of protein structure prediction is of continued interest, not only to chart the molecular map of the living cell, but also to design proteins of new functions. The Inverse Folding Problem (IFP) is in itself an important research problem, but also at the heart of most rational protein design approaches. In brief, the IFP consists in finding sequences that will fold into a given structure, rather than determining the structure for a given sequence - as in conventional structure prediction. In this work we present a Multi Objective Genetic Algorithm (MOGA) using the diversity-as-objective (DAO) variant of multi-objectivisation, to optimise secondary structure similarity and sequence diversity at the same time, hence pushing the search farther into wide-spread areas of the sequence solution-space. To control the high diversity generated by the DAO approach, we add a novel Quantile Constraint (QC) mechanism to discard an adjustable worst quantile of the population. This DAO-QC approach can efficiently emphasise exploitation rather than exploration to a selectable degree achieving a trade-off producing both better and more diverse sequences than the standard Genetic Algorithm (GA). To validate the final results, a subset of the best sequences was selected for tertiary structure prediction. The super-positioning with the original protein structure demonstrated that meaningful sequences are generated underlining the potential of this work.Show less >
Language :
Anglais
Peer reviewed article :
Oui
Audience :
Internationale
Popular science :
Non
Collections :
Source :