logo-InriaRennes logo-Irisa

Structural fragments: comparison, predictability from sequence and application to identification of viral proteins

Clovis Galiez

équipe-projet DYLISS

Soutenance de thèse - Mardi 8 décembre 2015

L'exposé [37:17 mn]
img-logoPdfsmall Les transparents
Le Jury
index vidéo des soutenances de thèse


Cliquer sur l'image pour lancer la vidéo (fichier téléchargeable (MP4) img-flecheHAUT

Les membres du Jury:

Anne-Claude CAMPROUX, professeure à l'Université Paris-Diderot : rapporteur
Dave RITCHIE, directeur de Recherche chez Inria Nancy : rapporteur
Rumen ANDONOV, professeur à l’Université de Rennes 1 : examinateur
Marc SEBBAN, professeur à l'Université de Saint-Étienne : examinateur
Jacques NICOLAS, directeur de Recherche chez Inria Rennes : directeur de thèse
François COSTE, chargé de Recherche chez Inria Rennes : encadrant de thèse



This thesis investigates the local characterization of protein families at both structural and sequential level. A formal framework is introduced to describe local relationships between the primary and tertiary structure of proteins.

Building on this formalism, we introduce contact fragments (CF) as portions of protein structure that conciliate spatial locality together with sequential neighborhood. We show that the predictability of CF from the sequence is better than that of contiguous fragments and of structurally distant pairs of fragments. In order to structurally compare CF, we introduce ASD, a novel alignment-free dissimilarity based on Fourier transform of the matrix of inter-atomic distances. This measure respects triangular inequality while being tolerant to sequence shifts and indels. We show that ASD can be used for standard fragments comparison and outperforms classical scores on practical experiments such that unsupervised classification and structural mining.

Ultimately, by integrating the identification of CF from the sequence into a statistical machine learning framework, we developed VIRALpro, a tool that enables the detection of sequences of viral structural proteins.


© 2015 Pôle audiovisuel Inria Rennes- Bretagne Atlantique