%0$503"5%&-6/7&345²%&506-064& ฀ ฀ ฀

Téléchargement

฀

฀฀฀฀฀

฀

%0$503"5%&-6/*7&34*5²%&506-064&

฀

฀฀

฀฀฀

฀

฀฀฀฀฀

฀

฀

฀

฀฀

฀฀฀

฀฀฀

฀

฀

M฀฀฀:

Université Toulouse III Paul Sabatier (UT3 Paul Sabatier)

Systèmes (EDSYS)

Outil d'aide au diagnostic du cancer à partir d'extraction d'informations issues

de bases de données et d'analyses par biopuces

jeudi 8 décembre 2011

Lyamine Hedjazi

Systèmes Automatiques

Sylvie Galichet

Sylvie Charbonnier

Florence Dalenc et Marie-Véronique Le Lann

LAAS-CNRS

Isabelle Bloch

Boutaib Dahhou

Gilles Favre

Yijun Sun

Joseph Aguilar-Martin (invité)

Aknowledgments

Les travaux présentés dans cette thèse ont été effectués au Laboratoire d'Analyse et

d'Architecture des Systèmes du Centre National de la Recherche Scientifique (LAAS-CNRS)

au sein du groupe Diagnostic, Supervision et Conduite (DISCO). Je tiens pour cela à

remercier les directeurs successifs du LAAS, Messieurs Raja CHATILLA, Jean-Louis

SANCHEZ et Jean ARLAT de m’avoir accueilli au LAAS-CNRS.

Je tiens à remercier aussi Madame Louise TRAVE-MASSUYES, responsable du groupe

DISCO, de m'avoir accueilli dans son groupe de recherche pendant ces années de thèse.

J’exprime toute ma gratitude à ma Directrice de thèse, Madame Marie-Véronique LE LANN,

pour avoir encadré mes travaux de recherche, et à qui cette thèse doit beaucoup. Je la remercie

pour sa disponibilité constante et efficace, son soutien et ses encouragements. Ma

reconnaissance va aussi à ma co-directrice de thèse, Madame Florence DALENC, pour ses

conseils précieux, sa patience et la confiance qu'elle m'a accordée pour mener à terme ces

travaux de recherche. Je tiens également à remercier Monsieur Joseph AGUILAR-MARTIN

pour ses remarques, ses conseils, sa disponibilité et son soutien tout au long de cette thèse.

Mes remerciements vont aussi à Madame Tatiana KEMPOWSKY-HAMON pour l’aide

qu’elle a apportée pour la réalisation de ces travaux de thèse.

Lire et juger une thèse n’est pas une tâche aisée à accomplir. Aussi, je tiens tout

particulièrement à exprimer ma reconnaissance à tous les membres du jury de ma thèse:

• Boutaib DAHHOU, professeur à l’Université Paul Sabatier, pour avoir présidé ce jury.

• Sylvie GALICHET, Professeur à l’Université de Savoie, et Sylvie CHARBONNIER,

Maître de Conférences à l’Université Joseph Fourier, pour m’avoir fait l’honneur

d’accepter la lourde tâche de rapporter sur cette thèse.

• Isabelle BLOCH, Professeur à Télécom Paris Tech, et Gille FAVRE, Professeur et

Praticien hospitalier à l’Université Paul Sabatier, d’avoir accepté de juger mes travaux

de thèses et pour leurs remarques pertinentes.

• Yijun SUN, assistant scientist (University of Florida), pour sa lecture pertinente du

manuscrit de thèse et pour m’avoir accueilli dans son laboratoire (ICBR) dans le cadre

d’un séjour de recherche.

Un GRAND merci à ma famille qui m’a soutenu pendant 28 ans de prés et de loin.

Finalement, je ne saurai conclure sans remercier mes amis pour leur soutien inconditionnel.

Abstract

Cancer is one of the most common causes of death in the world. Currently, breast cancer is

the most frequent in female cancers. Although the significant improvement made last decades

in cancer management, an accurate cancer management is still needed to help physicians take

the necessary treatment decisions and thereby reducing its related adverse effects as well as its

expensive medical costs. This work addresses the use of machine learning techniques to

develop such tools of breast cancer management.

Clinical factors, such as patient age and histo-pathological variables, are still the basis of day-

to-day decision for cancer management. However, with the emergence of high throughput

technology, gene expression profiling is gaining increasing attention to build more accurate

predictive tools for breast cancer. Nevertheless, several challenges have to be faced for the

development of such tools mainly (1) high dimensionality of data issued from microarray

technology; (2) low signal-to-noise ratio in microarray measurement; (3) membership

uncertainty of patients to cancer groups; and (4) heterogeneous (or mixed-type) data present

usually in clinical datasets.

In this work we propose some approaches to deal appropriately with such challenges. A first

approach addresses the problem of high data dimensionality by taking use of ℓ1 learning

capabilities to design an embedded feature selection algorithm for SVM (ℓ1 SVM) based on a

gradient descent technique. The main idea is to transform the initial constrained convex

optimization problem into an unconstrained one through the use of an approximated loss

function. A second approach handles simultaneously all challenges and therefore allows the

integration of several data sources (clinical, microarray …) to build more accurate predictive

tools. In this order a unified principle to deal with the data heterogeneity problem is proposed.

This principle is based on the mapping of different types of data from initially heterogeneous

spaces into a common space through an adequacy measure. To take into account membership

uncertainty and increase model interpretability, this principle is proposed within a fuzzy logic

framework. Besides, in order to alleviate the problem of high level noise, a symbolic approach

is proposed suggesting the use of interval representation to model the noisy measurements.

Since all data are mapped into a common space, they can be processed in a unified way

whatever its initial type for different data analysis purposes. We particularly designed, based

on this principle, a supervised fuzzy feature weighting approach. The weighting process is

mainly based on the definition of a membership margin for each sample. It optimizes then a

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

212

1 / 212 100%

Documents connexes

Thème S03 The Environment

English Exam B3: Grammar, Vocabulary, Essay

Qu`est-ce tu porterais?

UN RAPPEL : COLLECTE DE COMPOSTAGE - St-Pierre

The Marseille SIRIC, which IPC is part of together with AP

Slide 1

CANCER RISK

Diapositiva 1

Reported speech - Sen

LA GÉNÉRATRICE DE PHRASE ” I am We are You are You are He

La boulangerie

Merci pour votre participation!

Faire une suggestion

Avez-vous trouvé des erreurs dans l'interface ou les textes ? Ou savez-vous comment améliorer l'interface utilisateur de StudyLib ? N'hésitez pas à envoyer vos suggestions. C'est très important pour nous!

GDPR Confidentialité Conditions d'utilisation

%0$503"5%&-6/7&345²%&506-064& ฀ ฀ ฀

Documents connexes

Faire une suggestion

Produits

Assistance

Produits

Assistance

%0$503&#34;5%&amp;-6/*7&amp;34*5²%&amp;506-064&amp; ฀ ฀ ฀

Documents connexes

Faire une suggestion

Produits

Assistance

Ajouter ce document à la (aux) collections

Ajouter ce document à enregistré

Suggérez-nous comment améliorer StudyLib

%0$503"5%&-6/7&345²%&506-064& ฀ ฀ ฀