A Predictive Based Regression Algorithm for Gene Network Selection St´ ephane Guerrier

Téléchargement

A Predictive Based Regression Algorithm

for Gene Network Selection

St´

ephane Guerrier1, Nabil Mili2& Samuel Orso2

1Department of Statistics, University of Illinois at Urbana-Champaign, USA

2Research Center for Statistics, University of Geneva, Switzerland

joint work with

Marco Avella Medina (U. Geneva), Yanyuan Ma (USC),

Roberto Molinari (U. Geneva)

June 6, 2016

S. Guerrier, N. Mili & S. Orso Panning Algorithm for Gene Selection June 6, 2016 1 / 32

Introduction Motivation

Introduction

Gene Selection Problems:

Selection of relevant genes is a common task in most gene expression

studies. Researchers try to identify the smallest possible set of genes

that can still achieve good predictive performance (D

ıaz-Uriarte &

Alvarez de Andr´

es, 2006).

How statisticians (typically) understand this deﬁnition:

We are looking for a single model.

For a given candidate model, picking the most likely parameters

given the data is optimal.

Predictive performance can be measured by the likelihood function

(typically out-of-sample).

The order in which the variables enter the model is unimportant

(implying: Model ABis equivalent to Model BA).

S. Guerrier, N. Mili & S. Orso Panning Algorithm for Gene Selection June 6, 2016 2 / 32

Introduction Motivation

Equivalence of outcomes according likelihood function

S. Guerrier, N. Mili & S. Orso Panning Algorithm for Gene Selection June 6, 2016 3 / 32

Introduction Potential drawbacks

Introduction

Is this a good idea?

According to our understanding of the problem (i.e. single model based on

likelihood methods): YES! However:

Focusing on a single model suggests a level of conﬁdence in our ﬁnal

result that is not justiﬁed by the data as other models generally exist

with similar good ﬁt (Whittingham et al., 2006).

Maximizing the likelihood function does not guarantee ﬁnding the

best model(s) (and parameters) according to a given out-of-sample

(medically chosen) objective function (e.g. classiﬁcation error, quality

of life, mortality, ... ).

The unimportance of the order of variable can causes

interpretation issues.

These methods are prone to overﬁtting (due to the asymmetric

eﬀects of “under” vs “over” ﬁtting).

S. Guerrier, N. Mili & S. Orso Panning Algorithm for Gene Selection June 6, 2016 4 / 32

Introduction Random Medical News

This can lead to...

S. Guerrier, N. Mili & S. Orso Panning Algorithm for Gene Selection June 6, 2016 5 / 32

1 / 32 100%

Documents connexes

The Marseille SIRIC, which IPC is part of together with AP

Robert Vanden Eynde

AN ALTERNATIVE TO VIRAL VECTORS (VVs)

Open access

Profil N° (à remplir par VAS) FINANCEMENT

Symposium Satellite HER2+ Early Breast Cancer Therapy:

2015/2016 Marketing Models Use of languages Contact

Influenza A gutless vector: new approach against lung cancer

Poulakidas Angela

Table of Contents

devoirs - cloudfront.net

Merci pour votre participation!

Faire une suggestion

Avez-vous trouvé des erreurs dans l'interface ou les textes ? Ou savez-vous comment améliorer l'interface utilisateur de StudyLib ? N'hésitez pas à envoyer vos suggestions. C'est très important pour nous!

GDPR Confidentialité Conditions d'utilisation

A Predictive Based Regression Algorithm for Gene Network Selection St´ ephane Guerrier

Documents connexes

Faire une suggestion

Produits

Assistance

Produits

Assistance

A Predictive Based Regression Algorithm for Gene Network Selection St´ ephane Guerrier

Documents connexes

Faire une suggestion

Produits

Assistance

Ajouter ce document à la (aux) collections

Ajouter ce document à enregistré

Suggérez-nous comment améliorer StudyLib