Vitenskapelige artikler m/referee

Analysis of genetic marker – phenotype relationships by jack-knifed partial least squares regression (PLSR).

Dette er artikkel 1 i en serie på to artikler. En ny og relativt enkel mate å analysere genmarkør-data presenters. Metoden er basert på den kjemometriske PLS Regresjons-teknikken for multivariat data-analyse, kombinert med nye grafiske metoder. Fordelen med metoden er at den er relativt forutsetningsfri, og kan derfor brukes på datasett der klassisk maximum-likelihood-basert genmarkør-analyse får problemer på grunn av for få observasjoner. En annen fordel er at genmarkør-dataene analyseres på nøyaktig samme måte som man analyserer andre typer multikanals data; derfor er metoden velegnet også for ikke-statistikere.

Nøkkeldata

Årstall 2004
Abstract The utility of a relatively new multivariate method, Bi-Linear Modelling by cross-validated Partial Least Squares Regression (PLSR), was investigated in the analysis of QTL. The distinguishing feature of PLSR is to reveal reliable covariance structures in data of different types with regard to the same set objects. Two matrices X (here: genetic markers) and Y (here: phenotypes) are interactively decomposed into latent variables (’ PLS Components, or PCs) in a way, which facilitates statistically reliable and graphically interpretable model building. Natural collinearities between input variables are utilized actively to stabilise the modelling, instead of being treated as a statistical problem. The importance of cross-validation/jack-knifing as an intuitively appealing way to avoid overfitting, is emphasized. Two datasets from chromosomal mapping studies of different complexity were chosen for illustration (QTLs for tomato yield and for oat heading data). Results from PLSR analysis were compared to published results and to results using the package PLABQTL in these data sets. In all cases PLSR gave at least similar explained validation variances as the reported studies. An attractive feature is that PLSR allows the analysis of several traits/replicates in one analysis, and the direct visual identification of individuals with desirable marker genotypes. It is suggested that PLSR may be useful in structural and functional genomics and in marker assisted selection, particularly in cases with limited number of objects. The second paper in this series utilises the new features of this data analytical approach for QTL studies of a more complex phenotypic trait in barley.
Referanse Bjørnstad, Å., Westad, F., Martens, H. 2004. Analysis of genetic marker – phenotype relationships by jack-knifed partial least squares regression (PLSR). Hereditas, Vol 141, pp 149-165.
Utgiver Hereditas,

Relaterte personer

  • Harald Martens

    Seniorforsker

    Tlf: +47 64970291

    Mobil: +47 950 75 025