Title: | Loadings for Principal Component Analysis and Partial Least Squares |
---|---|
Description: | Computing statistical hypothesis testing for loading in principal component analysis (PCA) (Yamamoto, H. et al. (2014) <doi:10.1186/1471-2105-15-51>), orthogonal smoothed PCA (OS-PCA) (Yamamoto, H. et al. (2021) <doi:10.3390/metabo11030149>), one-sided kernel PCA (Yamamoto, H. (2023) <doi:10.51094/jxiv.262>), partial least squares (PLS) and PLS discriminant analysis (PLS-DA) (Yamamoto, H. et al. (2009) <doi:10.1016/j.chemolab.2009.05.006>), PLS with rank order of groups (PLS-ROG) (Yamamoto, H. (2017) <doi:10.1002/cem.2883>), regularized canonical correlation analysis discriminant analysis (RCCA-DA) (Yamamoto, H. et al. (2008) <doi:10.1016/j.bej.2007.12.009>), multiset PLS and PLS-ROG (Yamamoto, H. (2022) <doi:10.1101/2022.08.30.505949>). |
Authors: | Hiroyuki Yamamoto |
Maintainer: | Hiroyuki Yamamoto <[email protected]> |
License: | LGPL-3 |
Version: | 0.5.1 |
Built: | 2025-01-26 02:39:01 UTC |
Source: | https://github.com/hiroyukiyamamoto/loadings |
This is metagenome and metabolome data of antibiotic-treated mouse fecal samples [Y. Wakita, et al, BMC Microbiology, 18:188 (2018)]. This dataset includes six groups, comprising three types (Clea, CR, and Slc) of control mice and their antibiotic-treated counterparts.
data(antibiotic)
data(antibiotic)
The list object "antibiotic" contains the following elements:
data$X$microbiome$dist: Weighted UniFrac distance matrix for the metagenome.
data$X$microbiome$comp: Composition data for the metagenome.
data$X$metabolome : Metabolome data.
data$Y: A dummy matrix that includes group information with 0 or 1 in each column.
annotation$metabolome : Compound annotation for metabolome data.
Y. Wakita, et al, Taxonomic classification for microbiome analysis, which correlates well with the metabolite milieu of the gut, BMC microbiology, 18:188 (2018)
This is proteome and metabolome data of COVID-19 serum samples [B. Shen et al, Cell. 182 (2020) 59-72.e15.]. This dataset includes three groups of healthy control, mild and severe COVD-19.
data(covid19)
data(covid19)
The list object "covid19" contains the following elements:
X : List of data matrix that include samples in each rows and protein or metabolite in each columns for proteomics and metabolomics.
Y : Dummy matrix that include group information 0,1 in each columns.
D : Example of differential matrix used in PLS-ROG or multiset PLS-ROG.
tau : Example of strength parameter matrix used in mutliset PLS or multiset PLS-ROG. All weights were set as the same value of 0.1.
K : List of kernel matrix by hyperbolic tangent kernel (scale=1, offset=1).
B. Shen, et al, Proteomic and Metabolomic Characterization of COVID-19 Patient Sera, Cell. 182 (2020) 59-72.e15.
This data includes metabolome data of liver samples from normal and 12 hour fasted mice measured by capillary electrophoresis time-of-flight mass spectrometry. The data matrix contains 10 samples (5 samples in each groups) and 282 metabolites.
data(fasting)
data(fasting)
The list object "fasting" contains the following elements:
X : Data matrix that include metabolites in each columns.
Y : Dummy matrix that include group information 0,1 in each columns.
Yamamoto H., Fujimori T., Sato H., Ishikawa G., Kami K., Ohashi Y. (2014). "Statistical hypothesis testing of factor loading in principal component analysis and its application to metabolite set enrichment analysis". BMC Bioinformatics, (2014) 15(1):51.
This includes metabolome data of Japanese green tea measured by gas chromatgraphy mass spectrometry. The data matrix contains 30 samples and 225 metabolites.
data(greentea)
data(greentea)
The list object "greentea" contains the following elements:
X : Data matrix that include metabolites in each columns.
Y : Dummy matrix that include group information 0,1 in each columns.
D : Differential matrix (Second order).
M : Averaging matrix for repeated data.
y : Ranking of taste in competitive exhibition.
Statistics in Microsoft Excel ( http://prime.psc.riken.jp/compms/others/main.html#Statistics )
Pongsuwan W., Fukusaki E., Bamba T., Yonetani T., Yamahara A.T., Kobayashi A. Prediction of Japanese Green Tea Ranking by Gas Chromatography/Mass Spectrometry-Based Hydrophilic Metabolite Fingerprinting. J. Agric. Food Chem. 2007;55:231-236.
This function performs Multiset partial least squares (Multiset PLS). In this function, data matrix is automatically scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
multipls_geigen(X,Y,tau)
multipls_geigen(X,Y,tau)
X |
List of data matrix that include variables in each columns. |
Y |
Dummy matrix that include group information 0,1 in each columns. |
tau |
Matrix for strength parameter of the connection between omics datasets or between omics dataset and group information. |
Diagonal elements of matrix tau must be 0.
The return value is a list object that contains the following elements:
P : A list of matrix with Multiset PLS coefficients for the explanatory variables in each column for each dataset
T : A list of matrix with Multiset PLS scores for the explanatory variables in each column for each dataset
Q : A matrix with Multiset PLS coefficients for the response variable in each column
U : A matrix with Multiset PLS scores for the response variable in each column
tau : Matrix for strength parameter of the connection between omics datasets or between omics dataset and group information (same as input value).
Hiroyuki Yamamoto
Yamamoto H. (2022) Multiset partial least squares with rank order of groups for integrating multi-omics data, bioRxiv.
data(whhl) X <- whhl$X Y <- whhl$Y tau <- whhl$tau multipls <- multipls_geigen(X,Y,tau)
data(whhl) X <- whhl$X Y <- whhl$Y tau <- whhl$tau multipls <- multipls_geigen(X,Y,tau)
This function computes multiset partial least squares (PLS) loading from the result of the "multipls_geigen" function.
multipls_loading(multipls)
multipls_loading(multipls)
multipls |
The following variables (P,T,Q and U) are included in the multipls object. P : A list of matrix with Multiset PLS coefficients for the explanatory variables in each column for each dataset T : A list of matrix with Multiset PLS scores for the explanatory variables in each column for each dataset Q : A matrix with Multiset PLS coefficients for the response variable in each column U : A matrix with Multiset PLS scores for the response variable in each column tau : Matrix for strength parameter of the connection between omics datasets or between omics dataset and group information. |
The multiset PLS loading and the p-value by statistical hypothesis testing is added to the multipls object returned by the "multipls_geigen" function in this package.
The return value is a list object that contains the following elements:
Score : A list of matrix with Multiset PLS score for auxiliary variable in each column for each dataset
R : A list of matrix with Multiset PLS loadings in each column for each dataset
p.value : A list of matrix with p-value of multiset PLS loading in each column for each dataset
Hiroyuki Yamamoto
Yamamoto H. (2022) Multiset partial least squares with rank order of groups for integrating multi-omics data, bioRxiv.
data(whhl) X <- whhl$X Y <- whhl$Y tau <- whhl$tau multipls <- multipls_geigen(X,Y,tau) multipls <- multipls_loading(multipls) multipls$loading$R multipls$loading$p.value
data(whhl) X <- whhl$X Y <- whhl$Y tau <- whhl$tau multipls <- multipls_geigen(X,Y,tau) multipls <- multipls_loading(multipls) multipls$loading$R multipls$loading$p.value
This function performs multiset partial least squares with rank order of groups (Multiset PLS-ROG). In this function, data matrix is automatically scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
multipls_rog(X,Y,tau,D,kappa)
multipls_rog(X,Y,tau,D,kappa)
X |
List of data matrix that include variables in each columns. |
Y |
Dummy matrix that include group information 0,1 in each columns. |
tau |
Matrix for strength parameter of the connection between omics datasets or between omics dataset and group information. |
D |
Differential matrix. |
kappa |
The smoothing parameter (default : 0.999). |
Diagonal elements of matrix tau must be 0.
The return value is a list object that contains the following elements:
P : A list of matrix with Multiset PLS-ROG coefficients for the explanatory variables in each column for each dataset
T : A list of matrix with Multiset PLS-ROG scores for the explanatory variables in each column for each dataset
Q : A matrix with Multiset PLS-ROG coefficients for the response variable in each column
U : A matrix with Multiset PLS-ROG scores for the response variable in each column
tau : Matrix for strength parameter of the connection between omics datasets or between omics dataset and group information (same as input value).
Hiroyuki Yamamoto
Yamamoto H. (2022) Multiset partial least squares with rank order of groups for integrating multi-omics data, bioRxiv.
data(whhl) X <- whhl$X Y <- whhl$Y D <- whhl$D tau <- whhl$tau multiplsrog <- multipls_rog(X,Y,tau,D) # multiplsrog <- multipls_rog(X,Y,tau,D, kappa=0.999)
data(whhl) X <- whhl$X Y <- whhl$Y D <- whhl$D tau <- whhl$tau multiplsrog <- multipls_rog(X,Y,tau,D) # multiplsrog <- multipls_rog(X,Y,tau,D, kappa=0.999)
This function computes multiset partial least squares rank order of groups (PLS-ROG) loading from the result of the "multipls_rog" function.
multiplsrog_loading(multiplsrog)
multiplsrog_loading(multiplsrog)
multiplsrog |
The following variables (P,T,Q and U) are included in the multiplsrog object. P : A list of matrix with Multiset PLS-ROG coefficients for the explanatory variables in each column for each dataset T : A list of matrix with Multiset PLS-ROG scores for the explanatory variables in each column for each dataset Q : A matrix with Multiset PLS-ROG coefficients for the response variable in each column U : A matrix with Multiset PLS-ROG scores for the response variable in each column tau : Matrix for strength parameter of the connection between omics datasets or between omics dataset and group information. |
The multiset PLS-ROG loading and the p-value by statistical hypothesis testing is added to the multiplsrog object returned by the "multipls_rog" function in this package.
The return value is a list object that contains the following elements:
Score : A list of matrix with Multiset PLS score for auxiliary variable in each column for each dataset
R : A list of matrix with Multiset PLS-ROG loadings in each column for each dataset
p.value : A list of matrix with p-value of multiset PLS-ROG loading in each column for each dataset
Hiroyuki Yamamoto
Yamamoto H. (2022) Multiset partial least squares with rank order of groups for integrating multi-omics data, bioRxiv.
data(whhl) X <- whhl$X Y <- whhl$Y D <- whhl$D tau <- whhl$tau multiplsrog <- multipls_rog(X,Y,tau,D) # multiplsrog <- multipls_rog(X,Y,tau,D, kappa=0.999) multiplsrog <- multiplsrog_loading(multiplsrog) multiplsrog$loading$R multiplsrog$loading$p.value
data(whhl) X <- whhl$X Y <- whhl$Y D <- whhl$D tau <- whhl$tau multiplsrog <- multipls_rog(X,Y,tau,D) # multiplsrog <- multipls_rog(X,Y,tau,D, kappa=0.999) multiplsrog <- multiplsrog_loading(multiplsrog) multiplsrog$loading$R multiplsrog$loading$p.value
This function performs one-sided kernel principal component analysis (one-sided KPCA). In this function, data matrix is automatically scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
one_kpca(X,K)
one_kpca(X,K)
X |
Data matrix that include variables in each columns. |
K |
Kernel matrix computed from the data matrix X. |
The kernel matrix K, which is the argument of the one_kpca function, must be centered.
The return value is a list object that contains the following elements:
P : A matrix with one-sided KPCA loading in each column
T : A matrix with one-sided KPCA score for linear side in each column
U : A matrix with one-sied KPCA score for nonlinear side in each column
Hiroyuki Yamamoto
Yamamoto H. (2023) One-sided Kernel Principal Component Analysis, Jxiv, <doi:10.51094/jxiv.262>.
# COVID-19 metabolome data data(covid19) X <- covid19$X$metabolomics K <- covid19$K$metabolomics onekpca <- one_kpca(X,K)
# COVID-19 metabolome data data(covid19) X <- covid19$X$metabolomics K <- covid19$K$metabolomics onekpca <- one_kpca(X,K)
This function computes one-sided kernel principal component (one-sided KPC) loading from the result of the "one_kpca" function.
onekpca_loading(onekpca)
onekpca_loading(onekpca)
onekpca |
The following variables (P,T and U) are included in the onekpca object. P : A matrix with one-sided KPCA loading in each column T : A matrix with one-sided KPCA score for linear side in each column U : A matrix with one-sied KPCA score for nonlinear side in each column |
The one-sided KPC loading and the p-value by statistical hypothesis testing is added to the onekpca object returned by the "one_kpca" function.
The return value is a list object that contains the following elements:
R : one-sided KPC loading (Correlation coefficient between one-sided KPC score of nonlinear variable (kernel matrix) and each variables in data matrix.)
p.value : p-value of one-sided KPC loading
Hiroyuki Yamamoto
Yamamoto H. (2023) One-sided Kernel Principal Component Analysis, Jxiv, <doi:10.51094/jxiv.262>.
# COVID-19 metabolome data data(covid19) X <- covid19$X$metabolomics K <- covid19$K$metabolomics onekpca <- one_kpca(X,K) onekpca <- onekpca_loading(onekpca) onekpca$loading$R onekpca$loading$p.value
# COVID-19 metabolome data data(covid19) X <- covid19$X$metabolomics K <- covid19$K$metabolomics onekpca <- one_kpca(X,K) onekpca <- onekpca_loading(onekpca) onekpca$loading$R onekpca$loading$p.value
This function performs orthogonal smoothed principal component analysis (OS-PCA). In this function, data matrix is automatically scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
os_pca(X,D,kappa,M)
os_pca(X,D,kappa,M)
X |
Data matrix that include variables in each columns. |
D |
Differential matrix. |
kappa |
The smoothing parameter (default : 0.999). |
M |
Averaging matrix for repeated data (default : Identity matrix). |
The kappa represents the degree of smoothing. The value of kappa increases, the strength of the smoothing increases.
The return value is a list object that contains the following elements:
P : A matrix with OS-PC loading in each column
T : A matrix with OS-PC score in each column
MT : A matrix with averaging OS-PC score for repeated data in each column (If not for repeated data, the matrix MT equals to the matrix T)
Q : A matrix with OS-PC loading for auxiliary variable in each column
U : A matrix with OS-PC score for auxiliary variable in each column
Hiroyuki Yamamoto
Yamamoto H., Nakayama Y., Tsugawa H. (2021) OS-PCA: Orthogonal Smoothed Principal Component Analysis Applied to Metabolome Data, Metabolites, 11(3):149.
# metabolic turnover data data(turnover) X <- turnover$X D <- turnover$D ospca <- os_pca(X,D,0.999) # metabolome data data(greentea) X <- greentea$X D <- greentea$D M <- greentea$M ospca <- os_pca(X,D,0.1,M)
# metabolic turnover data data(turnover) X <- turnover$X D <- turnover$D ospca <- os_pca(X,D,0.999) # metabolome data data(greentea) X <- greentea$X D <- greentea$D M <- greentea$M ospca <- os_pca(X,D,0.1,M)
This function computes orthogonals smoothed principal component (OS-PC) loading from the result of the "os_pca" function. This is also wrapper function of pls_loading function.
ospca_loading(ospca)
ospca_loading(ospca)
ospca |
The following variables (P,T,Q and U) are included in the ospca object. P : A matrix with OS-PC loading in each column T : A matrix with OS-PC score in each column MT : A matrix with averaging OS-PC score for repeated data in each column (If not for repeated data, the matrix MT equals to the matrix T) Q : A matrix with OS-PC loading for auxiliary variable in each column U : A matrix with OS-PC score for auxiliary variable in each column |
The OS-PC loading and the p-value by statistical hypothesis testing is added to the ospca object returned by the "os-pca" function.
The return value is a list object that contains the following elements:
R : OS-PCA loading (Correlation coefficient between OS-PC score of auxiliary variable and each variables in data matrix.)
p.value : p-value of OS-PCA loading
Hiroyuki Yamamoto
Yamamoto H., Nakayama Y., Tsugawa H. (2021) OS-PCA: Orthogonal Smoothed Principal Component Analysis Applied to Metabolome Data, Metabolites, 11(3):149.
# metabolic turnover data data(turnover) X <- turnover$X D <- turnover$D ospca <- os_pca(X,D,0.999) ospca <- ospca_loading(ospca) ospca$loading$R ospca$loading$p.value # metabolome data data(greentea) X <- greentea$X D <- greentea$D M <- greentea$M ospca <- os_pca(X,D,0.1,M) ospca <- ospca_loading(ospca) ospca$loading$R ospca$loading$p.value
# metabolic turnover data data(turnover) X <- turnover$X D <- turnover$D ospca <- os_pca(X,D,0.999) ospca <- ospca_loading(ospca) ospca$loading$R ospca$loading$p.value # metabolome data data(greentea) X <- greentea$X D <- greentea$D M <- greentea$M ospca <- os_pca(X,D,0.1,M) ospca <- ospca_loading(ospca) ospca$loading$R ospca$loading$p.value
This function computes principal component (PC) loading from the result of the "prcomp" function. (The "princomp" function is not supported. For "princomp" function, the "loadings" function in stats package should be used.) In this function, data matrix is should be scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
pca_loading(pca)
pca_loading(pca)
pca |
The return object of the "prcomp" function. |
The PC loading and the p-value by statistical hypothesis testing is added to the pca object returned by the "prcomp" function.
The return value is a list object that contains the following elements:
R : PC loading (Correlation coefficient between PC score and each variables in data matrix.)
p.value : p-value of PC loading
Hiroyuki Yamamoto
Yamamoto H., Fujimori T., Sato H., Ishikawa G., Kami K., Ohashi Y. (2014). "Statistical hypothesis testing of factor loading in principal component analysis and its application to metabolite set enrichment analysis". BMC Bioinformatics, (2014) 15(1):51.
data(fasting) X <- fasting$X pca <- prcomp(X, scale=TRUE) pca <- pca_loading(pca) pca$loading$R # PC loading pca$loading$p.value # p-value
data(fasting) X <- fasting$X pca <- prcomp(X, scale=TRUE) pca <- pca_loading(pca) pca$loading$R # PC loading pca$loading$p.value # p-value
This function performs partial least squares discriminant analysis (PLS-DA). In this function, data matrix for explanatory variable is automatically scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
pls_da(X,Y,k)
pls_da(X,Y,k)
X |
Data matrix of explanatory variables that include variables in each columns. |
Y |
Dummy matrix that include group information 0,1 in each columns. |
k |
Number of components. |
This function calculates PLS-DA. For PLS, use the 'pls_svd' function for PLS.
The return value is a list object that contains the following elements:
P: A matrix containing the PLS-DA loadings for each explanatory variable in the columns, before transformation.
T : A matrix with PLS-DA score for explanatory variable in each column
Hiroyuki Yamamoto
Yamamoto, H. et al., Dimensionality reduction for metabolome data using PCA, PLS, OPLS, and RFDA with differential penalties to latent variables", Chemom. Intell. Lab. Syst., 98 (2009)
data(whhl) X <- whhl$X$liver Y <- whhl$Y plsda <- pls_da(X,Y,2)
data(whhl) X <- whhl$X$liver Y <- whhl$Y plsda <- pls_da(X,Y,2)
This function computes partial least squares (PLS) loading from the result of the "pls_svd" function. PLS loading can also be computed from the result of the "pls_eigen" function in chemometrics package.
pls_loading(pls)
pls_loading(pls)
pls |
The following variables (P,T,Q and U) are included in the pls object. P : A matrix with PLS loading for explanatory variable in each column T : A matrix with PLS score for explanatory variable in each column Q : A matrix with PLS loading for response variable in each column U : A matrix with PLS score for response variable in each column |
The PLS loading and the p-value by statistical hypothesis testing is added to the pls object returned by the "pls_svd" function in this package or "pls_eigen" function in chemometrics package.
The return value is a list object that contains the following elements:
R : PLS loading (Correlation coefficient between PLS score of response variable and each variables in data matrix.)
p.value : p-value of PLS loading
Hiroyuki Yamamoto
Yamamoto, H. (2017) PLS-ROG: Partial least squares with rank order of groups., Journal of Chemometrics, 31(3) (2017) e2883.
data(whhl) X <- whhl$X$liver Y <- whhl$Y pls <- pls_svd(X,Y) pls <- pls_loading(pls) pls$loading$R pls$loading$p.value
data(whhl) X <- whhl$X$liver Y <- whhl$Y pls <- pls_svd(X,Y) pls <- pls_loading(pls) pls$loading$R pls$loading$p.value
This function performs partial least squares with rank order of groups (PLS-ROG). In this function, data matrix is automatically scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
pls_rog(X,Y,D,kappa)
pls_rog(X,Y,D,kappa)
X |
Data matrix of explanatory variables that include variables in each columns. |
Y |
Dummy matrix that include group information 0,1 in each columns. |
D |
Differential matrix. |
kappa |
The smoothing parameter (default : kappa = 0.999). |
The kappa represents the degree of smoothing. The value of kappa increases, the strength of the smoothing increases.
The return value is a list object that contains the following elements:
P : A matrix with PLS loading for explanatory variable in each column
T : A matrix with PLS score for explanatory variable in each column
Q : A matrix with PLS loading for response variable in each column
U : A matrix with PLS score for response variable in each column
Hiroyuki Yamamoto
Yamamoto, H. (2017) PLS-ROG: Partial least squares with rank order of groups., Journal of Chemometrics, 31(3) (2017) e2883.
data(whhl) X <- whhl$X$liver Y <- whhl$Y D <- whhl$D plsrog <- pls_rog(X,Y,D)
data(whhl) X <- whhl$X$liver Y <- whhl$Y D <- whhl$D plsrog <- pls_rog(X,Y,D)
This function performs partial least squares. In this function, data matrix for explanatory variable is automatically scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
pls_svd(X,Y)
pls_svd(X,Y)
X |
Data matrix of explanatory variables that include variables in each columns. |
Y |
Dummy matrix that include group information 0,1 in each columns. |
This function is wrapper function of "pls_rog" fucntion that the smoothing parameter kappa=0.
The return value is a list object that contains the following elements:
P : A matrix with PLS loading for explanatory variable in each column
T : A matrix with PLS score for explanatory variable in each column
Q : A matrix with PLS loading for response variable in each column
U : A matrix with PLS score for response variable in each column
Hiroyuki Yamamoto
Barker, M. and Rayens, W. (2003) Partial Least Squares for Discrimination. Journal of Chemometrics, 17, 166-173.
data(whhl) X <- whhl$X$liver Y <- whhl$Y pls <- pls_svd(X,Y)
data(whhl) X <- whhl$X$liver Y <- whhl$Y pls <- pls_svd(X,Y)
This function computes partial least squares discriminant analysis (PLS-DA) loading from the result of the 'pls_da' function.
plsda_loading(plsda)
plsda_loading(plsda)
plsda |
The following variables (P,T) are included in the plsda object. P: A matrix containing the PLS-DA loadings for each explanatory variable in the columns, before transformation. T : A matrix with PLS-DA score for explanatory variable in each column |
The PLS-DA loading and the p-value by statistical hypothesis testing is added to the pls object returned by the 'pls_da' function in this package.
The return value is a list object that contains the following elements:
R : PLS-DA loading (Correlation coefficient between PLS-DA score of explanatory variable and each variables in data matrix.)
p.value : p-value of PLS-DA loading
Hiroyuki Yamamoto
Yamamoto, H. et al., Dimensionality reduction for metabolome data using PCA, PLS, OPLS, and RFDA with differential penalties to latent variables", Chemom. Intell. Lab. Syst., 98 (2009)
data(whhl) X <- whhl$X$liver Y <- whhl$Y plsda <- pls_da(X,Y,2) plsda <- plsda_loading(plsda) plsda$loading$R plsda$loading$p.value
data(whhl) X <- whhl$X$liver Y <- whhl$Y plsda <- pls_da(X,Y,2) plsda <- plsda_loading(plsda) plsda$loading$R plsda$loading$p.value
This function computes partial least squares rank rder of groups (PLS-ROG) loading from the result of the "pls_rog" function. PLS-ROG loading can also be computed from the result of the "pls_rog" function.
plsrog_loading(plsrog)
plsrog_loading(plsrog)
plsrog |
The following variables (P,T,Q and U) are included in the plsrog object. P : A matrix with PLS-ROG loading for explanatory variable in each column T : A matrix with PLS-ROG score for explanatory variable in each column Q : A matrix with PLS-ROG loading for response variable in each column U : A matrix with PLS-ROG score for response variable in each column |
The PLS-ROG loading and the p-value by statistical hypothesis testing is added to the pls object returned by the "pls_rog" function in this package.
The return value is a list object that contains the following elements:
R : PLS-ROG loading (Correlation coefficient between PL-ROG score of response variable and each variables in data matrix.)
p.value : p-value of PLS-ROG loading
Hiroyuki Yamamoto
Yamamoto, H. (2017) PLS-ROG: Partial least squares with rank order of groups., Journal of Chemometrics, 31(3) (2017) e2883.
data(whhl) X <- whhl$X$liver Y <- whhl$Y D <- whhl$D plsrog <- pls_rog(X,Y,D) plsrog <- plsrog_loading(plsrog) plsrog$loading$R plsrog$loading$p.value
data(whhl) X <- whhl$X$liver Y <- whhl$Y D <- whhl$D plsrog <- pls_rog(X,Y,D) plsrog <- plsrog_loading(plsrog) plsrog$loading$R plsrog$loading$p.value
This function performs regularized canonical correlation analysis for discriminant analysis (RCCA-DA). In this function, data matrix for explanatory variable is automatically scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
rcca_da(X,Y,lambda,k)
rcca_da(X,Y,lambda,k)
X |
Data matrix of explanatory variables that include variables in each columns. |
Y |
Dummy matrix that include group information 0,1 in each columns. |
lambda |
The regularized parameter has a value in the range [0, 1), meaning it can be 0 but is less than 1." |
k |
Number of components. |
RCCA-DA is equivalent to Regularized Fisher discriminant analysis, theoretically.
The return value is a list object that contains the following elements:
P: A matrix containing the RCCA-DA loadings for each explanatory variable in the columns, before transformation.
T : A matrix with RCCA-DA score for explanatory variable in each column
Hiroyuki Yamamoto
Yamamoto, H. et al., Canonical correlation analysis for multivariate regression and its application to metabolic fingerprinting", Biochem. Eng. Journal, 40 (2008) 199-204.
Yamamoto, H. et al., Dimensionality reduction for metabolome data using PCA, PLS, OPLS, and RFDA with differential penalties to latent variables", Chemom. Intell. Lab. Syst., 98 (2009)
data(whhl) X <- whhl$X$liver Y <- whhl$Y rcca <- rcca_da(X,Y,0.5,2)
data(whhl) X <- whhl$X$liver Y <- whhl$Y rcca <- rcca_da(X,Y,0.5,2)
This function computes regularized canonical correlation analysis for discriminant analysis (RCCA-DA) loading from the result of the 'rcca_da' function.
rcca_loading(rcca)
rcca_loading(rcca)
rcca |
The following variables (Wx,P,T,Wy and S) are included in the rcca object. Wx : A matrix with RCCA-DA weight for explanatory variable in each column P: A matrix containing the RCCA-DA loadings for each explanatory variable in the columns, before transformation. T : A matrix with RCCA-DA score for explanatory variable in each column Wy : A matrix with RCCA-DA weight for explanatory variable in each column S : A matrix with RCCA-DA score for response variable in each column |
The RCCA-DA loading and the p-value by statistical hypothesis testing is added to the pls object returned by the 'rcca_da' function in this package.
The return value is a list object that contains the following elements:
R : RCCA-DA loading (Correlation coefficient between PLS-DA score of explanatory variable and each variables in data matrix.)
p.value : p-value of RCCA-DA loading
Hiroyuki Yamamoto
Yamamoto, H. et al., Canonical correlation analysis for multivariate regression and its application to metabolic fingerprinting", Biochem. Eng. Journal, 40 (2008) 199-204.
data(whhl) X <- whhl$X$liver Y <- whhl$Y rcca <- rcca_da(X,Y,0.5,2) rcca <- rcca_loading(rcca) rcca$loading$R rcca$loading$p.value
data(whhl) X <- whhl$X$liver Y <- whhl$Y rcca <- rcca_da(X,Y,0.5,2) rcca <- rcca_loading(rcca) rcca$loading$R rcca$loading$p.value
This includes differential isotopomer ratio from metabolomic turnover analysis of yeast cultures by gas chromatgraphy mass spectrometry. The data matrix contains 11 samping time for three groups of different yeast strains and culture condition (33 samples total) and 60 peaks.
data(turnover)
data(turnover)
The list object "turnover" contains the following elements:
X : Data matrix of explanatory variables that include variables in each columns.
Y : Dummy matrix that include group information 0,1 in each columns.
D : Differential matrix (Second order).
annotation : Compound annotation for metabolome data.
Nakayama Y., Tamada Y., Tsugawa H., Bamba T., Fukusaki E. Novel Strategy for Non-Targeted Isotope-Assisted Metabolomics by Means of Metabolic Turnover and Multivariate Analysis. Metabolites. 2014;4:722-739.
data(turnover)
data(turnover)
This function performs Unsupervised multiset partial least squares. In this function, data matrix is automatically scaled to zero mean and unit variance (i.e. autoscaling) for each variables.
unsv_multipls(X,tau)
unsv_multipls(X,tau)
X |
List of data matrix that include variables in each columns. |
tau |
Matrix for strength parameter of the connection between omics datasets or between omics dataset and group information. |
Diagonal elements of matrix tau must be 0.
The return value is a list object that contains the following elements:
P : A list of matrix with Multiset PLS coefficients in each column for each dataset
T : A list of matrix with Multiset PLS scores in each column for each dataset
Hiroyuki Yamamoto
Yamamoto H. (2022) Multiset partial least squares with rank order of groups for integrating multi-omics data, bioRxiv.
data(whhl) X <- whhl$X tau <- rbind(c(0,1/4,1/4,1/4),c(1/4,0,1/4,1/4),c(1/4,1/4,0,1/4),c(1/4,1/4,1/4,0)) unsvmpls <- unsv_multipls(X,tau)
data(whhl) X <- whhl$X tau <- rbind(c(0,1/4,1/4,1/4),c(1/4,0,1/4,1/4),c(1/4,1/4,0,1/4),c(1/4,1/4,1/4,0)) unsvmpls <- unsv_multipls(X,tau)
This function computes unsupervised multiset partial least squares loading from the result of the "unsv_multipls" function.
unsvmultipls_loading(unsvmpls)
unsvmultipls_loading(unsvmpls)
unsvmpls |
The following variables (P,T) are included in the unsvmpls object. P : A list of matrix with Multiset PLS coefficients in each column for each dataset T : A list of matrix with Multiset PLS scores in each column for each dataset |
The unsupervised multiset PLS loading and the p-value by statistical hypothesis testing is added to the unsvmpls object returned by the "unsv_multipls" function in this package.
The return value is a list object that contains the following elements:
Score : A list of matrix with Multiset PLS score for auxiliary variable in each column for each dataset
R : A list of matrix with Unsupervised multiset PLS loadings in each column for each dataset
p.value : A list of matrix with p-value of Unsupervised multiset PLS loading in each column for each dataset
Hiroyuki Yamamoto
Yamamoto H. (2022) Multiset partial least squares with rank order of groups for integrating multi-omics data, bioRxiv.
data(whhl) X <- whhl$X tau <- rbind(c(0,1/4,1/4,1/4),c(1/4,0,1/4,1/4),c(1/4,1/4,0,1/4),c(1/4,1/4,1/4,0)) unsvmpls <- unsv_multipls(X,tau) unsvmpls <- unsvmultipls_loading(unsvmpls) unsvmpls$loading$R unsvmpls$loading$p.value
data(whhl) X <- whhl$X tau <- rbind(c(0,1/4,1/4,1/4),c(1/4,0,1/4,1/4),c(1/4,1/4,0,1/4),c(1/4,1/4,1/4,0)) unsvmpls <- unsv_multipls(X,tau) unsvmpls <- unsvmultipls_loading(unsvmpls) unsvmpls$loading$R unsvmpls$loading$p.value
This is metabolome data from plasma, liver, heart and brain samples of normal, hyperlipidemic, and statin-treated rabbits measured by capillary electrophoresis time-of-flight mass spectrometry. The data matrix contains 9 samples (3 samples in each groups).
data(whhl)
data(whhl)
The list object "whhl" contains the following elements:
X : List of data matrix that include samples in each rows and metabolites in each columns for plasma, liver, heart and brain samples.
Y : Dummy matrix that include group information 0,1 in each columns.
D : Example of differential matrix used in PLS-ROG or multiset PLS-ROG.
tau : Example of strength parameter matrix used in mutliset PLS or multiset PLS-ROG. All weights were set as the same value of 0.1.
Ooga, T., Sato, H., Nagashima, A., Sasaki, K., Tomita, M., Soga T. and Ohashi, Y. (2011) Metabolomic Anatomy of Animal Model Revealing Homeostatic Imbalances in Dyslipidemia. Y. Mol. BioSyst.,7, 1217-23.