https://github.com/mlesnoff/rchemo
- pcaeigen Eigen decomposition
- pcaeigenk Eigen for wide matrices (kernel form)
- pcasvd SVD decomposition
- pcanipals Nipals
- pcanipalsna Nipals allowing missing data
- kpca Non Linear Kernel PCA (KPCA) (Scholkopf et al. 2002)
- scordis Score distances (SD) for a score (T) space
- odis Orthogonal distances (OD) for a score (T) space
- xfit Matrix fitting from score (T) and loadings (P) matrices
- lmr Multiple linear regression
- cglsr: CGLSR algorithm for ill-conditionned systems (Björck 1996)
- plskernel "Improved Kernel #1" (Dayal & McGregor 1997)
- plsnipals Nipals
- plsrannar Kernel version for wide matrices (Rannar et al. 1994)
- kplsr Non linear kernel PLSR (KPLSR) (Rosipal & Trejo 2001)
- dkplsr Direct KPLSR (Bennett & Embrechts 2003)
Auxiliary
- dfplsr_cg, dfplsr_cov, dfplsr_div: Model complexity estiamtion for PLSR
- aicplsr: Cp and AIC for PLSR models
- rr Ridge Regression (RR)
- krr Non linear kernel ridge regression (KRR) = Least-square SVM regression (LS-SVMR)
- dkrr Direct KRR
- svmr SVM regression (SVMR)
- knnr KNN regression (KNNR)
- lwplsr KNN Locally weighted PLSR (KNN-LWPLSR)
Averaging PLSR models with different numbers of LVs
- plsr_agg PLSR-AGG
- lwplsr_agg KNN-LWPLSR-AGG
- fda Eigen decomposition of the compromise "inter/intra"
- fdasvd Weighted SVD decomposition of the class centers
- lmrda DA on LMR prediction (LMR-DA)
- plsrda DA on PLSR prediction (PLSR-DA = common "PLSDA")
- kplsrda DA on KPLSR prediction (KPLSR-DA)
- rrda DA on RR prediction (RR-DA)
- krrda DA on KRR prediction (KRR-DA)
- lda Linear discriminant analysis (LDA)
- qda Quadratic discriminant analysis (QDA)
- plslda LDA on PLS latent variables (LVs) (PLS-LDA)
- plsqda QDA on PLS LVs (PLS-QDA)
- svmda SVMDA (= SVMC)
- knnda KNN-DA
- lwplsrda KNN Locally weighted PLSR-DA (KNN-LWPLSR-DA)
- lwplslda KNN Locally weighted PLS-LDA/QDA (KNN-LWPLS-LDA/QDA)
Averaging PLSR models with different numbers of LVs
- plsrda_agg PLSRDA-AGG
- lwplsrda_agg KNN-LWPLSR-DA-AGG
- lwplslda_agg KNN-LWPLS-LDA-AGG
- lwplslda_agg KNN-LWPLS-QDA-AGG
- gridscore Any model
- gridscorelv Models with LVs (faster)
- gridscorelb Models with ridge parameter (faster)
- gridcv Any model
- gridcvlv Models with LVs (faster)
- gridcvlb Models with ridge parameter (faster)
Auxiliary
- segmkf Building segments for K-fold CV
- segmts Building segments for test-set CV
- msep MSEP
- rmsep RMSEP
- sep SEP
- bias Bias
- r2 R2
- cor2 Squared correlation
- rpd, rpdr Ratio of performance to deviation
- mse Summary for regression
- err Classification error rate
- selwold Wold's criterion for models with LVs
- covsel COVSEL algorithm (Roger et al. 2011)
- detrend Polynomial detrend transformation
- snv Standard-normal-deviation transformation
- mavg Smoothing by moving average
- savgol Savitsky-Golay filtering (derivation)
- dderiv Derivation by finite difference
- xinterp Resampling of spectra by interpolation methods
- *rmgap Remove vertical gaps in spectra (e.g. for ASD)
- eposvd Pre-processing data by external parameter orthogonalization (EPO; Roger et al 2003)
- sampks Kennard-Stone sampling
- sampdp Duplex sampling
- sampcla Within-class stratified sampling
- checkna Find and count NA values in a data set
- plotxna Plotting missing data in a matrix
- checkdupl Find duplicated row observations between two data sets
- rmdupl Remove duplicated row observations between two data sets
- aggmean Centers of classes
- dtagg Summary statistics with data subsets
- summ Summary of the quantitative variables of a data set
- mblocks Makes a list of blocks
- hconcat Horizontal block concatenation
- blockscal Block autoscaling
- *asdgap ASD spectra with vertical gaps
- cassav Tropical shrubs
- forages Tropical forages
- octane Gazoline "octane" dataset
- ozone Los Angeles "ozone" pollution (1976) dataset
- plotsp Plotting spectra, loadings, or more generally row observations of a data set
- plostsp1 Same as plotsp but one-by-one row
- plotxy 2-d scatter plot
- plotjit Jittered plot
- plotscore Plotting error rates of prediction models
- dmnorm Multivariate normal probability density
- dummy Dummy table
- euclsq, *euclsq_mu Euclidean distance matrices
- mahsq, *mahsq_mu Mahalanobis distance matrices
- getknn KNN selection
- krbf, kpol, ktanh Gram matrices for different kernels
- headm Print the first part of a matrix or data frame
- locw Working function for locally weighted models
- matB, matW Between and within covariance matrices
- pinv Moore-Penrose pseudo-inverse
- sourcedir Source every R functions in a directory
- wdist Weights for distances
- Additional working functions in file zfunctions.R
Matthieu Lesnoff
-
Cirad, UMR Selmet, Montpellier, France
-
ChemHouse, Montpellier
Lesnoff, M. 2021. R package rchemo: Dimension reduction, Regression and Discrimination for Chemometrics. https://github.com/mlesnoff/rchemo. CIRAD, UMR SELMET, Montpellier, France