Data Editing Scaling and Preprocessing: Difference between revisions

From Eigenvector Research Documentation Wiki
Jump to navigation Jump to search
imported>Jeremy
(Importing text file)
 
No edit summary
 
(16 intermediate revisions by 2 users not shown)
Line 1: Line 1:
===Data Editing, Scaling, and Preprocessing===
:[[alignmat]] - Alignment of matrices and N-way arrays.
:[[preprocess]] - Selection and application of standard preprocessing structures.
:[[alignpeaks]] - Calibrates wavelength scale using standard peaks.
:[[preprouser]] - User-defined preprocessing methods.
:[[alignspectra]] - Calibrates wavelength scale using standard spectrum.
:[[editds]] - Editor for DataSet Objects.
:[[arithmetic]] - Apply simple arithmetic operations to all or part of dataset.
:[[asinhx]] - Arcsinh transform.
:[[asinsqrt]] - Arcsin square root transformation.
====Baselining and Offset Correction====
:[[auto]] - Autoscales matrix to mean zero unit variance.
:[[baseline]] - Subtracts a polynomial baseline offset from spectra.
:[[baseline]] - Subtracts a polynomial baseline offset from spectra.
:[[baselineds]] - Wrapper for baselining functions (wlsbaseline, baseline, whittaker, datafit).
:[[baselinew]] - Baseline using windowed polynomial filter.
:[[baselinew]] - Baseline using windowed polynomial filter.
:[[wlsbaseline]] - Weighted least squares baseline function.
:[[batchalign]] - Convert data columns based on matching ref col to target vector.
:[[mscorr]] - Multiplicative scatter/signal correction (MSC).
:[[batchdigester]] - Parse wafer or batch data into MPCA or Summary PCA form.
:[[savgol]] - Savitzky-Golay smoothing and differentiation.
:[[batchfold]] - Transform batch data into dataset for analysis.
:[[batchmaturity]] - Batch process model and monitoring.
:[[classcenter]] - Centers classes in data to the mean of each class.
:[[coadd]] - Reduce resolution through combination of adjacent variables or samples.
 
:[[datafit_engine]] - Asymmetric least squares with smoothing, baselining & robust fitting.
 
:[[delsamps]] - Deletes samples (rows) or variables (columns) from data matrices.
:[[deresolv]] - Changes high resolution spectra to low resolution.
:[[editds]] - Editor for DataSet Objects.
:[[excludemissing]] - Automatically exclude too-much missing data in a matrix.
:[[glog]] - Generalized log transform.
:[[glsw]] - Generalized least-squares weighting/preprocessing.
:[[gscale]] - Group/block scaling for a single or multiple blocks.
:[[gscaler]] - Applies group/block scaling to submatrices of a single matrix.
:[[lamsel]] - Determines indices of wavelength axes in specified ranges.
:[[logdecay]] - Mean centers and variance scales a matrix using the log decay of the variable axis.
:[[lsq2top]] - Fits a polynomial to the top/(bottom) of data.
:[[lsq2top]] - Fits a polynomial to the top/(bottom) of data.
:[[mdcheck]] - Missing Data Checker and infiller.
:[[med2top]] - Fits a constant to top/(bottom) of data.
:[[med2top]] - Fits a constant to top/(bottom) of data.
====Derivatives and Smoothing====
:[[medcn]] - Median center scales matrix to median zero.
:[[savgol]] - Savitzky-Golay smoothing and differentiation.
:[[minmax]] - Scale rows or columns of a matrix to range from 0 to 1.  
:[[polyinterp]] - Polynomial interpolation, smoothing, and differentiation.
:[[mncn]] - Scale matrix to mean zero.
====Sample-wise scaling corrections (Normalizing)====
:[[mscorr]] - Multiplicative scatter/signal correction (MSC).
:[[mscorr]] - Multiplicative scatter/signal correction (MSC).
:[[normaliz]] - Normalize rows of matrix.
:[[normaliz]] - Normalize rows of matrix.
:[[snv]] - Standard normal variate scaling.
:[[npreprocess]] - Preprocessing of multi-way arrays.
====Variable-wise scaling and offset corrections====
:[[oscapp]] - Applies OSC model to new data.
:[[auto]] - Autoscales matrix to mean zero unit variance.
:[[osccalc]] - Calculates orthogonal signal correction (OSC).
:[[mncn]] - Scale matrix to mean zero.
:[[poissonscale]] - Perform Poisson scaling with scaling offset.
:[[medcn]] - Median center scales matrix to median zero.
:[[polyinterp]] - Polynomial interpolation, smoothing, and differentiation.
:[[logdecay]] - Mean centers and variance scales a matrix using the log decay of the variable axis.
:[[pqnorm]] - Probabilistic Quotient Normalization for rows of a matrix.
:[[pr_entropy]] - Pattern recognition entropy (PRE), Shannon entropy, transform.
:[[preprocess]] - Selection and application of standard preprocessing structures.
:[[preprocessiterator]] - Create array of preprocessing combinations.
:[[preprouser]] - User-defined preprocessing methods.
:[[registerspec]] - Shift spectra based on expected peak locations.
:[[rescale]] - Scales data back to original scaling.
:[[savgol]] - Savitzky-Golay smoothing and differentiation.
:[[savgolcv]] - Cross-validation for Savitzky-Golay smoothing and differentiation.
:[[scale]] - Scales data using specified means and std. devs.
:[[scale]] - Scales data using specified means and std. devs.
:[[rescale]] - Scales data back to original scaling.
:[[gscale]] - Group/block scaling for a single or multiple blocks.
:[[gscaler]] - Applies group/block scaling to submatrices of a single matrix.
====Interference Removal and Missing Data====
:[[glsw]] - Generalized least-squares weighting/preprocessing.
:[[mdcheck]] - Missing Data Checker and infiller.
:[[osccalc]] - Calculates orthogonal signal correction (OSC).
====Sample and Variable handling====
:[[coadd]] - Reduce resolution through combination of adjacent variables or samples.
:[[delsamps]] - Deletes samples (rows) or variables (columns) from data matrices.
:[[excludemissing]] - Automatically exclude too-much missing data in a matrix.
:[[lamsel]] - Determines indices of wavelength axes in specified ranges.
:[[shuffle]] - Randomly re-orders matrix and multiple blocks rows.
:[[shuffle]] - Randomly re-orders matrix and multiple blocks rows.
:[[snv]] - Standard normal variate scaling.
:[[specedit]] - GUI for selecting spectral regions on a plot.
:[[specedit]] - GUI for selecting spectral regions on a plot.
:[[super_reduce]] - Eliminates highly correlated variables.
:[[unfoldm]] - Rearranges (unfolds) an augmented matrix to row vectors.
:[[unfoldm]] - Rearranges (unfolds) an augmented matrix to row vectors.
:[[unfoldmw]] - Unfolds multiway arrays along specified order.
:[[unfoldmw]] - Unfolds multiway arrays along specified order.
:[[batchdigester]] - Parse wafer or batch data into MPCA or Summary PCA form.
:[[windowfilter]] - Spectral filtering.
:[[wlsbaseline]] - Weighted least squares baseline function.
(Sub topic of [[Categorical_Index|Categorical_Index]])

Latest revision as of 10:47, 14 October 2022

alignmat - Alignment of matrices and N-way arrays.
alignpeaks - Calibrates wavelength scale using standard peaks.
alignspectra - Calibrates wavelength scale using standard spectrum.
arithmetic - Apply simple arithmetic operations to all or part of dataset.
asinhx - Arcsinh transform.
asinsqrt - Arcsin square root transformation.
auto - Autoscales matrix to mean zero unit variance.
baseline - Subtracts a polynomial baseline offset from spectra.
baselineds - Wrapper for baselining functions (wlsbaseline, baseline, whittaker, datafit).
baselinew - Baseline using windowed polynomial filter.
batchalign - Convert data columns based on matching ref col to target vector.
batchdigester - Parse wafer or batch data into MPCA or Summary PCA form.
batchfold - Transform batch data into dataset for analysis.
batchmaturity - Batch process model and monitoring.
classcenter - Centers classes in data to the mean of each class.
coadd - Reduce resolution through combination of adjacent variables or samples.
datafit_engine - Asymmetric least squares with smoothing, baselining & robust fitting.
delsamps - Deletes samples (rows) or variables (columns) from data matrices.
deresolv - Changes high resolution spectra to low resolution.
editds - Editor for DataSet Objects.
excludemissing - Automatically exclude too-much missing data in a matrix.
glog - Generalized log transform.
glsw - Generalized least-squares weighting/preprocessing.
gscale - Group/block scaling for a single or multiple blocks.
gscaler - Applies group/block scaling to submatrices of a single matrix.
lamsel - Determines indices of wavelength axes in specified ranges.
logdecay - Mean centers and variance scales a matrix using the log decay of the variable axis.
lsq2top - Fits a polynomial to the top/(bottom) of data.
mdcheck - Missing Data Checker and infiller.
med2top - Fits a constant to top/(bottom) of data.
medcn - Median center scales matrix to median zero.
minmax - Scale rows or columns of a matrix to range from 0 to 1.
mncn - Scale matrix to mean zero.
mscorr - Multiplicative scatter/signal correction (MSC).
normaliz - Normalize rows of matrix.
npreprocess - Preprocessing of multi-way arrays.
oscapp - Applies OSC model to new data.
osccalc - Calculates orthogonal signal correction (OSC).
poissonscale - Perform Poisson scaling with scaling offset.
polyinterp - Polynomial interpolation, smoothing, and differentiation.
pqnorm - Probabilistic Quotient Normalization for rows of a matrix.
pr_entropy - Pattern recognition entropy (PRE), Shannon entropy, transform.
preprocess - Selection and application of standard preprocessing structures.
preprocessiterator - Create array of preprocessing combinations.
preprouser - User-defined preprocessing methods.
registerspec - Shift spectra based on expected peak locations.
rescale - Scales data back to original scaling.
savgol - Savitzky-Golay smoothing and differentiation.
savgolcv - Cross-validation for Savitzky-Golay smoothing and differentiation.
scale - Scales data using specified means and std. devs.
shuffle - Randomly re-orders matrix and multiple blocks rows.
snv - Standard normal variate scaling.
specedit - GUI for selecting spectral regions on a plot.
super_reduce - Eliminates highly correlated variables.
unfoldm - Rearranges (unfolds) an augmented matrix to row vectors.
unfoldmw - Unfolds multiway arrays along specified order.
windowfilter - Spectral filtering.
wlsbaseline - Weighted least squares baseline function.

(Sub topic of Categorical_Index)