Data Editing Scaling and Preprocessing

From Eigenvector Research Documentation Wiki
Revision as of 08:57, 2 September 2008 by imported>Jeremy (Importing text file)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Data Editing, Scaling, and Preprocessing

preprocess - Selection and application of standard preprocessing structures.
preprouser - User-defined preprocessing methods.
editds - Editor for DataSet Objects.


Baselining and Offset Correction

baseline - Subtracts a polynomial baseline offset from spectra.
baselinew - Baseline using windowed polynomial filter.
wlsbaseline - Weighted least squares baseline function.
mscorr - Multiplicative scatter/signal correction (MSC).
savgol - Savitzky-Golay smoothing and differentiation.


lsq2top - Fits a polynomial to the top/(bottom) of data.
med2top - Fits a constant to top/(bottom) of data.

Derivatives and Smoothing

savgol - Savitzky-Golay smoothing and differentiation.
polyinterp - Polynomial interpolation, smoothing, and differentiation.

Sample-wise scaling corrections (Normalizing)

mscorr - Multiplicative scatter/signal correction (MSC).
normaliz - Normalize rows of matrix.
snv - Standard normal variate scaling.

Variable-wise scaling and offset corrections

auto - Autoscales matrix to mean zero unit variance.
mncn - Scale matrix to mean zero.
medcn - Median center scales matrix to median zero.
logdecay - Mean centers and variance scales a matrix using the log decay of the variable axis.
scale - Scales data using specified means and std. devs.
rescale - Scales data back to original scaling.
gscale - Group/block scaling for a single or multiple blocks.
gscaler - Applies group/block scaling to submatrices of a single matrix.

Interference Removal and Missing Data

glsw - Generalized least-squares weighting/preprocessing.
mdcheck - Missing Data Checker and infiller.
osccalc - Calculates orthogonal signal correction (OSC).

Sample and Variable handling

coadd - Reduce resolution through combination of adjacent variables or samples.
delsamps - Deletes samples (rows) or variables (columns) from data matrices.
excludemissing - Automatically exclude too-much missing data in a matrix.
lamsel - Determines indices of wavelength axes in specified ranges.
shuffle - Randomly re-orders matrix and multiple blocks rows.
specedit - GUI for selecting spectral regions on a plot.


unfoldm - Rearranges (unfolds) an augmented matrix to row vectors.
unfoldmw - Unfolds multiway arrays along specified order.
batchdigester - Parse wafer or batch data into MPCA or Summary PCA form.