Excludemissing: Difference between revisions

From Eigenvector Research Documentation Wiki
Jump to navigation Jump to search
imported>Jeremy
(Importing text file)
imported>Bob
No edit summary
 
Line 1: Line 1:
===Purpose===
===Purpose===


Automatically exclude too-much missing data in a matrix.
Automatically exclude too much missing data in a matrix.


===Synopsis===
===Synopsis===
Line 10: Line 9:
===Description===
===Description===


Excludes rows, columns, or n-dim elements of input x which have too''' '''much missing based on the input threshold which is a fraction of''' '''allowed missing data. If omitted, threshold will be equal to the default''' '''max_missing value of the function MDCHECK (typically 0.40).
Excludes rows, columns, or n-dim elements of input <tt>x</tt> which have too much missing data based on the input threshold which is a fraction of allowed missing data. If omitted, threshold will be equal to the default <tt>max_missing</tt> value of the function MDCHECK (typically 0.40).
 
Outputs are a dataset object with excluded elements <tt>newx</tt> and a cell holding the indices of the bad elements for each mode of data <tt>bad</tt>.
 
====Inputs====
 
* '''x''' = input data set.
 
====Optional Inputs====
 
* '''threshold''' = fraction of allowed missing data; if omitted, the default value for <tt>max_missing</tt> in the MDCHECK function will be used (typically 0.40).
 
====Outputs====


Outputs are a dataset object with excluded elements newx and a cell''' '''holding the indices of the bad elements for each mode of data bad.
* '''newx''' = dataset object with excluded elements.
* '''bad''' = cell that holds the indices of the bad elements for each mode of <tt>x</tt>.


===See Also===
===See Also===


[[mdcheck]], [[replace]]
[[mdcheck]], [[replace]]

Latest revision as of 06:38, 9 October 2008

Purpose

Automatically exclude too much missing data in a matrix.

Synopsis

[newx,bad] = excludemissing(x,threshold)

Description

Excludes rows, columns, or n-dim elements of input x which have too much missing data based on the input threshold which is a fraction of allowed missing data. If omitted, threshold will be equal to the default max_missing value of the function MDCHECK (typically 0.40).

Outputs are a dataset object with excluded elements newx and a cell holding the indices of the bad elements for each mode of data bad.

Inputs

  • x = input data set.

Optional Inputs

  • threshold = fraction of allowed missing data; if omitted, the default value for max_missing in the MDCHECK function will be used (typically 0.40).

Outputs

  • newx = dataset object with excluded elements.
  • bad = cell that holds the indices of the bad elements for each mode of x.

See Also

mdcheck, replace