Using the DataSet Include Field: Difference between revisions

From Eigenvector Research Documentation Wiki
Jump to navigation Jump to search
imported>Jeremy
No edit summary
imported>Jeremy
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
The DataSet "Include" property allows portions of the data to be ignored ("excluded") from analysis. This can be used on samples (rows of the data) or variables (columns of the data). Below are step-by-step directions for excluding variables from analysis using a plot of the data. Similar steps can be used to exclude samples.
The DataSet "Include" property allows portions of the data to be ignored ("excluded") from analysis. This can be used on samples (rows of the data) or variables (columns of the data). Any samples or variables you have excluded will NOT be used in any analyses.


To select only a portion of variables to analyze (e.g. a subset of the wavelength range of the calibration spectra) in the Analysis window, do the following:
There are several ways to edit the Include field. Each has different advantages:
 
==Using a Plot==
 
===Graphically Selecting and Excluding===
 
These step-by-step directions explain how to use a plot of the data to excluding variables from analysis. Similar steps can be used to exclude samples.


# Click on the "X" button in the Analysis Status Pane and select "Plot" from the context menu. This will bring up a plot of your data along with a "[[Plot Controls]]" window.
# Click on the "X" button in the Analysis Status Pane and select "Plot" from the context menu. This will bring up a plot of your data along with a "[[Plot Controls]]" window.
Line 8: Line 14:
# Using the "Select" button in the Plot Controls, select the range of variables you want to include in the analysis.
# Using the "Select" button in the Plot Controls, select the range of variables you want to include in the analysis.
#:'''HINT:''' if you want more than one non-consecutive range, select one range first, then, click "select" again and hold down the "shift" key before selecting the next range.
#:'''HINT:''' if you want more than one non-consecutive range, select one range first, then, click "select" again and hold down the "shift" key before selecting the next range.
# From the "Edit" menu on the Plot Controls, select "Exclude Unselected". This will exclude all of the points you have NOT selected.
# From the "Edit" menu on the Plot Controls, select "Include Only Selected". This will exclude all of the points you have NOT selected.
#:'''HINT:''' You can also do these last two steps "inversely": select variables you want to ''exclude'' then use the menu option "Exclude Selected".
#:'''HINT:''' If you want to re-include variables which you have previously excluded, first select "View/Excluded Data" on the Plot Controls, then, use the Select button along with the "Edit/Include Selection" menu to re-include the selected variables.
#:'''HINT:''' If you want to re-include variables which you have previously excluded, first select "View/Excluded Data" on the Plot Controls, then, use the Select button along with the "Edit/Include Selection" menu to re-include the selected variables.


Any variables you have excluded will NOT be used in any analyses.
===Using Plot Search===
 
When viewing a plot of a DataSet, the Plot Controls menu: "Edit > Search Bar" will enable the [[PlotControlsWindow_Layout_2#Search_Bar|selection search bar.]] from which the user can enter search criteria for variables or samples to select, and then use the include/exclude of selection as described above.
 
==From the DataSet Editor==
 
The DataSet Editor allows bulk or fine control over included and excluded variables. These features are available from the Row Labels and Column Labels tabs of the DataSet Editor.
 
===Include Checkboxes===
 
The "Incl." column defines whether each row or column is included. These checkboxes can be used individually, including or excluding each item, or multiple rows/columns can be "selected" (by clicking on the labels to the left of the table) and a single click of any selected row's include checkbox will include or exclude all items.
 
===Bulk Include Change===
 
Right-clicking the Incl. column header will offer the option "Bulk Include Change". If selected, the user will be presented a listbox from which all items to be included should be selected. All unselected items will be excluded.
 
===Editing the Include Field===
 
Right-clicking the Incl. column header will offer the option "Edit Incl.". This presents the user with an edit box in which they can enter one or more vectors of items to be included using indices for the items. Vectors are entered as <tt>start:end</tt> (in which a colon separates the start and end indices, and an optional "step" index can be added in-between with an additional colon:  <tt>start:step:end</tt>)
 
===Copy/Paste===
 
If a list of samples or variables exists (as a list of numeric indices to include) in another program, copying this list and pasting it into the Incl. column of the DataSet editor (right click the Incl. column header and choose paste) will force the include field to match the pasted list.

Latest revision as of 16:08, 12 December 2013

The DataSet "Include" property allows portions of the data to be ignored ("excluded") from analysis. This can be used on samples (rows of the data) or variables (columns of the data). Any samples or variables you have excluded will NOT be used in any analyses.

There are several ways to edit the Include field. Each has different advantages:

Using a Plot

Graphically Selecting and Excluding

These step-by-step directions explain how to use a plot of the data to excluding variables from analysis. Similar steps can be used to exclude samples.

  1. Click on the "X" button in the Analysis Status Pane and select "Plot" from the context menu. This will bring up a plot of your data along with a "Plot Controls" window.
  2. In the Plot Controls, select "Variables" for the x-axis. You should now see a plot of the mean sample (e.g. a mean spectrum).
    HINT: if you want to exclude samples from an analysis, choose "Samples" for the x-axis here and use the rest of these instructions as-is.
  3. Using the "Select" button in the Plot Controls, select the range of variables you want to include in the analysis.
    HINT: if you want more than one non-consecutive range, select one range first, then, click "select" again and hold down the "shift" key before selecting the next range.
  4. From the "Edit" menu on the Plot Controls, select "Include Only Selected". This will exclude all of the points you have NOT selected.
    HINT: You can also do these last two steps "inversely": select variables you want to exclude then use the menu option "Exclude Selected".
    HINT: If you want to re-include variables which you have previously excluded, first select "View/Excluded Data" on the Plot Controls, then, use the Select button along with the "Edit/Include Selection" menu to re-include the selected variables.

Using Plot Search

When viewing a plot of a DataSet, the Plot Controls menu: "Edit > Search Bar" will enable the selection search bar. from which the user can enter search criteria for variables or samples to select, and then use the include/exclude of selection as described above.

From the DataSet Editor

The DataSet Editor allows bulk or fine control over included and excluded variables. These features are available from the Row Labels and Column Labels tabs of the DataSet Editor.

Include Checkboxes

The "Incl." column defines whether each row or column is included. These checkboxes can be used individually, including or excluding each item, or multiple rows/columns can be "selected" (by clicking on the labels to the left of the table) and a single click of any selected row's include checkbox will include or exclude all items.

Bulk Include Change

Right-clicking the Incl. column header will offer the option "Bulk Include Change". If selected, the user will be presented a listbox from which all items to be included should be selected. All unselected items will be excluded.

Editing the Include Field

Right-clicking the Incl. column header will offer the option "Edit Incl.". This presents the user with an edit box in which they can enter one or more vectors of items to be included using indices for the items. Vectors are entered as start:end (in which a colon separates the start and end indices, and an optional "step" index can be added in-between with an additional colon: start:step:end)

Copy/Paste

If a list of samples or variables exists (as a list of numeric indices to include) in another program, copying this list and pasting it into the Incl. column of the DataSet editor (right click the Incl. column header and choose paste) will force the include field to match the pasted list.