DataSet Editor Window: Layout: Difference between revisions

From Eigenvector Research Documentation Wiki
Jump to navigation Jump to search
imported>Jeremy
No edit summary
imported>Jeremy
No edit summary
Line 12: Line 12:
|
|
* Right-click on an item in the Workspace Browser window and on the context menu that opens, click Edit.
* Right-click on an item in the Workspace Browser window and on the context menu that opens, click Edit.
|}
{|


|-
|-
Line 17: Line 21:
|
|
* Drag an item in the Workspace Browser window onto the DataSet Editor icon.  
* Drag an item in the Workspace Browser window onto the DataSet Editor icon.  
|}
{|


|-
|-
Line 22: Line 30:
|
|
* Double-click the DataSet Editor icon in the Workspace Browser window, and on the window's main menu, click File > Load or File > Import From.  
* Double-click the DataSet Editor icon in the Workspace Browser window, and on the window's main menu, click File > Load or File > Import From.  
|}
{|


|-
|-
Line 27: Line 39:
|
|
* Right-click on a data component in an analysis window (for example, the X calibration control in the PCA analysis window), and on the context menu that opens, click Edit.
* Right-click on a data component in an analysis window (for example, the X calibration control in the PCA analysis window), and on the context menu that opens, click Edit.
|}
{|


|-
|-
Line 37: Line 53:
===DataSet Editor window layout===
===DataSet Editor window layout===


''DataSet Editor window''
::''DataSet Editor window''


[[Image:DataSetEditorWindow_Layout.10.1.1.jpg|543x299px]]
::[[Image:DataSetEditorWindow_Layout.10.1.1.jpg|543x299px]]
::
 
::
::


The DataSet Editor window has three major components-the title bar, the main menu, and the tabs.
The DataSet Editor window has three major components-the title bar, the main menu, and the tabs.
Line 60: Line 80:
|
|
* Main menu - The DataSet Editor main menu is set up in a standard Windows menu format with menu commands grouped into menus (File, Edit, View, and FigBrowser) across the menu bar. The Load and Import options on the File menu are identical to the options on the File menu on the Workspace Browser window and the Analysis window. You use these options to load or import data from the Workspace Browser or from a file. You use the File > Save Data option to save a DataSet to the Workspace Browser or to a file. You use the File > Export to export a DataSet to a .csv or .xml file.
* Main menu - The DataSet Editor main menu is set up in a standard Windows menu format with menu commands grouped into menus (File, Edit, View, and FigBrowser) across the menu bar. The Load and Import options on the File menu are identical to the options on the File menu on the Workspace Browser window and the Analysis window. You use these options to load or import data from the Workspace Browser or from a file. You use the File > Save Data option to save a DataSet to the Workspace Browser or to a file. You use the File > Export to export a DataSet to a .csv or .xml file.
|}
{|


|-
|-
Line 80: Line 104:
|
|
* To edit the DataSet name, author, or description, click the Edit button [[Image:Edit_button.png|22x22px]] next to the appropriate field.
* To edit the DataSet name, author, or description, click the Edit button [[Image:Edit_button.png|22x22px]] next to the appropriate field.
|}
{|


|-
|-
Line 85: Line 113:
|
|
* To plot the included data in the DataSet, click the Plot button [[Image:Plot_button.png|20x20px]].
* To plot the included data in the DataSet, click the Plot button [[Image:Plot_button.png|20x20px]].
|}
{|


|-
|-
Line 97: Line 129:
The Data tab displays the data in the DataSet in a spreadsheet format.  
The Data tab displays the data in the DataSet in a spreadsheet format.  


''DataSet Editor window, Data tab''
::''DataSet Editor window, Data tab''


[[Image:DE_window_Datatab_contextmenu.png|327x278px]]
::[[Image:DE_window_Datatab_contextmenu.png|327x278px]]
::


The tab is interactive. You can:
The tab is interactive. You can:
Line 109: Line 142:
|
|
* Edit the data directly on this tab.
* Edit the data directly on this tab.
|}
{|


|-
|-
Line 114: Line 151:
|
|
* Copy and paste rows and/or columns to and from other programs.
* Copy and paste rows and/or columns to and from other programs.
|}
{|


|-
|-
Line 119: Line 160:
|
|
* Include and exclude rows and/or columns of data.
* Include and exclude rows and/or columns of data.
|}
{|


|-
|-
Line 133: Line 178:
In a typical two-way DataSet, data mode 1 (the rows) represents the data samples and data mode 2 (the columns) represents the variables. The Row Labels tab and the Columns Labels tab-also known as the Mode Labels-provide access to the auxiliary "context" data for the Dataset, such as the labels for each sample, the axis scale, the data classes, and the Include status for data. (Multi-way data has additional Label tabs for each mode of the data.)
In a typical two-way DataSet, data mode 1 (the rows) represents the data samples and data mode 2 (the columns) represents the variables. The Row Labels tab and the Columns Labels tab-also known as the Mode Labels-provide access to the auxiliary "context" data for the Dataset, such as the labels for each sample, the axis scale, the data classes, and the Include status for data. (Multi-way data has additional Label tabs for each mode of the data.)


''DataSet Editor window, Row Labels tab''
::''DataSet Editor window, Row Labels tab''


[[Image:DE_window_RowLabelstab_Contextmenu.png|408x277px]]
::[[Image:DE_window_RowLabelstab_Contextmenu.png|408x277px]]
::
 
::
::


You use the information in these fields for:
You use the information in these fields for:
Line 145: Line 194:
|
|
* Managing the data. (For example, the Include field indicates whether a given row or column is to be included in an analysis.)
* Managing the data. (For example, the Include field indicates whether a given row or column is to be included in an analysis.)
|}
{|


|-
|-
Line 150: Line 203:
|
|
* Plotting the data (For example, some correction algorithms plot against the axis scale of the columns.)  
* Plotting the data (For example, some correction algorithms plot against the axis scale of the columns.)  
|}
{|


|-
|-
Line 166: Line 223:
|
|
* Manually enter the information in each field.
* Manually enter the information in each field.
|}
{|


|-
|-
Line 171: Line 232:
|
|
* Assign a name to a field set to assist in identifying content.
* Assign a name to a field set to assist in identifying content.
|}
{|


|-
|-
Line 176: Line 241:
|
|
* Create sets for loading multiple versions of a field into a single DataSet.
* Create sets for loading multiple versions of a field into a single DataSet.
|}
{|


|-
|-
Line 192: Line 261:
|
|
* Copy and paste information to and from the Label field, the Axis Scale field, the Classes field, and the Include field.  
* Copy and paste information to and from the Label field, the Axis Scale field, the Classes field, and the Include field.  
|}
{|


|-
|-
Line 212: Line 285:
|
|
* Transpose-Switches the rows and columns for a 2-way DataSet object.
* Transpose-Switches the rows and columns for a 2-way DataSet object.
|}
{|


|-
|-
Line 217: Line 294:
|
|
* Exclude Data-Mark rows or columns as "Excluded." (Also known as a "Soft Delete.")
* Exclude Data-Mark rows or columns as "Excluded." (Also known as a "Soft Delete.")
|}
{|


|-
|-
Line 222: Line 303:
|
|
* Hard Delete Excluded-Permanently removes excluded data from a DataSet object.
* Hard Delete Excluded-Permanently removes excluded data from a DataSet object.
|}
{|


|-
|-
Line 227: Line 312:
|
|
* Exclude Excessive Missing-Automatically excludes rows or columns or multi-dimensional indices in which the number of missing data values exceeds the allowable amounts, as defined by the missing data replacement algorithm "mdcheck." You you use this option to remove samples or variables which do not have enough information to be used in modeling.
* Exclude Excessive Missing-Automatically excludes rows or columns or multi-dimensional indices in which the number of missing data values exceeds the allowable amounts, as defined by the missing data replacement algorithm "mdcheck." You you use this option to remove samples or variables which do not have enough information to be used in modeling.
|}
{|


|-
|-
Line 232: Line 321:
|
|
* Permute modes-Changes the order of the data modes.
* Permute modes-Changes the order of the data modes.
|}
{|


|-
|-

Revision as of 10:51, 29 July 2010

Table of Contents | Previous | Next

DataSet Editor Window

The DataSet Editor window is the standard interface that you use for creating and managing a DataSet in Solo. Multiple options are available for opening the DataSet Editor window. You can:

  • Right-click on an item in the Workspace Browser window and on the context menu that opens, click Edit.
  • Drag an item in the Workspace Browser window onto the DataSet Editor icon.
  • Double-click the DataSet Editor icon in the Workspace Browser window, and on the window's main menu, click File > Load or File > Import From.
  • Right-click on a data component in an analysis window (for example, the X calibration control in the PCA analysis window), and on the context menu that opens, click Edit.
  • With data loaded in an analysis window, (for example, the X calibration control in the PCA analysis window or the X and Y calibration control in the PLS analysis window), on the Solo main menu, click Edit > Calibration > X-Block Data or Edit > Calibration > Y-Block Data.

DataSet Editor window layout

DataSet Editor window
DataSetEditorWindow Layout.10.1.1.jpg

The DataSet Editor window has three major components-the title bar, the main menu, and the tabs.

  • Title bar - The phrase "DataSet Editor" appears in the title bar at the top of the DataSet Editor window, as well as the standard Window Minimize, Maximize, and Close buttons. The name of the DataSet that is currently loaded in the DataSet Editor window also appears in the title bar. For example, in the figure above, the DataSet named "conc" is currently loaded in the window.
Note: If you launch the DataSet Editor window any other way than from an analysis window, and then modify the DataSet in any way, an asterisk (*) is displayed next to the DataSet name in the title bar. The asterisk indicates that modifications to the data are pending. Before you can close the DataSet Editor window, you must answer a prompt about saving the modified data. If you launch the DataSet Editor window from an analysis window, any modifications that you make to the data are immediate (no asterisk is displayed next to the DataSet name in the title bar) and you can close the window without having to answer a prompt about saving the data.
  • Main menu - The DataSet Editor main menu is set up in a standard Windows menu format with menu commands grouped into menus (File, Edit, View, and FigBrowser) across the menu bar. The Load and Import options on the File menu are identical to the options on the File menu on the Workspace Browser window and the Analysis window. You use these options to load or import data from the Workspace Browser or from a file. You use the File > Save Data option to save a DataSet to the Workspace Browser or to a file. You use the File > Export to export a DataSet to a .csv or .xml file.
  • Tabs-The DataSet Editor window has four tabs-Info, Data, Row Labels, and Column Labels-each of which provides access to different content in the DataSet.

Info tab

When the DataSet Editor window opens, the Info tab is the active tab. (See the figure above.) The Info tab provides a high-level overview of the DataSet, including the DataSet name, the DataSet author, the data type and size in the DataSet for both included and excluded data, the DataSet creation date and time, the DataSet modification date and time, and a description of the DataSet.

The Info tab is interactive:

  • To edit the DataSet name, author, or description, click the Edit button Edit button.png next to the appropriate field.
  • To plot the included data in the DataSet, click the Plot button Plot button.png.
  • To view the history of the DataSet, click the History button Edit button.png.

Data tab

The Data tab displays the data in the DataSet in a spreadsheet format.

DataSet Editor window, Data tab
DE window Datatab contextmenu.png

The tab is interactive. You can:

  • Edit the data directly on this tab.
  • Copy and paste rows and/or columns to and from other programs.
  • Include and exclude rows and/or columns of data.
  • Designate rows and/or columns as axis scales, classes, or the Include field. The results are reflected on either the Row Labels tab or the Column Labels tab.

All actions are available either from the Edit menu, or by right-clicking on a row or on a column header (as shown in the figure above) to open a context menu.

Row Labels tab/Column Labels tab

In a typical two-way DataSet, data mode 1 (the rows) represents the data samples and data mode 2 (the columns) represents the variables. The Row Labels tab and the Columns Labels tab-also known as the Mode Labels-provide access to the auxiliary "context" data for the Dataset, such as the labels for each sample, the axis scale, the data classes, and the Include status for data. (Multi-way data has additional Label tabs for each mode of the data.)

DataSet Editor window, Row Labels tab
DE window RowLabelstab Contextmenu.png

You use the information in these fields for:

  • Managing the data. (For example, the Include field indicates whether a given row or column is to be included in an analysis.)
  • Plotting the data (For example, some correction algorithms plot against the axis scale of the columns.)
  • Analyzing the data. (For example, classification algorithms use the information in the Class field to identify class assignments.)

A variety of options are available for specifying and working with the information for these fields. You can:

  • Manually enter the information in each field.
  • Assign a name to a field set to assist in identifying content.
  • Create sets for loading multiple versions of a field into a single DataSet.
  • Load the fields from files or variables in the base workspace (as long as the information that is being loaded is of the correct size.)

Note: When you load field information from one DataSet object (the source DataSet) into another DataSet (the target DataSet), the information is always loaded from the corresponding field and mode (row/column) of the source DataSet. To load information from a different mode, you must first Extract or Copy the contents from the source DataSet object, and then Load or Paste the content into the target DataSet object. You can always load or paste information from a non-DataSet object, even when the information is from an external program such as Microsoft Excel.

  • Copy and paste information to and from the Label field, the Axis Scale field, the Classes field, and the Include field.
  • Edit multiple fields in a single step.

All actions are available either from the Edit menu, or by right-clicking on a row or on a column header (as shown in the figure above) to open a context menu.

Edit menu

The DataSet Editor Edit menu has many powerful data manipulation options. Some of the more commonly used options include the following:

  • Transpose-Switches the rows and columns for a 2-way DataSet object.
  • Exclude Data-Mark rows or columns as "Excluded." (Also known as a "Soft Delete.")
  • Hard Delete Excluded-Permanently removes excluded data from a DataSet object.
  • Exclude Excessive Missing-Automatically excludes rows or columns or multi-dimensional indices in which the number of missing data values exceeds the allowable amounts, as defined by the missing data replacement algorithm "mdcheck." You you use this option to remove samples or variables which do not have enough information to be used in modeling.
  • Permute modes-Changes the order of the data modes.
  • Concatenate-Provides the option of concatenating "old" data (data already loaded in the DataSet Editor) with "new" data (data that are either loaded or imported using the Load or Import From options on the DataSet Editor File menu). The old data and new data are compared and if they match in size in at least one dimension, (either the same number of rows or the same number of columns), the data can be concatenated. You can use this option to build a larger DataSet from several smaller DataSets.