Variableselectiongui and Faq import three-way data: Difference between pages

From Eigenvector Research Documentation Wiki
(Difference between pages)
Jump to navigation Jump to search
imported>Scott
No edit summary
 
imported>Bob
No edit summary
 
Line 1: Line 1:
==Introduction==
===Issue:===


The Variable Selection panel contains an interface to several methods for performing variable selection. The goal is to find subsets of variables that improve predictions when compared to using all variables. This interface has several different methods available. Finding the best method and options settings will take some experimentation. Use links below for more information on particular methods.
How do I import three-way data into Solo or PLS_Toolbox?


==Methods==
===Possible Solutions:===


* Automatic (VIP or sRatio)
'''Solution 1) Built in EEM importers :'''
* GA - Genetic Algorithm
* iPLS - Interval PLS
* rPLS - Recursive PLS
* sRatio - Selectivity Ratio
* VIP - Variable Importance in Projection


==Work Flow==
If applicable to your file type, use one of the built in EEM importers. There are importers for EEM data from Hitachi, Shimazdu, Horiba and Jasco. Please see this wiki entry for more information on [[Data_Importing_Formats | Data Importing Formats]]


* <u>Select a Method</u> - Select a method from the drop-down menu. Options for the method will be displayed. If a previous calculation has been done, the results of it will be displayed.  
EEM data needs be configured in a specific way such that:
* <u>Adjust Options</u> - By default, a simplified set of options are displayed. If the "Show All Options" checkbox is selected then all available options will be displayed. Depending on the options set, a particular method can take an extended amount of time to complete. For example, decreasing the window width in GA will increase the amount of time it takes to complete. See documentation for more details on optional settings.
 
* <u>Run Variable Selection</u> - Clicking the "Execute" button will run the current variable selection method with values specified in the options. A waitbar will be displayed indicating the method is running. Some methods will display a waitbar with a message indicating it can be closed to cancel execution. NOTE: It can take some time for the method to finish a calculation loop and identify the user has canceled. If "Show Plots" is checked then any additional plots will be displayed in separate windows. This is useful for GA as it will show progress of the calculation.
* '''mode 1''' corresponds to '''samples'''
* <u>View Results</u> - When a calculation is complete the selected variables will be displayed under a plot of the data mean as green bars.
 
* <div>'''mode 2''' corresponds to '''emission'''</div>
 
* <div>'''mode 3''' corresponds to '''excitation'''</div>
 
The built-in EEM importers will handle this configuration automatically. When importing manually (see below), further manipulation will likely be necessary. Use the Transform &rarr; Permute modes and Transform &rarr; Reshape smenu items to modify your imported data as appropriate.
 
'''Solution 2) For three-way data with few slabs:'''
 
<ol style="list-style-type:lower-alpha">
  <li>Import the data slabs into the workspace (browser). The workspace browser is available from the main analysis user interface from the menu item FigBrowser.</li>
  <li>Each slab, i.e. each matrix of data is imported individually. Hence, if you have a '''10x8x3''' array, you will import three slabs each of size '''10x8'''.</li>
  <li>Use the mouse to drag slab two onto slab one. In the window that opens choose Augment and then choose augment in the Slabs direction.</li>
  <li>A two-slab three-way array has now replaced the first data matrix. More slabs can be added in the same fashion.</li>
</ol>
 
Alternatively, you may also open one slab in the dataset editor and then add additional slabs using File &rarr; Import. After selecting the next slab to import, answer the same questions as in step c above. Repeat for each slab.  
 
'''Solution 3) For larger three-way data:'''
 
In the DataSet editor, you can import a full three-way array if you have it organized as a two-way matrix. Upon importing the two-way data, you can reshape to a three-way array using the menu item: Transform &rarr; Fold into 3-way.  
 
For example, you have the above matrices (three slabs) in one table/matrix:
 
  [ Slab1;
  Slab2;
  Slab3 ]
 
hence have the three slabs below each other. Upon importing, use the menu option described above to "Fold into 3-way" and choose three as the number of slabs and the data will be rearranged accordingly. If you are familiar with the MATLAB function <code>reshape</code>, you may also use Transform &rarr; Reshape for other types of rearrangements.
 
Note: the result of this command will give you slabs in the 3rd mode of the DataSet. If these slabs are separate samples (such as with EEMs), you'll want to use the Transform &rarr; Permute menu to reorder the dimensions. For example, permuting to the order [3 2 1] would swap the order of the 1st and 3rd modes, putting slabs as the first mode.  
 
 
'''Still having problems? Please contact our helpdesk at [mailto:helpdesk@eigenvector.com helpdesk@eigenvector.com]'''
 
[[Category:FAQ]]

Revision as of 10:45, 20 June 2019

Issue:

How do I import three-way data into Solo or PLS_Toolbox?

Possible Solutions:

Solution 1) Built in EEM importers :

If applicable to your file type, use one of the built in EEM importers. There are importers for EEM data from Hitachi, Shimazdu, Horiba and Jasco. Please see this wiki entry for more information on Data Importing Formats

EEM data needs be configured in a specific way such that:

  • mode 1 corresponds to samples
  • mode 2 corresponds to emission
  • mode 3 corresponds to excitation

The built-in EEM importers will handle this configuration automatically. When importing manually (see below), further manipulation will likely be necessary. Use the Transform → Permute modes and Transform → Reshape smenu items to modify your imported data as appropriate.

Solution 2) For three-way data with few slabs:

  1. Import the data slabs into the workspace (browser). The workspace browser is available from the main analysis user interface from the menu item FigBrowser.
  2. Each slab, i.e. each matrix of data is imported individually. Hence, if you have a 10x8x3 array, you will import three slabs each of size 10x8.
  3. Use the mouse to drag slab two onto slab one. In the window that opens choose Augment and then choose augment in the Slabs direction.
  4. A two-slab three-way array has now replaced the first data matrix. More slabs can be added in the same fashion.

Alternatively, you may also open one slab in the dataset editor and then add additional slabs using File → Import. After selecting the next slab to import, answer the same questions as in step c above. Repeat for each slab.

Solution 3) For larger three-way data:

In the DataSet editor, you can import a full three-way array if you have it organized as a two-way matrix. Upon importing the two-way data, you can reshape to a three-way array using the menu item: Transform → Fold into 3-way.

For example, you have the above matrices (three slabs) in one table/matrix:

 [ Slab1;
 Slab2;
 Slab3 ]

hence have the three slabs below each other. Upon importing, use the menu option described above to "Fold into 3-way" and choose three as the number of slabs and the data will be rearranged accordingly. If you are familiar with the MATLAB function reshape, you may also use Transform → Reshape for other types of rearrangements.

Note: the result of this command will give you slabs in the 3rd mode of the DataSet. If these slabs are separate samples (such as with EEMs), you'll want to use the Transform → Permute menu to reorder the dimensions. For example, permuting to the order [3 2 1] would swap the order of the 1st and 3rd modes, putting slabs as the first mode.


Still having problems? Please contact our helpdesk at helpdesk@eigenvector.com