Diviner analysis: Difference between revisions

From Eigenvector Research Documentation Wiki
Jump to navigation Jump to search
No edit summary
 
(14 intermediate revisions by the same user not shown)
Line 89: Line 89:
If outlier detection is turned on then potential outliers will need to be reviewed. Please see this page for more information: [[diviner_review_outliers | reviewing outliers for Diviner]]
If outlier detection is turned on then potential outliers will need to be reviewed. Please see this page for more information: [[diviner_review_outliers | reviewing outliers for Diviner]]


===Model Refinement===
===Results===
 
'''This section is under construction'''


During a Diviner run the number of models to be built is based on the number of:
During a Diviner run the number of models to be built is based on the number of:
Line 100: Line 98:
** As of version 9.5, the model types are PLS models (by default) and MLR models (which can be set from the Options window)
** As of version 9.5, the model types are PLS models (by default) and MLR models (which can be set from the Options window)
* Variable selection methods
* Variable selection methods
** As of version 9.5, the variables selection methods are [[Interval_PLS_(IPLS)_for_Variable_Selection| iPLS]] and [[Selectvars | Automatic]]
** As of version 9.5, the variable selection methods are none and [[Selectvars | Automatic]]
 
Model refinement may be performed on the better performing of these initial models.
 
====Model Refinement====
 
Model refinement may be performed from the first plot of results to appear during a Diviner run. Use the '''Make Selection''' button to select the models to refine. It is recommended to refine the models that are in the lower left hand portion of the plot, as these models show good performance (low RMSECV) and are not over-fit (low RMSECV/RMSEC ratio). Click the '''Accept and close''' button (the green check) to accept the selection.
 
<gallery widths=500px heights=500px mode="nolines">
File: Diviner_Results_Plot.png
File: Diviner_Make_Selection_button.png
</gallery>
 
Use the '''Model Refinement Options''' window to select how to refine the models:
 
* Variable selection - perform [[Interval_PLS_(IPLS)_for_Variable_Selection| iPLS variables selection]] on the selected models
* Preprocessing - as of version 9.5 this option is unavailable
* Outlier reinclusion - if potential outliers were removed then reinclude and rebuild the models
 
[[File: Diviner_Model_Refinement.png | 300px]]
 
====Review Results====


===Review Results===
When the Diviner run has completed then the results plot will automatically appear. Here is an example of a results plot showing the '''RMSECV/RMSEC vs RMSECV''' for all the models calculated during the Diviner run. The legend is showing the models plotted by Number of Latent Variables. To review the results, change the class set shown on the results plot to identify any trends. See this page for more information: [[Diviner_review_results | diviner results]]


'''This section is under construction'''
<gallery widths=500px heights=500px mode="nolines">
File: Diviner_Results_Plot.png
File: Diviner_Results_Class_Sets.png
</gallery>


===Diviner Buttons===
===Diviner Buttons===
Line 124: Line 146:
[[File: Diviner_Output_Section_Buttons.png | 400 px]]
[[File: Diviner_Output_Section_Buttons.png | 400 px]]


# Variable Consensus - open the variable consensus plot
# Variable Consensus - open the [[Diviner_review_results#Variable_Consensus_Plot | variable consensus plot]]
# Results plot - open the results plot and preprocessing lookup table
# Results plot - open the [[Diviner_review_results#Results_Plot| results plot and preprocessing lookup table]]
# Results table - open the results table
# Results table - open the [[Diviner_review_results#Results_Table | results table]]
# Outlier plot - open the outlier plot. This button will only be enabled if outlier detection was turned on.
# Outlier plot - open the [[Diviner_review_outliers | outlier plot]]. This button will only be enabled if outlier detection was turned on.




'''This page is under construction'''
'''This page is under construction'''

Latest revision as of 09:31, 4 September 2024

Diviner Analysis Window

The Diviner Analysis window serves as the main interface to perform a diviner run.

Main Diviner Interface.png

Using the Diviner Analysis window

Loading Data

There are several methods to load data into the Diviner analysis window. As of version 9.5 the Diviner analysis window does not allow importing data directly from a spreadsheet or other file format. This must be done before hand and the resulting DataSet Object must be present in the Workspace Browser. Please see this wiki page for importing data into: Importing Data and this webinar: Importing Data Webinar

To Load Data

File Menu 1

  1. Go to File → Load Data → Calibration X (or Validation X)
  2. Choose data to load for X block
  3. Go to File → Load Data → Calibration Y (or Validation Y)
  4. Choose data to load for Y block

File Menu 2

  1. Go to File → Load Data → Cal X and Y (or Val X and Y)
  2. Choose data for X block
  3. Choose data for Y block

Load Data File Menu.png

Context Menu 1

  1. Right click on Calibration X block (or Validation X block)
  2. Select Load Data
  3. Choose data to load for X block
  4. Right click on Calibration Y block (or Validation Y block)
  5. Select Load Data
  6. Choose data to load for Y block

Context Menu 2

  1. Right click on Calibration X block or Calibration Y block (or Validation X block or Validation Y block)
  2. Select Load X and Y
  3. Choose data to load for X block
  4. Choose data to load for Y block


Validation Data

Important Note: It is recommended to load validation data, if available, before performing a Diviner run. Doing so will apply all the models to the validation data and report RMSEP values in the Diviner results. If validation data is loaded afterwards then the Diviner run will need to be recalculated to apply the models to the test data.

Select Calibration Y Column

If the Calibration Y block is multivariate then a single Y variable must be selected for modeling:

Method One

  1. Go to Edit → Calibration → Select Y-Column
  2. Select Y variable to use for modeling

Method Two

  1. Right click on Calibration Y block
  2. Choose Select Y Column
  3. Select Y variable to use for modeling

Set Preprocessing Recipes

Once data has been loaded, the next step is to set the Preprocessing recipes to be used by Diviner to build the models. Click on the Preprocessing button to open the Diviner Preprocessing window. Please see this page for more information about the Diviner Preprocessing interface: diviner preprocessing interface. The collection of recipes will be referred to as a library.

Cross-Validation

Review and modify the cross-validation settings using the Cross-Validation button. Please this page for more information on using cross-validation: cross-validation. The default is set to Venetian Blinds with 5 splits.

Outlier Detection

If the preprocessing library contains recipes to use for outlier detection then sse the Outlier Detection On/Off toggle to set if Outlier detection should be performed.

Starting a Diviner

To start a Diviner run, click on the Start Diviner Run button.

Diviner Interface Start Run button.png

Review Outliers

If outlier detection is turned on then potential outliers will need to be reviewed. Please see this page for more information: reviewing outliers for Diviner

Results

During a Diviner run the number of models to be built is based on the number of:

  • X block preprocessing recipes
  • Y block preprocessing recipes
  • Model types
    • As of version 9.5, the model types are PLS models (by default) and MLR models (which can be set from the Options window)
  • Variable selection methods
    • As of version 9.5, the variable selection methods are none and Automatic

Model refinement may be performed on the better performing of these initial models.

Model Refinement

Model refinement may be performed from the first plot of results to appear during a Diviner run. Use the Make Selection button to select the models to refine. It is recommended to refine the models that are in the lower left hand portion of the plot, as these models show good performance (low RMSECV) and are not over-fit (low RMSECV/RMSEC ratio). Click the Accept and close button (the green check) to accept the selection.

Use the Model Refinement Options window to select how to refine the models:

  • Variable selection - perform iPLS variables selection on the selected models
  • Preprocessing - as of version 9.5 this option is unavailable
  • Outlier reinclusion - if potential outliers were removed then reinclude and rebuild the models

Diviner Model Refinement.png

Review Results

When the Diviner run has completed then the results plot will automatically appear. Here is an example of a results plot showing the RMSECV/RMSEC vs RMSECV for all the models calculated during the Diviner run. The legend is showing the models plotted by Number of Latent Variables. To review the results, change the class set shown on the results plot to identify any trends. See this page for more information: diviner results

Diviner Buttons

Toolbar Buttons

Diviner Interface just top buttons.png

  1. Start Diviner - start a Diviner run. Becomes enabled once calibration data has been loaded and preprocessing recipes set.
  2. Send to PCA - quickly send data to PCA Analysis window to allow building a PCA model
  3. Options - open the Diviner options interface
  4. Save Results - save the results from a Diviner run. Becomes enabled after a Diviner run has completed.
  5. Load Results - load results from a Diviner run. Will clear any current results and data.
  6. Open Browser - open the Workspace Browser window
  7. Help - open Diviner documentation
Output Buttons

Diviner Output Section Buttons.png

  1. Variable Consensus - open the variable consensus plot
  2. Results plot - open the results plot and preprocessing lookup table
  3. Results table - open the results table
  4. Outlier plot - open the outlier plot. This button will only be enabled if outlier detection was turned on.


This page is under construction