Model Building: Calibration Phase: Difference between revisions
imported>Jeremy (Created page with '__TOC__ Table of Contents | Previous | Next | Index ==Building the M…') |
No edit summary |
||
(9 intermediate revisions by one other user not shown) | |||
Line 1: | Line 1: | ||
__TOC__ | __TOC__ | ||
[[TableOfContents|Table of Contents]] | [[ModelBuilding_AnalysisPhasesOverview|Previous]] | [[ModelBuilding_PlottingEigenValues|Next | |||
[[TableOfContents|Table of Contents]] | [[ModelBuilding_AnalysisPhasesOverview|Previous]] | [[ModelBuilding_PlottingEigenValues|Next]] | |||
==Building the Model in the Calibration Phase== | ==Building the Model in the Calibration Phase== | ||
Line 6: | Line 7: | ||
Regardless of the analysis method, building a model in the Calibration phase consists of a series of the same general steps, with the second and third steps being iterative, until you are satisfied with your model. These steps are: | Regardless of the analysis method, building a model in the Calibration phase consists of a series of the same general steps, with the second and third steps being iterative, until you are satisfied with your model. These steps are: | ||
1. | {| | ||
Loading the calibration data and building the initial model. See [[ModelBuilding_CalibrationPhase#Loading the calibration data and building the initial model|Loading the calibration data and building the initial model]]. | |||
|- valign="top" | |||
|1. | |||
|Loading the calibration data and building the initial model. See [[ModelBuilding_CalibrationPhase#Loading the calibration data and building the initial model|Loading the calibration data and building the initial model]]. | |||
|} | |||
{| | |||
|- valign="top" | |||
|2. | |||
|Changing the number of components or factors that are to be retained in the model and recalculating the model. See [[ModelBuilding_CalibrationPhase#Changing the number of components|Changing the number of components]]. | |||
|} | |||
{| | |||
|- valign="top" | |||
|3. | |||
|Examining the model and refining the model by excluding certain samples and/or variables to enhance the model performance. See [[ModelBuilding_CalibrationPhase#Examining and refining the model|Examining and refining the model]]. | |||
|} | |||
{| | |||
|- valign="top" | |||
|4. | |||
|After you are satisfied with the model, you can then do one of the following: | |||
|} | |||
{| style="margin-left:18pt" | |||
|- valign="top" | |||
| | |||
* Save the model and use it at a later date. | |||
|} | |||
{| style="margin-left:18pt" | |||
|- valign="top" | |||
| | |||
* Load validation and test data and apply the model immediately. | |||
|} | |||
Note: Decomposition and Clustering analysis methods require only x block data for model building in the Calibration phase. Regression analysis methods require both x block data and y block data. Classification analysis methods require x block data with classes in either X or Y. For simplicity and brevity, this section describes model building during the Calibration phase using default preprocessing methods for a simple PCA model; however, all of the general information in this section is applicable for all analysis methods. | '''Note:''' Decomposition and Clustering analysis methods require only x block data for model building in the Calibration phase. Regression analysis methods require both x block data and y block data. Classification analysis methods require x block data with classes in either X or Y. For simplicity and brevity, this section describes model building during the Calibration phase using default preprocessing methods for a simple PCA model; however, all of the general information in this section is applicable for all analysis methods. | ||
Note: Although this section describes model building using default preprocessing methods, remember, for most analyses, it is critical to select the appropriate preprocessing methods for the data that is being analyzed. To review detailed information about preprocessing, see [[ModelBuilding_PreProcessingMethods|Preprocessing Methods]]. | '''Note:''' Although this section describes model building using default preprocessing methods, remember, for most analyses, it is critical to select the appropriate preprocessing methods for the data that is being analyzed. To review detailed information about preprocessing, see [[ModelBuilding_PreProcessingMethods|Preprocessing Methods]]. | ||
Note: To review a detailed description of the Calibration phase, see [[ModelBuilding_AnalysisPhasesOverview| | '''Note:''' To review a detailed description of the Calibration phase, see [[ModelBuilding_AnalysisPhasesOverview| "Analysis Phases."]] | ||
===Loading the calibration data and building the initial model=== | ===Loading the calibration data and building the initial model=== | ||
Line 32: | Line 75: | ||
You have a variety of options for opening an Analysis window and loading data. Because these methods have been discussed in detail in other areas of the documentation, they are not repeated here. Instead, a brief summary is provided with a cross-reference to the detailed information. Simply choose the method that best fits your working needs. | You have a variety of options for opening an Analysis window and loading data. Because these methods have been discussed in detail in other areas of the documentation, they are not repeated here. Instead, a brief summary is provided with a cross-reference to the detailed information. Simply choose the method that best fits your working needs. | ||
{| | |||
|- valign="top" | |||
| | |||
* To open an Analysis window: | * To open an Analysis window: | ||
:* In the Workspace Browser, click the shortcut icon for the specific analysis that you are carrying out. | |} | ||
{| style="margin-left:18pt" | |||
|- valign="top" | |||
| | |||
* In the Workspace Browser, click the shortcut icon for the specific analysis that you are carrying out. | |||
|} | |||
{| style="margin-left:18pt" | |||
|- valign="top" | |||
| | |||
* In the Workspace Browser, click Other Analysis to open an Analysis window, and on the Analysis menu, select the specific analysis method that you are carrying out. | |||
|} | |||
: | {| style="margin-left:18pt" | ||
|- valign="top" | |||
| | |||
* In the Workspace Browser, drag a data icon to a shortcut icon to open the Analysis window and load the data in a single step. | |||
|} | |||
:'''Note:''' For information about working with icons in the Workspace Browser, see [[WorkspaceBrowser_DataIcons|Icons in the Workspace Browser]]. | |||
{| | |||
|- valign="top" | |||
| | |||
* To load data into an open Analysis window: | * To load data into an open Analysis window: | ||
: | |} | ||
{| style="margin-left:18pt" | |||
|- valign="top" | |||
| | |||
* Click File on the Analysis window main menu to open a menu with options for loading and importing calibration data. | |||
|} | |||
: | {| style="margin-left:18pt" | ||
:Note: For information about the data manipulation options on the context menu, see [[WorkspaceBrowser_DataIcons|Icons in the Workspace Browser]] or [[WorkspaceBrowser_ImportingData|Importing Data into the Workspace Browser]]. For information about loading items from the Model Cache pane, see [[AnalysisWindow_ModelCachepane|Analysis window Model Cache pane]]. | |- valign="top" | ||
| | |||
* Click the appropriate calibration control to open the Import dialog box and select a file type to import. | |||
|} | |||
{| style="margin-left:18pt" | |||
|- valign="top" | |||
| | |||
* Right-click the appropriate calibration control to open a context menu with options for loading and importing data. | |||
|} | |||
{| style="margin-left:18pt" | |||
|- valign="top" | |||
| | |||
* Right-click on an entry for a cached item Model Cache pane to open a context menu that contains options for loading the selected cached item into the Analysis window. | |||
|} | |||
:'''Note:''' For information about the data manipulation options on the context menu, see [[WorkspaceBrowser_DataIcons|Icons in the Workspace Browser]] or [[WorkspaceBrowser_ImportingData|Importing Data into the Workspace Browser]]. For information about loading items from the Model Cache pane, see [[AnalysisWindow_ModelCachepane|Analysis window Model Cache pane]]. | |||
Also, remember that after you load data into a calibration control, you can place your mouse pointer on the control to view not only information about the loaded data, but also, different instructions about working with the control. In the figure below, data has been loaded into the X calibration control for a PCA analysis. | Also, remember that after you load data into a calibration control, you can place your mouse pointer on the control to view not only information about the loaded data, but also, different instructions about working with the control. In the figure below, data has been loaded into the X calibration control for a PCA analysis. | ||
''Example of loaded data in the X calibration control for a PCA analysis'' | :''Example of loaded data in the X calibration control for a PCA analysis'' | ||
[[Image:PCA_analysis_xblock_data_loaded_Cal.png|269x129px]] | ::[[Image:PCA_analysis_xblock_data_loaded_Cal.png|269x129px]] | ||
:: | |||
:: | |||
:: | |||
After you have opened the Analysis window and loaded the calibration data, you then calculate the initial model. To calculate the initial calibration model, you can do one of the following: | After you have opened the Analysis window and loaded the calibration data, you then calculate the initial model. To calculate the initial calibration model, you can do one of the following: | ||
{| | |||
|- valign="top" | |||
| | |||
* On the Analysis window toolbar, click the Calculate/Apply model icon [[Image:Calculate_Apply_Model_icon.png|25x22px]]. | * On the Analysis window toolbar, click the Calculate/Apply model icon [[Image:Calculate_Apply_Model_icon.png|25x22px]]. | ||
|} | |||
{| | |||
|- valign="top" | |||
| | |||
* Click the Model control. | * Click the Model control. | ||
|} | |||
:''Clicking the Model control in the Analysis window'' | |||
[[Image:Clicking_to_calculate_model.png|343x78px]] | |||
:: | |||
:: | |||
::[[Image:Clicking_to_calculate_model.png|343x78px]] | |||
:: | |||
After the initial model is calculated, you can place your mouse pointer on the Model control to view general information about the model. To view detailed information the model, right-click on the Model control and on the context menu that opens, select Show Model Details. | After the initial model is calculated, you can place your mouse pointer on the Model control to view general information about the model. To view detailed information the model, right-click on the Model control and on the context menu that opens, select Show Model Details. | ||
''Showing model details in the Analysis window'' | :''Showing model details in the Analysis window'' | ||
[[Image:Information_initial_model.png|350x138px]] | ::[[Image:Information_initial_model.png|350x138px]] | ||
:: | |||
:: | |||
:: | |||
:: | |||
:: | |||
===Changing the number of components=== | ===Changing the number of components=== | ||
Line 84: | Line 215: | ||
For analysis methods which use factors or principal components, you can choose a different number of components or factors to retain in the model and then recalculate the model. To choose a different number of components or factors: | For analysis methods which use factors or principal components, you can choose a different number of components or factors to retain in the model and then recalculate the model. To choose a different number of components or factors: | ||
1 | {| | ||
|- valign="top" | |||
|1. | |||
|Click on the appropriate row in the Control panel. | |||
|} | |||
{| | |||
|- valign="top" | |||
|2. | |||
[[Image:Control_pane_PCA.png|359x353px]] | |Recalculate the model by doing one of the following: | ||
|} | |||
{| style="margin-left:18pt" | |||
|- valign="top" | |||
| | |||
* On the Analysis window toolbar, click the Calculate/Apply model icon [[Image:Calculate_Apply_Model_icon.png|25x22px]]. | |||
|} | |||
{| style="margin-left:18pt" | |||
|- valign="top" | |||
| | |||
* Click the Model control. | |||
|} | |||
'''Note:''' By default, the maximum number of principal components or factors that you can retain in a model is 20. You can change this value in the Analysis options settings for the Edit menu. For example, the figure below shows an initial model calculated for a PCA analysis with the suggested value for the number of components to retain set to three. | |||
:''Initial model calculated for a PCA analysis with number of suggested components = 3'' | |||
::[[Image:Control_pane_PCA.png|359x353px]] | |||
:: | |||
After you select a different number of components or factors to retain, the Model control is marked with an Exclamation icon indicating that you must recalculate the model. | After you select a different number of components or factors to retain, the Model control is marked with an Exclamation icon indicating that you must recalculate the model. | ||
''Model marked for recalculation'' | :''Model marked for recalculation'' | ||
[[Image:ModelBuilding_CalibrationPhase.23.1.07.jpg|580x254px]] | ::[[Image:ModelBuilding_CalibrationPhase.23.1.07.jpg|580x254px]] | ||
:: | |||
:: | |||
:: | |||
===Examining and refining the model=== | ===Examining and refining the model=== | ||
Line 111: | Line 274: | ||
After the model is calculated, the Control pane displays the percent variance captured and other statistical information for the model. For certain analyses, the application provides a suggested number of components or factors to retain for the model based on internal tests. For example, the figure below shows an initial model calculated for a PCA analysis with the suggested value for the number of components to retain set to three. | After the model is calculated, the Control pane displays the percent variance captured and other statistical information for the model. For certain analyses, the application provides a suggested number of components or factors to retain for the model based on internal tests. For example, the figure below shows an initial model calculated for a PCA analysis with the suggested value for the number of components to retain set to three. | ||
''Initial model calculated for a PCA analysis with number of suggested components = 3'' | :''Initial model calculated for a PCA analysis with number of suggested components = 3'' | ||
[[Image:Control_pane_PCA.png|359x353px]] | ::[[Image:Control_pane_PCA.png|359x353px]] | ||
:: | |||
The Analysis window toolbar is updated dynamically with other toolbar buttons based on the selected analysis method. All of these toolbar buttons create plots and other visual aids that assist you in examining and refining the model by excluding certain samples and/or variables to enhance the model performance. Common toolbar buttons include the following: | The Analysis window toolbar is updated dynamically with other toolbar buttons based on the selected analysis method. All of these toolbar buttons create plots and other visual aids that assist you in examining and refining the model by excluding certain samples and/or variables to enhance the model performance. Common toolbar buttons include the following: | ||
{| | |||
|- valign="top" | |||
| | |||
* The Plot Eigenvalues button [[Image:Plot_Eigenvalues_icon.png|19x20px]]. See [[ModelBuilding_PlottingEigenValues|Plotting Eigenvalues for a Calibration Model]]. | * The Plot Eigenvalues button [[Image:Plot_Eigenvalues_icon.png|19x20px]]. See [[ModelBuilding_PlottingEigenValues|Plotting Eigenvalues for a Calibration Model]]. | ||
|} | |||
{| | |||
|- valign="top" | |||
| | |||
* The Plot scores and sample statistics button [[Image:Plot_scores_sample_statistics_icon.png|16x19px]]. See [[ModelBuilding_PlottingScores|Plotting Scores and Statistical Values for a Calibration Model]]. | * The Plot scores and sample statistics button [[Image:Plot_scores_sample_statistics_icon.png|16x19px]]. See [[ModelBuilding_PlottingScores|Plotting Scores and Statistical Values for a Calibration Model]]. | ||
|} | |||
{| | |||
|- valign="top" | |||
| | |||
* The Plot loads and variable statistics button [[Image:Plot_loads_variable_statistics_icon.png|21x20px]]. See [[ModelBuilding_PlottingLoads|Plotting Loads and Variable Statistics for a Calibration Model]]. | * The Plot loads and variable statistics button [[Image:Plot_loads_variable_statistics_icon.png|21x20px]]. See [[ModelBuilding_PlottingLoads|Plotting Loads and Variable Statistics for a Calibration Model]]. | ||
Note: All other Analysis window toolbar buttons are specific to an analysis method and therefore, are not discussed in this guide. | |} | ||
{| | |||
|- valign="top" | |||
| | |||
* The Scores and loadings biplots button [[Image:Biplot_button.png|35x34px]]. See [[ModelBuilding_Biplot|Scores and Loadings Biplots for a Calibration Model]]. | |||
|} | |||
'''Note:''' All other Analysis window toolbar buttons are specific to an analysis method and therefore, are not discussed in this guide. | |||
:: | |||
:: | |||
:: | |||
:: |
Latest revision as of 08:58, 13 February 2020
Table of Contents | Previous | Next
Building the Model in the Calibration Phase
Regardless of the analysis method, building a model in the Calibration phase consists of a series of the same general steps, with the second and third steps being iterative, until you are satisfied with your model. These steps are:
1. | Loading the calibration data and building the initial model. See Loading the calibration data and building the initial model. |
2. | Changing the number of components or factors that are to be retained in the model and recalculating the model. See Changing the number of components. |
3. | Examining the model and refining the model by excluding certain samples and/or variables to enhance the model performance. See Examining and refining the model. |
4. | After you are satisfied with the model, you can then do one of the following: |
|
|
Note: Decomposition and Clustering analysis methods require only x block data for model building in the Calibration phase. Regression analysis methods require both x block data and y block data. Classification analysis methods require x block data with classes in either X or Y. For simplicity and brevity, this section describes model building during the Calibration phase using default preprocessing methods for a simple PCA model; however, all of the general information in this section is applicable for all analysis methods.
Note: Although this section describes model building using default preprocessing methods, remember, for most analyses, it is critical to select the appropriate preprocessing methods for the data that is being analyzed. To review detailed information about preprocessing, see Preprocessing Methods.
Note: To review a detailed description of the Calibration phase, see "Analysis Phases."
Loading the calibration data and building the initial model
You have a variety of options for opening an Analysis window and loading data. Because these methods have been discussed in detail in other areas of the documentation, they are not repeated here. Instead, a brief summary is provided with a cross-reference to the detailed information. Simply choose the method that best fits your working needs.
|
|
|
|
- Note: For information about working with icons in the Workspace Browser, see Icons in the Workspace Browser.
|
|
|
|
|
- Note: For information about the data manipulation options on the context menu, see Icons in the Workspace Browser or Importing Data into the Workspace Browser. For information about loading items from the Model Cache pane, see Analysis window Model Cache pane.
Also, remember that after you load data into a calibration control, you can place your mouse pointer on the control to view not only information about the loaded data, but also, different instructions about working with the control. In the figure below, data has been loaded into the X calibration control for a PCA analysis.
- Example of loaded data in the X calibration control for a PCA analysis
After you have opened the Analysis window and loaded the calibration data, you then calculate the initial model. To calculate the initial calibration model, you can do one of the following:
|
- Clicking the Model control in the Analysis window
After the initial model is calculated, you can place your mouse pointer on the Model control to view general information about the model. To view detailed information the model, right-click on the Model control and on the context menu that opens, select Show Model Details.
- Showing model details in the Analysis window
Changing the number of components
For analysis methods which use factors or principal components, you can choose a different number of components or factors to retain in the model and then recalculate the model. To choose a different number of components or factors:
1. | Click on the appropriate row in the Control panel. |
2. | Recalculate the model by doing one of the following: |
|
Note: By default, the maximum number of principal components or factors that you can retain in a model is 20. You can change this value in the Analysis options settings for the Edit menu. For example, the figure below shows an initial model calculated for a PCA analysis with the suggested value for the number of components to retain set to three.
- Initial model calculated for a PCA analysis with number of suggested components = 3
After you select a different number of components or factors to retain, the Model control is marked with an Exclamation icon indicating that you must recalculate the model.
- Model marked for recalculation
Examining and refining the model
After the model is calculated, the Control pane displays the percent variance captured and other statistical information for the model. For certain analyses, the application provides a suggested number of components or factors to retain for the model based on internal tests. For example, the figure below shows an initial model calculated for a PCA analysis with the suggested value for the number of components to retain set to three.
- Initial model calculated for a PCA analysis with number of suggested components = 3
The Analysis window toolbar is updated dynamically with other toolbar buttons based on the selected analysis method. All of these toolbar buttons create plots and other visual aids that assist you in examining and refining the model by excluding certain samples and/or variables to enhance the model performance. Common toolbar buttons include the following:
|
|
|
|
Note: All other Analysis window toolbar buttons are specific to an analysis method and therefore, are not discussed in this guide.