Tools Cross-Validation: Difference between revisions
imported>Jeremy No edit summary |
|||
(29 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
__TOC__ | __TOC__ | ||
[[TableOfContents|Table of Contents]] | [[ModelApplication_ValidationPhase|Previous]] | [[Tools_ModelRobustness|Next]] | [[TableOfContents|Table of Contents]] | [[ModelApplication_ValidationPhase|Previous]] | [[Tools_ModelRobustness|Next]] | ||
Line 6: | Line 7: | ||
You use the Cross-Validation tool to: | You use the Cross-Validation tool to: | ||
{| | {| | ||
|- | |- valign="top" | ||
| | | | ||
Line 15: | Line 16: | ||
|} | |} | ||
{| | {| | ||
|- | |- valign="top" | ||
| | | | ||
* Estimate the performance of a model when you apply the model to unknown data. | * Estimate the expected performance of a model when you apply the model to unknown data. | ||
|} | |} | ||
Line 26: | Line 27: | ||
For a given set of data, cross-validation involves a series of steps called subvalidation steps in which you remove a subset of objects from a set of data (the test set), build of a model using the remaining objects in the set of data (the model building set), and then apply the resulting model to the removed objects. You note how the errors accumulate as you leave out samples to determine the number of principal components/latent variables/factors to retain in the model. Cross-validation typically involves more than one subvalidation step, each of which in turn involves the selection of different subsets of samples for model building and model testing. In Solo, five different cross-validation methods are available, and these methods vary with respect to how the different sample subsets are selected for these subvalidation steps. | For a given set of data, cross-validation involves a series of steps called subvalidation steps in which you remove a subset of objects from a set of data (the test set), build of a model using the remaining objects in the set of data (the model building set), and then apply the resulting model to the removed objects. You note how the errors accumulate as you leave out samples to determine the number of principal components/latent variables/factors to retain in the model. Cross-validation typically involves more than one subvalidation step, each of which in turn involves the selection of different subsets of samples for model building and model testing. In Solo, five different cross-validation methods are available, and these methods vary with respect to how the different sample subsets are selected for these subvalidation steps. | ||
{| | {| | ||
|- | |- valign="top" | ||
|1. | |1. | ||
Line 36: | Line 37: | ||
|} | |} | ||
{| style="margin-left:18pt" | {| style="margin-left:18pt" | ||
|- | |- valign="top" | ||
| | | | ||
Line 45: | Line 46: | ||
|} | |} | ||
{| style="margin-left:18pt" | {| style="margin-left:18pt" | ||
|- | |- valign="top" | ||
| | | | ||
* Click the Cross-Validation icon in the Analysis window. | * Click the Cross-Validation icon in the Analysis window. | ||
'''Note:''' You must load data into the Analysis window before the Cross-Validation icon is available. | |||
|} | |} | ||
:''Cross-validation icon in the Analysis window'' | |||
::[[Image:Cross_validation_icon_Analysis_window.png|406x83px]] | |||
:: | |||
[[Image:Cross_validation_icon_Analysis_window.png|406x83px]] | |||
{| style="margin-left:18pt" | {| style="margin-left:18pt" | ||
|- | |- valign="top" | ||
| | | | ||
Line 69: | Line 68: | ||
|} | |} | ||
{| | {| | ||
|- | |- valign="top" | ||
|2. | |2. | ||
|In the Cross-Validation dialog box, select the method of cross-validation that you want to use. | |In the Cross-Validation dialog box, select the method of cross-validation that you want to use. You will notice a visual representation of the splits at the top of the window. Other relevant information and warnings will appear at the bottom of the window. | ||
|} | |} | ||
:''Cross-Validation dialog box'' | |||
[[Image: | ::[[Image:CrossValGUI.png|500px||CrossVal Window]] | ||
:: | |||
{| | {| | ||
|- | |- valign="top" | ||
|3. | |3. | ||
Line 93: | Line 93: | ||
|} | |} | ||
Note: Not all parameters are relevant for all cross-validation methods. The initial values that are specified for the available parameters are default values that are based on the dimensionality of the data. You can click Reset at any time to reset the parameters to their default settings. | :'''Note:''' Not all parameters are relevant for all cross-validation methods. The initial values that are specified for the available parameters are default values that are based on the dimensionality of the data. You can click Reset at any time to reset the parameters to their default settings. See the [[Using_Cross-Validation]] page for details on how to use the different methods and settings. | ||
[[ | |||
{| | {| | ||
|- | |- valign="top" | ||
|4. | |4. | ||
Line 376: | Line 105: | ||
|} | |} | ||
{| style="margin-left:18pt" | {| style="margin-left:18pt" | ||
|- | |- valign="top" | ||
| | | | ||
Line 385: | Line 114: | ||
|} | |} | ||
{| style="margin-left:18pt" | {| style="margin-left:18pt" | ||
|- | |- valign="top" | ||
| | | | ||
Line 393: | Line 122: | ||
|} | |} | ||
===Technical Details=== | |||
More technical details can be found on the [[Using_Cross-Validation]] page. |
Latest revision as of 12:16, 11 August 2020
Table of Contents | Previous | Next
Cross-Validation Tool
You use the Cross-Validation tool to:
|
|
For a given set of data, cross-validation involves a series of steps called subvalidation steps in which you remove a subset of objects from a set of data (the test set), build of a model using the remaining objects in the set of data (the model building set), and then apply the resulting model to the removed objects. You note how the errors accumulate as you leave out samples to determine the number of principal components/latent variables/factors to retain in the model. Cross-validation typically involves more than one subvalidation step, each of which in turn involves the selection of different subsets of samples for model building and model testing. In Solo, five different cross-validation methods are available, and these methods vary with respect to how the different sample subsets are selected for these subvalidation steps.
1. | To open the Cross-Validation tool, do one of the following: |
|
Note: You must load data into the Analysis window before the Cross-Validation icon is available. |
|
2. | In the Cross-Validation dialog box, select the method of cross-validation that you want to use. You will notice a visual representation of the splits at the top of the window. Other relevant information and warnings will appear at the bottom of the window. |
- Cross-Validation dialog box
3. | Use the slider bars to change the default values for the available parameters. |
- Note: Not all parameters are relevant for all cross-validation methods. The initial values that are specified for the available parameters are default values that are based on the dimensionality of the data. You can click Reset at any time to reset the parameters to their default settings. See the Using_Cross-Validation page for details on how to use the different methods and settings.
4. | Do one of the following: |
|
|
Technical Details
More technical details can be found on the Using_Cross-Validation page.