Togaware DATA MINING
Desktop Survival Guide
by Graham Williams
Google

V

Validation Set: A portion of a dataset that is used to test the performance of a model as it is being built. See also Training Set and Test Set.

Variance: The error in a model due to the fact that the model is built from only a sample of the data taken to represent the whole population. That is, those instances occasionally incorrectly classified by models built by the learning algorithm. Modelling error due to variance can be reduced using Bagging. Compare with bias.



Copyright © 2004-2006 Graham.Williams@togaware.com
Support further development through the purchase of the PDF version of the book.
Brought to you by Togaware.