Desktop Survival Guide
by Graham Williams
Data is fundamental to data mining, but quality data is fundamental to quality data mining. The data preparation step in a data mining project involves assessing and improving the data quality--transforming and cleaning and subsetting the data to suit to requirements of the data mining task. In this chapter we explore the process of transforming a data source into a dataset ready for mining.