DATA MINING
Desktop Survival Guide by Graham Williams |
|||||
The Data tab is the starting point for Rattle, and is where we can load a specific dataset into Rattle.
Rattle is able to load data from various sources. Support is directly included in Rattle for comma separated data files (.csv files as might be exported by a spreadsheet), tab separated files (.txt, which are also commonly exported from spreadsheets) and from an ODBC connection (thus allowing connection to an enormous collection of data sources including MS/Excel, MS/Access, SQL Server, Oracle, IBM DB2, Teradata, MySQL, and Postgress).
Underneath, R is very flexible in where it obtains its data from, and data from almost any source can be loaded. Consequently, Rattle is able to access this same variety of sources. It does, however, require the loading of the data into the R console and then within Rattle loading it as an R Dataset. Refer to Chapter for details on loading all kinds of data into R, including loading data directly from MS/Excel spreadsheets, MS/Access databases, SAS, SPSS, Minitab, Oracle, MySQL, and SQLite.