Data Mining: Principal Component Analysis (PCA) Feature Selection
Help

Upload a file to create features using principal component analysis:

Sample Data

The goal of the project is to determine the most relevant parameters for determining drought condition for all stream segments within the study area. The data set consists of 1000 records. The first column in the table represents the median flow in the driest month of the year and the rest of the columns are predictive variables. Columns 4 to 20 are the physiographical parameters; columns 21 to 44 are the stream flow parameters; and columns 45 to 69 are the weather related (more specifically precipitation) parameters.

Download example here
Select data columns by holding shift and clicking column names: