Visualizing Properties of the Rocks versus Mines Data Set
Visualizing with Parallel Coordinates Plots
Visualizing Interrelationships between Attributes and Labels
crossplot the attributes with the labels / scatter plots
- Figures 2-4 and 2-5 show the scatter plots for two pairs of attributes from the rocks versus mines data set
- correlation
- [JB: feature engineering: delta x, also Ableitung bilden?]
Pearson’s correlation coefficient
- Equation 2-2: Average values of the entries in u
- Equation 2-3: Subtract the average from each element in u
- Equation 2-4: Definition of Pearson’s correlation coefficient
Visualizing Attribute and Label Correlations Using a Heat Map
Perfect correlation (correlation = 1) between attributes means that you may have made a mistake and included the same thing twice.
Very high correlation between a set of attributes (pairwise correlations > 0.7) is known as multicol- linearity and can lead to unstable estimates.