Visualizing Properties of the Rocks versus Mines Data Set

Visualizing with Parallel Coordinates Plots

Visualizing Interrelationships between Attributes and Labels

crossplot the attributes with the labels / scatter plots

  • Figures 2-4 and 2-5 show the scatter plots for two pairs of attributes from the rocks versus mines data set
  • correlation
  • [JB: feature engineering: delta x, also Ableitung bilden?]

Pearson’s correlation coefficient

  • Equation 2-2: Average values of the entries in u
  • Equation 2-3: Subtract the average from each element in u
  • Equation 2-4: Definition of Pearson’s correlation coefficient

Visualizing Attribute and Label Correlations Using a Heat Map

Perfect correlation (correlation = 1) between attributes means that you may have made a mistake and included the same thing twice.

Very high correlation between a set of attributes (pairwise correlations > 0.7) is known as multicol- linearity and can lead to unstable estimates.