When you are trying to find the relationship between multiple variables, Parallel coordinates is a good choice.
Parallel coordinates can clearly observe the distribution of data. For example, we can run Parallel coordinates on the iris data set.
iris dataset with parallel_coordinates
import seaborniris_data = seaborn.load_dataset('iris')from pandas.plotting import parallel_coordinatesparallel_coordinates( iris_data,'species',color=('#55627080','#4ECDC480','#C7F46480'))import matplotlib.pyplot as pltplt.title('iris dataset with parallel_coordinates')# Mandatory run display interfaceplt.show()
After running Parallel coordinates on the iris data set, you can clearly observe the relationship between sepal_length, sepal_width, petal_length, petal_width and species.
For example, the petal_length of virginica is larger than the other two varieties. Then when you encounter an iris plant in the future, if its petal_length is larger, then it is likely to be a virginica.