What Questions Should I Use to Guide My Bivariate Analysis?

Bivariate analysis is a way of exploring the relationship between two variables in a dataset.

Here are some examples of bivariate analysis questions in EDA:

  1. Is there a relationship between two variables in the dataset? For example, “Is there a relationship between height and weight in the dataset?”
  2. How strong is the relationship between two variables in the dataset? For example, “How strongly is the number of hours studied related to the grade received in the course?”
  3. What is the direction of the relationship between two variables in the dataset? For example, “Is there a positive or negative relationship between income and education level in the dataset?”
  4. Are there any outliers in the relationship between two variables in the dataset? For example, “Are there any individuals in the dataset who have an unusually high income for their level of education?”
  5. Is there any difference in the relationship between two variables in the dataset between different groups? For example, “Is the relationship between age and income different for men and women in the dataset?”

By asking these kinds of questions, you can explore the relationship between two variables in the dataset and gain insights into how they are related to each other.