Data visualizations include simple graphs like bar and pie charts, scatter plots, and line graphs, but they can also be more complicated, like network visualizations or 3D models. This guide will walk you through the process of deciding if a data visualization project is a good choice for your research.
We often think of data as numeric, but it can also be text, images, qualitative, time-based, spatial, and more. The best type of visualization for your research will depend on the data you're using and what you want to show on your visualization.
The most important thing to remember about data visualization is that they are meant to make data easier to understand. Do not inadvertently make the data harder to understand with a confusing or overly complicated visual. The more you create data visualizations, the better you'll become at clarifying the dataset with the visual. See Making an Effective Data Visualization for examples.
When choosing your visualization, consider the following:
A great place to research data visualizations and find ones suitable to your goals is the Data Visualization Catalogue. You can search by function to see those appropriate to your data type or general visualization method.
Data visualizations are meant to make a dataset easier to understand. Unfortunately, some data visualizations make it more difficult by obscuring the overall view of the dataset. Generally, the more visual elements you add to a data visualization (different colors, 3D views, a third axis), the more difficult the visualization is to understand. Simple visualizations are easier to understand for a general audience. Be intentional about what visual elements you choose to include in your graph.
Keep in the mind the following points while you're creating your visualization.
Bar and pie charts are useful for showing how much of any given category exists. An example might be a class survey of who has siblings and how many they have. The bar chart will show the differences in these categories, while the pie chart will show which percentage of the whole has 0 siblings, 1 sibling, and so on.
For bar charts:
For pie charts:
Line charts are an excellent choice for showing changes in data over time. Consider how your data points are connected by the line -- data visualization software will by default draw the shortest line between any two data points, which can create confusion if data is missing.
Things to consider:
Scatter plots show correlation between variables; each variable is mapped on the x or y axis, and a dot represents where they correlate. See the example below showing the correlation between the duration of Old Faithful eruptions and the time between eruptions.
"Old Faithful Eruptions" is by Maksim is licensed under CC-BY 2.0.
The plot suggests that eruptions that are longer in duration need more "warm-up time" because the time between eruptions is longer. In addition, the data points are clustered between short wait, short duration and long wait, long duration; this suggests there are generally two types.
When making a scatter plot, consider: