Data Visualization: Home
What is data visualization?
Data visualization is the process of representing information using visual means, such as a chart/graph, diagram, map, or picture. There are two fundamental types of data visualization: exploratory visualization and explanatory visualization.
Exploratory data visualization is an essential part of exploratory data analysis: the process of summarizing and analyzing data to understand how it is organized, make transformations where necessary, and explore potential underlying patterns in the data. Exploratory data visualization is an important step in trying to make sense of your data by visualizing trends and relationships that may be hidden by summary statistics (the classic example of this is Anscombe's Quartet). In other words: "Exploratory analysis is the process of turning over 100 rocks to find perhaps 1 or 2 precious gemstones." (Cole Nussbaumer Knaflic 2014, Storytelling with Data).
Explanatory data visualization, on the other hand, is the process of telling a story about a particular aspect of your data. Whereas an exploratory visualization has the primary purpose of helping you understand your own data, an explanatory visualization communicates what you have determined is important about your data to an audience. An explanatory visualization can, for example, take the form of a graph with only a subset of the data highlighted or an infographic that combines one or more graphs, tables, or images with text that frames your argument and tells a more complete story than a single chart alone.
Steps to Creating a Visualization
The following steps represent the general process you might take in order to create a visualization.*
- Know your data. It is helpful to start with data that is clean (read more about what it means for data to be "clean" in the article "Tidy Data" by Hadley Wickham, 2014). If you are not the original creator of the dataset you want to visualize, you should also start by making sure you understand what the variables in the dataset mean and ensure that the data has secure provenance (i.e. you know where it came from and how it was created).
- Determine your purpose. Decide whether you are in the exploratory or explanatory stage of visualizing your data. Are you trying to find patterns, or are you trying to tell a story or support an argument with your visualization? Understanding your purpose can help you decide what data to include in your final visualization.
- Choose a chart type. Different charts are useful for different kinds of data, representing different numbers of variables, or showing different patterns and relationships among those variables. Check out the Best Practices page on this guide for interactive resources that can help you decide which chart is most useful for your purpose (Step 2) and your particular data (Step 1).
- Refine your visualization. Once you have selected a tool and input your data, you can adjust some of the textual and visual aspects of your chart to make it more readable, by tweaking the chart axes, labels, or color palette. You should consider how your visualization will appear to people who have visual impairment or different kinds of colorblindness and try to make your visualization as understandable and accessible as possible, whether it will appear in print or on a screen. Some accessibility guidelines related to font and color can be found on the Best Practices page on this guide.
* Adapted from the Data Visualization guide at GMU University Libraries.
Digital Scholarship Events
by Karlen Chase Last Updated Nov 1, 2019 572 views this year
by Rachel Starry Last Updated Oct 2, 2019 747 views this year
by Rachel Starry Last Updated Nov 6, 2019 139 views this year
Digital Scholarship @ UB
by Rachel Starry Last Updated Oct 30, 2019 1544 views this year
Open Access Publishing
by Karlen Chase Last Updated Sep 11, 2019 365 views this year