Will Traditional Charts Survive?

by Jorge

No, traditional charts are useless  in our complex world

playfair-piechartOver the next 25 years, we will need new visualization tools to replace traditional charts.

As you know, line, bar and even pie charts first appeared 200 years ago, with William Playfair, and perhaps until 25 years ago, they were good enough helping us to make sense of our data. Before computers, they were crafted by graphic designers. Kids in schools drew them using millimetric paper.

Lotus 123 and Harvard Graphics were the most popular charting tools in the early days of personal computers. With those tools (and later, with Excel), the charting landscape changed forever. Some charts vanished, either because they weren’t simple enough and/or didn’t make it into the chart gallery (I miss trilinear plots – yes, Jon, I know how to create them in Excel, but still…), while others should never have been allowed into that gallery.

Today, only a small and shrinking fraction of all the data collected and stored is actually used in our decision making processes and only a fraction of that is actually displayed in charts. But we have a problem: these charts can hardly cope with our complex multidimensional world. Adding injury to insult, we are diving into visualization p0rn, defined by Robert as:

“pretty, flashy mash-ups of something or other, depicting somebody’s life, citing information graphics in a commercial, or growing flowers from twitter feeds.”

I find the expression highly descriptive, but I would associate it also with the corruption of traditional charts.  This is a bizarre path: the more data we collect, the fewer data points are displayed in our charts. After all, we need space for textures, 3D and other chartjunk, right?

Now that our best-known charts are dying of of old age and inadequacy to the needs of our time, it is clear to me that we must really escape Flatland and conquer the third dimension. In a complex world we don’t have the luxury of suppressing one of the only three dimensions that are available to us. That’s the future of charts. And even if we resist the temptation of prettifying our charts and add more data, they are doomed to fail. They belong to a simpler world.

Yes, traditional charts can now realize their full potential

Well, we do collect more data, we do live in a world of a much higher variability. But that’s exactly the reason why we need better charts. We can use data visualization techniques to instantly recognize patterns in very large data sets. We have a reasonably good understanding of how our perception works. The design of traditional charts can take advantage of color or grouping mechanisms.  We know how to manage short-term memory.

We don’t need new tools, we just need to know how to use the existing ones. Traditional charts can realize their full potential if we correct bad practices like:

  • One chart to rule them all: Our reality is infinitely more complex than the one we inherited from our fathers. A chart may be fully optimized for pattern discovery but we can’t expect to make sense of our complex data with a single chart. We need multiple charts to reveal multiple dimensions. Quoting Tufte, “show comparisons adjacent in spaces, not stacked in time”;
  • Large charts: we use charts to find patterns, and the chart size must be optimized to reveal those patterns.  I don’t know what the average chart size is, but I guess we could safely reduce it to one third (and use the available space to show more dimensions of our datasets);
  • Lack of prioritization: not all data is born equal. We must define what ir relevant and what is merely interesting. Focus+context techniques can help us. Ben Shneiderman’s Visual Information-Seeking Mantra (“Overview first, zoom and filter, then details-on-demand”) should be on every desk;
  • Lack of data-reduction techniques: it is often more insightful to plot a ratio (imports/exports, actual/budget) than the the original variables. This is clearly a loss aversion syndrome;
  • Chartjunk: this one goes without saying, right?
  • Static charts: Interaction is the cornerstone of today’s information visualization for data analysis. We can make a point using a single, static chart, but to fully understand a complex dataset the user must be able to manipulate the chart.

Well, maybe…

I stumbled upon a page about “statistical techniques for dimension reduction”. The author writes:

“This can be a problem, especially when some of the features are not discriminatory. In addition to the computational cost, irrelevant features may also cause a reduction in the accuracy of some algorithms. For example (Witten 1999), experiments with a decision tree classifier have shown that adding a random binary feature to standard datasets can deteriorate the classification performance by 5 – 10%.”

He’s right, of course, but look at the page header:

dimension-reduction

I find this very amusing. Can we seriously discuss data reduction techniques and, at the same time, be unable to recognize “irrelevant features” in other areas of our work? (Perhaps he’s been ironic.)

The nature of information visualization did change in the last 25 years, for better and for worse, and is likely to change as much over the next 25 years. New formats will emerge and become popular, specially in the area of complex networks, but basic perception principles will remain the same, and if a chart is optimized to take advantage of those principles it will survive.

Current divorce between data visualization and statistics is harmful for both sides. Those who know what “mode” or “percentile” mean don’t know how to make a good chart. Those who know how to create a chart don’t know how to read a boxplot. Data visualization is not the lighter side of data analysis and data is not just “something” we use to create a chart.

Will traditional charts survive? It depends on how we treat the data.

Well, this was almost a SWOT analysis for traditional charts… So, back to you, what are the strengths, weaknesses, opportunities and threats that our charts face today?

You may also be interested in:

  1. Charts Are for Sissies (Grumpy Old Man’s Guide to Charts)
  2. Excel Charts: If It Isn’t Broken…
  3. Can Edward Tufte Do Business Charts?
  4. 14 Misconceptions About Charts and Graphs
  5. Visualizing change with Stephen Few

{ 7 comments… read them below or add one }

Jon Peltier January 29, 2009 at 19:55

The biggest threat is the move towards “visualizations” which are more and more artistic, but less and less informative.

Case in point. Nathan over at FlowingData has hosted a visualization challenge on the forum attached to his blog. It provided some data on poverty among the different states of the US, and asked for the best visualization efforts of the blog visitors.

I and several other people provided pretty much standard charts to display the data. Nathan’s favorite visualization was not a standard display, however, but a Jackson-Pollock-esque oeuvre that exaggerated one artifact of the data while putting all 50 states into a nondescript blob of color.

I didn’t care for this graphic, because it didn’t show me any information. I also didn’t like a handful of animated displays where values were represented on a map, by making the thickness of each state (the third dimension, orthogonal to lattitude and longitude) proportional to the various quantities. You couldn’t compare adjacent states, because there was no scale, and you couldn’t compare the different measures of poverty, because you have to download and view each separately. Your Tufte quote (show comparisons adjacent in spaces, not stacked in time”) partly sums up my thoughts. The other part was also described by Tufte, who likes dense infographics that allow the reader to take his time while studying the different aspects of the data. Not at the speed of the video, but at the speed of one’s own mind.

admin January 29, 2009 at 23:12

@Jon: I usually follow these challenges, but not this one (as you said in another comment, we must pay the bills…). It looks interesting, from several points of view. A new battle between aesthetics and function…

I’m afraid this is wishful thinking, but I secretly hope that the present crisis will make people more aware of the real value of things, including, in our small niche, the value of some charts and charting tools.

Enrico Bertini February 1, 2009 at 15:31

Jorge,

I couldn’t agree more with you. Interaction and data reduction are great challenges of infovis. Some time ago I wrote a post titled “The Neglected Role of Interaction in Infovis“, which expresses some concerns on it.

In your post I see a potential mix however between different purposes: knowledge discovery/sense-making and communication. Most charts IMHO serve the purpose of communicating something to an audience that has already been digested by others. To convey a message. If we don’t keep in mind this difference we might end up giving the wrong advice for a just cause. I think you understand what I mean :-)

Anyway … the best work I have seen on data reduction for infovis in recent years is the work done by Matt Ward and colleagues at WPI. Everything is implemented and included in the XmdvTool, where you can also find a list of relevant research papers. If you don’t know this work already I am sure you can find great stuff there.

Cheers.

admin February 1, 2009 at 23:01

Enrico, thanks for stopping by and for the links. I am aware of the difference, but at the same time I believe that interaction may somehow blur that difference. There will not be a total overlap, but at least “making-sense” and “communication” must be redefined. A passive audience is an endangered specie, and communication charts/visualizations must take that into account.

paresh shah February 2, 2009 at 18:06

Hi Jorge,

Do me a favor please. Can you visit the website http://www.rediff.com and go to “stocks live”. It has this wonderful sparkline type graph on the movement of the stock market for the day. It also has an additional feature of a moving dot which gives you the data regarding the point. I think that is a great functionality – is that replicable in Excel. What I had in mind is a sparkline say in a dashboard giving monthly figures where the user can scroll along the line to see the underlying data.

Jon Peltier February 2, 2009 at 19:04

Hi Paresh -

I see the chart, but not the moving dot. However, I did see that the little textbox changes as you mouse along the line.

Both of these effects, the moving point and the dynamic textbox, are possible in Excel, using chart events to do the hard work.

Paolo Ciuccarelli February 3, 2009 at 00:11

Hi all. Talking again about Nathan’s challenge, the visualization posted by Luca Masud (Nathan’s favorite, or, if you prefer, the Pollock-esque one) was an experimental contribution: we asked to some of our master students to use the challenge as an entry point, to verify some of their early hypothesis, and to get as much feedbacks as possible. Luca is currently working – he just started – on the intersection between consistency of data and visualization potentialities. His work is part of a more general framework that we (visual design) cultivates together with a department of statistics. We both (statistics and designers) believe that approaching complexity entails new communication challenges (i.e. considering and visualizing uncertainty) and asks for new tools. There is definitely no room here for a battle between aesthetic and function, between visual design and statistics. This is the time to consolidate the marriage.

Leave a Comment

Previous post:

Next post: