From the category archives:

Better Charts

Misconception #1: A Better Chart Starts With… the Chart

Wrong. It starts by asking yourself if you really need one. Perhaps a statistical measure of some sort is good enough, perhaps you should use a table. If your job is to find patterns in a data set and build shared knowledge about it, what really matters is how efficiently the message is sent, and how efficiently it is received by the audience (two different things).

Misconception #2: You Should Master the (Technological) Tools of the Trade

No, you don’t. Just because you know how to create a chart in Excel it doesn’t mean that you know how to create a chart. If you use Microsoft Excel as your charting software then yes, you should learn more Excel (to spend more time with the kids). But you must go beyond technology, or else you end up creating some very stupid charts. Please note that a vast majority of Excel training courses will not teach you what it should (best practices). It will only tell you how to make “cool” graphs, like a 3D exploded pie chart.

Misconception #3: Defaults are good enough

They aren’t. Each chart must be tailored to the specific data set, audience and message. For instance, try to create a graph that clearly displays a large number of series and you’ll fail if you use the defaults (but can do it with clever color coding). And if you use recognizable defaults, like the Excel 2003 charts, you’ll look very, very, lazy (at best).

Misconception #4: Vendors obviously implement the very best templates

(I’ve heard this one recently, and I found it so incredibly naive that I had to write about it.) They don’t. About 90% of the Excel 2003 chart gallery is junk, and you must heavily reformat the remaining 10% to get something useful. Select other tools, like Crystal Xcelsius and the scenario is even worse. And I am unable to create in Cognos something that remotely resembles a chart (people tell me that version 8.4 is a little better).

Misconception #5: Better charts are just “prettier” charts

I hear this all the time. A good chart may look “prettier”, but that’s just an unintended consequence of a design that communicates better. In information visualization, prettiness must be a by-product of function. The very concept of a “better communicator” is sometimes difficult to comprehend, and trying to explain it is a waste of time, because people need to see it in action. You must take the user by the hand and guide him/her. You must force comparisons: “what can you learn about x using this chart?” and “what can you learn about x using that chart?” “how long did it take you to learn x using this and using that?”.

Misconception #6: It’s All About the Wow Factor

It is not. Many marketers and graphic designers fail to understand this. Marketers are hopeless in their relentless search for the wow factor and the eye-catching, “professional-looking” graphs, and graphic designers should know better, but they prefer to sacrifice data on the altar of Beauty (form is everything, data is a nuisance).

The dominant view among visualization experts (namely Tufte and Few) is that “form follows function“: every ornament in a graph should be eliminated, every object must serve a clear purpose, efficiency should be maximized (labeling series instead of using a legend, for instance). Given the extremely low graphic literacy levels among the general population, this may not always be the best approach.

Misconception #7: A good chart displays the actual values

No. If you label each data point you get a useless table over a useless chart. Labels are not only a distraction but often actually hide patterns in the data. Short labels and annotations can, and should, be used to identify or explain outliers or other interesting data points and circumstances. If your audience expects to see the underlying data then add a link to the table.

Misconception #8: Good Charts Should Be Read at a Glance

No, they don’t. The more complex, the longer it takes. It really doesn’t matter if it takes a second or an hour. What matter is how efficiently the graph  communicates. If a chart takes for ever to be read look for bottlenecks: the series are not easily identifiable, patterns are hidden, demands on the working memory are high, etc.

Misconception #9: The More Detail the Better

What we see as detail can be seen by someone else as clutter. Clutter is the natural child of loss aversion and is is very difficult to remove. If you have 12 competitors your audience will want to see the market share for each of them, even if it doesn’t make any sense. Tufte says “to clarify, add detail”, and yes, 12 competitors in a line chart can be made clear and useful, but you must know how to categorize them and provide a framework to help the user (you can use a large number of categories in a pie chart, for instance).

Misconception #10: It’s All About Selling Your Point, No Nuances

In The three laws of great graphs Seth Godin says that “there is no room for nuance [in a presentation]” and your charts should reflect that. Maybe it is just me, but I hate it when I am not allowed to draw my own conclusions because the data made available by the presenter is too biased towards his/her own points of view. Depending on the situation, a clear path that is supported by a lot of details is much better than a yes/no pie chart.

Misconception #11: You Have to Have Color, Lots of Color

Wrong. Color is a very difficult subject. Large surfaces of primary colors like we often see in presentations should be avoided because they are hard on the eyes and, because everything stands out, nothing stand out. A good option is to use grays for non-data elements like grid lines, and pale colors for color-coding. As a rule of thumb, color should always carry some meaning. Use primary colors to highlight a data point or some other small detail.

Misconception #12: A Single Chart is Enough

It is not. We live in an increasingly complex world, and traditional charts are very simple tools. While we wait for a new set of charts to be invented, we can use interaction (see below) and multiple charts to create a richer picture. That’s why scatter plot matrices, small multiples or trellis displays, and specially those multiple variations of executive dashboards are much more powerful than a simple chart.

Misconception #13: Charts Are Interchangeable

They aren’t. You can use a column chart or a line chart to display a time series, but while a line chart performs better than a column chart when reading trends, it is easier to compare data points using a column chart. Most visualization experts will tell you that you should use a bar chart instead of a pie chart (also because it is easier to compare data points), but a pie chart gives you the perception of a whole that is absent in a bar chart. Every graph has its own strengths, and you should select the one that suits your needs.

Misconception #14: Create It and Forget It

Don’t. Making sense of your data is a process of exploration and discovery. A pattern in a subset may be hidden by a noisy background. Different measures may lead to more complex insights. Creating a chart that the user can interact with should always be your primary goal. Unfortunately, that’s beyond the skills of an intermediate Excel user (if you what to learn about interactive charts my Excel dashboards may be a good starting point).

*

This post lists 14 widespread misconceptions about charts, but probably is a very incomplete list and you may not agree with all of them. What misconceptions would you add/remove?

*

[Update: Jon has been writing extensively about Excel 2003 and Excel 2007 (by the way, it's a great resource that helps us to see through the marketing noise). I said in the comments below that I prefer to use Excel 2007 charts to post images in this blog. He doesn't agree and he tries to prove in his last post that charts in Excel 2003 are actually better. He uses good examples to prove his point but I still believe that this (Excel 2007):

looks better than this (Excel 2003):

Yes, probably there is an "overaggressive anti-aliasing", but the line in Excel 2003 is too "crispy" for my taste. Again, it is just a matter of creating images for a blog, not exactly for serious work...]

{ 25 comments }

Do you type well? I mean, do your thoughts flow naturally through your fingers or, on the contrary, your (low) typing skills come between your thoughts and your writing?

I don’t type fast enough, I look at the keyboard from time to time and I don’t follow some simple typing rules that would save me months of my life.

My question is, if the benefits of adhering to those rules are so obvious, why don’t I spend invest some time learning how to use them properly?

The more I think about it, the more I feel that the reasons behind this are the ones that also slow down the adoption of best practices in information visualization.

Recognize the problem

The major reason is, of course, to recognize and accept that there is a problem. If you are happy with the way you do things today, why would you change it? After learning how a specific task should be performed, just close that box in your life and move on, no need to reopen it, thank you. You are a two-finger typist and a pie chart designer, and that’s more than enough.

How do you deal with this? Too often we are irrational creatures, so the benefits must be so obvious that not taking action would be almost embarrassing (remember that inertia is one of the most powerful forces in the known universe). Unfortunately, you can promise to double the typing speed, but can’t offer a similar quantitative measure to sell the benefits of better charts.

Imagine that a new company is selling a two-finger typing concept instead of all ten (“you’ll get sexier fingers!”). Too absurd? Well, I bet they would make a good living out of it. That’s exactly what Microsoft, Dundas or Business Objects do in information visualization. And unlike typing, peer pressure and marketing budgets actually work against you.

Break Old Habits. Start Today

As Stephen Few often emphasizes, the basic rules of information visualization are not difficult. Learn them and start practicing (here are a few). Just like typing.

That’s what I am doing right now. This was the first post I wrote with my eyes closed (well, the draft version, anyway). It was painfully slow. That this can help my writing or my typing remains to be seen, but it seems a good method to avoid distractions and remain focused.

So, how many fingers do you use to create a chart? :)

{ 0 comments }

Jigsaw ChartOne of the reasons why information visualization in business is so poor is because many managers are clueless about it.

They really believe that a simple training course on Office tools is more than enough to create some nice 3D charts for a Powerpoint presentation (and they are right: that’s all they need, because that’s all they know).

Let’s see how well those courses perform, from a business visualization point of view. Do they tell you:

  • How to select the best graph to visually display a data set? No, they don’t. They just tell you how to create some very obnoxious ones;
  • What you need to know to understand a scatterplot or box-and-whiskers graph? No, they don’t. After some pie charts, there will be no time left to discuss scatter plots. Box-and-whiskers? What’s that?
  • That you must know what our working memory is and how it impacts on your slide presentations? No, they don’t. Just put a huge chart in a slide and another chart in the next one – now go back and forth between slides to compare them;
  • That your charts and graphs should answer your readers’ questions? No, they don’t. A graph is just another neat little thing you can do in Excel;
  • How to validate your data? No, they don’t. Let’s assume the data is always correct, shall we?
  • Why you should create dynamic charts whenever possible/relevant? No, they don’t. You didn’t pay for that highly advanced subject; let’s stick to sums and pies, ok?
  • How to priortize information (focus/context, Schneiderman’s mantra)? No, they don’t. Do you have twelve series? Plot them all! Trends? Patterns? What do you mean?

To create insightful graphs that will really help you making sense of the data and communicate efficiently, you don’t have to be an Excel expert, a graphic designer, a psychologist and a statistician. But information visualization is a jigsaw puzzle, and a single piece will never be enough to see the whole picture.

I saw this brilliant quote in Robert Cialdini’s Influence:

- Joe Pine (60′s talk show host who sported a wooden leg) to Frank Zappa: “So, with your long hair, I guess that makes you a woman.”
- Frank Zappa’s response: “So, with your wooden leg, I guess that makes you a table.”

{ 3 comments }

Nathan asks us Can You Improve this Mediocre Statistical Graphic?

Since there are only two series (two parties) with a obvious mirror effect, I would say it doesn’t make sense (from a chart economy point of view) to display both series. And since the 50% mark is relevant in election results, why shouldn’t we just look at the trend of one of those parties around that mark? It would help to tell a more interesting story.

So, this is my radical suggestion, with Bonavista’s sparklines:

“The percentage of counties in California that have a Democrat majority of registered voters in Presidential election years droped sharply in the last four elections and now stays well below the 50% mark ( Microcharts). Loren ipsum….” (I have to work on a better integration of sparklines and the blog template, but you get the idea…)

It would be nicer to have more data points, but this small footprint chart conveys the essencial message. Of course you can follow the standard approaches (a line chart with both parties or a stacked bar chart). As always, it all depends on what you want to say and how you want to say it.

{ 3 comments }

In my latest post at Microcharts blog I discuss the role of information visualization under a recession and the use of “certain” charts…. Take a look!

{ 0 comments }

Wow is Beauty, Eureka is Knowledge.

Wow alone is a dumb blond, Eureka alone is a decrepit old wizard.

Great information visualization is 20% meaningful wow and 80% useful Eureka.

{ 2 comments }

New to information visualization? Let me give you some quick answers to frequently asked questions.

What is a chart?

Just open your eyes and an amazing amount of data is immediately funneled into your brain. This data is processed in real time and makes possible your interaction with the outside world. Shapes and patterns emerge and you’ll be able to tell a tree from a building.

A chart is a device that takes advantage of this power by plotting abstract data in space. Since similar data points will be plotted next to each other, you’ll see some patterns that would be very difficult to uncover just by looking at the data itself. When you are creating a chart your primary concern should be to simplify that pattern discovery. Everything else is wrong. Everything else is marketing.

Why should I use charts?

Because data is expensive and you should get the most out of it. Because charts allow you to process data efficiently. Because you get insights that you wouldn’t get using other methods. Because they save time.

How to I create professional-looking charts?

Don’t use the charts you see in magazines as a model, if that’s what you mean be “professional-looking charts”. Usually they are more form than function, eye-catching but irrelevant. If you are using charts to support a decision making process just make them clean, and let them tell their story.

But my manager loves flying 3D pie slices in PowerPoint…

I am a terrible player at impression management but I know this can be a problem. Play by the rules and you’ll be on the safe side (or not). Play against them and you’ll either be promoted or fired…

If you want to change the corporate culture regarding information visualization use visualization to do it. People usually are not stupid, they just don’t have the right information. Evangelize. Make them compare current practices with the ones you are promoting and let them judge the real benefits. If they are obvious people will see them. Be patient and persistent. People change. Slowly.

What tools should I use?

If you work in a corporate environment probably you can’t avoid Microsoft Excel, or other spreadsheet application (Powerpoint should be available also but you better avoid it). Don’t use defaults, keep away from some stupid options and Excel can really be a good starting point.

If you are more design oriented, you could use Illustrator or a good mixture of design and a programming language like Processing. Keep the programming language, remove the design and add statistical packages and you get SAS or R. Use Spotfire for interactive analysis of large datasets.

Give me a single tip to make better charts today

Make them smaller.

What?!

A smaller chart forces you to remove all the junk you once thought was essential (just like starting to live on a tight budget…). Then add more charts to that empty space and you end up with a more detailed picture of your data. Small charts are beautiful.

What about authors and books?

No book had more influence in the way we think about information visualization than Edward Tufte’s The Visual Display of Quantitative Information (Tufte’s books are a pleasure for the eyes so you can leave them in the living room…). Tufte combines a minimalistic approach with easily digestible concepts (like chartjunk, data-ink ratio, data density) to create a strong framework. Use that framework to validate your design. Discover other authors like Stephen Few, Jacques Bertin, William Cleveland, John Tukey, Stephen Kosslyn or Colin Ware.

What online resources are available?

Edward Tufte maintains a discussion forum. In Stephen Few’s site you’ll also find a discussion forum and some before/after examples of chart design. It is difficult to find great examples of information visualization in the media, but the NYT is a good reference. A list of online resources can be found here.

I’ll answer your questions…

… if I know the answers. Leave them in the comments.

{ 7 comments }

Always ask yourself: “what can I remove from this chart”? Remove Excel defaults, remove grid lines, make the chart smaller, use soft colors, remove irrelevant labels, remove the legend (by directly labeling series), remove series that you don’t really need, remove frames, remove decimal places, remove visual fluff.

Then ask yourself: “what can I add to this chart?” Add meaningful annotations, add other charts, add change, add interaction, add a story, add you.

{ 1 comment }

This is the time for scatter plots in the 10 x 10 charting tips series:

  1. A scatter plot is square by definition (I forget that sometimes…);
  2. In some cases, it makes more sense to use a scatter plot than two column charts: for example, instead of having a column chart to display product market share and another chart to display product growth, consider merging both into a scatter plot (market share on the x axis and growth on the y axis);
  3. If you are plotting several data series, color code them instead of using different markers…
  4. … but consider using several charts;
  5. In scatter plots, use empty circles as markers to let the reader see the overlapping points;
  6. Use a scatter plot matrix to analyze pairwise relationships between series;
  7. Use a scatter plot as an alternative to horizontal bar charts, like in a population pyramid;
  8. If needed, use a scatter plot instead of a line chart if you have an unevenly-spaced time series;
  9. You can use a scatter plot to create a basic map;
  10. An outline can ruin your scatter plot. If possible, remove it and explain it;

As you can see, you can use a scatter plot in Excel to create many other charts. Just use your imagination and share it in the comments.

{ 0 comments }

These are 10 basic tips for column and bar chart design:

  1. A column chart is not a skyline: if you can’t see the individual patterns, consider removing some series or create several smaller charts;
  2. If you are charting categorical data sort the columns; if there is more than one series, allow the user to sort the data herself;
  3. If you are displaying time series, column charts are not interchangeable with line charts: column charts allow you to compare individual data points, while a line chart shows the trend; be sure to select what your audience wants to see;
  4. For target/actual series (like budget/actual) overlap them but make sure they can’t be taken for stacked bars; you can do it by using a different column width for each series or by setting filling to none (usually the target series);
  5. Use horizontal bar charts when x labels are too large to be correctly displayed;
  6. The y axis scale should start at zero; this is particularly important if you are using bar charts; make sure you have a (very) good reason to break this rule;
  7. If you really need to label each column try to minimize its impact; in Excel 2003, select Format Data Labels / Alignment / Label Position: Inside Base;
  8. Don’t use multiple colors for a single data series;
  9. Avoid stacked bar charts;
  10. Use category/subcategory to label the x axis. For example, instead of having Mar-2008, Apr-2008… use Mar, Apr and place 2008 in the second line.

As usual, feel free to add your own tips in the comments.

{ 2 comments }