The Dangers of Deceptive Data–Confusing Charts and Misleading Headlines

“You don’t must be an skilled to deceive somebody, although you would possibly want some experience to reliably acknowledge when you find yourself being deceived.”

When my co-instructor and I begin our quarterly lesson on misleading visualizations for the info visualization course we train on the College of Washington, he emphasizes the purpose above to our college students. With the arrival of contemporary expertise, creating fairly and convincing claims about knowledge is less complicated than ever. Anybody could make one thing that appears satisfactory, however incorporates oversights that render it inaccurate and even dangerous. Moreover, there are additionally malicious actors who actively need to deceive you, and who’ve studied a number of the greatest methods to do it.

I usually begin this lecture with a little bit of a quip, wanting critically at my college students and asking two questions:

“Is it a very good factor if somebody is gaslighting you?”
After the final murmur of confusion adopted by settlement that gaslighting is certainly unhealthy, I ask the second query: “What’s the easiest way to make sure nobody ever gaslights you?”

The scholars usually ponder that second query for a bit longer, earlier than chuckling a bit and realizing the reply: It’s to find out how folks gaslight within the first place. Not so you may reap the benefits of others, however so you may stop others from benefiting from you.

The identical applies within the realm of misinformation and disinformation. Individuals who need to mislead with knowledge are empowered with a bunch of instruments, from high-speed web to social media to, most not too long ago, generative AI and enormous language fashions. To guard your self from being misled, you could be taught their tips.

On this article, I’ve taken the important thing concepts from my knowledge visualization course’s unit on deception–drawn from Alberto Cairo’s glorious e book How Charts Lie–and broadened them into some normal ideas about deception and knowledge. My hope is that you simply learn it, internalize it, and take it with you to arm your self in opposition to the onslaught of lies perpetuated by ill-intentioned folks powered with knowledge.

People Can not Interpret Space

No less than, not in addition to we interpret different visible cues. Let’s illustrate this with an instance. Say now we have an very simple numerical knowledge set; it’s one dimensional and consists of simply two values: 50 and 100. One approach to characterize this visually is by way of the size of bars, as follows:

That is true to the underlying knowledge. Size is a one-dimensional amount, and now we have doubled it with a purpose to point out a doubling of worth. However what occurs if we need to characterize the identical knowledge with circles? Properly, circles aren’t actually outlined by a size or width. One choice is to double the radius:

Hmm. The primary circle has a radius of 100 pixels, and the second has a radius of fifty pixels–so that is technically appropriate if we wished to double the radius. Nevertheless, due to the best way that space is calculated (πr²), we’ve far more than doubled the realm. So what if we tried simply doing that, because it appears extra visually correct? Here’s a revised model:

Now now we have a unique drawback. The bigger circle is mathematically twice the realm of the smaller one, but it surely not seems that manner. In different phrases, despite the fact that it’s a visually correct comparability of a doubled amount, human eyes have issue perceiving it.

The difficulty right here is attempting to make use of space as a visible marker within the first place. It’s not essentially incorrect, however it’s complicated. We’re rising a one-dimensional worth, however space is a two-dimensional amount. To the human eye, it’s at all times going to be troublesome to interpret precisely, particularly compared with a extra pure visible illustration like bars.

Now, this will seem to be it’s not an enormous deal–however let’s check out what occurs once you prolong this to an precise knowledge set. Beneath, I’ve pasted two photos of charts I made in Altair (a Python-based visualization package deal). Every chart reveals the utmost temperature (in Celsius) throughout the first week of 2012 in Seattle, USA. The primary one makes use of bar lengths to make the comparability, and the second makes use of circle areas.

Which one makes it simpler to see the variations? The legend helps in the second, but when we’re being sincere, it’s a misplaced trigger. It’s a lot simpler to make exact comparisons with the bars, even in a setting the place now we have such restricted knowledge.

Keep in mind that the purpose of a visualization is to make clear knowledge–to make hidden traits simpler to see for the typical particular person. To attain this purpose, it’s greatest to make use of visible cues that simplify the method of constructing that distinction.

Beware Political Headlines (In Any Route)

There’s a small trick query I typically ask my college students on a homework project across the fourth week of sophistication. The project largely includes producing visualizations in Python–however for the final query, I give them a chart I personally generated accompanied by a single query:

Query: There’s one factor egregiously incorrect with the chart above, an unforgivable error in Data Visualization. What’s it?

Most suppose it has one thing to do with the axes, marks, or another visible facet, usually suggesting enhancements like filling within the circles or making the axis labels extra informative. These are fantastic ideas, however not probably the most urgent.

Probably the most flawed trait (or lack thereof, slightly) within the chart above is the lacking title. A title is essential to an efficient knowledge visualization. With out it, how are we imagined to know what this visualization is even about? As of now, we are able to solely verify that it should vaguely have one thing to do with carbon dioxide ranges throughout a span of years. That isn’t a lot.

Many of us, feeling this requirement is simply too stringent, argue {that a} visualization is commonly meant to be understood in context, as half of a bigger article or press launch or different accompanying piece of textual content. Sadly, this line of considering is much too idealistic; in actuality, a visualization should stand alone, as a result of it would usually be the one factor folks have a look at–and in social media blow-up circumstances, the one factor that will get shared extensively. Consequently, it ought to have a title to elucidate itself.

After all, the title of this very subsection tells you to be cautious of such headlines. That’s true. Whereas they’re essential, they’re a double-edged sword. Since visualization designers know viewers will take note of the title, ill-meaning ones may use it to sway folks in less-than-accurate instructions. Let’s have a look at an instance:

Source link

Mastering Hadoop, Part 3: Hadoop Ecosystem: Get the most out of your cluster

The Impact of GenAI and Its Implications for Data Scientists

Nine Pico PIO Wats with Rust (Part 2)

Black Women Are Using Side Hustles to Mitigate the Pay Gap. Is It Helping or Hurting Them?

Google Edits Super Bowl Ad After AI Fact Error

What 2024 Taught Us About ESG Engagement

GPU Programming for beginners. Understanding GPU Programming for… | by Mehul Gupta | Data Science in your pocket | Mar, 2025

Why it’s so hard to use AI to diagnose cancer

Most Popular

What is ANOVA? Types of ANOVA and Their Applications | by Meriç Özcan | Feb, 2025

Branching Out: 4 Git Workflows for Collaborating on ML

The Role of Data Deduplication in Cloud Storage Optimization

Our Picks

UnitedHealthcare Offers Buyouts to Benefits Unit Employees

Announcing the Towards Data Science Author Payment Program

A Deep Dive Into Hospital Readmission Reduction | by Yudeshsubas | Mar, 2025

The Dangers of Deceptive Data–Confusing Charts and Misleading Headlines

People Can not Interpret Space

Beware Political Headlines (In Any Route)

Don’t Use 3D. Please.

Last Ideas

Related Posts