Welcome back as you move along in this series of courses regarding data analytics in the public sector, we hope that you feel that you're developing new skills and data analysis and visualization and also you're deepening your understanding of their important role in the key functions and pillars of the public sector. As we keep saying, data are extremely important and powerful and are used in so many important ways. This also means however, the data can be misused in powerful ways and the unfortunate reality is that there are a lot of pressures in the public sector to use and present data in ways that tell stories and convey information in misleading ways, sometimes in a minor subtle way, but also sometimes in dramatic ways that are intentional. This could be labeled as half truths puffery. Sometimes though it can be labeled as lies, misinformation and even propaganda. We will talk more about this in course four, but for now I would like to show you just a few data visualizations that we can say are misleading. Now we cannot tell from looking at them if they were intentionally designed to mislead or not, But we can see the problems in the outcome. First we have an infamous graph that is showing changes in the number of murders that involved firearms or guns in the state of Florida over time including both before and after the year 2005 when a new law was enacted in the state called the stand your ground law. This law allows florida citizens to quote, stand their ground meaning they could fight back in some kind of altercation, including with the use of a firearm with immunity from prosecution if they feel threatened with death or great bodily harm. Now, there was concern among some groups that this policy was actually going to increase injury and death from firearms rather than deter it. If you quickly glance at this graph, it actually looks like the number of gun deaths in florida decreased after 2005 when the new law went to into effect. The line goes down at that time after increasing after 1990 then leveling off. However, if you look at the Y access, you see that the numbers on the y axis go down. That's weird, right? The access starts at the bottom at 1000 and the values get smaller as you go up the access. So the interpretation of this graph is actually the opposite of what you see. Upon first glance, the number of gun deaths in Florida did increase after the stand your ground law went into effect. Now, we're the makers of this graph trying to confuse or mislead us. It's not clear. One story is that they were not trying to mislead if you notice their choice of red for the background color above the trend line in the graph was perhaps intended to show that more blood was spilling after the policy change than before. Take a look, what do you think? So one point to make is that if you're trying to do something creative and that's not typical or standard in a visualization, you best workshop and test drafts with your intended audience first. Whatever the intent of the designers of this graph, the public outcry was loud and most labeled it as intentionally misleading and as a misuse of data visualization in support Of a questionable policy. This next graph was also judged harshly by the public. The one on the left is showing the number of COVID-19 tests per one million people in the population early in the pandemic in a handful of countries. And the graph shows Argentina somewhat behind some other countries at a single point in time and that you should know. This graph was made by the Argentinian government. If you look closely the height of each bar on the graph doesn't have much to do with the actual number of tests. There's actually no why access with numbers on this graph. So while the graph is showing some level of difference and the testing rates across countries, it's not showing the app absolute difference in the testing rates when this is done. As you see in the graph on the right, you notice that Argentina was testing at a rate that was much much lower than the comparison countries at that point in time. So this comes under the category of a visual visualization that is trying to downplay the extent of a problem. This data visualization is saying yes, Argentina is lagging behind other countries but not by that much When a reality the problem was greater. This next graph is showing the growth in different types of data science jobs between 2010 and 2020. Now at first glance, the graph seems to be suggesting that the number of data science jobs and a number of different subcategories leveled off or decrease towards the end of the time series, which again is in 22. However, there are also a number of problems with this graph. First this pesky Y access. It often has a lot of problems and in this case it's not labeled. So what is this graph even showing? It turns out that what is being plotted here is the percent job growth rate, not the actual number of jobs or job interviews or other indicators that are used in this kind of workforce analys. So it is the rate of job growth in some types of data analysis and data science jobs that has slowed, not the actual number of jobs. Also, this graph doesn't tell us where the data come from our how the main indicator of job growth was measured. So unfortunately it's really hard to interpret the graph and come away with any conclusions. We do know from other Information that the data for this graph was actually from some kind of Internet survey. But this graph is picked up by the media and shared widely with the story that COVID had decreased jobs in 2020 and data science and data analytics. Now you'll be glad to know that better data suggests that while growth rates had indeed slowed a bit in this sector there is still a growing field. This is still a growing field worldwide, both in the public and private sectors. Finally, here's something that recently happened in the United States in the summer of 2022. Now, Nikki Haley is a former governor of a U. S. State and the former U. S. Ambassador to the United Nations. She tweeted this graphic of a receipt from quote joe biden's inconvenience store right before the fourth of july. This receipt shows the cost and the percent inflation over the past year of a number of products that americans tend to have at their fourth of july celebration picnics. The price of hot dogs. According to this receipt increased by 15.6% since just last summer, soda increased by 13.2% ice cream by 9.6% etcetera. And the receipt suggests that for the six i items on the receipt, the total inflation was 67.25% implying that the fourth of july celebration this year costs over 67% more than it did the year before due to inflation and as the tweet says, thank you, President biden. Well of course there's a very basic math error here instead of taking the average amount of inflation for the six products listed on the receipt, the inflation percentages were added up. This of course, is such a basic math mistake that it looks intentional. How could an educated us politician make such a basic and misleading error? The instant this went live on social media, there was severe backlash calling the graphic intentionally misleading, unethical and more when pressed publicly, Nikki Haley's pR team threw her staff under the bus. Their statement was quote, this was a staff error that should not have been published. We realize the calculation error and immediately remove the graphic. So one final lesson here is that if you are ever pressured to create a misleading graphic, first of all, try not to do it and we'll talk more about how to push back and not produce work that you find misleading or unethical and course for. But second, if you do end up producing mislead a data visualizations at someone else's order or authority, just prepare yourself for having to take the blame at the end of the day, If there's public scrutiny or accusations of misrepresentation