Hacking the Matrix: What Data Scientists Can Learn from Investigative Journalism

Martin Robinson
4 min readApr 18, 2023

In the world of data science, where vast amounts of information are processed and analyzed to uncover insights and patterns, there is much to learn from the field of investigative journalism. Both data scientists and investigative journalists share a common goal: to unravel complex stories hidden within data. They seek truth, expose hidden truths, and bring to light important information that can impact decision-making and drive change.

Uncovering the Story: Data and Journalism

Data is the backbone of both data science and investigative journalism. Just as journalists collect data from various sources to piece together a story, data scientists collect and analyze data to extract insights and patterns. Both fields require the ability to work with data in different formats, from structured to unstructured, and use tools and techniques to clean, process, and make sense of the data.

For example, in an article published by the UC Berkeley School of Information, a student data scientist, Joon Park, used data journalism techniques to analyze a large dataset of campaign contributions in California. By examining the data, Park was able to uncover patterns of potential political corruption, which led to the exposure of illegal campaign practices and a subsequent investigation by the Federal Election Commission. This demonstrates how data scientists can leverage their skills to uncover stories hidden within data, just like investigative journalists.

Data scientists can learn from investigative journalists in the following ways:

Follow the Data: A Story is Waiting to be Told

Similar to how investigative journalists follow leads and clues to uncover a story, data scientists should let the data guide them. Data can reveal patterns, trends, and outliers that can lead to important insights. It’s crucial to approach data analysis with curiosity and an investigative mindset, constantly asking questions and exploring the data from different angles.

For example, in the Forbes article “Data Science and Investigative Journalism: Two Fields, One Goal, Similar Mindset,” the author highlights how data scientists can learn from the “dogged persistence” of investigative journalists in following the data trail to uncover the truth. Data scientists should adopt a similar mindset of perseverance and tenacity when analyzing data, not being satisfied with surface-level findings, but delving deeper to uncover hidden stories.

Digging for the Truth: Scrutinize and Verify Data

Investigative journalists are known for their rigorous fact-checking and verification processes to ensure the accuracy and reliability of their stories. Similarly, data scientists should apply the same level of scrutiny to their data. This includes checking for data integrity, identifying and addressing data quality issues, and validating results through multiple methods.

For instance, in an article by Yellowfin BI, the author emphasizes the importance of data accuracy and integrity in data journalism. Data scientists should verify the authenticity of the data they are working with, check for any biases or inconsistencies, and ensure that the data is reliable and trustworthy.

Context is Key: Understanding the Story Behind the Data

Just as investigative journalists seek to understand the context and background of a story, data scientists should also strive to understand the story behind the data. Data can often be complex and nuanced, and without proper context, it can lead to misinterpretation and inaccurate conclusions.

For example, in the UC Berkeley article, Joon Park’s analysis of campaign contribution data was not only based on the data itself but also involved understanding the political landscape, campaign finance laws, and the history of political corruption in California. This contextual understanding enabled Park to uncover the full story hidden within the data.

Visualization: Telling the Story with Impact

Just as journalists use compelling storytelling techniques to engage readers, data scientists can leverage visualization to present their findings in an impactful and memorable way. Visualization can help convey complex information in a clear and concise manner, making it easier for stakeholders to understand and act upon.

For example, in the UC Berkeley article, Joon Park used data visualization techniques to present his findings on campaign contributions in California. He created interactive visualizations that allowed users to explore the data and uncover patterns on their own, making the data more accessible and engaging.

Conclusively, data scientists can learn a lot from investigative journalism in terms of mindset, approach, and techniques. Both fields share a common goal of uncovering the truth and presenting information in a clear and impactful way. By following the data, digging for the truth, understanding the context, and using visualization to tell the story, data scientists can enhance their skills and become more effective at extracting insights and driving change. As we continue to navigate the complex world of data, let us not forget the power of investigative journalism and the lessons we can learn from it.

I hope you guys enjoyed this.

Follow for more! See you next week and tell a friend to tell a friend!

--

--

Martin Robinson
Martin Robinson

Written by Martin Robinson

Discover the future of research and analytics

No responses yet