This repo contains materials for the module on the MA in Data Journalism at Birmingham City University
- Module Leader: Paul Bradshaw (paul.bradshaw@bcu.ac.uk) 0121 331 5367
- Slack: use the
#med7373-data-jchannel for questions, sharing ideas, further resources and examples etc. Direct message @paulbradshaw for non-public questions - Room: MP211
Data Journalism aims to facilitate a flexible and adaptable skillset, including the use of ‘computational thinking’, that provides a basis for you to critically adapt to both new and existing data journalism techniques.
The module begins by building applied understanding of data journalism techniques and issues, before exploring more specific practices around design, analysis and coding. These are closely aligned to the core assignment tasks. A range of newsgathering and storytelling techniques are explored, giving you the basis for initiating and developing contemporary data journalism projects within a professional context.
You will also be learning about other aspects of data journalism in sister modules, for example the Storytelling Techniques module will give you a range of skills for telling data-driven stories using video, audio, visual journalism, visualisation, and interactivity. In Research in Practice you will explore research around newsrooms. In Specialist Reporting, Investigations and Coding you will expand your coding and investigative skills, and in Law, ethics, regulation and security you will build your understanding of legal and security issues in the field.
- Identify, gather and communicate stories based on structured information using data journalism techniques and technologies for an identified audience
- Critically evaluate the professional, legal and ethical contexts surrounding data journalism and apply that to a specific project
This module begins with formal classes and becomes more student-driven as it progresses. You will be expected to experiment with techniques ahead of sessions, so that class time is spent more fruitfully in interactive discussion rather than one-way lectures.
You will also be expected to feed your own experiences into each class - and your own problems and questions - rather than coming to the sessions with nothing to contribute or build on. As independent learners the emphasis is on you to drive your learning forward through conversation rather than accept it passively.
By the end of this week you should be able to describe what data journalism is, and what types of stories you can find and tell with data journalism techniques. You should be able to generate ideas for data journalism stories yourself, and identify some sources of data.
- Workshop: Data journalism story workshop;
- Directed study:
- Review examples of data journalism - what sources do they use? Where has the idea come from? What angle? Make notes. (30-60’)
- Create a news diary (e.g. in Google Calendar)
- Look for data sources and brainstorm ideas. Share your story ideas on Slack and give feedback to others. (30-60’)
- Use the techniques from the pre-class videos and/or Data Journalism Heist to find specific story leads in the gender pay gap (or other) data. Have problems/questions to bring to the next class
- Reading:
By the end of this week you should be able to write about numbers. You should also be able to use core spreadsheet techniques to find stories, including sorting and filtering, pivot tables, and be able to calculate change and proportions.
- Pre-class video: The 3 chords of data journalism (or watch the more detailed Data journalism basics: using sorting, filtering and pivot tables to find stories)
- Workshop: Data journalism basics: sorting, filtering, pivot tables, calculations
- Directed study:
- Use the techniques from this week to write a 2-4 paragraph news story based on a dataset
- Reading:
- Finding Stories in Spreadsheets chapters 4 and 6. (BCU students please ask for your voucher for a free copy)
Additional resources: The New York Times have made their internal data journalism training materials available here and written about their training here.
By the end of this week you should be able to choose the right chart to illustrate a given story, and to edit that chart to improve clarity and accuracy. You should also be able to explain and address critical issues such as accessibility and diversity.
- Pre-class video: Data storytelling: choosing the right chart
- Workshop: telling a number story with Datawrapper
- Task: Create charts for the stories you have written so far
- Task: Create charts for a charticle
- Reading: Alan Smith: How Charts Work ch3+
- Optional additional reading:
By the end of this week you will be able to take a story from pitch to publication, understanding the structure of both a basic 'news in brief' news report and an exploratory feature.
- Pre-class video: Research for journalists playlist
- Hour 1: Drafting the data story
- Directed study hour: Write another story
- Hour 2: Drafting a data feature
- Reading 1: How to: write a news story based on new data
- Reading 2: How to write an 'in numbers' feature
Optional additional reading:
- Borges-Rey (2016) Unravelling Data Journalism: A study of data journalism practice in British newsrooms
- Figl (2017) BIGGER IS NOT ALWAYS BETTER: WHAT WE CAN LEARN ABOUT DATA JOURNALISM FROM SMALL NEWSROOMS
- Usher (2016) Inside the Interactive Journalism Newsroom, from Interactive Journalism: Hackers, Data, and Code
- Young and Hermida (2014) From Mr. and Mrs. Outlier To Central Tendencies
- Zanchellia and Crucianelli (2011) Integrating Data Journalism into Newsrooms
By the end of this week you should be able to use a range of spreadsheet functions - but more importantly, use computational thinking to break down editorial challenges into problems that can be tackled systematically, quickly and effectively, with the potential for automation or semi-automation as algorithms.
- Pre-class video 1: An introduction to functions and formulae for data journalism
- Pre-class video 2: A data journalist's guide to computational thinking
- Workshop: Three functions that will save your life
- Workshop: Getting started with notebooks
- Bonus computational thinking challenge
- Video: Create a roadmap to learn coding
Additional resources: OpenLearn: Computational thinking and automation
By the end of this week you should be able to identify scenarios where the use of programming languages such as Python and R is justified, and to explain critical considerations such as transparency and replicability. You should be able to import and export data using Python and the pandas library. You should also be able to explain how lists and dictionaries store data in Python.
-
Pre-class video: why code?
-
Lecture: Notebooks and literate programming
-
Workshop: Running notebooks, creating variables, and using built-in AI
-
Reading/watching: Winny de Jong’s Python for Journalists OR Ben Welsh’s Python MOOC
-
Lecture: Data objects in Python
-
Workshop: Importing, analysing, and exporting in Python
-
Watching: Playlist of videos on APIs
By the end of this week you should be able to source data from APIs using coding. You should also be able to identify techniques for working with large datasets, and issues surrounding big data, linked data, and open data.
- Workshop: Using APIs in Python
- Workshop: Fork this!: Querying large datasets using SQL
- Task: Development of data journalism project - reflection on data-driven vs hypothesis-driven methods
- Reading:
- Vallance-Jones & McKie (2017) ch5: Working with databases
- Hammond (2015): From computer-assisted to data-driven: Journalism and Big Data
- Data.world's Gitbook on SQL is a useful resource and the platform is a good place to practise SQL queries on your own data.
By the end of this week you should be able to identify common data problems, and use techniques to solve those.
- Workshop: identifying dirty data, and telling stories with accuracy. Choose
- A chapter from Finding Stories With Spreadsheets OR an Open Refine tutorial in the Cleaning repo tutorials list
- One of the tutorials on cleaning in R
- Task: Use Heather Krause’s data biography approach to evaluate a data source(s) you plan to use for a story
- Reading: Stray: The Curious Journalist's Guide to Data
By the end of this week you should be able to create a range of map types (point, shape, heat) and talk about the ethical issues surrounding mapping. You should also be able to use SQL to query data.
- Workshop: fork this! Creating shape and point maps using Datawrapper; creating hex maps using R or JavaScript
- Reading: When to use maps in data visualisation: a great big guide - part 1 and part 2 on shape maps
- Reading: Thinking about maps
- Task: Write a MAPPED! story
- Optional: any chapter(s) from Mapping, Society and Technology
By the end of this week you should be able to create a basic HTML page with CSS styles, and explain the basics of design for mobile devices.
Before the class: read Learning HTML and CSS by making tweetable quotes on Leanpub or in the repo
- Workshop: HTML, CSS and responsive frameworks. Fork this! Intro to responsive web design with Bootstrap
- Task: Create a responsive webpage on GitHub Pages
- Reading: Jeremy Keith: Resilient Web Design
- Reading: Manuel Matuzovic: Writing HTML with accessibility in mind; Writing CSS with Accessibility in Mind
Additional resources:
- YouTube playlist: the UIkit frontend framework (for designing responsive pages, longform and scrollytelling)
This week we review your progress so far, and look ahead to the assignment. You will also find material in this section on future developments such as AI, machine learning and bots.
- Workshops:
- Fork this! Command line and regex:
- Exercise: combining stop and search data CSV files using command line
- Using grep and regex in R
- Task: Development of data journalism portfolio
- Reading - choose from:
You can find more tips on using command line in another repo here
In the final weeks of the semester you will work on your portfolio for assessment. You will find a range of readings on Moodle that you should use to inform your decisions, and that you can draw on in your reflection. Those include:
- Borges-Rey (2016) Unravelling Data Journalism: A study of data journalism practice in British newsrooms
- Figl (2017) BIGGER IS NOT ALWAYS BETTER: WHAT WE CAN LEARN ABOUT DATA JOURNALISM FROM SMALL NEWSROOMS
- Usher (2016) Inside the Interactive Journalism Newsroom, from Interactive Journalism: Hackers, Data, and Code
- Young and Hermida (2014) From Mr. and Mrs. Outlier To Central Tendencies
- Zanchellia and Crucianelli (2011) Integrating Data Journalism into Newsrooms