Skip to content

Fix records with broken sources #3195

@baltpeter

Description

@baltpeter

Inspired by @mal-tee, I wrote a little script to find records with sources that don't exist anymore. In the interest of keeping our records up to date, we should update those.

I'll start working on that myself, but since more than 500 records are affected, I would definitely appreciate some help. :D If anyone is interested in helping, it would be good to announce that here so we don't do duplicate work. I'll be starting from the top, you may then want to pick (and announce) another starting position.

What is there to do?

If a source doesn't exist anymore, it is also possible that the information we have extracted from it has changed, so we should validate all the information in the affected records.

In addition, we'll of course want to replace any broken sources with new ones. Since the purpose of our database is not to collect historical information but only the current contact details, broken sources should (almost) never be replaced with archived links. Instead, if a page hasn't just moved to a different URL, we'll need to find new sources that fit our criteria. If we cannot find a source for some piece of information anymore, it has to be removed from the record.

If anything is unclear, feel free to discuss here.

List of records with broken sources

Also available as NDJSON if anyone needs it: offline-sources.ndjson

Metadata

Metadata

Assignees

No one assigned

    Labels

    keepaliverecordIssue related to the JSON records

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions