Skip to content

Use pywb to create single wacz files #980

@zhabiba24

Description

@zhabiba24

Really cool to have this pywb tool for creating archives programatically...

Currently I'm trying to use pywb to create a single wacz record that corresponds to a given URL. However it seems right now to create warc.gz files which correspond to an entire archiving session.

My ideal flow:

  1. Make a GET request to a local running pywb server
  2. Receive a single wacz file that corresponds that to that URL

So far I have go tthis working:

uv run wb-manager init test-cli-archive
uv run wayback --record --live -a --auto-interval 10
curl http://localhost:8080/test-cli-archive/record/<MY URL>

This then gives me this file:

collections/test-cli-archive/archive/rec-20260123155603209431-<machine id>.warc.gz

But this is a warc.gz file of everything in this archiving session, when in reality I just want a single wacz that corresponds to the URL I just archived.

Is there a way to achieve this workflow?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions