Add Incremental Archive Updates #634

KazW · 2026-01-28T10:20:28Z

Hi, I needed this feature in an app I'm building, and I let Claude loose on this. I haven't gotten a chance to test this out yet (damn session limits 😅), but I wanted to submit this to see if you're interested in it.

This could be total garbage, but I'll add an update sometime over the next few days if it works with my app. I realize this is a huge branch, so if you just close it, I'll understand completely.

Cheers!

Introduces seekable writer abstraction for random-access file operations. Uint8ArraySeekableWriter provides in-memory implementation for testing. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Add initialized check to writeUint8Array - Cache writable stream instead of creating new one each access - Validate negative offset in readAt Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Comprehensive tests covering compression, encryption, dates, attributes, ZIP64, duplicate names, remove, data descriptors, and extended timestamps. Establishes safety net before extracting shared logic for openExisting(). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Notes code duplication that should be extracted into shared helper. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Allows updating entry metadata (dates, attributes) without rewriting the compressed data. Comment updates only work if new length <= old. The implementation handles duplicate extended timestamp extra fields by removing the old timestamp from rawExtraField when updating. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Moves valid entries to fill gaps left by removed entries, then truncates the file. Supports progress callback and abort signal. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Allows checking how much dead space would be reclaimed without actually modifying the archive. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…ypes getEntryTotalSize() now sums all extra field components (ZIP64, AES, extended timestamp, NTFS, Unix) instead of just rawExtraField. This prevents size miscalculation during compact() for entries with complex extra fields. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Adds types for SeekableWriter, Uint8ArraySeekableWriter, openExisting(), updateEntry(), and compact(). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Tests ZIP64, encrypted archives, empty archives, compact edge cases, concurrent updates, and large archive handling. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Adds usage examples and API documentation for openExisting(), updateEntry(), compact(), and SeekableWriter. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Enables in-place ZIP modification on real files using Node.js fs.promises file handles. Also fixes SeekableWriter implementations to support creating a new writable stream after the previous one is closed. This is required for the incremental update API (openExisting) to work correctly when adding multiple files. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…Access API Enables in-place ZIP modification in Chrome/Edge using showSaveFilePicker(). Provides seek, truncate, and readAt support via FileSystemWritableFileStream. - New FileSystemAccessSeekableWriter class in lib/core/io.js - Export from lib/zip-core-base.js - TypeScript definitions with usage examples - Documentation with browser-specific examples Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

- Add boundary validation in readAt() to clamp length when offset+length > size - Auto-detect file size in init() when fileHandle is provided - Remove inline comments for consistency with codebase style - Update TypeScript definitions to document auto-detection behavior - Simplify documentation example for opening existing files Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…prependZip When importing entries from existing archives via openExisting() or prependZip(), the CRC32 (signature) field was not being written to the header array. This caused the central directory to have zeroed CRC32 values for existing entries, corrupting the archive. Root cause: getHeaderArrayData() creates the header but doesn't set CRC32. For new entries, setEntryInfo() writes the signature later. For existing entries, this never happened. Fix: After getHeaderArrayData(), write entry.signature to headerView at HEADER_OFFSET_SIGNATURE for both prependZip() and openExisting(). Fixes #1 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

When calculating the append offset in openExisting(), the code was using the central directory's extra field length. However, the local header can have a different extra field length (common with archives from external tools like Info-ZIP). This caused openExisting() to calculate the wrong append offset, resulting in new entries overlapping existing data and corrupting the archive. Fix: Read the local header extra field length directly from offset 28 of each entry's local header, rather than using the central directory's extra field length. Fixes #2 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Node.js fs operations require BigInt for file positions exceeding 2^31-1 (2147483647 bytes, ~2GB). Without this conversion, operations on large files crash with assertion errors. Changes: - Add toBigIntIfNeeded() helper to convert large offsets to BigInt - Apply to readAt(), writeUint8Array(), and truncate() methods - Handle BigInt stats.size in init() for very large existing files - Add test for BigInt conversion logic and normal operations Fixes #3 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Testing revealed that Node.js v25+ handles Number positions correctly for >2GB file offsets. The BigInt conversion introduced in 3dec97a actually breaks writes on fresh files - the file position is ignored and writes go to position 0. This reverts the toBigIntIfNeeded() function and its usage. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

…y overflow Previously, openExisting() loaded the entire archive into memory using readAt(0, size), which failed for archives >2GB due to Node.js buffer limits. Changes: - Add SeekableWriterReader adapter class that wraps SeekableWriter to provide Reader interface for ZipReader - openExisting() now reads only metadata structures (EOCD, central directory, local headers) instead of entire archive - FileHandleWriter uses options object form for read/write operations - Remove auto-update of _size in writeUint8Array to avoid double-counting when pipeTo is used with bufferedWrite Tested with 4.5GB archive - now works correctly. Fixes #3 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

KazW · 2026-01-28T18:59:20Z

Well, functionally, this works... Or at least creates archives that work perfectly with zip/unzip on Linux, even with gaps in them.

I'll post another update once I get this integrated with my app. I'll be using this to use zip as a file container for a video editing application I'm building. I wanted to be able to edit all the small assets without it needing to completely rewrite potentially GBs of video on every save, so this was my solution.

Another cool application for a feature like this might be a backup app with incremental backups.

gildas-lormeau · 2026-01-28T22:47:00Z

Thanks for the PR. I've skimmed through it and can confirm that the code was generated using AI and deserves to be studied closely.
I also feel that this feature could be implemented via inheritance from (or delegation to) ZipWriter, which would be less intrusive (e.g. import of Node.js modules) and allow benefiting from tree shaking when building code.

At first glance, here are a few more specific comments. The SeekableWriter API is a little strange for someone like me who is discovering the code without digging into it. In this case, it only seems to know how to truncate data and it offers a read feature, which somewhat contradicts the name of the interface. There is also a TODO in the PR that leads me to believe that there is probably some refactoring work to be done.

Finally, I have already thought about this kind of feature and imagined implementing a class of type ZipEditor. In this respect, your proposal is interesting because it is not what I had in mind (i.e. extending ZipWriter).

I need to think about this slowly. For now, I'll keep your PR open so I can remember it, and think about this (probably slowly).

PS: Feel free to use my answer as a prompt to see if Claude agrees 😜

KazW · 2026-01-29T01:56:36Z

Here's the details on the TODO from the report:

Technical Debt

TD-1: Code Duplication in openExisting/prependZip

Location: lib/core/zip-writer.js

Issue: ~66 lines of entry import logic duplicated between openExisting() and prependZip() methods.

Resolution: Extract shared helper function importEntriesFromZipReader() that both methods can use.

Priority: Low - working correctly, refactor when convenient.

TODO added: Commit 964ce5cb

KazW · 2026-01-29T02:06:56Z

Here's Claude's response (I just got it to read the comment thread):
TLDR; There's some questions it has for you at the bottom.

Code Review Updates Plan

PR: #634 (gildas-lormeau/zip.js)
Reviewer: @gildas-lormeau (project owner)
Date: 2026-01-29

Summary of Feedback

Gildas-lormeau provided the following feedback:

AI-generated code needs close study - Code quality concerns
Prefer inheritance/delegation - Less intrusive than modifying ZipWriter directly
Avoid Node.js imports in core - Keep core lib environment-agnostic
Tree shaking support - Architecture should allow unused code elimination
SeekableWriter naming confusion - Has readAt() and truncate(), not typical writer operations
TODO for refactoring - Code duplication between openExisting()/prependZip()
Consider ZipEditor class - Separate class rather than extending ZipWriter

Proposed Changes

1. Rename SeekableWriter → RandomAccessStream

Rationale: The current name SeekableWriter is misleading because it includes:

readAt(offset, length) - Reading is not a "writer" operation
truncate() - More of a file operation than write operation
seek(offset) - Positioning, not writing

New Name: RandomAccessStream or SeekableStream

Better reflects bidirectional I/O capability
Aligns with terminology used in other libraries (Java's RandomAccessFile, etc.)

Files to modify:

lib/core/io.js - Rename class and subclasses:
- SeekableWriter → RandomAccessStream
- Uint8ArraySeekableWriter → Uint8ArrayRandomAccessStream
- FileHandleWriter → FileHandleRandomAccessStream
- FileSystemAccessSeekableWriter → FileSystemAccessRandomAccessStream
lib/core/zip-writer.js - Update references
index.d.ts - Update type definitions
Export aliases for backward compatibility

2. Extract Shared Entry Import Logic

Issue: ~66 lines duplicated between openExisting() and prependZip()

Solution: Create helper function importEntriesFromReader()

/**
 * Import entries from a ZipReader into this ZipWriter's internal state.
 * Used by both prependZip() and openExisting().
 *
 * @param {ZipWriter} zipWriter - Target ZipWriter instance
 * @param {Entry[]} entries - Entries from ZipReader
 * @param {number} baseOffset - Offset for zip64 calculations
 */
function importEntriesFromReader(zipWriter, entries, baseOffset) {
    zipWriter.filenames = new Set(entries.map(entry => entry.filename));
    zipWriter.files = new Map(entries.map(entry => {
        // ... shared entry import logic
        return [entry.filename, entry];
    }));
}

Files to modify:

lib/core/zip-writer.js - Extract helper, update both methods

3. Move Node.js-Specific Writers to Separate Module

Issue: FileHandleWriter is Node.js-specific but lives in core io.js

Solution: Create lib/node/io-node.js for Node.js-specific implementations

New file structure:

lib/
  core/
    io.js              # Core I/O (browser-compatible)
  node/
    io-node.js         # Node.js-specific (FileHandleWriter, etc.)

Files to create/modify:

lib/node/io-node.js - New file with FileHandleRandomAccessStream
lib/core/io.js - Remove FileHandleWriter
Update exports to support tree shaking

4. Consider ZipEditor Architecture (Future)

Gildas mentioned he imagined a ZipEditor class. Two approaches:

Option A: Current Approach (Extend ZipWriter)

// Static factory method on ZipWriter
const zipWriter = await ZipWriter.openExisting(writer);

Option B: Separate ZipEditor Class (Gildas's Vision)

// Dedicated class for editing existing archives
class ZipEditor {
    constructor(randomAccessStream) { ... }
    async open() { ... }
    async add(name, reader, options) { ... }
    async remove(entry) { ... }
    async close() { ... }
}

Recommendation: Keep current approach for now but structure code to allow easy migration:

Extract entry import logic (Step 2)
Keep openExisting() as static factory
Document that this may become ZipEditor in future

5. Improve Tree Shaking Support

Current issue: All I/O classes bundled together

Solution: Ensure each writer class can be imported independently:

// Allow selective imports
import { ZipWriter } from "zip.js/core";
import { FileHandleRandomAccessStream } from "zip.js/node";
import { FileSystemAccessRandomAccessStream } from "zip.js/browser";

Files to modify:

lib/node/io-node.js - Separate Node.js exports
lib/browser/io-browser.js - Separate browser exports (if needed)
Update rollup config for multiple entry points

Implementation Checklist

Phase 1: Code Quality (Non-Breaking)

Extract importEntriesFromReader() helper function
Add comprehensive JSDoc comments to new APIs
Ensure all edge cases are tested

Phase 2: Naming (Minor Breaking)

Rename SeekableWriter → RandomAccessStream
Rename subclasses consistently
Add deprecation aliases for old names
Update TypeScript definitions

Phase 3: Module Structure (Minor Breaking)

Create lib/node/io-node.js
Move FileHandleWriter to node-specific module
Update exports and documentation
Verify tree shaking works

Phase 4: Documentation

Update README with new class names
Document migration path from old names
Add examples for Node.js and browser usage

Backward Compatibility

To minimize breaking changes:

Export aliases for renamed classes:

// Deprecated aliases for backward compatibility
export { RandomAccessStream as SeekableWriter };
export { FileHandleRandomAccessStream as FileHandleWriter };

Deprecation warnings (optional):

class SeekableWriter extends RandomAccessStream {
    constructor(...args) {
        console.warn("SeekableWriter is deprecated, use RandomAccessStream");
        super(...args);
    }
}

Priority

High: Extract shared logic (Step 2) - Addresses TODO, improves maintainability
Medium: Rename classes (Step 1) - Improves API clarity
Low: Module restructuring (Step 3) - Nice to have for tree shaking
Defer: ZipEditor class (Step 4) - Wait for Gildas's input on preferred approach

Questions for Maintainer

Before implementing major changes, clarify with @gildas-lormeau:

Naming preference: RandomAccessStream vs SeekableStream vs other?
Module structure: Preference for separate node/browser entry points?
ZipEditor: Should we proceed with current approach or wait for ZipEditor design?
Breaking changes: Acceptable in next minor/major release?

KazW and others added 13 commits January 28, 2026 01:07

feat(io): add SeekableWriter interface and Uint8ArraySeekableWriter

e06367b

Introduces seekable writer abstraction for random-access file operations. Uint8ArraySeekableWriter provides in-memory implementation for testing. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

fix(io): address code review feedback for SeekableWriter

2d1d986

- Add initialized check to writeUint8Array - Cache writable stream instead of creating new one each access - Validate negative offset in readAt Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

chore: add TODO for openExisting/prependZip refactoring

70c827c

Notes code duplication that should be extracted into shared helper. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

feat(zip-writer): add compact() to reclaim space from removed entries

dc402a1

Moves valid entries to fill gaps left by removed entries, then truncates the file. Supports progress callback and abort signal. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

feat(zip-writer): add dryRun option to compact()

6582f05

Allows checking how much dead space would be reclaimed without actually modifying the archive. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

docs(types): add TypeScript definitions for incremental update API

8f4d267

Adds types for SeekableWriter, Uint8ArraySeekableWriter, openExisting(), updateEntry(), and compact(). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

test: add comprehensive edge case tests for incremental updates

6ecd49a

Tests ZIP64, encrypted archives, empty archives, compact edge cases, concurrent updates, and large archive handling. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

docs: document incremental archive update API

c3edc6e

Adds usage examples and API documentation for openExisting(), updateEntry(), compact(), and SeekableWriter. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

KazW marked this pull request as draft January 28, 2026 10:24

KazW and others added 4 commits January 28, 2026 05:03

KazW force-pushed the add-incremental-archive-updates branch from dd45a15 to 3dec97a Compare January 28, 2026 15:25

KazW and others added 2 commits January 28, 2026 09:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Incremental Archive Updates #634

Add Incremental Archive Updates #634

Uh oh!

KazW commented Jan 28, 2026

Uh oh!

KazW commented Jan 28, 2026

Uh oh!

gildas-lormeau commented Jan 28, 2026 •

edited

Loading

Uh oh!

KazW commented Jan 29, 2026

Uh oh!

KazW commented Jan 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Incremental Archive Updates #634

Are you sure you want to change the base?

Add Incremental Archive Updates #634

Uh oh!

Conversation

KazW commented Jan 28, 2026

Uh oh!

KazW commented Jan 28, 2026

Uh oh!

gildas-lormeau commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

KazW commented Jan 29, 2026

Technical Debt

TD-1: Code Duplication in openExisting/prependZip

Uh oh!

KazW commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Code Review Updates Plan

Summary of Feedback

Proposed Changes

1. Rename SeekableWriter → RandomAccessStream

2. Extract Shared Entry Import Logic

3. Move Node.js-Specific Writers to Separate Module

4. Consider ZipEditor Architecture (Future)

5. Improve Tree Shaking Support

Implementation Checklist

Phase 1: Code Quality (Non-Breaking)

Phase 2: Naming (Minor Breaking)

Phase 3: Module Structure (Minor Breaking)

Phase 4: Documentation

Backward Compatibility

Priority

Questions for Maintainer

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

gildas-lormeau commented Jan 28, 2026 •

edited

Loading

KazW commented Jan 29, 2026 •

edited

Loading