-
Notifications
You must be signed in to change notification settings - Fork 1
Open
Labels
enhancementNew feature or requestNew feature or request
Description
At the moment, one needs to have twice the storage page for all requests and responses (one for WARC, one for the WACZ). As this is not always known beforehand, and could potentially be larger from one crawl to another, it would be helpful to make it work also when not enough storage space is available.
Is it possible to store the WACZ incrementally while the spider is running? How could this be done?
The desire is to do this directly on object storage. But skipping the saving of WARCs first would also be some improvement.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request