Skip to content

SHA Mismatch: Wikimedia Provides SHA1, But Databus Uses SHA256 #3

@tech0priyanshu

Description

@tech0priyanshu

Body

I'm currently working on integrating Wikimedia dump files into the Databus and ran into an issue related to checksum verification.

Wikimedia provides only SHA1 hashes for its dumps while the Databus expects and stores SHA256 hashes (sha256sum).

Since SHA256 is not provided by Wikimedia, I am currently forced to hardcode the SHA256 values manually during the publish process to get around this. This is obviously not ideal and could cause integrity issues or maintenance overhead in the future.

Future Solution

Allow alternate hash formats during the Databus publishing flow?

Add a script to automatically calculate the SHA256

Or provide guidance on how best to handle this mismatch?

cc:- @kurzum, @ronitblenz, @m1ci

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions