Price Statistics Data Catalogue

Welcome to the Price Statistics Data Catalogue project repository!

Find out more about how the catalogue works!

How to register an open dataset to the catalogue?

Raison d'être

Status quo in the price statistics discipline

There are many datasets that researchers in the field price statistics can use to do empirical research. Specifically, they can use

internal (such as to a National Statistical Office) datasets that only they and their colleagues have access to,
proprietary datasets (i.e. available to researchers at a cost) that may be too expensive for other researchers to acquire, or
openly available or public datasets.

If researchers use the first two types of datasets, replicability is a challenge for the discipline as others may not be able to easily get access to the same datasets to validate or replicate the results. Researchers who may wish to try to use the third type of datasets with the aim of making their research reproducible (or even replicable, or robust) may be dissuaded from trying if it is too challenging to find applicable datasets or if the datasets that are available are poorly documented, requiring researchers to document the dataset for others as part of their project. Our observation is that currently in the price statistics discipline, most research is done with internal or proprietary datasets for this reason.

Purpose of the price statistics data catalogue

The aim of this data catalogue is to considerably simplify the process outlined above and make it easy to find and use open datasets in price statistics. Our hope is that with this process considerably simplified, more research will tend towards open datasets, allowing better reproducibility in the discipline. Thus the catalogue lists the main datasets that are availible within the discipline and describes how to download them, as well as outlines how they are structured to provide a common interface for researchers. We accept two types of datasets:

Open datasets that are available to others - includes datasets that are made available by various researchers or organizations for research purposes.
Proprietary but free datasets - includes datasets that are 'owned' by someone else but are summarized here so that they are easily discoverable to researchers.

Note on this version of the catalogue

This data catalogue is a proof of concept. We are aiming to use this version to demonstrate the use and add value to the discipline. If this proves a success, we will look for a more permanent and powerful catalogue.

Contributing a dataset:

To contribute a dataset to this catalogue, please submit either an issue to this repository outlining the dataset and why it should be added to the catalogue, or a pull request with the proposed changes for us to review and approve. See the full contriubting proces flushed out on the price statistics reproducibility project site.

Technical context

We are exploring the use of data contract cli as a way to document and track datasets as the python library comes with a data catalog.

Who are we?

This catalogue is maintained by the reproducibility project team (which is a workstream of the UN Task Team for Scanner data). Read more about us here.

Config

To run this locally, install datacontract-cli and run:

datacontract catalog --files "datasets/*.odcs.yaml" --output _site

The post-processing.py executes afterwards to clean the html files

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.github		.github
bitex		bitex
datasets		datasets
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
post-processing.py		post-processing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Price Statistics Data Catalogue

Raison d'être

Status quo in the price statistics discipline

Purpose of the price statistics data catalogue

Note on this version of the catalogue

Contributing a dataset:

Technical context

Who are we?

Config

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Price Statistics Data Catalogue

Raison d'être

Status quo in the price statistics discipline

Purpose of the price statistics data catalogue

Note on this version of the catalogue

Contributing a dataset:

Technical context

Who are we?

Config

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages