Skip to content

Tool to help document archaeological dataΒ #3

@zackbatist

Description

@zackbatist

An interface that prompts users to document various aspects of their datasets and highlight or explain the implicit relationships between tables or variables. May be especially valuable for organizing series of scattered spreadsheets.

  1. User selects a series of spreadsheets, MS Access databases or SQL databases, PDFs of blank physical recording sheets, images of blank physical tags, directory structures, etc
    • For physical recording sheets and tags, user selects and highlights the fields that are represented on those media, which are then included in subsequent stages
  2. Prompt to delimit scope
    • Written explanation, i.e. this documentation explains the extensive portion of our database that deals with lithics processing
    • Select which tables or components to include
    • The scope of file directories, what they are meant to contain (e.g.: /trench001/ contains all files pertaining to trench 001 including trench report and section drawings, /trench001/context0001 contains all info pertaining to context 0001 including folder for pics, special finds from within context 0001, etc)
    • Notes regarding the project as a whole, why the data is being collected, what kinds of work can or will be done with it, etc.
  3. Prompt to document each table
    • Why does it exist? What is it meant to include?
    • Who contributes to it? Provide names and contact info
    • How are these tables populated? e.g.: web forms, physical recording sheets copied over, API access, etc
  4. Prompt to document the variables for each table
    • Identify and explain the composition of indexes, why indexes require or do not require unique values, etc
    • Identify and explain the reasons behind each relationship between table indexes (populated based on fuzzy searches to similarly named variables elsewhere)
    • Identify implicit groupings among dependent variables, e.g.: if different survey point collection methods have different variables associated with them (dog leash samples are associated with values in radius variable, whereas grab samples are not)
    • If values are selected from a preset list, what does each value in the list represent?
  5. Contact info of key personnel in charge of managing the project and its data
  6. Generate fancy visualizations and reports
    • Colour-coded variable groupings
    • Flow charts representing the drawn-out processes through which data is filled in to various tables
  • Also, optional suggestions to relate concepts to CIDOC-CRM or ARIADNE reference model somehow?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions