Skip to content

and then I loaded 300 projects / 6,815 repos into facade and tried building cache 🤣 #31

@sgoggins

Description

@sgoggins

128 gig of ram, solid state drives …. got all the repos and analysis_data .. its been 21 hours loading project_weekly_cache .. no cpu usage, so I am guessing I have the database eating disk …

I’ve already made a set of modification and database config notes on my fork at sgoggins/facade:augur branch … I’m thinking I an rewrite the query that loads cache to go after one repository or project group at a time .. since this is a nominal, 4 hour thing for me (a very experienced database guy / formerly well compensated Oracle DBA) I thought I would circle back and see if you would approve a pull request that modularized some of the functions in facade-worker.py into another python file. Or how you would recommend doing this.

The refactoring would change how cache is built and have options for execution. I think:

  1. Cache would not be rebuilt at the project level when "recache" is tagged automatically
  2. Cache at the project level would be rebuilt one project at a time
  3. I will explore a process of accumulating project level details from repository level cache, which may require some changes to the repository cache.
  4. Cache would be rebuilt at the repo level one repo at at time.
  5. i would take a parameter that enabled wholesale cache building at the project and repo level as is the case today for smaller scale implementations
  6. I will explore the potential to keep cache without destroying it on each recache

What do you think @brianwarner ?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions