-
Notifications
You must be signed in to change notification settings - Fork 11
Description
128 gig of ram, solid state drives …. got all the repos and analysis_data .. its been 21 hours loading project_weekly_cache .. no cpu usage, so I am guessing I have the database eating disk …
I’ve already made a set of modification and database config notes on my fork at sgoggins/facade:augur branch … I’m thinking I an rewrite the query that loads cache to go after one repository or project group at a time .. since this is a nominal, 4 hour thing for me (a very experienced database guy / formerly well compensated Oracle DBA) I thought I would circle back and see if you would approve a pull request that modularized some of the functions in facade-worker.py into another python file. Or how you would recommend doing this.
The refactoring would change how cache is built and have options for execution. I think:
- Cache would not be rebuilt at the project level when "recache" is tagged automatically
- Cache at the project level would be rebuilt one project at a time
- I will explore a process of accumulating project level details from repository level cache, which may require some changes to the repository cache.
- Cache would be rebuilt at the repo level one repo at at time.
- i would take a parameter that enabled wholesale cache building at the project and repo level as is the case today for smaller scale implementations
- I will explore the potential to keep cache without destroying it on each recache
What do you think @brianwarner ?