Docs: Add job dispatch and resource tiers documentation by erick-GeGe · Pull Request #271 · bitmakerla/estela

erick-GeGe · 2026-03-27T14:01:56Z

Description

Please include a summary of the changes, relevant motivation and context.

Issue

Github Issue ID.

Checklist before requesting a review

I have performed a self-review of my code.
My code follows the style guidelines of this project.
I have made corresponding changes to the documentation.
New and existing tests pass locally with my changes.
If this change is a core feature, I have added thorough tests.
If this change affects or depends on the behavior of other estela repositories, I have created pull requests with the relevant changes in the affected repositories. Please, refer to our official documentation.
I understand that my pull request may be closed if it becomes obvious or I did not perform all of the steps above.

joaquingx

Good coverage of the dispatch system, but I think this reads more like internal code notes than documentation. A few suggestions:

Too much implementation detail

Things like the Redis lock command (SET spider_jobs_lock 1 NX EX 120), internal function names (_get_cluster_resources(), _dispatch_single_job()), and the "Key Files" table are implementation details that will go stale as code changes. These belong in code comments or docstrings, not in docs.

Suggestion: Simplify to focus on what users need to know (tiers, statuses, config options) and drop the code-level details. The code should document itself.

joaquingx · 2026-03-27T16:46:05Z

docs/estela/api/job-dispatch.md

+   exits immediately.
+
+2. **Fetch queued jobs**: Queries jobs with `IN_QUEUE` status, ordered by creation
+   date (FIFO), limited to `RUN_JOBS_PER_LOT` (default 100, 1000 in production).


recommended in production is 1000?

It's not a recommendation — it's the current value in config/settings/prod.py. The default in base.py is 100, but prod.py overrides it to 1000.

joaquingx · 2026-03-27T16:47:43Z

docs/estela/api/job-dispatch.md

+## Cluster Resource Checking
+
+The `_get_cluster_resources()` function queries the K8s API to determine available
+capacity on worker nodes. Nodes are selected by label (`role=<SPIDER_NODE_ROLE>`,


What about if the DEDICATED_SPIDER_NODES is set to true?

When DEDICATED_SPIDER_NODES=True, spider pods are scheduled only on nodes labeled with role=<SPIDER_NODE_ROLE>, and _get_cluster_resources() checks capacity only on those nodes. When set to False, pods have no nodeSelector and can land on any node — but the capacity check won't work accurately since it doesn't know which nodes to measure.

docs/estela/api/job-dispatch.md

joaquingx · 2026-03-27T16:51:13Z

docs/estela/api/job-dispatch.md

+- **`MULTI_NODE_MODE` must be `"True"`**: This is **critical**. When `MULTI_NODE_MODE`
+  is enabled, spider pods are scheduled with a `nodeSelector` matching `SPIDER_NODE_ROLE`,
+  and `_get_cluster_resources()` queries only those labeled nodes. If `MULTI_NODE_MODE`
+  is `"False"`, pods have no `nodeSelector` and the capacity check has no way to
+  accurately measure available resources. The sequential dispatch system is designed
+  to work with `MULTI_NODE_MODE=True`.


If this is mandatory why there's an option to deactivate?

It's not strictly mandatory — it's the recommended setup for production or any infrastructure running many spiders at scale. With DEDICATED_SPIDER_NODES=True, you get accurate capacity checking and isolation between spider workloads and system components. However, for smaller setups where everything runs on one or a few nodes, you may want spiders to be scheduled on any available node, so the option exists for that flexibility.

docs: add job dispatch and resource tiers documentation

e1dbeb6

erick-GeGe requested a review from joaquingx March 27, 2026 14:01

joaquingx requested changes Mar 27, 2026

View reviewed changes

update job dispatch doc with variable renames

82bfc8a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs: Add job dispatch and resource tiers documentation#271

Docs: Add job dispatch and resource tiers documentation#271
erick-GeGe wants to merge 2 commits intomainfrom
docs/job-dispatch

erick-GeGe commented Mar 27, 2026

Uh oh!

joaquingx left a comment

Uh oh!

joaquingx Mar 27, 2026

Uh oh!

erick-GeGe Mar 27, 2026

Uh oh!

joaquingx Mar 27, 2026

Uh oh!

erick-GeGe Mar 27, 2026

Uh oh!

Uh oh!

Uh oh!

joaquingx Mar 27, 2026

Uh oh!

erick-GeGe Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

erick-GeGe commented Mar 27, 2026

Description

Issue

Checklist before requesting a review

Uh oh!

joaquingx left a comment

Choose a reason for hiding this comment

Uh oh!

joaquingx Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

erick-GeGe Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

joaquingx Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

erick-GeGe Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

joaquingx Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

erick-GeGe Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants