Skip to content

Revise README with updated project details#4

Merged
416rehman merged 1 commit intomainfrom
416rehman-patch-2
Apr 16, 2026
Merged

Revise README with updated project details#4
416rehman merged 1 commit intomainfrom
416rehman-patch-2

Conversation

@416rehman
Copy link
Copy Markdown
Owner

Updated the README to reflect new features and descriptions for DeepZero, including changes to the pipeline architecture and installation instructions.

Updated the README to reflect new features and descriptions for DeepZero, including changes to the pipeline architecture and installation instructions.
Copilot AI review requested due to automatic review settings April 16, 2026 05:57
@416rehman 416rehman merged commit c4eb85c into main Apr 16, 2026
4 checks passed
@416rehman 416rehman deleted the 416rehman-patch-2 branch April 16, 2026 05:57
Copy link
Copy Markdown

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Updates the project README to reflect DeepZero’s current positioning (agentic vulnerability research pipeline), pipeline architecture primitives, CLI usage, installation extras, and processor authoring guidance.

Changes:

  • Rewrites the top-level project description and architecture section (Ingest/Map/BulkMap/Reduce).
  • Expands installation and CLI usage examples (run/resume/status/validate/list-processors/init/interactive/serve).
  • Adds detailed YAML pipeline anatomy, processor reference formats, built-in processor list, and repository structure.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread README.md
deepzero interactive -m openai/gpt-4o

# Start the REST API server
deepzero serve --host 127.0.0.1 --port 8420 -w work/
Copy link

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The deepzero serve example uses -w work/, but run state is written under the pipeline-specific work dir (settings.work_dir + / + pipeline.name). With the current engine, work/ will typically not contain run.json, so the API will show no data. Update the README example to point at the pipeline work dir (e.g. -w work/loldrivers).

Suggested change
deepzero serve --host 127.0.0.1 --port 8420 -w work/
deepzero serve --host 127.0.0.1 --port 8420 -w work/loldrivers

Copilot uses AI. Check for mistakes.
Comment thread README.md
# Stage 3: external processor from processors/ directory
- name: decompile
processor: my_decompiler/my_decompiler.py
parallel: 0 # 0 = use max_workers from settings
Copy link

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the YAML example, parallel: 0 is documented as "use max_workers from settings", but the runner treats parallel <= 0 as auto-scaling to os.cpu_count() (and does not use settings.max_workers for stage parallelism). Please adjust the README comment (and/or the example) so it matches the actual behavior.

Suggested change
parallel: 0 # 0 = use max_workers from settings
parallel: 0 # 0 or less = auto-scale to available CPU count

Copilot uses AI. Check for mistakes.
Comment thread README.md
processor: file_discovery # bare name = built-in processor
config:
extensions: [".exe", ".sys"]
extensions: ["*"]
Copy link

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The YAML example sets extensions: ["*"] for file_discovery, but the current implementation treats each entry as a file extension and prefixes a dot when missing (so * becomes .* and the glob becomes *.*, which won’t match files without a dot). If the intent is “all files”, omit extensions or set it to an empty list instead.

Suggested change
extensions: ["*"]
extensions: []

Copilot uses AI. Check for mistakes.
Comment thread README.md
Comment on lines +113 to +114
require:
is_executable: true
Copy link

Copilot AI Apr 16, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The YAML example’s metadata_filter stage requires is_executable: true, but the preceding file_discovery ingest stage only emits sha256 and size_bytes (no is_executable). As written, this filter will exclude every sample. Consider changing the example to filter on fields that actually exist (e.g. min_size_bytes) or switching the ingest stage to one that provides is_executable metadata.

Suggested change
require:
is_executable: true
min_size_bytes: 1

Copilot uses AI. Check for mistakes.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants