Commit 788c13a

committed

ras/serviceability: Refactor to use Podman utility with comprehensive VLLM testing

- Migrated from manual podman commands to avocado.utils.podman utility class - Added setUp/tearDown methods for proper test lifecycle management - Implemented four test methods for VFIO Spyre device access scenarios: * test_root_in_sentient: Root user with sentient group access (expects success) * test_root_not_in_sentient: Root user without sentient group (expects VFIO failure) * test_user_in_sentient: Non-root user with sentient group access (expects success) * test_user_not_in_sentient: Non-root user without sentient group (expects VFIO failure) - Added wait_for_vllm_startup() helper for container readiness monitoring - Created comprehensive YAML configuration with 4 variants: * RAG: 4 AIUs, 200G memory, granite-3.3-8b-instruct model * Entity Extraction: 1 AIU, 100G memory * Re-Ranker: 1 AIU, 50G memory, bge-reranker-large model * Embedding: 1 AIU, 50G memory, granite-embedding-125m-english model - All container runtime parameters now configurable via YAML: * Memory, shared memory, device, privileged mode, PIDs limit * User namespace, group add, port mapping * VLLM-specific options: prefix caching, additional arguments - Removed hardcoded credentials for security - Added comprehensive inline documentation in YAML - Registry login with API key support - Container image pull before test execution Signed-off-by: Abdul Haleem <abdhalee@linux.vnet.ibm.com>

1 parent 1ea4e7f commit 788c13aCopy full SHA for 788c13a

2 files changed

+533

-0

lines changed

ras
- serviceability.py.data
  - serviceability.yaml
- serviceability.py

2 files changed

+533

-0

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Commit 788c13a

2 files changed

2 files changed

File tree

2 files changed

2 files changed

0 commit comments