Generic Pattern: Collect → Evaluate → Report¶

Operator / Author summary¶

Every piece of automation SE runs — checking a lab is ready, grading a candidate, archiving a session — is the same three steps:

Collect — gather facts from the live lab (run show commands on devices, read files).
Evaluate — compare those facts against the author's rules (regex, parse, thresholds).
Report — produce a typed result document.

As an author, you describe each step in your content: what to collect (scenarios), how to grade it (grading rules), and which report to emit. As an operator, this is why "readiness check", "grade", and "archive" all feel the same — they are the same engine with different inputs.

The process type picks which report you get:

Process type	Collects	Evaluates	Report
`Initialization`	Boot/health facts	Lab-is-ready rules	ReadinessReport
`Grading`	Candidate config/output	Grading rubric	ScoreReport
`Change`	Before/after facts	Change rules	ChangeReport
`Submission`	Final state	Submission rules	ALII / submission report
`Archive`	Final artifacts	—	ArchiveReport

Collect → Evaluate → Report is a soft grouping (the stage of each step), not a rigid template. A real job usually also has a setup stage first — seed files, wipe devices, run the candidate's solution — before it can collect. You express all of it with the same closed set of primitives, described next.

Architect detail¶

The two layers — scenarioFunctions vs JobDefinitions¶

The DSL has exactly two layers (see ADR-057):

scenarioFunction — a code-defined, versioned, trusted primitive in the SE registry (an @scenario(name, version) class). The set is closed and orthogonal. Authors and LLMs never write these; adding one is a code PR + version bump.
JobDefinition — a content-defined phase body: an ordered DAG of steps, each of which uses: a scenarioFunction@version. Authors (and LLMs) write only this declarative wiring.

This keeps content sandboxed and machine-validatable. A JobDefinition lives in PAv1/jobs/<name>.yaml and is bound to a phase from lifecycle.yaml (definition: <name>@<version>); the runtime instance is a Job.

The closed primitive set (scenarioFunctions)¶

`uses:`	Stage	Purpose	Legacy origin
`pause@v1`	setup	Wait/settle	`tPause`
`exec@v1`	setup	Run command/script on a connector, capture output, gate	`tExecute`, `tExecuteBatch`
`copy@v1`	setup	Push a content file to the POD host	`tScp`
`cml.bounce_interface@v1`	setup	Bounce an interface via the control node	`bounce_interface`
`cml.wipe@v1`	setup	Wipe devices via the control node	`cmlctl --action wipe`
`cml.power@v1`	setup	Start/stop a node or ext-conn	`cmlctl --action stop`
`cml.lab_resolve@v1` / `cml.lab_start@v1` / `cml.lab_stop@v1`	setup	Lab lifecycle	(native)
`collect@v1`	collect	Run a `show` command on a device, capture output	`verify subject='commandOutput'`
`evaluate.regex@v1`	evaluate	Regex check a captured var → pass/fail + issue	`verify subject='parse'`, `tVerify`
`report.score@v1` / `report.readiness@v1` / …	report	Assemble the typed report	`reportClass='LabletReport'`

evaluate.regex is the single check primitive: in a grading stage it feeds report.score; in a setup stage its captured passed flag feeds a later step's when: gate (the legacy tVerify … set='X.OK' → if='X.OK' pattern). cml.* operations are first-class primitives — the author names the operation, SE owns the mechanics and the cml_password.

The step shape¶

Every step has one consistent shape:

- id: <unique-in-job>            # stable id; also the capture namespace
  uses: <scenarioFunction>@<ver> # must exist in the SE registry
  target: <connector-name>       # optional — omitted for pause/report/cml.*
  with: { <input>: <value|expr> }# inputs; values may be ${ jq } over the scopes
  capture: { <var>: <output> }   # write named outputs into vars.*
  when: "${ <jq-bool-expr> }"    # optional gating
  on_error: { action: fail|continue|retry, retries?, backoff? }
  timeout: <seconds>
  stage: setup|collect|evaluate|report   # soft grouping (default: setup)

target: selects a connector from PAv1/connectors.yaml (the port model from the legacy pod.xml). Steps run in document order, honouring when and on_error; a step may read any vars.* an earlier step captured.

Data-flow scopes¶

Every value a step reads or writes lives in one of four namespaced scopes (see ADR-058). References use jq in ${ }.

Scope	Writable?	Holds	Replaces legacy
`session.*`	read-only	candidate / exam / timeslot metadata (`mosaic_meta.json` + Session)	—
`content.*`	read-only	lab-root path, packaged files, form FQN	`${config.core.paths.lab_root}`
`runtime_env.*`	read-only	POD facts: device ports, prompts, creds, `cml_password`, worker IP	hard-coded `5052`, `trackNMC50`, `rtr01#`
`vars.*`	read/write	task-captured intermediates (`step.capture`)	`{files}`, `$(rtr01.show_int_loop0)`

Only vars.* is writable; the other three are resolved at job submit and frozen. Secrets and ports are never literal in content — they resolve from runtime_env.*.

# Example: the legacy tScp PAT push, fully scoped
- id: push_package
  uses: copy@v1
  target: workstation_22
  with:
    source: "${ content.files.desktop_package }"   # was ${config.core.paths.lab_root}/…
    dest: "/home/cisco/Desktop/tmp/desktop_package.tgz"
    via_port: "${ runtime_env.devices.workstation.pat_port }"  # was port=5052
  capture: { ok: scp_ok }

Canonical PAv1 layout (one shape)¶

There is one content layout. A single-part lablet uses the top level; a multi-part session repeats the per-part subtree under parts/.

PAv1/
├── manifest.yaml          # definition metadata + pod_type  OR  SessionDefinition + parts[]
├── lifecycle.yaml         # phases -> { native_steps_by_pod_type, jobs[] }   (CPA + SE seam)
├── connectors.yaml        # connector/port model (from legacy pod.xml)
├── topology/
│   ├── devices.json       # instance config
│   └── ports.json         # per-device serial/vnc/pat ports
├── jobs/                  # JobDefinitions — the step DAGs (SE)
│   ├── post_init.yaml
│   └── grade.yaml
├── grading/rubric.yaml    # EvaluationRuleset (graded items + checks + points)
├── reports/score_report.yaml   # ProcessReportSpec (report shape)
└── files/                 # packaged payloads pushed by copy@v1 (desktop_package.tgz)

The single lifecycle shape is phases[].{native_steps_by_pod_type, jobs[]}:

A single-part lab uses the cml_on_aws (or none) entry of native_steps_by_pod_type — this subsumes the older native_steps: [] form.
A multi-part session applies the same jobs[] per part under part_workflow — this subsumes the older part_workflow / native_steps_by_pod_type forms.
jobs[] always reference a JobDefinition file (definition: <name>@<version> → jobs/<name>.yaml); the step DAG never lives inline in lifecycle.yaml. Orchestration is in lifecycle.yaml; bodies are in jobs/.

See the LAB-1.1.1 golden port for a full 1:1 port of legacy content into this layout, and LAB-0.1 / LAB-0.2 for the minimal and multi-part references.

Worked example — `post_init` (from `LAB-0.1`)¶

The minimal LAB-0.1 package ports the legacy sb_post_init.xml scriptblock into jobs/post_init.yaml. It shows the setup primitives end-to-end: pause → exec (mkdir) → copy (seed the desktop over the PAT port) → exec (list) → an evaluate.regex gate → a gated exec (unpack) → cml.bounce_interface → cml.power (isolate the Internet). The gate is the key mechanic — legacy set='file.OK' / if='file.OK' becomes capture: feeding a downstream when::

- id: list_tmp                     # capture the `ls` output into vars.files
  uses: exec@v1
  target: workstation_22
  with: { command: "ls -la /home/cisco/Desktop/tmp/" }
  capture: { stdout: files, ok: cmd1_ok }

- id: verify_package               # gate: is the package present?
  uses: evaluate.regex@v1
  when: "${ vars.cmd1_ok }"
  with: { source: "${ vars.files }", regex: "desktop_package\\.tgz", mode: positive }
  capture: { passed: file_ok }

- id: unpack                       # only runs when the gate passed
  uses: exec@v1
  target: workstation_22
  when: "${ vars.file_ok }"
  with: { command: "tar -C /home/cisco/Desktop/tasks/ -xzf …/desktop_package.tgz" }

The companion jobs/grade.yaml ports sb_pre_collect.xml (setup stage) plus grade.xml (collect → evaluate → report) into one job — the canonical Collect→Evaluate→Report flow.

Reuse & iteration — today vs. the deferred hybrid¶

The v1 model is flat: a job lists its steps, and reuse is copy-paste. In a two-device lab this is fine; at scale it shows. The LAB-1.1.1 golden port already hand-writes near-identical step pairs that differ only by device — c_rtr01_lo / c_rtr02_lo, c_sw01_vlan / c_sw02_vlan — and repeats the same Loopback0 check per device in the rubric. The evaluate stage already dodges this by expanding grading/rubric.yaml into N evaluate.regex steps from one ruleset-driven step — i.e. constrained iteration over content in all but name.

ADR-057 §2.8 records a deferred, opt-in hybrid that generalises this without re-opening the sandbox:

CompositeScenario — a content-defined, parameterised group of closed primitives (typed parameters in, export out), invoked from any step via uses: composite:<name>@<ver>. It is indistinguishable from a primitive at the call site and runs in an isolated vars.* frame (ADR-058 §2.5) — never code, never a new primitive.
for_each — a step modifier that runs a step once per list element (e.g. one collect over a device list), collapsing the c_rtrNN_* duplication.

These are specified, not yet built: v1 ships the flat model (Model B), and the hybrid (Model C) is promoted only when duplication in real authored content crosses a pain threshold. Every v1 job stays valid under the hybrid, so the deferral is reversible upward.

Where a job comes from¶

A session phase binds to a JobDefinition(name@version) in the content's lifecycle.yaml. When CPA enters that phase, it calls SE submit_job with the session context (session id, pod/bucket handle, pod type). SE resolves the JobDefinition, builds the pipeline, runs Collect→Evaluate→Report, persists a ProcessReport, and emits a CloudEvent that CPA consumes to advance the phase.

flowchart LR
    subgraph Content["Authored content (PAv1)"]
        LC["lifecycle.yaml<br/>phase → JobDefinition@version"]
        JB["jobs/*.yaml<br/>(step DAG + collect stage)"]
        GR["grading/rubric.yaml<br/>(Evaluate spec)"]
    end
    LC --> JD["JobDefinition"]
    JB --> JD
    GR --> ER["EvaluationRuleset"]
    JD --> JOB["Job (run instance)"]
    ER --> JOB
    JOB --> PR["ProcessReport"]

    style JOB fill:#0d9488,color:#fff
    style PR fill:#475569,color:#fff

C4 — Component view of the Scenario Engine¶

C4Component
    title Component View — Scenario Engine

    Container_Boundary(se, "scenario-engine") {
        Component(api, "Job API", "Neuroglia controllers", "submit_job, cancel_job, get_job, sync_content, list_scenarios")
        Component(runner, "Job Runner", "Application service", "Orchestrates Collect→Evaluate→Report")
        Component(collector, "Collect stage", "Adapter layer", "Resolves scenario, gathers device/file facts")
        Component(evaluator, "Evaluate stage", "Rules engine", "Applies EvaluationRuleset (parse/regex/compare)")
        Component(reporter, "Report stage", "Report builder", "Emits typed ProcessReport")
        Component(content, "Content store adapter", "S3 client", "Loads PAv1 package from RustFS")
        Component(roc_adapter, "ROC adapter", "HTTP client", "Delegates raw device collection")
    }

    ContainerDb(sedb, "SE MongoDB", "MongoDB", "Job + reports")
    ContainerDb(blob, "RustFS / S3", "Object store", "Content bytes")
    System_Ext(roc, "ROC", "RADkit devices")
    System_Ext(cpa, "CPA", "Session manager")

    Rel(cpa, api, "Triggers job", "HTTP")
    Rel(api, runner, "Starts run")
    Rel(runner, content, "Loads JobDefinition + scenarios + rubric")
    Rel(content, blob, "Reads", "S3")
    Rel(runner, collector, "1. Collect")
    Rel(collector, roc_adapter, "Device commands")
    Rel(roc_adapter, roc, "POST /execute/bulk", "HTTP")
    Rel(runner, evaluator, "2. Evaluate")
    Rel(runner, reporter, "3. Report")
    Rel(reporter, sedb, "Persists ProcessReport")
    Rel(reporter, cpa, "Result", "CloudEvent")

    UpdateLayoutConfig($c4ShapeInRow="2", $c4BoundaryInRow="1")

Sequence — one job run¶

sequenceDiagram
    autonumber
    participant CPA as CPA (Session)
    participant SE as SE (Job Runner)
    participant S3 as RustFS
    participant ROC as ROC
    participant DB as SE MongoDB

    CPA->>SE: submit_job(definition, session_ctx, process_type)
    SE->>S3: load PAv1 (lifecycle, scenarios, rubric)
    S3-->>SE: content bytes
    SE->>DB: create Job (status=running)
    Note over SE: 1. COLLECT
    SE->>ROC: POST /devices (provision targets)
    SE->>ROC: POST /execute/bulk (show commands)
    ROC-->>SE: bulk_cmd_uuid
    SE->>ROC: GET /execute/bulk/{uuid}
    ROC-->>SE: device outputs
    Note over SE: 2. EVALUATE
    SE->>SE: apply EvaluationRuleset (parse/regex/points)
    Note over SE: 3. REPORT
    SE->>DB: persist ProcessReport
    SE-->>CPA: CloudEvent: job.completed(report ref)
    CPA->>CPA: advance session phase

Native LCM step vs SE job — the boundary¶

A phase may run a native step, an SE job, or both in sequence. The rule of thumb:

Native (CPA/controllers): anything about infrastructure and session bookkeeping — worker_lab_resolve, pod_locator, ports_alloc, lds_register, mark_ready, archive, schedule. These rarely change and are not author-editable.
SE job (content-driven): anything about the lab's behaviour and assessment — lab_start/stop/wipe, collect_evidence, grade_item, score_report, and report generation. These are defined by content and change per lab.

Commands & queries used¶

Direction	Operation	Kind
CPA → SE	`submit_job`, `cancel_job`, `sync_content`	Command
CPA → SE	`get_job`, `list_scenarios`	Query
SE → ROC	`POST /devices`, `POST /execute/bulk`, `GET/DELETE /execute/bulk/{uuid}`	HTTP
SE → CPA	`job.completed` / `job.failed`	CloudEvent