Ticket: P1.5-S5-06
Type: Execution | Est: 2d
Goal: All 30 incidents run through ARIA on real GCP infrastructure in both invocation modes. Results captured in a structured file for AC scoring.
Scope (depends on P1.5-S5-05 test harness):
- Tool mode: direct
ARIAPipeline.run(incident_number) for all 30 incidents; capture per-incident: classification_label, confidence_band, affected_ci_matched, log_recall, duration_ms
- API mode:
POST /api/v1/pipeline/run for all 30 incidents via HTTP; capture: HTTP response time, notification_sent, duration_ms, AC-01 timestamp delta, AC-06 timestamp delta
- Results written to
tests/acceptance/results_round2.json
Acceptance criteria:
Ticket:
P1.5-S5-06Type: Execution | Est: 2d
Goal: All 30 incidents run through ARIA on real GCP infrastructure in both invocation modes. Results captured in a structured file for AC scoring.
Scope (depends on P1.5-S5-05 test harness):
ARIAPipeline.run(incident_number)for all 30 incidents; capture per-incident:classification_label,confidence_band,affected_ci_matched,log_recall,duration_msPOST /api/v1/pipeline/runfor all 30 incidents via HTTP; capture: HTTP response time,notification_sent,duration_ms, AC-01 timestamp delta, AC-06 timestamp deltatests/acceptance/results_round2.jsonAcceptance criteria:
results_round2.jsoncreated with per-incident per-mode data for all 60 runs