VoiceLive & Chat SOP: Multi-Environment Deployment

Standard Operating Procedures for Voice and Chat Services

Document Version: 1.0
Last Updated: December 2025
Classification: Internal Operations
Compliance: NIST AI RMF, SOC2 Type II

1. Environment Overview

Level	Environment	Purpose	Auth Model	Data Classification
1	Staging POC	Proof of concept, demos	Dev tokens + API Key	Non-sensitive
2	Development	Feature development	Dev tokens	Synthetic only
3	Test	Integration testing	Service Principal	Synthetic only
4	UAT	User acceptance	Entra ID (test tenant)	Anonymized
5	Production	Live operations	Entra ID + MFA	Production data

1.1 Customer Tenant Replication Checklist (Minimum Viable Chat + Voice)

This section is written to be customer-safe (no tenant IDs, no secrets). Replace placeholders with customer values.

Required Azure components (minimum)

Frontend (UI): Azure Static Web Apps (SWA) hosting the Engram UI.
Backend API: Azure Container Apps (ACA) running the Engram FastAPI service.
Azure AI Gateway (Chat): an OpenAI-compatible gateway (often APIM) that exposes .../openai/v1/... and supports POST /chat/completions.
Azure AI Services (VoiceLive): an Azure AI endpoint that supports VoiceLive realtime sessions (via the azure-ai-voicelive SDK).
Secrets store: Azure Key Vault (recommended) or equivalent for API keys.

Optional but recommended:

Zep (memory service): chat will still run if Zep is unavailable (best-effort enrich/persist), but memory features degrade.

Backend (ACA) required configuration

Set these environment variables on the Engram API container app:

# --- POC mode (fast validation; NOT for production) ---
ENVIRONMENT=development
AUTH_REQUIRED=false

# --- CORS: must include your SWA hostname and any local dev origins ---
# Example: ["https://<CUSTOMER_APP>.azurestaticapps.net","http://localhost:5173"]
CORS_ORIGINS=["https://<CUSTOMER_SWA_HOSTNAME>"]

# --- Chat (OpenAI-compatible gateway) ---
AZURE_AI_ENDPOINT=https://<CUSTOMER_AI_GATEWAY_HOST>/<PATH>/openai/v1
AZURE_AI_DEPLOYMENT=gpt-5-chat
AZURE_AI_KEY=<secret>  # store in Key Vault; inject via container-app secretRef

# Optional (only when using *.services.ai.azure.com Foundry endpoints, not APIM /openai/v1):
# AZURE_AI_PROJECT_NAME=<CUSTOMER_PROJECT_NAME>   # example in this tenant: zimax

# --- VoiceLive (Realtime) ---
AZURE_VOICELIVE_ENDPOINT=https://<CUSTOMER_AI_SERVICES>.services.ai.azure.com
AZURE_VOICELIVE_MODEL=gpt-realtime
AZURE_VOICELIVE_VOICE=en-US-Ava:DragonHDLatestNeural
# Preferred auth: Managed Identity (no key env var).
# Optional API-key mode (NOT recommended long-term): AZURE_VOICELIVE_KEY=<ai-services-key>

Production hardening (summary):

Set ENVIRONMENT=production and AUTH_REQUIRED=true and enforce Entra JWT validation.
Restrict CORS to the exact SWA origin(s).
Prefer Managed Identity to access Key Vault and Azure AI (VoiceLive + chat where supported); otherwise use rotated keys.

VoiceLive RBAC prerequisite (Managed Identity)

When using Managed Identity, grant the backend identity permission on the Azure AI Services account (or narrower project scope if your org requires):

Role: Cognitive Services Speech User
Scope: /subscriptions/<SUB>/resourceGroups/<RG>/providers/Microsoft.CognitiveServices/accounts/<AI_ACCOUNT>

Frontend (SWA) required build configuration

Set these build-time variables for the UI build:

VITE_API_URL=https://<CUSTOMER_ENGRAM_API_FQDN>
VITE_WS_URL=https://<CUSTOMER_ENGRAM_API_FQDN>  # frontend converts https→wss internally

SWA routing prerequisite:

Ensure staticwebapp.config.json is deployed (place it in frontend/public/ so Vite copies it into dist/). It must include a SPA fallback so deep links like /voice don’t 404.

Validation (copy/paste)

API_BASE="https://<CUSTOMER_ENGRAM_API_FQDN>"

# Health
curl -sS "$API_BASE/health"

# Chat (POC mode: no Authorization header required)
curl -sS -H "Content-Type: application/json" \
  -d '{"content":"What is the capital of France?","agent_id":"elena"}' \
  "$API_BASE/api/v1/chat"

# VoiceLive status
curl -sS "$API_BASE/api/v1/voice/status"

Voice transcripts → Zep (episodic memory)

Engram persists VoiceLive interactions into Zep so voice becomes part of the Context Engine:

User speech: captured from VoiceLive STT events (input audio transcription) and stored as a user turn
Assistant response: captured from VoiceLive text events (preferred) and stored as an assistant turn
Write path: best-effort (timeouts + errors swallowed) so real-time voice does not stall if Zep is slow/unavailable

Implementation notes:

Backend: backend/api/routers/voice.py initializes an EnterpriseContext, populates its semantic layer with facts from Zep, and uses this context to enrich the system instructions. It appends Turns to this context and calls persist_conversation(...) after each assistant response.
Enrichment: At session start, the backend fetches up to 20 facts from Zep, adds them to the EnterpriseContext, and generates a context summary for the instructions.
Event sources:
- user: CONVERSATION_ITEM_INPUT_AUDIO_TRANSCRIPTION_COMPLETED
- assistant: RESPONSE_TEXT_DONE (fallback: RESPONSE_AUDIO_TRANSCRIPT_DONE/RESPONSE_DONE)

Unified “one episode” across Chat + Voice (recommended)

enrich_context(...) searches memory within the current session id. To let Chat benefit from Voice memory (and vice-versa), Engram uses a single shared session id across the UI:

Frontend: generates one sessionId per browser tab (stored in sessionStorage key engram_session_id)
Chat: sends it in POST /api/v1/chat as session_id
Voice: uses it as the websocket path param: wss://<API>/api/v1/voice/voicelive/<session_id>
Dictation: ChatPanel microphone uses standard Web Speech API to dictate into the chat input, maintaining the same chat session context.

If you want per-interaction sessions instead, pass a different session_id for chat and voice (they will become separate Zep sessions/episodes).

Validate voice→memory end-to-end (manual)

1) Open Engram UI and run a short voice exchange (2–3 turns), or use the ChatPanel microphone to dictate a message.

2) In browser DevTools, read the shared session id:

sessionStorage.getItem("engram_session_id")

3) Fetch the transcript from the backend:

API_BASE="https://<CUSTOMER_ENGRAM_API_FQDN>"
SESSION_ID="<paste from sessionStorage>"
curl -sS "$API_BASE/api/v1/memory/episodes/$SESSION_ID"

You should see new user + assistant messages corresponding to what was spoken/heard.

Reference: Azure AI Foundry VoiceLive Playground (env vars + SDK behavior)

The Azure AI Foundry “Azure Speech Voice Live” playground uses the same azure-ai-voicelive SDK patterns that Engram uses. This is a sanitized version of the variables you’ll commonly see in Foundry/azd sample templates:

# Base AI Services endpoint (account-scoped)
AZURE_VOICELIVE_ENDPOINT="https://<AI_ACCOUNT>.services.ai.azure.com/"

# Optional: AI Project context (project-scoped endpoint)
AZURE_VOICELIVE_PROJECT_NAME="<AI_PROJECT_NAME>"
AZURE_EXISTING_AIPROJECT_ENDPOINT="https://<AI_ACCOUNT>.services.ai.azure.com/api/projects/<AI_PROJECT_NAME>"
AZURE_EXISTING_AIPROJECT_RESOURCE_ID="/subscriptions/<SUBSCRIPTION_ID>/resourceGroups/<RG>/providers/Microsoft.CognitiveServices/accounts/<AI_ACCOUNT>/projects/<AI_PROJECT_NAME>"

# API version used by the VoiceLive realtime WS
AZURE_VOICELIVE_API_VERSION="2025-10-01"

What URL the SDK actually connects to

azure.ai.voicelive.aio.connect() derives the final WebSocket URL by appending /voice-live/realtime and query params:

wss://<AI_ACCOUNT>.services.ai.azure.com/voice-live/realtime?api-version=2025-10-01&model=gpt-realtime

If you pass the project endpoint instead:

https://<AI_ACCOUNT>.services.ai.azure.com/api/projects/<AI_PROJECT_NAME>

the SDK will connect under that path:

wss://<AI_ACCOUNT>.services.ai.azure.com/api/projects/<AI_PROJECT_NAME>/voice-live/realtime?api-version=2025-10-01&model=gpt-realtime

Authentication modes (important)

The VoiceLive SDK supports two auth modes:

API key mode: sends api-key: <KEY> during the WS handshake.
- Use an AI Services key (Cognitive Services account key), not an APIM/OpenAI gateway subscription key.
Token credential mode: sends Authorization: Bearer <TOKEN> during the WS handshake.
- The SDK requests tokens for scope https://ai.azure.com/.default.
- For Engram in Azure Container Apps, this means the backend Managed Identity must have:
  - Role: Cognitive Services Speech User
  - Scope: the AI Services account (or narrower approved scope)

Engram recommendation:

Use Managed Identity for VoiceLive (preferred).
Only use AZURE_VOICELIVE_KEY when the customer cannot use MI yet (and store it as a separate secret).

2. Level 1: Staging POC

2.1 Purpose

Demonstrate capabilities to stakeholders
Validate Azure AI integration
Test VoiceLive real-time features
Quick iteration without security overhead

2.2 Configuration

# Environment Variables
ENVIRONMENT=development
AUTH_REQUIRED=false
DEBUG=true

# VoiceLive Configuration
AZURE_VOICELIVE_ENDPOINT=https://zimax.services.ai.azure.com
AZURE_VOICELIVE_MODEL=gpt-realtime
AZURE_VOICELIVE_VOICE=en-US-Ava:DragonHDLatestNeural
AZURE_VOICELIVE_KEY=<key-vault-secretref>

# Chat Configuration (APIM Gateway)
AZURE_AI_ENDPOINT=https://zimax-gw.azure-api.net/zimax/openai/v1
AZURE_AI_DEPLOYMENT=gpt-5-chat
AZURE_AI_KEY=<key-vault-secretref>

# Optional when using Foundry endpoints (this tenant example):
# AZURE_AI_PROJECT_NAME=zimax

2.3 Authentication

# POC mode (AUTH_REQUIRED=false): no auth header required.
# If AUTH_REQUIRED=true, use dev tokens in development or Entra JWTs in production.

# Example
curl -X POST https://staging-env-api.../api/v1/chat \
  -H "Content-Type: application/json" \
  -d '{"content": "Hello", "agent_id": "elena"}'

2.4 Azure Resources

Resource	Name	Configuration
Container App	staging-env-api	0.5 vCPU, 1GB RAM, scale 0-3
Key Vault	stagingenvkv*	RBAC enabled
AI Services	zimax	VoiceLive + Chat

2.5 Operational Procedures

Deploy

# Trigger via GitHub Actions
gh workflow run deploy.yml -f environment=staging

# Or manual Azure CLI
az containerapp update --name staging-env-api --resource-group engram-rg \
  --set-env-vars "AZURE_VOICELIVE_ENDPOINT=https://zimax.services.ai.azure.com"

Test Voice

# Check status
curl https://staging-env-api.gentleriver-dd0de193.eastus2.azurecontainerapps.io/api/v1/voice/status

# WebSocket connection (from frontend)
wss://staging-env-api.gentleriver-dd0de193.eastus2.azurecontainerapps.io/api/v1/voice/voicelive/<session_id>

Test Chat

curl -X POST https://staging-env-api.../api/v1/chat \
  -H "Content-Type: application/json" \
  -d '{"content": "Analyze requirements for CRM migration", "agent_id": "elena"}'

2.6 Monitoring

Azure Container Apps logs (basic)
No alerting required
Manual health checks

3. Level 2: Development

3.1 Purpose

Active feature development
Developer testing
Integration with local IDEs
Debugging and tracing

3.2 Configuration

# Environment Variables
ENVIRONMENT=development
DEBUG=true
LOG_LEVEL=DEBUG

# VoiceLive - Same as staging
AZURE_VOICELIVE_ENDPOINT=https://zimax.services.ai.azure.com
AZURE_VOICELIVE_MODEL=gpt-realtime

# Chat - Development deployment
AZURE_AI_ENDPOINT=https://zimax-gw.azure-api.net/zimax/openai/v1
AZURE_AI_DEPLOYMENT=gpt-5-chat

# Tracing enabled
APPLICATIONINSIGHTS_CONNECTION_STRING=<dev-app-insights>
OTEL_EXPORTER_OTLP_ENDPOINT=http://localhost:4317

3.3 Authentication

# Dev tokens with role simulation
Authorization: Bearer dev_<user_id>_<role>

# Examples
dev_alice_analyst     # Business Analyst role
dev_bob_admin         # Admin role
dev_carol_viewer      # Read-only role

3.4 Local Development Setup

# Clone and setup
git clone https://github.com/zimaxnet/engram.git
cd engram
python -m venv venv && source venv/bin/activate
pip install -r backend/requirements.txt

# Create .env from template
cp .env.example .env
# Edit with dev credentials

# Run locally
uvicorn backend.api.main:app --reload --port 8082

# Test voice locally
curl http://localhost:8082/api/v1/voice/status

3.5 Operational Procedures

Feature Branch Workflow

# Create feature branch
git checkout -b feature/voice-enhancement

# Make changes, test locally
pytest backend/tests -v

# Push and create PR
git push -u origin feature/voice-enhancement
gh pr create --title "Voice: Add interruption handling"

Debug VoiceLive

# Enable detailed logging in code
import logging
logging.getLogger("backend.voice").setLevel(logging.DEBUG)

# Check WebSocket frames
# Use browser DevTools Network tab > WS filter

3.6 Monitoring

Local logging to stdout
Application Insights (dev workspace)
Jaeger for distributed tracing (optional)

4. Level 3: Test

4.1 Purpose

Automated testing
CI/CD integration
Performance baseline
Security scanning

4.2 Configuration

# Environment Variables
ENVIRONMENT=test
DEBUG=false
LOG_LEVEL=INFO

# VoiceLive - Dedicated test resource
AZURE_VOICELIVE_ENDPOINT=https://zimax-test.services.ai.azure.com
AZURE_VOICELIVE_MODEL=gpt-realtime

# Chat - Test deployment with rate limiting
AZURE_AI_ENDPOINT=https://zimax-gw.azure-api.net/zimax-test/openai/v1
AZURE_AI_DEPLOYMENT=gpt-5-chat

# Test database
DATABASE_URL=postgresql://test_user@test-postgres/engram_test

4.3 Authentication

# Service Principal (non-interactive)
AZURE_CLIENT_ID=<test-sp-client-id>
AZURE_CLIENT_SECRET=<test-sp-secret>
AZURE_TENANT_ID=<tenant-id>

# DefaultAzureCredential picks up automatically

4.4 Azure Resources

Resource	Name	Configuration
Container App	test-env-api	0.5 vCPU, 1GB RAM, scale 0-2
AI Services	zimax-test	Separate quota
PostgreSQL	test-postgres	Basic tier, auto-purge

4.5 Operational Procedures

Run E2E Tests

# Via GitHub Actions
gh workflow run e2e-tests.yml -f environment=test -f test_suite=all

# Manual test run
npx playwright test --grep @voice
npx playwright test --grep @chat

Performance Testing

# Load test chat endpoint
k6 run scripts/load-tests/chat-load.js

# Voice WebSocket stress test
k6 run scripts/load-tests/voice-ws-stress.js

Security Scan

# Dependency scan
pip-audit -r backend/requirements.txt

# Container scan
trivy image ghcr.io/zimaxnet/engram/backend:latest

4.6 Monitoring

Application Insights (test workspace)
Test result artifacts in GitHub Actions
Weekly security reports

5. Level 4: UAT (User Acceptance Testing)

5.1 Purpose

Business user validation
Workflow verification
Accessibility testing
Final sign-off before production

5.2 Configuration

# Environment Variables
ENVIRONMENT=uat
DEBUG=false
LOG_LEVEL=INFO

# VoiceLive - Production-like
AZURE_VOICELIVE_ENDPOINT=https://zimax.services.ai.azure.com
AZURE_VOICELIVE_MODEL=gpt-realtime
AZURE_VOICELIVE_VOICE=en-US-Ava:DragonHDLatestNeural

# Chat - Production gateway, UAT deployment
AZURE_AI_ENDPOINT=https://zimax-gw.azure-api.net/zimax/openai/v1
AZURE_AI_DEPLOYMENT=gpt-5-chat

# UAT database with anonymized data
DATABASE_URL=postgresql://uat_user@uat-postgres/engram_uat

5.3 Authentication

# Entra ID with test tenant
# Users authenticate via browser SSO
# Test accounts provisioned in Entra ID test tenant

# Required claims
- oid (Object ID)
- preferred_username
- roles (engram.user, engram.analyst, engram.admin)

5.4 User Provisioning

# Create UAT test users in Entra ID
az ad user create \
  --display-name "UAT Tester 1" \
  --user-principal-name uat.tester1@zimaxtest.onmicrosoft.com \
  --password "TempP@ss123!"

# Assign roles
az ad app role assignment add \
  --id <app-id> \
  --principal-id <user-oid> \
  --role-id <engram.analyst-role-id>

5.5 Operational Procedures

UAT Deployment

# Deploy to UAT
gh workflow run deploy.yml -f environment=uat

# Verify health
curl https://uat-env-api.../health
curl https://uat-env-api.../api/v1/voice/status

UAT Test Scenarios

## Voice Test Cases
[ ] Start voice session with Elena
[ ] Speak requirements, verify transcription
[ ] Switch to Marcus mid-conversation
[ ] Verify voice characteristics match agent
[ ] Test interruption handling
[ ] Test poor network simulation

## Chat Test Cases
[ ] Send text message, verify response
[ ] Test multi-turn conversation
[ ] Verify memory persistence
[ ] Test agent switching
[ ] Verify facts extraction

Sign-off Process

## UAT Sign-off Checklist
- [ ] All test cases passed
- [ ] Accessibility review complete
- [ ] Performance acceptable
- [ ] Security review approved
- [ ] Business stakeholder sign-off
- [ ] Ready for production deployment

5.6 Monitoring

Application Insights (UAT workspace)
User feedback collection
Session recordings (with consent)

6. Level 5: Production

6.1 Purpose

Live customer operations
Full security enforcement
High availability
Compliance audit trail

6.2 Configuration

# Environment Variables
ENVIRONMENT=production
DEBUG=false
LOG_LEVEL=WARNING

# VoiceLive - Production with HA
AZURE_VOICELIVE_ENDPOINT=https://zimax.services.ai.azure.com
AZURE_VOICELIVE_MODEL=gpt-realtime
AZURE_VOICELIVE_VOICE=en-US-Ava:DragonHDLatestNeural

# Chat - Production gateway with premium tier
AZURE_AI_ENDPOINT=https://zimax-gw.azure-api.net/zimax/openai/v1
AZURE_AI_DEPLOYMENT=gpt-5-chat

# Production database
DATABASE_URL=postgresql://prod_user@prod-postgres/engram_prod

# Compliance
AUDIT_LOG_ENABLED=true
DATA_RETENTION_DAYS=90

6.3 Authentication

# Entra ID production tenant
# MFA enforced
# Conditional Access policies applied

# Required security
- MFA for all users
- Compliant device required
- Geographic restrictions (optional)
- Session timeout: 8 hours

6.4 Azure Resources

Resource	Name	Configuration
Container App	prod-env-api	1 vCPU, 2GB RAM, scale 1-10, zone redundant
AI Services	zimax	Premium tier, 100K TPM
PostgreSQL	prod-postgres	General Purpose, HA, geo-backup
Key Vault	prodenvkv*	HSM-backed keys
Private Link	All services	No public endpoints

6.5 Operational Procedures

Production Deployment

# Requires approval gate
gh workflow run deploy.yml -f environment=production

# Blue-green deployment
# 1. Deploy to staging slot
# 2. Run smoke tests
# 3. Swap slots
# 4. Monitor for 15 minutes
# 5. Rollback if issues

Rollback Procedure

# Immediate rollback
az containerapp revision activate \
  --name prod-env-api \
  --resource-group engram-rg-prod \
  --revision <previous-revision>

# Verify rollback
curl https://prod-env-api.../health

Incident Response

## VoiceLive Incident
Check Azure AI Services status
Review Container App logs
Check network connectivity
Verify Managed Identity permissions
Escalate to Azure Support if needed

## Chat Incident
Check APIM gateway status
Verify API key in Key Vault
Check rate limiting
Review error logs
Escalate if persistent

6.6 Monitoring & Alerting

# Alert Rules
- name: VoiceLive Error Rate
  condition: error_rate > 5%
  severity: High
  action: PagerDuty + Slack

- name: Chat Latency
  condition: p95_latency > 3s
  severity: Medium
  action: Slack

- name: Active Sessions Spike
  condition: sessions > 100
  severity: Low
  action: Email

- name: Authentication Failures
  condition: auth_failures > 10/min
  severity: High
  action: PagerDuty + Security Team

6.7 Compliance

## Audit Requirements
- All API calls logged with user identity
- Voice recordings retained per policy
- Chat transcripts encrypted at rest
- Monthly access reviews
- Quarterly penetration testing

## Data Handling
- PII masked in logs
- No sensitive data in error messages
- GDPR right to erasure supported
- Data residency: East US 2

7. Environment Comparison Matrix

Feature	Staging POC	Dev	Test	UAT	Prod
Auth	Dev tokens	Dev tokens	Service Principal	Entra ID	Entra ID + MFA
VoiceLive	✅	✅	✅	✅	✅
Chat	✅	✅	✅	✅	✅
Managed Identity	✅	❌ (local)	✅	✅	✅
Private Link	❌	❌	❌	❌	✅
Audit Logging	❌	❌	✅	✅	✅
HA/DR	❌	❌	❌	❌	✅
Scale Min	0	0	0	1	1
Scale Max	3	2	2	5	10
Cost/Month	~$50	~$30	~$40	~$100	~$500+

8. Secrets Management

8.1 GitHub Secrets (CI/CD)

# Required for all environments
AZURE_CREDENTIALS          # Service Principal JSON
CR_PAT                     # Container Registry token

# Per-environment
AZURE_AI_ENDPOINT          # AI Services endpoint
AZURE_OPENAI_KEY           # APIM API key
POSTGRES_PASSWORD          # Database password

8.2 Key Vault Secrets

# Staging POC
azure-ai-key               # AI Services key
zep-api-key                # Memory service key

# Production (additional)
encryption-key             # Data encryption
audit-storage-key          # Audit log storage

8.3 Rotation Schedule

Secret	Rotation	Procedure
API Keys	90 days	Automated via Key Vault
DB Password	180 days	Manual with downtime window
Service Principal	365 days	Create new, update, delete old

9. Troubleshooting Guide

9.1 VoiceLive Issues

Symptom	Cause	Resolution
`voicelive_configured: false`	Missing endpoint	Set `AZURE_VOICELIVE_ENDPOINT`
`key must be a string`	Missing/empty key in API-key mode	Set `AZURE_VOICELIVE_KEY` (or use Managed Identity auth)
`Failed to establish WebSocket connection: 401`	Managed Identity lacks VoiceLive permission	Assign `Cognitive Services Speech User` to the backend MI at the AI Services resource scope
401 Unauthorized	Invalid credential	Check Managed Identity / Key Vault
WebSocket disconnect	Network timeout	Increase client timeout, check firewall
Audio quality poor	Codec mismatch	Ensure PCM16, mono; prefer 24kHz (Foundry samples), and resample if needed

9.2 Chat Issues

Symptom	Cause	Resolution
401 Missing token	No auth header	Add `Authorization: Bearer <token>`
429 Rate limited	Too many requests	Implement backoff, check quota
Slow responses	Model latency	Check Azure AI status, consider caching
Empty responses	Context too long	Reduce conversation history

9.3 Common Commands

# Check container logs
az containerapp logs show --name staging-env-api --resource-group engram-rg --tail 100

# Restart container
az containerapp revision restart --name staging-env-api --resource-group engram-rg --revision <rev>

# Check env vars
az containerapp show --name staging-env-api --resource-group engram-rg \
  --query "properties.template.containers[0].env"

# Test health
curl https://<api-url>/health | jq
curl https://<api-url>/api/v1/voice/status | jq

10. Appendix

10.1 API Endpoints

Endpoint	Method	Auth	Description
`/health`	GET	None	Health check
`/api/v1/voice/status`	GET	None	VoiceLive status
`/api/v1/voice/config/{agent}`	GET	None	Agent voice config
`/api/v1/voice/voicelive/{session_id}`	WS	None (POC)	Voice WebSocket (VoiceLive proxy)
`/api/v1/chat`	POST	Depends	Chat message (POC: no auth; Prod: Bearer JWT)
`/api/v1/agents`	GET	None	List agents

10.2 Voice WebSocket Protocol

// Client -> Server
{"type": "audio", "data": "<base64-pcm16>"}
{"type": "agent", "agent_id": "marcus"}
{"type": "cancel"}

// Server -> Client
{"type": "agent_switched", "agent_id": "elena"}
{"type": "transcription", "status": "listening|processing|complete", "text": "..."}
{"type": "audio", "data": "<base64-pcm16>", "format": "audio/pcm16"}
{"type": "agent_switched", "agent_id": "marcus"}
{"type": "error", "message": "..."}

10.3 Contact & Escalation

Level	Contact	Response Time
L1	DevOps Team	< 15 min
L2	Platform Engineering	< 1 hour
L3	Azure Support	< 4 hours
Security	Security Team	Immediate

Document maintained by Platform Engineering. For updates, submit PR to docs/sop-voicelive-chat-environments.md