Track 1ROCm + vLLMQwen3-Coder-NextCUDA to ROCm

ROCmPilot

Multi-agent ROCm migration cockpit for PyTorch and vLLM workloads.

Sample workload

Public GitHub URL

Agent run

Ready to audit the selected workload

idle

Select a workload and start the audit.

Agent War Room

Lead-led discussion with replies, objections, and shared memory the agents reuse later.

standbylead: No lead assigned

Repo Doctor waiting

Migration Planner waiting

Build Runner waiting

Benchmark Agent waiting

Report Agent waiting

Orchestrator waiting

CUDA/ROCm Coach Agent

Separate helper agent for developer questions, using the current findings and memory.

sidecar agent

CUDA/ROCm Coach

I am the CUDA/ROCm Coach Agent. Ask me what a finding means, how ROCm differs from CUDA, or how to validate PyTorch/vLLM on AMD.

Context: 4 findings, 3 patches, 0 memories.

Migration Kit Agent

Builds a downloadable implementation file from repo findings, patches, and shared memory.

export agent

Target

Qwen vLLM CUDA Starter

findings

files

memories

Package preview

Generates qwen-vllm-cuda-starter-migration-kit.md
Includes patch previews for src/inference/device.py, Dockerfile.rocm, scripts/serve-rocm.sh
Adds a ROCm validation script for PyTorch HIP detection and vLLM serving
References shared memory from the agent war room

Migration findings

CUDA assumptions, ROCm blockers, and recommended fixes.

Severity	Category	Location	Fix
Findings appear after the Repo Doctor stage starts.

Evidence panels

Patch previews, terminal logs, and final report output.

Patch previews unlock after the Migration Planner stage.

Workload

FastAPI, PyTorch, vLLM, Docker

Repository URL

Scan mode

Curated sample fixture

Target

Qwen vLLM CUDA Starter

Files scanned

Risk

Hardcoded CUDA device checks block AMD Developer Cloud deployment.

GPU model status

Qwen endpoint used by the Report Agent.

AMD GPU Model: Demo fallback

Qwen/Qwen3-Coder-Next

Set AMD_QWEN_BASE_URL

The MVP stays demo-safe until an AMD ROCm/vLLM endpoint is available.

Long-context memory

Synap context used by the Report Agent.

Synap Memory: Local fallback

Provider

local

Conversation

created after run starts

stored

recalled

Set SYNAP_API_KEY to persist agent memory across sessions.

Benchmark profile

Demo metrics for the submission walkthrough.

Benchmark cards appear during the run.

Submission proof

Track 1 agentic workflow with five specialized agents.

AMD GPU story through ROCm/vLLM Qwen model serving.

Demo remains reliable without credentials, then upgrades with live endpoint logs.