AMD Developer Hackathon · 2026

Path to Care

Multimodal, agentic triage decision-support for rural healthcare in the Global South. A phone photo plus a typed narrative becomes a top-3 condition guess, a Red / Yellow / Green urgency, and a structured pre-visit SOAP for the clinic doctor — contextualized by village distance, cost, and harvest season. It never diagnoses.

Live demo on Hugging Face GitHub LoRA adapter

What it does

A community health worker (or the patient) sends a phone photo of a skin lesion plus a short text history. Path to Care runs the image and narrative through a multimodal agentic pipeline and returns:

Top-3 conditions with confidence — never a single class label.
Red / Yellow / Green urgency, with the rule-based safety net only ever escalating, never relaxing.
Structured SOAP note for the clinic physician — chief complaint, HPI, exam, red flags, patient concerns.
Patient framing that names the cost-of-care tradeoff in the patient's own numbers (transport ₹, daily wage ₹, distance km).

Every patient-facing string passes through a deterministic cardinal-rule rewriter that strips diagnostic phrasing ("you have X" → "signs suggest X") before it leaves the API. The output is decision support, not a diagnosis.

Architecture

Multimodal Gemma 4 31B-it serves both the image classifier and the triage reasoner from one set of weights on a single AMD Instinct MI300X (192 GB VRAM, ROCm).
Qwen 2.5-7B-Instruct handles text-only SOAP extraction via a DSPy NarrativeToSOAP signature.
LoRA SFT on MI300X — supervised fine-tune of the triage head, 32 seconds wall-clock, loss 3.90 → 0.58 in one epoch. 180 MB adapter, served via vLLM --enable-lora.
DSPy ReAct orchestrator coordinates five MCP servers: image classifier, SOAP extractor, village context, triage reasoner, and camera capture.
vLLM (ROCm) in Docker for serving — OpenAI-compatible API on :8000. The Next.js frontend talks to it directly.
Adversarial test set — 30 hand-authored cases (10 R / 10 Y / 10 G) with red-flag, contradiction, and off-distribution variants. Held out from training.

Track 1 · Agents Track 2 · Fine-Tuning MI300X Track 3 · Multimodal Qwen prize Hugging Face prize Build-in-Public

Fine-tuning on MI300X

The triage head is a LoRA SFT pass over Gemma 4 31B-it, run end to end on a single AMD Instinct MI300X. Two epochs, twenty-one training examples, finished in 32 seconds. Training script in training/lora_sft.py; full log in logs/lora_train.log.

LoRA SFT loss curve — Path to Care, Gemma 4 31B-it on AMD MI300X. Cross-entropy loss falls from 3.90 at step 1 to 0.58 at step 10 across two epochs, in 32 seconds wall-clock. — Training loss across the 32-second run on MI300X. Loss 3.90 → 0.58 over 10 optimizer steps · 2 epochs · effective batch size 4.

Parameter	Value
Base model	`google/gemma-4-31B-it` (multimodal, dense)
Adapter	LoRA · `r=16`, `alpha=32`, `dropout=0.05`, `bias=none`
Target modules	language-model self-attention (`q_proj`, `k_proj`, `v_proj`, `o_proj`)
Trainable parameters	45.0 M of 31.3 B (0.14%)
Training rows	21 (image + SOAP + village context → urgency + reasoning)
Epochs · effective batch size	2 · 4
Wall-clock time	32 seconds
Training loss	3.90 → 0.58

Adapter weights on Hugging Face Hub at sankara68/path-to-care-triage-gemma4-lora. Served by vLLM with --enable-lora --lora-modules triage=adapters/triage-gemma4-lora; the orchestrator opts in via PTC_VLLM_LORA_NAME=triage. The eval delta from this adapter is the +7 pp top-1 accuracy lift in the Results table below (Image classification · SCIN top-16).

Results

Two complementary evaluations: a 30-case adversarial test set authored to probe the safety property (red flags, contradictions, off-distribution variants), and a 100-case held-out slice of the SCIN dermatology dataset to probe image-grounded classification.

Triage urgency — 30 adversarial cases

Reward R = 1.0 exact / 0.5 adjacent / 0.0 off-by-two.

Run	Mean reward	Exact match	FN Red → Green
Zero-shot baseline (Gemma 4 31B)	0.983	96.7%	0.0%
LoRA-tuned (180 MB adapter)	0.983	96.7%	0.0%

Both runs hit the same ceiling — the single residual error is a Yellow → Green slip; no Red was missed. The headline here is the false-negative Red → Green rate at 0.0% — the safety property that matters in the field.

Image classification — SCIN top-16, 100-case holdout

Top-1 accuracy on a held-out slice of the Stanford SCIN dermatology dataset, restricted to the 16 most-frequent conditions.

Run	Top-1 accuracy	Δ vs baseline
Zero-shot baseline (Gemma 4 31B)	28.0%	—
LoRA-tuned (same 180 MB adapter)	35.0%	+7.0 pp / +25% rel

A 32-second LoRA training run on the MI300X moved top-1 from 28% to 35% — a real learning signal beyond the saturated triage table above. Per-case results in results/scin_top16_topk_tuned.json and scin_top16_topk_baseline.json.

Try it

Live demo on Hugging Face

Public, always-on Space. Upload a photo or describe in text — the Space HTTPs into the MI300X-hosted vLLM container for all model work.

Source on GitHub

Full repo: orchestrator, MCP servers, LoRA training, eval harness, adversarial generator, Next.js frontend.

LoRA adapter on HF Hub

The 180 MB triage adapter trained on MI300X. Load with PEFT or serve via vLLM --enable-lora.

The cardinal rule

Path to Care never produces diagnostic statements. The output is always "signs suggest infection", never "you have cellulitis"; image output is always top-3 with confidence, never a single class label, never binary sick / healthy. Enforcement is defense-in-depth: a system prompt rule, a deterministic regex rewriter on every model output, and a unit test suite that fails the build on diagnostic phrasing.

Decision-support tool. Not a diagnostic system. Always consult a qualified physician.