OVATION — Multi-Agent Review Response | Band of Agents Hackathon

Why this matters

Every unanswered review is an empty seat.

53%

of customers expect a reply to a negative review within a week.

Most never get one. The ones that do are often generic, slow, or both.

At scale, a thoughtful response isn't a writing problem — it's a coordination problem.

Google Reviews

Yelp

TripAdvisor

Reviews go unanswered. Every single day.

The cast

Six agents. One stage. Every cue visible.

Coordinated entirely inside Band — no hidden handoffs, no black-box routing. Every message between agents is one judges can read.

01 · INTAKE

Monitor

Polls connected sources and surfaces new reviews into the Band room as they arrive.

No LLM

02 · TRIAGE

Triage

Classifies sentiment and urgency, then routes the review to the right next step.

Featherless

03 · CONTEXT

Research

Enriches the review with brand voice and business context before drafting begins.

Featherless

04 · VOICE

Drafter

Generates the candidate response — and rewrites it when QA sends it back.

DeepSeek-V4

05 · REVIEW

QA

Scores every draft against tone and quality. Approves it — or loops it back, up to twice.

DeepSeek-V4

06 · ALERT

Escalation

Catches legal threats, safety issues, and media mentions — holds them for a human, always.

Featherless

The coordination showpiece

Watch the QA loop happen live.

A draft that misses the mark doesn't get published — it gets sent back, revised, and reviewed again. In the Band room, judges watch every step.

DRAFTER

→

QA AGENT

APPROVED ↓

→

PUBLISHED ✓

REJECTED — revision notes returned to Drafter Max 2 iterations — if QA rejects twice, Escalation takes over

“Every agent message is a real Band message. No hidden handoffs. No black-box coordination.”

Coverage

8 engineered seed scenarios.

Every branch of the pipeline — publish, revise, and escalate — exercised and verified before the live demo.

#01

Happy Path

PUBLISH

Full pipeline, single pass, QA approves on the first draft.

#02

Routine Complaint

PUBLISH

Brand-voice research is critical to getting the tone right.

#03

Context-Dependent

PUBLISH

Research enrichment is what decides the correct routing outcome.

#04

QA Revision Loop ★

LOOP

QA rejects v1 as too verbose. Drafter revises. QA approves v2.

#05

Legal Threat

ESCALATE

Drafting is skipped entirely. Held for human review, no auto-publish.

#06

Food Safety

ESCALATE

Immediate critical hold — severity HIGH, no exceptions.

#07

Media Mention

ESCALATE

Comms-team alert fired. No response goes out automatically.

#08

Ambiguous Mixed

PUBLISH

Research and Drafter together navigate genuinely mixed sentiment.

Under the hood

Built on Band, run on two LLM providers.

Band

COORDINATION LAYER

Shared room, agent handles, visible message routing for all six agents.

Featherless

LLM PROVIDER

Powers Triage, Research, and Escalation — cost-efficient inference.

AI/ML API · DeepSeek-V4

LLM PROVIDER

Powers Drafter and QA — premium reasoning for generation and scoring.

HTTP Trigger

DEMO CONTROL

A controlled inject endpoint for precise, repeatable live demo scenarios.

How to run

# clone and install
git clone git@github.com:resilientbeast/review-response-system.git
cd review-response-system
uv sync

# configure
cp .env.example .env
cp agent_config.yaml.example agent_config.yaml
# add BAND_API_KEY, OPENAI_API_KEY, etc.

# run the agent pipeline
uv run python run_all.py

# trigger seed_04 — the QA revision loop showpiece
curl -X POST http://localhost:8002/inject \
  -H "Content-Type: application/json" \
  -d '{
    "platform": "tripadvisor",
    "business_id": "loc_demo",
    "review": {
      "text": "Worst experience of my life. The manager was completely dismissive when I raised my concerns. The food was inedible and no one seemed to care. I will be leaving reviews everywhere I can.",
      "rating": 1,
      "author": "Sarah K.",
      "url": "http://demo.platform",
      "language": "en"
    }
  }'

# full setup incl. dashboard + ingestion: see Judges Overview