105+ tests across 14 rounds • 10 test websites • 3 models • Updated March 11, 2026
| Level | S1 | S2 | S3 | S4 | S5 | Avg |
|---|---|---|---|---|---|---|
| Full | 8 demo | 9 demo | 9 demo | 8 demo | 9 demo | 8.6 |
| Medium | 8 demo | 7 demo | 9 demo | 9 demo | 9 demo | 8.4 |
| Lean | 9 demo | 9 demo | 8 demo | 8 demo | 9 demo | 8.6 |
| Zero | 8 demo | 8 demo | 8 demo | 8 demo | 9 demo | 8.2 |
| Source | S1 | S2 | S3 | S4 | S5 | Avg |
|---|---|---|---|---|---|---|
| Real | 8 demo | 9 demo | 8 demo | 9 demo | 9 demo | 8.6 |
| Stock | 7 demo | 5 demo | 7 demo | 7 demo | 4 demo | 6.0 |
| Mix | 8 demo | 9 demo | 8 demo | 9 demo | 9 demo | 8.6 |
| None | 9 demo | 9 demo | 8 demo | 8 demo | 9 demo | 8.6 |
| Control | S1 | S2 | S3 | S4 | S5 | Avg |
|---|---|---|---|---|---|---|
| Specified | 9 demo | 9 demo | 9 demo | 9 demo | 9 demo | 9.0 |
| Vibe | 9 demo | 9 demo | 8 demo | 9 demo | 4 demo | 7.8 |
| Free | 9 demo | 9 demo | 9 demo | 9 demo | 9 demo | 9.0 |
| Model | S1 | S2 | S3 | S4 | S5 | Avg | $/M in | $/M out |
|---|---|---|---|---|---|---|---|---|
| Pro | 9 demo | 9 demo | 9 demo | 9 demo | 9 demo | 9.0 | $1.25 | $10.00 |
| Flash | 9 demo | 9 demo | 8 demo | 9 demo | 9 demo | 8.8 | $0.10 | $0.40 |
| Lite | 7 demo | 4 demo | 8 demo | 6 demo | 8 demo | 6.6 | $0.02 | $0.10 |
| Temp | S1 | S2 | S3 | S4 | S5 | Avg |
|---|---|---|---|---|---|---|
| 0.3 | 9 demo | 9 demo | 8 demo | 9 demo | 9 demo | 8.8 |
| 0.5 | 9 demo | 9 demo | 9 demo | 9 demo | 9 demo | 9.0 |
| 0.7 | 9 demo | 8 demo | 9 demo | 9 demo | 8 demo | 8.6 |
| Site | Score | Demo |
|---|---|---|
| S1 Professional Lawyer | 9 | demo |
| S2 The Groomer | 9 | demo |
| S3 Now Consultant | 9 | demo |
| S4 PAL Auto Garage | 9 | demo |
| S5 Jazz Lounge Spa | 9 | demo |
| Method | S1 | S2 | S3 | S4 | S5 | Avg |
|---|---|---|---|---|---|---|
| Lean | 9 demo | 9 demo | 8 demo | 8 demo | 9 demo | 8.6 |
| Checklist | 9 demo | 8 demo | 8 demo | 9 demo | 9 demo | 8.6 |
| Skeleton | 9 demo | 8 demo | 9 demo | 7 demo | 9 demo | 8.4 |
| Site | Niche | Pass 1 | Pass 2 | Δ | Img P1→P2 | Demos |
|---|---|---|---|---|---|---|
| S1 | Law Firm | 8 | 9 | +1 | 7→8 | P1 P2 |
| S2 | Pet Grooming | 7 | 7 | 0 | 5→4 | P1 P2 |
| S3 | Accounting | 8 | 9 | +1 | 6→9 | P1 P2 |
| S4 | Auto Repair | 9 | 9 | 0 | 9→9 | P1 P2 |
| S5 | Spa | 9 | 8 | -1 | 9→5 | P1 P2 |
| S6 | Barbershop | 9 | 9 | 0 | 9→9 | P1 P2 |
| S7 | Tax Attorney | 8 | 9 | +1 | 7→9 | P1 P2 |
| S8 | Auto Service | 7 | 7 | 0 | 3→5 | P1 P2 |
| S9 | Criminal Law | 7 | 9 | +2 | 2→9 | P1 P2 |
| S10 | Family Law | 7 | 7 | 0 | 4→5 | P1 P2 |
| Site | P1 | P2 | Img P1 | Demos |
|---|---|---|---|---|
| S7 Tax Attorney | 9 | 9 | 8 | P1 P2 |
| S9 Criminal Law | 7 | 8 | 3 | P1 P2 |
| S10 Family Law | 8 | 8 | 5 | P1 P2 |
| Site | P1 | P2 | Img P1 | Demos |
|---|---|---|---|---|
| S7 Tax Attorney | 9 | 9 | 8 | P1 P2 |
| S9 Criminal Law | 9 | 9 | 8 | P1 P2 |
| S10 Family Law | 9 | 9 | 8 | P1 P2 |
| Site | P1 | P2 | Img P1 | Demos |
|---|---|---|---|---|
| S7 Tax Attorney | 9 | 8 | 7 | P1 P2 |
| S9 Criminal Law | 7 | 9 | 2 | P1 P2 |
| S10 Family Law | 7 | 8 | 3 | P1 P2 |
| Site | Visual | Sections | Images | Copy | Mobile | Overall | Demo |
|---|---|---|---|---|---|---|---|
| S7 Tax Attorney | 9 | 10 | 9 | 9 | 8 | 9 | demo |
| S9 Criminal Law | 9 | 10 | 9 | 9 | 8 | 9 | demo |
| S10 Family Law | 9 | 10 | 9 | 9 | 8 | 9 | demo |
| Site | Visual | Sections | Images | Copy | Mobile | Overall | Demo |
|---|---|---|---|---|---|---|---|
| S7 Tax Attorney | 9 | 9 | 9 | 9 | 8 | 9 | demo |
| S9 Criminal Law | 9 | 10 | 9 | 9 | 8 | 9 | demo |
| S10 Family Law | 9 | 10 | 8 | 9 | 8 | 9 | demo |
| Site | R9 | R10 | R11 | R12 | R13 ⭐ | R14 |
|---|---|---|---|---|---|---|
| S7 Tax | 8→9 | 9→9 | 9→9 | 9→8 | 9 (img:9) | 9 (img:9) |
| S9 Criminal | 7→9 | 7→8 | 9→9 | 7→9 | 9 (img:9) | 9 (img:9) |
| S10 Family | 7→7 | 8→8 | 9→9 | 7→8 | 9 (img:9) | 9 (img:8) |
| Parameter | Value | Why |
|---|---|---|
| Instructions | Skeleton + niche rules | HTML skeleton ensures all sections, niche rules add polish |
| Images | Curated image bank (38) | Solved #1 bottleneck: avg 4-8 → consistent 9 |
| Design | Niche direction | Navy/Gold + DM Serif Display/Inter |
| Layout | 4 creativity rules | Asymmetry, scale contrast, variety, atmosphere |
| Copy | Niche-specific | Outcome headlines, proper CTAs, trust signals |
| Prompt | Markdown | XML didn't improve. Simpler and faster. |
| Model | gemini-3.1-pro | 9.0/10. Flash at 8.8 for 10x less |
| Temp | 0.5 | Beats 0.3 and 0.7 |
| Passes | Single | 9/10 first try. Pass 2 only when <9 |
| 1 | Images are everything. AI image bank (Nano Banana Pro, $0.01/image) eliminated broken/irrelevant images. |
| 2 | Fewer rules beat more. R11 (4 rules, 60 words) = 9/10. R12 (14 rules, 200 words) = 7-9/10. |
| 3 | Niche direction > generic freedom. Palette + fonts + copy style adds polish without penalty. |
| 4 | XML and temp 1.0 are hype. R14 matched or underperformed R13. |
| 5 | Feedback loop has diminishing returns. Great for 8→9, risky at 9 (can regress). |
| 6 | Skeleton templates work. Pre-defined sections ensure completeness without limiting creativity. |