AI Camp Berlin Β· April 2026

We can each build anything.

So why can't we
build together?

Waldo Vanderhaeghen  Β·  Kleinanzeigen
Act 1
The Ladder
A framework for where you are

πŸͺœ The 5 Levels of AI Integration

⌨️ L1 β€” Autocomplete AI finishes your sentences. You still write everything.
πŸ’¬ L2 β€” Chat Describe intent, AI generates. No execution, no verification.
🀝 L3 β€” Agentic AI navigates, reads, plans, executes multi-step. You review.
βš™οΈ L4 β€” Harness AI writes, tests, reads failures, patches its own errors β€” on its own. You define inputs, outputs, and acceptance criteria. The bottleneck shifts from speed to spec quality.
🏭 L5 β€” Dark Factory AI owns generation, testing, and deployment. No human in the critical path. Your job: write perfect specs and maintain the verification systems that catch what goes wrong.
NB
Framework by Nate B. Jones β€” The Nate Newsletter Β· Substack
Act 2
My Journey
From superpowers to getting lost

βœ‰οΈ L2 β€” Invoice automation

Drop a PDF. Never touch it again.

Vendor, date, amount, category β€” extracted and filed automatically. Zero infrastructure, zero cost.

πŸ“‚
Drive Inbox
drop PDF
β†’
🧠
Gemini Flash
extract JSON
β†’
βš™οΈ
Logic layer
route + dedupe
β†’
πŸ“Š
Sheets
vendor Β· amount Β· category
β†’
πŸ“
Archive
filed by year
Architecture
Google Apps Script + Gemini
Monthly cost
€0
Self-healing
learns new vendors
Audit trail
every decision logged
Spreadsheet output
Date Vendor Amount Category
2026-04-01 Vodafone €49.99 Telekommunikation
2026-04-03 WeWork €350.00 BΓΌro
2026-04-08 AWS €12.40 Software
2026-04-12 Lidl €28.15 BΓΌrobedarf
2026-04-15 Notion €16.00 Software
MD5 dedupe Β· script_log audit Β· Drive verification
Act 3
L3 β€” Agentic AI
AI as the problem-solving teammate

πŸ“ˆ L3 β€” Campaign analysis pipeline

😰 The problem

"The reviews that happened were symptom-driven β€” something had to go visibly wrong before it got attention."

  • Everything else got a quick skim or nothing
  • Advertisers got inconsistent advice
  • Patterns across campaigns were invisible β€” nobody was looking across campaigns
Manual review per campaign: 40–60 min. Book of 200+ campaigns. The math doesn't work.
βš™οΈ The pipeline

"A pipeline that runs every night whether anyone asks it to or not."

  • Every campaign, every night β€” fully headless
  • Deterministic detection β†’ Claude synthesis β†’ recommendations
  • 15-test E2E suite gates every deploy
"The quality floor went up because nothing gets skipped β€” not because the ceiling got higher."
πŸ“Š
Campaign Analysis Dashboard
Every campaign, every night. Deterministic detection β†’ Claude synthesis β†’ recommendations. 15-test E2E suite gates every deploy.
100%
campaigns covered
15
E2E tests
0
manual steps
βš™οΈ L3 β€” The pipeline, live

πŸͺ€ Still L3 β€” the solo builder trap

I shipped it. It works. But I'm still the only reviewer, the only debugger, the only one who understands the system.

🧠
Context is unshared
The system lives in my head and 50 agent chat sessions. There's no onboarding path. Nobody else can pick this up.
πŸ”
Review is broken
It's not really "code" anymore. How do you review a diff when the agent wrote 80% of it and the logic is in the prompts?
⚑
Velocity doesn't transfer
I can ship a full feature on a Sunday. The moment a second person joins, we're back to meetings, alignment, "can you explain what this does?"
At L3 the AI is a teammate β€” but I'm still the single point of failure. That's the plateau.
Act 4
L4 Harness-driven,
semi-autonomous
towards L5, the dark factory

βš™οΈ L4 β€” what it means

You stop managing the code. You start managing the system.

πŸ€–
Self-running loop
AI writes, tests, reads failures, patches errors β€” on its own. No human in the middle.
πŸ“
You own the spec
You define inputs, outputs, acceptance criteria. The system figures out how to get there.
πŸ“ˆ
System compounds
Memory layer, self-optimisation, contradictions resolved automatically. Gets smarter without hand-tuning.
Ship while you sleep. Your job shifts to strategy, not execution. The team works on why and what.
At L4, the bottleneck is no longer you. It's the quality of your specs β€” and how well your team collaborates around them.

πŸͺž The L4 mirage

Consulting for FaustAI β€” an AI documentation startup doing impressive things. This looked like L4 from the outside.

What impressed me
  • Agents running constantly in production
  • Memory layer, contradiction detection, structured output
  • Backend self-optimising prompts at scale
  • Product defining inputs and expected outputs
Engineer defines the harness. Product defines the spec. Agents do the work. Looks like L4. βœ…
The collaboration gap
  • Only one person fully understands the system
  • No closed feedback loop β€” output quality still requires hand-tuning
  • Remove the engineer β†’ everything stops
  • Product can define what good looks like, but can't verify how to get there
The engineer is the harness. Smart system β€” but not yet a collaborative one.

πŸ«€ Be more human, among bots

Some things I've tried. They help β€” but they don't fully solve it.

🎫
Open-ended tickets
Jira-skill staying human among bots: avoid overspecification, define problem + desired outcome, but leave more for teammates to fill in β€” AI can fill in the rest.
βœ“ staying human among bots
🀝
Backlog refinement
Open-ended tickets provide fertile ground for active discussion. Teammates β€” aka "agent-representatives" β€” discuss how the puzzle pieces fit together.
βœ“ slows the right things down
❓
Still unsolved
What does code review mean at L4? How do you onboard? Who owns a system that is mostly prompts?
⚠ no good answer yet
These are patches, not a system. The real question: what does team collaboration look like when everyone has an agent running?
Act 5
The Open Question
I don't have the answer. I think this room might.

πŸ€” Questions I'm carrying

What does an AI-native team actually look like?

01
How do you onboard someone into a project built by one person + 50 agent sessions?
02
What does "code review" mean when the agent wrote 80% of it?
03
Who owns the system when it's mostly prompts and harnesses?
04
Is a PM's job now: write better specs, run better refinements, stay out of the way?
We've solved individual velocity. We haven't solved collective intelligence.

🎀 Let's talk

Open facilitation β€” 15–20 min

πŸͺœ
Where are you on the ladder β€” individually vs. as a team?
🧩
Has anyone cracked the onboarding/context problem in an AI-native codebase?
🐒
What's the hardest thing to do together that used to be hard alone?
Waldo Vanderhaeghen  Β·  Kleinanzeigen  Β·  waldo@vanderlore.de  Β·  waldo.vanderlore.de