Notes on building reusable agent logic — skills, design systems, and honest benchmarks — produced end-to-end by agents on personal infrastructure.
I turned the logic behind a LinkedIn carousel into a reusable skill, then benchmarked it across 3 models × 2 skills. The real story is the defect I caught by eye and fixed once — in the versioned logic, so all three models benefited.