Ta Da!

Ambiguity, not effort, drives avoidance.

An iOS app for the tasks you've been avoiding. The app detects which tasks cause you to freeze and breaks them into concrete first steps.

0 → 1 Build
Role
PM & Builder
Timeline
10 days
Stack
SwiftUI, SwiftData, Claude API
Stage
Dogfooding on TestFlight
Ta Da! app showing onboarding, Brain Dump task list, and Today view with progress tracking
Ta Da! app: Brain Dump and Today views

Every task app has the same blind spot.

Todoist, Things, TickTick, Apple Reminders. They all give "Buy milk" and "prepare investor deck" the same text field, the same checkbox. But these aren't the same problem. One takes thirty seconds. The other requires you to figure out what "prepare" even means before you can do anything. Avoidance comes from the second kind, and avoidance is an ambiguity problem, not a motivation problem.

The loop to break
Core insight

When a task is framed as an outcome ("figure out health insurance") and you can't identify the first physical action, you take none. The friction isn't doing the thing. It's identifying where to start.

Todoist's Task Assist comes closest with AI subtask suggestions, but it's reactive, complexity-blind, and paywalled. No mainstream app detects cognitive overload at the point of action. That's the gap Ta Da! is built for.

The hunch held up.

I started building for myself. But "me" isn't a persona. The screener and 5 deep interviews are what turned a personal pattern into a product thesis.

I designed a behavioural screener with a scoring rubric. Each respondent scored 0-7 against criteria, not self-reported opinions. 35 responses.

35
Screener respondents
72%
Avoiders
5
Deep interviews
Screener results: persona validation across 35 respondents
Key finding

"Organise my closet" (high effort, low ambiguity) had a 40% pick rate. "Figure out health insurance" (lower effort, high ambiguity) had 18%. Ambiguity, not effort, drives avoidance. That's the empirical proof, and it anchored the weights in the AI scoring model.

Behind the headline numbers, 64% of respondents exhibited the full pattern: burst dumping, accumulated backlog, avoidance behaviour. The "overwhelmed dumper" isn't a niche persona. It's the majority behaviour pattern.

One question the app answers.

Should this task be broken down before you attempt it? You type a task and hit return. The AI silently scores it across three dimensions: action clarity, step count, and cognitive load. Simple tasks stay as-is. Complex ones get broken into a concrete first step you can act on immediately.

Task input flowing through AI scoring to complexity classification to a decomposed first step
Type, score, decompose.

Built to lose your attention the moment you know where to start. Re-engagement is earned, not engineered: users come back when the next avoided task needs unblocking.

The decisions that shaped Ta Da.

Design Onboarding before the main app

You experience the AI before entering the app. The value prop isn't explained, it's felt.

Constraint Seven-task daily cap

Frustrates power users. Necessary anyway. The target user's problem is having too much to do, not too little.

Principle Silent scoring, explicit Accept

Scoring is low-stakes, so it runs silently. Decomposition creates new subtasks, which without consent would undermine the sense of control the app is trying to restore.

Pivot Activation metric redesign

Original: "tasks captured in W1." Changed to: "completes first AI-decomposed task within 48 hours." The first measures activity. The second measures whether the product actually works.

How I'll know it works.

North star: complex tasks completed per active user per week. Not tasks captured. Not streaks. The metric that proves the product works.

Other task apps optimise for attachment. Capture rates. Streaks. Daily logins. Ta Da optimises for whether the avoided task actually got finished.

Two layers of success. Within a session, the win is leaving the app fast. Across sessions, the win is coming back tomorrow because another avoided task needs unblocking. One is design discipline. The other is sustained utility. Both reinforce.

Signal Threshold Action
Acceptance rate Below 70% Review the scoring prompt
Helpfulness Mostly 1s Tune decomposition quality
Override rate Above 10% Recalibrate thresholds

The system is built and instrumented. No real usage data yet. Tester feedback is the next phase.

I can't write code.

10 days, end to end. A master context doc anchored every session. Five custom skills scoped the work: UX writing, AI UX patterns, onboarding activation, iOS HIG, and UX Laws. An instrumented analytics framework (30 events, CSV export) anchored measurement. Dogfooded daily.

What I didn't outsource
  • Problem discovery: the insight that ambiguity, not difficulty, drives task avoidance.
  • User research design: behavioural screener with scoring rubric (n=35) and 5 deep interviews.
  • AI scoring model: the 3-dimension complexity taxonomy (action clarity, step count, cognitive load).
  • Trust principles: false positives erode trust faster than false negatives.

What I got wrong. What I'd keep.

Sequencing was backwards

I built first, validated after. The screener confirmed direction. If the data hadn't, I'd have shipped around the wrong problem. Next time: validate first.

IA audit came too late

18 issues surfaced after the build. 10 required code changes. Architecture decisions are cheaper on paper.

Building fast had upside

I had a real app to dogfood, not wireframes. Post-build audits caught issues before testers saw them.

What's next

Three open problems: AI still struggles with genuinely ambiguous tasks, API-key dependency doesn't scale, the model has no persistent memory of user inputs. The real "what's next" is whatever five people who aren't me tell me is broken.

User Research Problem Discovery Survey Design IA Audit AI Product Design Activation Metrics
Next case study

Zomato →

ML at Scale