DashboardbetsDifftoby/strategy/bets.md

Diff

5dca57bb → head

before

after

1---1---

2title: Toby — Rolling Bets Queue2title: Toby — Rolling Bets Queue

3last_updated: 2026-05-10T00:00:00Z3last_updated: 2026-05-10T00:00:00Z

4---4---

5 5

6# Toby — Rolling Bets Queue6# Toby — Rolling Bets Queue

7 7

8_Append + edit. Killed bets stay with autopsies so future runs don't reinvent them. Score is ICE: Impact (1-10) × Confidence (1-10) × Ease (1-10), where Ease = inverse of effort. MoSCoW: Must / Should / Could / Won't. Every bet declares a falsifying signal in plain English._8_Append + edit. Killed bets stay with autopsies so future runs don't reinvent them. Score is ICE: Impact (1-10) × Confidence (1-10) × Ease (1-10), where Ease = inverse of effort. MoSCoW: Must / Should / Could / Won't. Every bet declares a falsifying signal in plain English._

9 9

10---10---

11 11

12## In-flight12## In-flight

13 13

14### `reliability-blank-page-fix` — Hotfix the new-tab blank-page incident14### `reliability-blank-page-fix` — Hotfix the new-tab blank-page incident

15- **Status**: in-flight15- **Status**: in-flight

16- **Hypothesis**: A guarded error-state UI ("your tabs are safe; tap to recover") + a synthetic check that asserts the new-tab page renders collections within 2s will (a) cut current 1-star CWS-review velocity, (b) reverse the CWS rank penalty that's compounding because 2026 algorithm weights WAU + review recency + Core Web Vitals.16- **Hypothesis**: A guarded error-state UI ("your tabs are safe; tap to recover") + a synthetic check that asserts the new-tab page renders collections within 2s will (a) cut current 1-star CWS-review velocity, (b) reverse the CWS rank penalty that's compounding because 2026 algorithm weights WAU + review recency + Core Web Vitals.

17- **Why now**: live blank-page incident is cross-confirmed in CWS reviews, Vivaldi 7.0 forum, Orion issue tracker. v3 research flagged it as the highest-leverage fix available (see: research-docs/toby-delta-2026-05-05-v3.md, 2026-05-05). Upstream of every other strategic move — every hour this lingers, the 2026 CWS algorithm penalizes Toby.17- **Why now**: live blank-page incident is cross-confirmed in CWS reviews, Vivaldi 7.0 forum, Orion issue tracker. v3 research flagged it as the highest-leverage fix available (see: research-docs/toby-delta-2026-05-05-v3.md, 2026-05-05). Upstream of every other strategic move — every hour this lingers, the 2026 CWS algorithm penalizes Toby.

18- **Success metric**: zero new 1-star reviews citing "blank screen" within 14 days of hotfix release; CWS review average stops declining; synthetic check passes 99.5%+.18- **Success metric**: zero new 1-star reviews citing "blank screen" within 14 days of hotfix release; CWS review average stops declining; synthetic check passes 99.5%+.

19- **Falsifying signal**: if we ship a fix and 1-star "blank screen" reviews continue within 14 days, the bug isn't fixed — re-open with broader reproduction (browser × extension combinations).19- **Falsifying signal**: if we ship a fix and 1-star "blank screen" reviews continue within 14 days, the bug isn't fixed — re-open with broader reproduction (browser × extension combinations).

20- **Owner**: TBD — operator decision needed.20- **Owner**: TBD — operator decision needed.

21- **ICE**: I=9, C=8, E=8, score=57621- **ICE**: I=9, C=8, E=8, score=576

22- **MoSCoW**: Must22- **MoSCoW**: Must

23- **Started**: 2026-05-05 (per v3 research)23- **Started**: 2026-05-05 (per v3 research)

24- **Last review**: 2026-05-1024- **Last review**: 2026-05-10

25 25

26### `phase-2-welcome-ab` — Onboarding welcome-screen A/B test26### `phase-2-welcome-ab` — Onboarding welcome-screen A/B test

27- **Status**: in-flight (planned, no commits yet — silent slip suspected)27- **Status**: in-flight (planned, no commits yet — silent slip suspected)

28- **Hypothesis**: The presence of a dedicated welcome / Get Started screen lifts D7 retention by ≥5pp over the V2-only baseline (32.92%). Activation moment is **persistence**, not "AI organized my tabs."28- **Hypothesis**: The presence of a dedicated welcome / Get Started screen lifts D7 retention by ≥5pp over the V2-only baseline (32.92%). Activation moment is **persistence**, not "AI organized my tabs."

29- **Why now**: cohort retention W4=39.2%, W12=30.5% per `toby/01-personas.md`; Day-1 retention 46% per `product/learnings.md`. The first 4 weeks are where the funnel leaks — every other persona has 83-94% weekly stickiness; New Adopters convert at 58.4%. Phase 2 is fully specced (12 slices, halt triggers, canary stages) in `tasks/phase2-todo.md`.29- **Why now**: cohort retention W4=39.2%, W12=30.5% per `toby/01-personas.md`; Day-1 retention 46% per `product/learnings.md`. The first 4 weeks are where the funnel leaks — every other persona has 83-94% weekly stickiness; New Adopters convert at 58.4%. Phase 2 is fully specced (12 slices, halt triggers, canary stages) in `tasks/phase2-todo.md`.

30- **Success metric**: ≥34% D7 retention on at least one variant at n≥2,000/arm by 2026-05-26 decision review.30- **Success metric**: ≥34% D7 retention on at least one variant at n≥2,000/arm by 2026-05-26 decision review.

31- **Falsifying signal**: kill if neither variant hits 34% D7 at n≥2,000/arm by 2026-05-26 (the team's own pre-defined kill criterion — preserve it).31- **Falsifying signal**: kill if neither variant hits 34% D7 at n≥2,000/arm by 2026-05-26 (the team's own pre-defined kill criterion — preserve it).

32- **Owner**: Jad (per `toby/00-state-of-the-project.md`).32- **Owner**: Jad (per `toby/00-state-of-the-project.md`).

33- **ICE**: I=8, C=6, E=4, score=19233- **ICE**: I=8, C=6, E=4, score=192

34- **MoSCoW**: Must34- **MoSCoW**: Must

35- **Started**: planned week of 2026-04-20, zero commits visible as of 2026-05-10. **Open question**: is work on an unpushed branch or genuinely deferred?35- **Started**: planned week of 2026-04-20, zero commits visible as of 2026-05-10. **Open question**: is work on an unpushed branch or genuinely deferred?

36- **Last review**: 2026-05-1036- **Last review**: 2026-05-10

37 37

38### `cws-narrative-repair` — Listing rewrite + cloud-sync + social proof + CWV38### `cws-narrative-repair` — Listing rewrite + cloud-sync + social proof + CWV

39- **Status**: in-flight (planned, no commits — listing copy still hasn't been refreshed)39- **Status**: in-flight (planned, no commits — listing copy still hasn't been refreshed)

40- **Hypothesis**: Retitling to `Toby — Tab Manager: Save Sessions, Cloud Sync & Notes`, rewriting the description with explicit cloud-sync mention, surfacing "used daily by teams at Netflix/Amazon/HubSpot/IBM" social proof, and publishing a Core Web Vitals benchmark will lift CWS install-conversion on the residual high-intent traffic (currently 30% conversion on ~250 daily installs).40- **Hypothesis**: Retitling to `Toby — Tab Manager: Save Sessions, Cloud Sync & Notes`, rewriting the description with explicit cloud-sync mention, surfacing "used daily by teams at Netflix/Amazon/HubSpot/IBM" social proof, and publishing a Core Web Vitals benchmark will lift CWS install-conversion on the residual high-intent traffic (currently 30% conversion on ~250 daily installs).

41- **Why now**: post-Oct-8 CWS page-view collapse is permanent (5,170 → 897/day, not recovered). The remaining traffic is high-intent, so even small conversion lifts compound. Two-way door — easily reversible.41- **Why now**: post-Oct-8 CWS page-view collapse is permanent (5,170 → 897/day, not recovered). The remaining traffic is high-intent, so even small conversion lifts compound. Two-way door — easily reversible.

42- **Success metric**: +20% lift in CWS install-conversion on the 4-week window post-rewrite.42- **Success metric**: +20% lift in CWS install-conversion on the 4-week window post-rewrite.

43- **Falsifying signal**: if 4 weeks post-rewrite there's no measurable lift, the problem is not narrative — it's ranking. Pivot to acquisition-channel diversification.43- **Falsifying signal**: if 4 weeks post-rewrite there's no measurable lift, the problem is not narrative — it's ranking. Pivot to acquisition-channel diversification.

44- **Owner**: TBD — operator decision needed.44- **Owner**: TBD — operator decision needed.

45- **ICE**: I=7, C=7, E=8, score=39245- **ICE**: I=7, C=7, E=8, score=392

46- **MoSCoW**: Must46- **MoSCoW**: Must

47- **Started**: pending47- **Started**: pending

48- **Last review**: 2026-05-1048- **Last review**: 2026-05-10

49 49

50### `pricing-reality-reconcile` — Close the $4.50 vs $6/$10 contradiction50### `pricing-reality-reconcile` — Close the $4.50 vs $6/$10 contradiction

51- **Status**: in-flight (1-hour audit, not yet done)51- **Status**: in-flight (1-hour audit, not yet done)

52- **Hypothesis**: Either a price increase shipped publicly without internal docs being updated, or the Efficient.app listing is stale. Until we know, every pricing experiment is built on a guess.52- **Hypothesis**: Either a price increase shipped publicly without internal docs being updated, or the Efficient.app listing is stale. Until we know, every pricing experiment is built on a guess.

53- **Why now**: blocks `role-based-paywall-gating`. v3 research flagged it as a 1-hour single cross-check that resolves a major business-model assumption (see: research-docs/toby-delta-2026-05-05-v3.md).53- **Why now**: blocks `role-based-paywall-gating`. v3 research flagged it as a 1-hour single cross-check that resolves a major business-model assumption (see: research-docs/toby-delta-2026-05-05-v3.md).

54- **Success metric**: one short doc (`product/metrics/pricing-reconciliation.md`) listing current Stripe price IDs in production, CWS listing, gettoby.com landing, and a single authoritative number.54- **Success metric**: one short doc (`product/metrics/pricing-reconciliation.md`) listing current Stripe price IDs in production, CWS listing, gettoby.com landing, and a single authoritative number.

55- **Falsifying signal**: if the audit reveals a 3rd price point we didn't know about, scope expands — the contradiction is bigger than v3 captured.55- **Falsifying signal**: if the audit reveals a 3rd price point we didn't know about, scope expands — the contradiction is bigger than v3 captured.

56- **Owner**: TBD — operator decision needed.56- **Owner**: TBD — operator decision needed.

57- **ICE**: I=6, C=10, E=10, score=60057- **ICE**: I=6, C=10, E=10, score=600

58- **MoSCoW**: Must58- **MoSCoW**: Must

59- **Started**: 2026-05-1059- **Started**: 2026-05-10

60- **Last review**: 2026-05-1060- **Last review**: 2026-05-10

61 61

62---62---

63 63

64## Proposed (queued, awaiting decision)64## Proposed (queued, awaiting decision)

65 65

66### `cliff-renewal-offer-ladder` — Pre-stage the Feb-2027 mega-cliff with an offer ladder + funnel instrumentation

67- **Status**: proposed

68- **Hypothesis**: Feb 2027 sees 2,354 legacy ThankYou subs / 2,700 seats / $12,447.50 MRR renew in a single month (31% of total MRR, 26% of paid subs); the broader Oct-26 / Jan-27 / Feb-27 cliff window is $22.6K MRR / 55% of total reaching a renewal decision. The cohort is mostly at legacy $4.50/seat/yr pricing. Without a pre-staged offer ladder + instrumented funnel + A/B'd email cadence, the cliff resolves either as silent churn (no offer) or undifferentiated re-price (all-or-nothing).

69- **Why now**: ~8.5 months from the first cliff month and the cohort is already leaking (17 subs / $81 MRR pre-cancelled this week). Build target Q3 2026 (Jul-Sep) so it's in production by the Oct-26 leading edge. Blocked on `pricing-reality-reconcile` (O3 KR1, due 2026-05-13) — the offer ladder has no anchor price to ladder from until the three inconsistent prices ($4.50 internal / $6/$10 Efficient.app / $9 TheTab) reconcile to one.

70- **Success metric**: at the Feb-2027 cliff month, ≥60% of the renewal-decision cohort engages with the offer (open or click); ≥40% of engaged subs renew (vs. modeled cold-renewal baseline ~25%); ≥$7.5K MRR retained of the $12.4K Feb-27 wave.

71- **Falsifying signal**: if 60 days post-launch engagement is <30% OR the instrumented funnel shows >70% drop-off between offer-view and renewal-decision, the offer-ladder framing is wrong. Switch to a single calm one-touch email + landing page; abandon the ladder.

72- **Owner**: TBD. Blocked on `pricing-reality-reconcile`.

73- **ICE**: I=9, C=6, E=4, score=216

74- **MoSCoW**: Should

75- **Last review**: 2026-05-12 _(promoted from TOBY-7 during backlog triage; ticket cancelled as the work is bet-shaped, not ticket-shaped)_

66### `role-based-paywall-gating` — Restructure paywall around team/admin/sharing, not card limits77### `role-based-paywall-gating` — Restructure paywall around team/admin/sharing, not card limits

67- **Status**: proposed78- **Status**: proposed

68- **Hypothesis**: Gating *team / admin / sharing* features behind paid (instead of card limits) hits the under-monetized Multi-User Collaborator persona (4,908 active multi-team users; only 79 yearly Team subs cover them — ~2.6% of multi-team active users pay) and the Free-Tier Archivist persona (1,848 users averaging 224 lists, 28.4% labels, 18.6% public-share rate — the heaviest organizers, paying $0). v3 research benchmarks this at **2× conversion**.79- **Hypothesis**: Gating *team / admin / sharing* features behind paid (instead of card limits) hits the under-monetized Multi-User Collaborator persona (4,908 active multi-team users; only 79 yearly Team subs cover them — ~2.6% of multi-team active users pay) and the Free-Tier Archivist persona (1,848 users averaging 224 lists, 28.4% labels, 18.6% public-share rate — the heaviest organizers, paying $0). v3 research benchmarks this at **2× conversion**.

69- **Why now**: v3 research identified this as the highest-leverage monetization restructure. Unblocks once `pricing-reality-reconcile` lands.80- **Why now**: v3 research identified this as the highest-leverage monetization restructure. Unblocks once `pricing-reality-reconcile` lands.

70- **Success metric**: free→paid conversion among multi-user-collaborator + free-tier-archivist segments doubles within 90 days post-launch (baseline: ~2.6% and ~0% respectively).81- **Success metric**: free→paid conversion among multi-user-collaborator + free-tier-archivist segments doubles within 90 days post-launch (baseline: ~2.6% and ~0% respectively).

71- **Falsifying signal**: if conversion among either target segment moves <30% in 60 days post-launch, the gating axis is wrong. Roll back; the feature gates aren't the bottleneck.82- **Falsifying signal**: if conversion among either target segment moves <30% in 60 days post-launch, the gating axis is wrong. Roll back; the feature gates aren't the bottleneck.

72- **Owner**: TBD.83- **Owner**: TBD.

73- **ICE**: I=8, C=6, E=3, score=14484- **ICE**: I=8, C=6, E=3, score=144

74- **MoSCoW**: Should85- **MoSCoW**: Should

75- **Last review**: 2026-05-1086- **Last review**: 2026-05-10

76 87

77### `public-collection-pride-loop` — Surface, reward, and amplify public-list creators88### `public-collection-pride-loop` — Surface, reward, and amplify public-list creators

78- **Status**: proposed89- **Status**: proposed

79- **Hypothesis**: Free-Tier Archivists (1,848 users, 18.6% public-share rate, avg 224 lists) are Toby's only native viral surface — and we don't ask them to share, surface their work, or close the recognition loop. A "public collection of the week" series on X (already in `toby/x-content-pipeline.md` Post 5) + a curator-spotlight slot on `gettoby.com` + a one-tap "feature my collection" submission flow would activate the only content-loop Toby owns. This is the under-pulled growth lever.90- **Hypothesis**: Free-Tier Archivists (1,848 users, 18.6% public-share rate, avg 224 lists) are Toby's only native viral surface — and we don't ask them to share, surface their work, or close the recognition loop. A "public collection of the week" series on X (already in `toby/x-content-pipeline.md` Post 5) + a curator-spotlight slot on `gettoby.com` + a one-tap "feature my collection" submission flow would activate the only content-loop Toby owns. This is the under-pulled growth lever.

80- **Why now**: only 3% of users have ever made a public list; 14,306 active card-share links exist; the share-link surface is largely dormant from a growth-team perspective. Zero engineering required for the X/blog version — operator-driven, low cost.91- **Why now**: only 3% of users have ever made a public list; 14,306 active card-share links exist; the share-link surface is largely dormant from a growth-team perspective. Zero engineering required for the X/blog version — operator-driven, low cost.

81- **Success metric**: 10 curated public collections featured by 2026-06-30; +25% week-over-week growth in new public-list creation; ≥3 inbound CWS installs trackable via UTM from the curator-spotlight URL within 6 weeks.92- **Success metric**: 10 curated public collections featured by 2026-06-30; +25% week-over-week growth in new public-list creation; ≥3 inbound CWS installs trackable via UTM from the curator-spotlight URL within 6 weeks.

82- **Falsifying signal**: if 10 weeks of "public collection of the week" generates <50 trackable installs total (across X + blog combined), the loop isn't viral — kill and reallocate to paid-channel diagnosis.93- **Falsifying signal**: if 10 weeks of "public collection of the week" generates <50 trackable installs total (across X + blog combined), the loop isn't viral — kill and reallocate to paid-channel diagnosis.

83- **Owner**: TBD (likely toby-x-strategist + toby-blog-seo coordination).94- **Owner**: TBD (likely toby-x-strategist + toby-blog-seo coordination).

84- **ICE**: I=6, C=7, E=8, score=33695- **ICE**: I=6, C=7, E=8, score=336

85- **MoSCoW**: Should96- **MoSCoW**: Should

86- **Last review**: 2026-05-1097- **Last review**: 2026-05-10

87 98

88### `lapsed-pro-reactivation-campaign` — Targeted reactivation flow for 101k cancelled monthly Pros99### `lapsed-pro-reactivation-campaign` — Targeted reactivation flow for 101k cancelled monthly Pros

89- **Status**: proposed100- **Status**: proposed

90- **Hypothesis**: 101,229 cancelled monthly Pro subs is 14× the active Pro base. Cancellation reasons are 39% `not_using`, 37% `too_expensive`, 22% other, 2% `missing_features` (per `toby/01-personas.md`). For the `not_using` cohort specifically, a "your library is still here — open Toby and see what you've forgotten" reactivation email (no discount, just recognition + persistence promise) plus a 7-day return-to-Pro free trial should pull a measurable percentage back.101- **Hypothesis**: 101,229 cancelled monthly Pro subs is 14× the active Pro base. Cancellation reasons are 39% `not_using`, 37% `too_expensive`, 22% other, 2% `missing_features` (per `toby/01-personas.md`). For the `not_using` cohort specifically, a "your library is still here — open Toby and see what you've forgotten" reactivation email (no discount, just recognition + persistence promise) plus a 7-day return-to-Pro free trial should pull a measurable percentage back.

91- **Why now**: even 5% reactivation moves $7-15K MRR — material vs. the $14.8K gap to breakeven (see: product/learnings.md). Costs are email + a small flow; doesn't compete with engineering bandwidth on reliability or Phase 2.102- **Why now**: even 5% reactivation moves $7-15K MRR — material vs. the $14.8K gap to breakeven (see: product/learnings.md). Costs are email + a small flow; doesn't compete with engineering bandwidth on reliability or Phase 2.

92- **Success metric**: ≥3% trial-start rate on the campaign; ≥40% trial-to-paid conversion (vs. global benchmark); net +$5K MRR within 60 days of full rollout.103- **Success metric**: ≥3% trial-start rate on the campaign; ≥40% trial-to-paid conversion (vs. global benchmark); net +$5K MRR within 60 days of full rollout.

93- **Falsifying signal**: if 60 days post-launch trial-start rate is <1% OR trial-to-paid <20%, the lapsed cohort has structurally moved on. Don't retry without a meaningfully-different angle.104- **Falsifying signal**: if 60 days post-launch trial-start rate is <1% OR trial-to-paid <20%, the lapsed cohort has structurally moved on. Don't retry without a meaningfully-different angle.

94- **Owner**: TBD.105- **Owner**: TBD.

95- **ICE**: I=7, C=5, E=6, score=210106- **ICE**: I=7, C=5, E=6, score=210

96- **MoSCoW**: Should107- **MoSCoW**: Should

97- **Last review**: 2026-05-10108- **Last review**: 2026-05-10

98 109

99### `chrome-133-vs-toby-comparison-page` — AI-search-friendly comparison page110### `chrome-133-vs-toby-comparison-page` — AI-search-friendly comparison page

100- **Status**: proposed111- **Status**: proposed

101- **Hypothesis**: Publishing a short comparison page explaining what Chrome 133 native sync covers vs. what Toby still does (visual collections, sharing, notes per tab, multi-team, deep history) earns recommendation slots in Perplexity / ChatGPT / Claude / Atlas answers when users ask "Chrome 133 vs tab manager" — and also captures organic search.112- **Hypothesis**: Publishing a short comparison page explaining what Chrome 133 native sync covers vs. what Toby still does (visual collections, sharing, notes per tab, multi-team, deep history) earns recommendation slots in Perplexity / ChatGPT / Claude / Atlas answers when users ask "Chrome 133 vs tab manager" — and also captures organic search.

102- **Why now**: Chrome 133 saved-tab-group cross-device sync is live; the question is being asked. v3 research flagged this as a 1-week action. We are not in this conversation today.113- **Why now**: Chrome 133 saved-tab-group cross-device sync is live; the question is being asked. v3 research flagged this as a 1-week action. We are not in this conversation today.

103- **Success metric**: page indexed within 30 days; at least 1 confirmed AI-search citation (Perplexity / ChatGPT) within 60 days; +100 trackable installs from the page within 90 days.114- **Success metric**: page indexed within 30 days; at least 1 confirmed AI-search citation (Perplexity / ChatGPT) within 60 days; +100 trackable installs from the page within 90 days.

104- **Falsifying signal**: if 90 days post-publish there's no AI-search citation and <20 trackable installs, the AI-recommendation channel is closed for our shape of product. Don't try a second time without a fundamentally different angle.115- **Falsifying signal**: if 90 days post-publish there's no AI-search citation and <20 trackable installs, the AI-recommendation channel is closed for our shape of product. Don't try a second time without a fundamentally different angle.

105- **Owner**: toby-blog-seo + landing-page eng.116- **Owner**: toby-blog-seo + landing-page eng.

106- **ICE**: I=5, C=6, E=8, score=240117- **ICE**: I=5, C=6, E=8, score=240

107- **MoSCoW**: Should118- **MoSCoW**: Should

108- **Last review**: 2026-05-10119- **Last review**: 2026-05-10

109 120

110### `x-relaunch-soft` — Restart @TobyForTabs at 3 posts/week with Tier A engagement121### `x-relaunch-soft` — Restart @TobyForTabs at 3 posts/week with Tier A engagement

111- **Status**: proposed122- **Status**: proposed

112- **Hypothesis**: @TobyForTabs is dormant (2 posts in 14 months, 2,913 followers) while demand on X is loud and unattached (dozens of organic "too many tabs" complaints per week, per `toby/x-strategy.md`). A calm, in-character relaunch — 3 posts/week + Tier A engagement on 5-10 ICP-shape accounts/day per `toby/x-engagement-targets.md` — earns brand presence at low cost and feeds CWS installs via UTM attribution.123- **Hypothesis**: @TobyForTabs is dormant (2 posts in 14 months, 2,913 followers) while demand on X is loud and unattached (dozens of organic "too many tabs" complaints per week, per `toby/x-strategy.md`). A calm, in-character relaunch — 3 posts/week + Tier A engagement on 5-10 ICP-shape accounts/day per `toby/x-engagement-targets.md` — earns brand presence at low cost and feeds CWS installs via UTM attribution.

113- **Why now**: account credentials need verification (🔒 acct gate). Once verified, the relaunch is operator-driven, costs zero engineering. Best-window timing already mapped.124- **Why now**: account credentials need verification (🔒 acct gate). Once verified, the relaunch is operator-driven, costs zero engineering. Best-window timing already mapped.

114- **Success metric**: median impressions/original-post ≥ 5,000 within 6 weeks (baseline ~2,500); ≥1 ICP-shape reply per original post by week 6; ≥15% follow rate on first-touch replies.125- **Success metric**: median impressions/original-post ≥ 5,000 within 6 weeks (baseline ~2,500); ≥1 ICP-shape reply per original post by week 6; ≥15% follow rate on first-touch replies.

115- **Falsifying signal**: if 8 weeks of disciplined cadence yields median <2,000 impressions/post and <3 ICP replies per month, the channel isn't worth the operator's time — wind down or sell to a contractor.126- **Falsifying signal**: if 8 weeks of disciplined cadence yields median <2,000 impressions/post and <3 ICP replies per month, the channel isn't worth the operator's time — wind down or sell to a contractor.

116- **Owner**: TBD (operator-led; toby-x-strategist drafts).127- **Owner**: TBD (operator-led; toby-x-strategist drafts).

117- **ICE**: I=4, C=6, E=7, score=168128- **ICE**: I=4, C=6, E=7, score=168

118- **MoSCoW**: Could129- **MoSCoW**: Could

119- **Last review**: 2026-05-10130- **Last review**: 2026-05-10

120 131

121### `seo-content-cadence-2-weeks` — Ship one calm essay every 2 weeks132### `seo-content-cadence-2-weeks` — Ship one calm essay every 2 weeks

122- **Status**: proposed133- **Status**: proposed

123- **Hypothesis**: A 2-weekly blog cadence (P1/P5 mix per `toby/blog-pipeline.md`) compounds an organic acquisition channel that doesn't depend on CWS rank. SERPs for "too many tabs", "OneTab alternative", "bookmarks vs tab manager" are dominated by listicles and weak essays; Toby's calm voice + brand authority can win foundational queries.134- **Hypothesis**: A 2-weekly blog cadence (P1/P5 mix per `toby/blog-pipeline.md`) compounds an organic acquisition channel that doesn't depend on CWS rank. SERPs for "too many tabs", "OneTab alternative", "bookmarks vs tab manager" are dominated by listicles and weak essays; Toby's calm voice + brand authority can win foundational queries.

124- **Why now**: first draft (`blog-why-you-have-so-many-tabs-open.md`) shipped to wiki, not yet published. Pipeline is sequenced. Operator owns publish step + image hand-off.135- **Why now**: first draft (`blog-why-you-have-so-many-tabs-open.md`) shipped to wiki, not yet published. Pipeline is sequenced. Operator owns publish step + image hand-off.

125- **Success metric**: 6 posts published by end of Q3 2026; one post ranks top-10 on its primary keyword within 90 days; +200 trackable installs from blog-attributed traffic within 6 months.136- **Success metric**: 6 posts published by end of Q3 2026; one post ranks top-10 on its primary keyword within 90 days; +200 trackable installs from blog-attributed traffic within 6 months.

126- **Falsifying signal**: if after 6 published posts none rank top-10 and total blog-attributed installs <50, SEO is not the channel — the SERPs are friendlier than they look or our voice isn't winning. Hold the cadence at 1/month and divert effort to public-collection-pride-loop.137- **Falsifying signal**: if after 6 published posts none rank top-10 and total blog-attributed installs <50, SEO is not the channel — the SERPs are friendlier than they look or our voice isn't winning. Hold the cadence at 1/month and divert effort to public-collection-pride-loop.

127- **Owner**: toby-blog-seo drafts; operator owns publish.138- **Owner**: toby-blog-seo drafts; operator owns publish.

128- **ICE**: I=6, C=5, E=6, score=180139- **ICE**: I=6, C=5, E=6, score=180

129- **MoSCoW**: Should140- **MoSCoW**: Should

130- **Last review**: 2026-05-10141- **Last review**: 2026-05-10

131 142

132---143---

133 144

134## Validated (succeeded — graduated to ongoing motion)145## Validated (succeeded — graduated to ongoing motion)

135 146

136### `session-start-heartbeat` — 4h `Session Start` event for intra-day retention analysis147### `session-start-heartbeat` — 4h `Session Start` event for intra-day retention analysis

137Shipped in v1.13.0 on 2026-04-14 (commit 0f3aa38d). Halt threshold 180k/day with 60k/day projection. Now the foundation for Phase 2's D7 retention measurement and the persona-shape analysis in `toby/01-personas.md`. Durable lesson: instrument the metric before running the experiment that depends on it. **Date validated**: 2026-04-14.148Shipped in v1.13.0 on 2026-04-14 (commit 0f3aa38d). Halt threshold 180k/day with 60k/day projection. Now the foundation for Phase 2's D7 retention measurement and the persona-shape analysis in `toby/01-personas.md`. Durable lesson: instrument the metric before running the experiment that depends on it. **Date validated**: 2026-04-14.

138 149

139### `retention-discount-all-cancellation-reasons` — Retention discount eligible for any cancel reason150### `retention-discount-all-cancellation-reasons` — Retention discount eligible for any cancel reason

140Backend live (commit cbc92a78d). Removed the "valid reason" filter so the discount surfaces for every churning user. Durable lesson: a small backend rule change can unlock significant top-of-flow exposure to retention offers; UI/UX investment can come later. **Note**: the frontend wiring in `CancelSubscription.tsx` is flagged as pending in `worklog.md` (Jan 2026) but no commit confirms it shipped — operator confirm. **Date validated**: 2026-01-30.151Backend live (commit cbc92a78d). Removed the "valid reason" filter so the discount surfaces for every churning user. Durable lesson: a small backend rule change can unlock significant top-of-flow exposure to retention offers; UI/UX investment can come later. **Note**: the frontend wiring in `CancelSubscription.tsx` is flagged as pending in `worklog.md` (Jan 2026) but no commit confirms it shipped — operator confirm. **Date validated**: 2026-01-30.

141 152

142### `monorepo-flatten-turborepo` — Monorepo flatten + Turborepo / pnpm workspaces migration153### `monorepo-flatten-turborepo` — Monorepo flatten + Turborepo / pnpm workspaces migration

143Shipped late March 2026 (commits 87bec6267, a90230ce1, 134f9bb90, ec843c5a2, c5545cbd5, 2574b5379, 5bd961266). `apps/{api,extension,landing,mobile}` + shared `packages/`. Durable lesson: collapsing 4 submodules into a monorepo with a single dep graph cut release-coordination cost; the migration was a one-time cost paid before the Phase 2 onboarding work that depends on shared analytics packages. **Date validated**: 2026-03-31.154Shipped late March 2026 (commits 87bec6267, a90230ce1, 134f9bb90, ec843c5a2, c5545cbd5, 2574b5379, 5bd961266). `apps/{api,extension,landing,mobile}` + shared `packages/`. Durable lesson: collapsing 4 submodules into a monorepo with a single dep graph cut release-coordination cost; the migration was a one-time cost paid before the Phase 2 onboarding work that depends on shared analytics packages. **Date validated**: 2026-03-31.

144 155

145### `cws-review-monitor-with-ai-drafts` — Cloud Run job posting AI-drafted CWS review responses156### `cws-review-monitor-with-ai-drafts` — Cloud Run job posting AI-drafted CWS review responses

146Shipped 2026-03-30 (commit ba247d9a) with fallback Slack message added 2026-04-29 (commit b9bea18c). Durable lesson: low-engineering-effort automation closed a brand-hygiene gap that was previously not staffed. **Date validated**: 2026-03-30.157Shipped 2026-03-30 (commit ba247d9a) with fallback Slack message added 2026-04-29 (commit b9bea18c). Durable lesson: low-engineering-effort automation closed a brand-hygiene gap that was previously not staffed. **Date validated**: 2026-03-30.

147 158

148---159---

149 160

150## Killed (preserved so we don't reinvent)161## Killed (preserved so we don't reinvent)

151 162

152### `deeper-discount-retention-offer-50pct` — 50% off save offer to ThankYouLegacy churners163### `deeper-discount-retention-offer-50pct` — 50% off save offer to ThankYouLegacy churners

153Killed: failed at <2% save rate (only 14 of 740 ThankYouLegacy churners took it). Why it failed: the 50% offer ($2.25/mo) is still 2× what ThankYouLegacy users were paying ($0.99/mo). Price wasn't the lever — the cohort was attracted by the 78% legacy discount specifically; they were never going to pay $3+/mo. What would have to be different to revive: only if we found a structurally cheaper-to-serve plan ($1-2/mo with explicit feature limits) for explicit price-sensitive cohort retention. **Date killed**: 2026-02-15 (per `product/learnings.md`).164Killed: failed at <2% save rate (only 14 of 740 ThankYouLegacy churners took it). Why it failed: the 50% offer ($2.25/mo) is still 2× what ThankYouLegacy users were paying ($0.99/mo). Price wasn't the lever — the cohort was attracted by the 78% legacy discount specifically; they were never going to pay $3+/mo. What would have to be different to revive: only if we found a structurally cheaper-to-serve plan ($1-2/mo with explicit feature limits) for explicit price-sensitive cohort retention. **Date killed**: 2026-02-15 (per `product/learnings.md`).

154 165

155### `ai-feature-pre-launch-q3-2026` — Pre-announce AI organize / smart-collection naming in Q3 2026166### `ai-feature-pre-launch-q3-2026` — Pre-announce AI organize / smart-collection naming in Q3 2026

156Killed: deferred to Q4 2026 per v3 research delta — AI-browser threat is slower-burn (18-24 months) than v2 implied (12 months), so the urgency to compete on AI-feature parity has dropped. What killed it: independent reporting framing Atlas/Comet as "struggling to dominate." Pre-announcing creates an expectation gap. What would have to be different to revive: only if Q4 build slips again, in which case the decision is "hold Q4 target" not "pre-announce earlier." **Date killed**: 2026-05-05 (per `research-docs/toby-delta-2026-05-05-v3.md`).167Killed: deferred to Q4 2026 per v3 research delta — AI-browser threat is slower-burn (18-24 months) than v2 implied (12 months), so the urgency to compete on AI-feature parity has dropped. What killed it: independent reporting framing Atlas/Comet as "struggling to dominate." Pre-announcing creates an expectation gap. What would have to be different to revive: only if Q4 build slips again, in which case the decision is "hold Q4 target" not "pre-announce earlier." **Date killed**: 2026-05-05 (per `research-docs/toby-delta-2026-05-05-v3.md`).

157 168

158### `onboarding-signup-position-AB` — A/B test signup position (beginning vs. end of onboarding)169### `onboarding-signup-position-AB` — A/B test signup position (beginning vs. end of onboarding)

159Killed: "end" variant shipped as the winner (commit bc5e45305, 2026-04-09). Removed 644 lines of dead experiment machinery in the same commit. Durable lesson: ship the winning variant + remove the experiment scaffold in one PR — fragments left in code become Phase 2 plumbing tax. What would have to be different to revive: nothing — this experiment is complete. **Date killed (variant shipped)**: 2026-04-09.170Killed: "end" variant shipped as the winner (commit bc5e45305, 2026-04-09). Removed 644 lines of dead experiment machinery in the same commit. Durable lesson: ship the winning variant + remove the experiment scaffold in one PR — fragments left in code become Phase 2 plumbing tax. What would have to be different to revive: nothing — this experiment is complete. **Date killed (variant shipped)**: 2026-04-09.

160 171

161### `action-0-untracked-wau-instrument` — Instrument the "270K untracked WAU" opportunity172### `action-0-untracked-wau-instrument` — Instrument the "270K untracked WAU" opportunity

162Killed: was a mirage. CWS WAU (~380K) is inflated 5-6× because it counts disabled extensions, ghost installs, and multi-device duplicates via Chrome's Omaha update protocol. Cross-verified against Toby DB heartbeats (61,852) and Amplitude (75,123 devices / 41,111 identified users). Real active base is ~62-75K. Why it failed: the premise that the untracked WAU was a conversion opportunity was wrong — those users are not "Toby users we haven't reached"; they're dead accounts and ghost installs. What would have to be different to revive: nothing — the metric was the problem, not the strategy. **Date killed**: 2026-03-21 (per `product/strategy/next-actions.md`).173Killed: was a mirage. CWS WAU (~380K) is inflated 5-6× because it counts disabled extensions, ghost installs, and multi-device duplicates via Chrome's Omaha update protocol. Cross-verified against Toby DB heartbeats (61,852) and Amplitude (75,123 devices / 41,111 identified users). Real active base is ~62-75K. Why it failed: the premise that the untracked WAU was a conversion opportunity was wrong — those users are not "Toby users we haven't reached"; they're dead accounts and ghost installs. What would have to be different to revive: nothing — the metric was the problem, not the strategy. **Date killed**: 2026-03-21 (per `product/strategy/next-actions.md`).

163 174

164### `team-plan-as-primary-monetization` — Push Team plans as the main upsell path175### `team-plan-as-primary-monetization` — Push Team plans as the main upsell path

165Killed (implicitly, as a *primary* strategy): only 79 active yearly Team subs and 96 multi-member paid teams in total vs. 7,058 active yearly Pro subs. Teams is <1% of revenue. Why it failed as a primary axis: B2B sales motion wasn't built and team buyers don't surface this need on public X — they're inside Slack. What would have to be different to revive as primary: dedicated B2B SDR motion + LinkedIn/Slack-community presence. **Note**: Team plan as a *secondary* monetization lever (via `role-based-paywall-gating`) remains live — the kill is on Teams-as-the-headline-bet, not on team features altogether. **Date killed (as headline bet)**: 2026-03-21.176Killed (implicitly, as a *primary* strategy): only 79 active yearly Team subs and 96 multi-member paid teams in total vs. 7,058 active yearly Pro subs. Teams is <1% of revenue. Why it failed as a primary axis: B2B sales motion wasn't built and team buyers don't surface this need on public X — they're inside Slack. What would have to be different to revive as primary: dedicated B2B SDR motion + LinkedIn/Slack-community presence. **Note**: Team plan as a *secondary* monetization lever (via `role-based-paywall-gating`) remains live — the kill is on Teams-as-the-headline-bet, not on team features altogether. **Date killed (as headline bet)**: 2026-03-21.

166 177

167### `paid-acquisition-channel` — Paid ads at $54/yr ARPU178### `paid-acquisition-channel` — Paid ads at $54/yr ARPU

168Killed: math doesn't work at current $54/yr ARPU with <5% full-price conversion (per `product/strategy/next-actions.md`). LTV doesn't cover CAC at any realistic channel cost. What would have to be different to revive: a structurally higher ARPU (Team plan working, role-based gating shipped) AND a conversion path >5% AND a channel under $5 CAC. None present today. **Date killed**: 2026-03-21.179Killed: math doesn't work at current $54/yr ARPU with <5% full-price conversion (per `product/strategy/next-actions.md`). LTV doesn't cover CAC at any realistic channel cost. What would have to be different to revive: a structurally higher ARPU (Team plan working, role-based gating shipped) AND a conversion path >5% AND a channel under $5 CAC. None present today. **Date killed**: 2026-03-21.

169 180

170### `aggressive-pricing-on-legacy-cohort` — Aggressively re-price legacy users to full price181### `aggressive-pricing-on-legacy-cohort` — Aggressively re-price legacy users to full price

171Killed: one-way door, destroys trust, accelerates churn. ThankYouLegacy 12-month auto-transition (78% off → 33% off, a 3× price jump) drove 23.5% cumulative cohort churn — the data we already have on what happens when this lever is pulled too hard. What would have to be different to revive: a transition floor where legacy users see *value-delivered-since* and the price ramps with the value, not against it. Not currently designed. **Date killed**: 2026-03-21.182Killed: one-way door, destroys trust, accelerates churn. ThankYouLegacy 12-month auto-transition (78% off → 33% off, a 3× price jump) drove 23.5% cumulative cohort churn — the data we already have on what happens when this lever is pulled too hard. What would have to be different to revive: a transition floor where legacy users see *value-delivered-since* and the price ramps with the value, not against it. Not currently designed. **Date killed**: 2026-03-21.

172 183

← back to history