Zero to App Store: Designing quso.ai App Experience from Scratch

Zero to App Store: Designing quso.ai App Experience from Scratch

Zero to App Store: Designing quso.ai App Experience from Scratch

This case study breaks down how I built a fast, focused MVP for AI-generated captions on mobile. Launched in just a month, it helped us validate demand, learn from real users, and set the direction for quso.ai’s mobile roadmap.

My role

Research

& Product Design

Team

Founder, Product Manager,

Product Designer

2 Developers

Timeline

1 month

Context

Context

Designing a sharp MVP for quso.ai ’s mobile launch

Designing a sharp MVP for quso.ai ’s mobile launch

Designing a sharp MVP for quso.ai ’s mobile launch

When Quso set out to bring its full suite of AI tools to mobile, we quickly realized shipping everything at once would slow us down — and likely miss the mark. So we stripped it down to just one high-impact feature: AI Captions. This case study walks through how we turned that narrow scope into a fast, focused MVP, and how I led the end-to-end design to learn quickly, validate real behavior, and pave the way for Quso’s mobile roadmap.

Project scope

Project scope

Designing for speed, clarity, and early validation

Designing for speed, clarity, and early validation

Designing for speed, clarity, and early validation

We designed the MVP to focus on one clear outcome: fast AI-powered captions on mobile. No sign-ups, no complex editors, no fluff. Users could upload or record, pick a language, and generate ready-to-post videos in seconds. We limited editing options to a few animations and styles, added paywalls at just the right moments, and built the whole flow to learn what mattered most: speed, quality, or control.

Impact

Impact

Outcomes That Shaped What’s Next

Outcomes That Shaped What’s Next

Outcomes That Shaped What’s Next

10+ happy clients

5+ years of experiences

10k+ downloads

Trusted by many

11.6% conversion rate

8+ conversions on day 1

10k+ downloads

Trusted by many

11.6% conversion rate

8+ conversions on day 1

10k+ downloads

Trusted by many

Launching the mobile app wasn’t about going multi-platform. This MVP gave us the clarity we needed to keep it fast, keep it simple, and build for creator habits, not just feature checklists.

Vedant

Founder at quso.ai

Launching the mobile app wasn’t about going multi-platform. This MVP gave us the clarity we needed to keep it fast, keep it simple, and build for creator habits, not just feature checklists.

Vedant

Founder at quso.ai

Launching the mobile app wasn’t about going multi-platform. This MVP gave us the clarity we needed to keep it fast, keep it simple, and build for creator habits, not just feature checklists.

Vedant

Founder at quso.ai

Superwall Integration

A/B tested paywalls to learn what drives conversions

Superwall Integration

A/B tested paywalls to learn what drives conversions

Web Product Insights

Web Product Insights

Web learnings guided every mobile decision

Web learnings guided every mobile decision

Web learnings guided every mobile decision

Since quso.ai’s web platform was already live, we didn’t start from zero. I had the opportunity to interact directly with web users, from individual creators to agencies managing multiple clients, and their feedback shaped much of our mobile strategy.

Solo creators wanted a mobile-first solution; they shoot, edit, and post from their phones and needed a faster way to add captions.

Customization fatigue was real; users didn’t want endless options; they just wanted clean, good-looking captions, fast.

Mobile app demand was strong, especially from individuals who valued speed, simplicity, and no-login workflows.

Competitor Research

Competitor Research

What others got right and where we saw opportunity

What others got right and where we saw opportunity

What others got right and where we saw opportunity

To shape the mobile experience, I analyzed over 8+ apps across various verticals, ranging from niche caption tools like CaptCut and Veed to broader content tools like Canva and Instagram. The audit helped identify proven UX patterns, friction points, and opportunities for differentiation.

Create-first flow is the norm

Start a Project

Most apps opened with a strong “Create” CTA on the home screen, often launching a bottom sheet with feature options like upload or record. We adopted this structure in early iterations to give users a clear starting point.

Create-first flow is the norm

Start a Project

Most apps opened with a strong “Create” CTA on the home screen, often launching a bottom sheet with feature options like upload or record. We adopted this structure in early iterations to give users a clear starting point.

Create-first flow is the norm

Start a Project

Most apps opened with a strong “Create” CTA on the home screen, often launching a bottom sheet with feature options like upload or record. We adopted this structure in early iterations to give users a clear starting point.

Paywalls vary wildly

Start a Project

Some apps showed the paywall early, either right when opening the app or in the middle of using a key feature to encourage quick upgrades. We also noticed a clear pattern: some apps used simple, straight-to-the-point paywalls, while others relied on heavy visuals and animations to make the screen feel more premium.

Paywalls vary wildly

Start a Project

Some apps showed the paywall early, either right when opening the app or in the middle of using a key feature to encourage quick upgrades. We also noticed a clear pattern: some apps used simple, straight-to-the-point paywalls, while others relied on heavy visuals and animations to make the screen feel more premium.

Paywalls vary wildly

Start a Project

Some apps showed the paywall early, either right when opening the app or in the middle of using a key feature to encourage quick upgrades. We also noticed a clear pattern: some apps used simple, straight-to-the-point paywalls, while others relied on heavy visuals and animations to make the screen feel more premium.

Templates = instant gratification

Start a Project

Apps like CapCut and Captions leaned heavily into presets to reduce decision fatigue. This validated our decision to offer ready-to-use styles and template-based workflows.

Templates = instant gratification

Start a Project

Apps like CapCut and Captions leaned heavily into presets to reduce decision fatigue. This validated our decision to offer ready-to-use styles and template-based workflows.

Templates = instant gratification

Start a Project

Apps like CapCut and Captions leaned heavily into presets to reduce decision fatigue. This validated our decision to offer ready-to-use styles and template-based workflows.

Less is more for solo creators

Start a Project

Heavier feature sets often came from platforms built for users who needed advanced editing or pro-level video control. We focused instead on a fast, minimal MVP tailored to creators who valued speed over depth.

Less is more for solo creators

Start a Project

Heavier feature sets often came from platforms built for users who needed advanced editing or pro-level video control. We focused instead on a fast, minimal MVP tailored to creators who valued speed over depth.

Less is more for solo creators

Start a Project

Heavier feature sets often came from platforms built for users who needed advanced editing or pro-level video control. We focused instead on a fast, minimal MVP tailored to creators who valued speed over depth.

Design process

Design process

The thinking behind the screens

The thinking behind the screens

The thinking behind the screens

Home Screen Iteration 1

We wanted a clear, action-first entry point for new users as well as existing.

The Create CTA offered users a clear starting point, along with the flexibility to kick off with UGC templates.

A bottom sheet allowed us to present all creation options without overwhelming the screen.

This pattern is familiar to users, reducing the learning curve and helping them move fast.

Home Screen Iteration 1

We started by aiming to launch all three core features - Short Clips, Captions, and Influencers in one app.

  • The goal was to prioritize quick creation and project access.

  • But it lacked visibility with all features packed in Create. Users would click to choose between Clips, Captions, or Influencers.

  • This led to the next iteration, aligning more closely with the web layout for reduced guesswork for existing users.

Home Screen Iteration 1

We started by aiming to launch all three core features - Short Clips, Captions, and Influencers in one app.

  • The goal was to prioritize quick creation and project access.

  • But it lacked visibility with all features packed in Create. Users would click to choose between Clips, Captions, or Influencers.

  • This led to the next iteration, aligning more closely with the web layout for reduced guesswork for existing users.

Home Screen Iteration 2

We also explored an iteration where feature cards were shown directly on the home screen, instead of being tucked behind a Create tap, to make the app’s value more immediately discoverable.

The goal across versions was to reduce the learning curve and make it easier for web users to adopt the app.

Home Screen Iteration 3

At this stage, we made two major shifts:

We decided to launch only AI Captions and not all three features

We launched the app as a separate, lowkey MVP under the name "CC Captions & Subtitles" — not under the quso brand

With this focused scope, we needed a leaner, faster home experience.


Why it worked: This approach worked because it delivered quick value with minimal friction. New users instantly understood the app’s purpose, and the interface felt intuitive without needing onboarding. With the floating Create button as the single CTA, action happened quickly, keeping users focused and reducing cognitive load. The result was a light, fast, and purposeful experience, exactly what mobile-first creators needed.

Home Screen Iteration 3

At this stage, we made two major shifts:

We decided to launch only AI Captions and not all three features

We launched the app as a separate, lowkey MVP under the name "CC Captions & Subtitles" — not under the quso brand

With this focused scope, we needed a leaner, faster home experience.

Why it worked: This approach worked because it delivered quick value with minimal friction. New users instantly understood the app’s purpose, and the interface felt intuitive without needing onboarding. With the floating Create button as the single CTA, action happened quickly, keeping users focused and reducing cognitive load. The result was a light, fast, and purposeful experience — exactly what mobile-first creators needed.

Home Screen Iteration 3

At this stage, we made two major shifts:

We decided to launch only AI Captions and not all three features

We launched the app as a separate, lowkey MVP under the name "CC Captions & Subtitles" — not under the quso brand

With this focused scope, we needed a leaner, faster home experience.

Why it worked: This approach worked because it delivered quick value with minimal friction. New users instantly understood the app’s purpose, and the interface felt intuitive without needing onboarding. With the floating Create button as the single CTA, action happened quickly, keeping users focused and reducing cognitive load. The result was a light, fast, and purposeful experience — exactly what mobile-first creators needed.

Generating AI Captions

The preview screen offered only essential options, language selection, and trim, keeping the experience clean while giving users light control before generating captions.

The “Generate Captions” CTA was made prominent and primary to drive quick action and maintain momentum.

We added a loading screen to bridge the AI processing time (which takes roughly half the duration of the uploaded video).

We also included a message on the loader that let users exit the screen and return later, reducing frustration during longer waits and allowing the process to feel less blocking.

Editor

After generating captions, users landed in an editor that felt lightweight yet powerful.

Users could quickly preview & apply changes, making the editor feel responsive and creative.

It encouraged exploration without confusion, a win for mobile-first creators who want results, not learning curves.

The interface made even first-time users feel like they knew what to do — no guesswork, no clutter.

Styles - Effects Customization with Real-Time Preview

I put all the styling options like font, size, color, and effects into a full-screen bottom sheet to give users a clean, focused space to customize. Adding a preview to show changes right there made it easy to play around without distractions.

We did think about using a smaller sheet so users could see the main video while editing, but that required more complex state management and rendering logic, syncing real-time style changes with the base video player outside the sheet. For the first version, we went with a setup that was simple, stable, and still felt easy to use.

Export

We wanted creators to instantly visualize their content in a social context, so the Reel-style preview made the outcome feel relevant and ready for sharing.

We showed the upgrade options right away, so users could make quick decisions without losing their flow.

Profile

I kept the profile screen simple on purpose, focused only on what mattered: managing plans and accounts. Since login wasn’t needed for the first 5 videos, most users didn’t need to interact with it early on.

But for those who upgraded, it offered a clear, dedicated space to manage subscriptions. This helped us support a frictionless, no-login MVP while still laying the groundwork for future account-based features.

Paywall A/B Test

Paywall A/B Test

Paywall A/B Test

We used Superwall to test monetization early, giving us speed, remote config, and analytics without custom dev work.

Ran a 50/50 A/B test between two paywall designs:


Design 1: Simple layout with full info and both plans upfront
Design 2: Visual-led with animation and pricing shown later

Design 1 outperformed Design 2 on both platforms:

→ iOS: 11.6% vs 7.4%, Android: 1.5% vs 1.1%

Key takeaway: Clarity beats polish — users convert better when pricing and benefits are immediately visible.

Designed an in-house paywall (3rd design) for full visual control, but ultimately chose Superwall to move faster and learn quicker.

Edge Cases & Alerts

Edge Cases & Alerts

Edge Cases & Alerts

Explorations That Didn’t Make the Cut

Explorations That Didn’t Make the Cut

Explorations That Didn’t Make the Cut

  1. First design layered too many controls over the video. The traditional horizontal thumbnail scrubber felt unnecessary.

  2. With a linear caption editor and short-form content, the thumbnail element didn’t provide meaningful value — it only complicated the layout.

  3. Third design's approach appeared dynamic, but in practice, it increased cognitive load. Users now had to scan multiple areas for controls.

What I Learned

What I Learned

What I Learned

Launching Captions & Subtitles was a crash course in fast decision-making, scope control, and designing for speed without losing clarity. I learned that simplicity outperforms, and that shipping a focused MVP, even under a different name, can teach you more than a full-featured launch ever could.

The learnings from this launch are now informing the next phase of quso’s mobile strategy, with deeper editing, smarter defaults, and expanded AI tools. And personally, it reminded me how valuable it is to design for progress, not perfection.