LongCut logo

A Keyboard-Less Future: Reinventing a 150-Year-Old Interface with Wispr Flow

By Notable Capital

Summary

Topics Covered

  • Founders Who Say Yes and No
  • Smaller Bets, Bigger Changes
  • Build Bigger Models, Not Smaller
  • The Best UI Is No UI
  • Keyboards Are a 150-Year-Old Discrimination

Full Transcript

the time between when they see the dictation show up to pressing enter, [music] it's half a second. Most people

don't even read what Flow produces anymore cuz they're like, "It's probably fine." They did a better job than I

fine." They did a better job than I would have anyway. What I care about as a user is getting to hit send as quickly as possible. [music]

as possible. [music] Our mission is to make interacting with technology feel just as effortless and natural [music] as talking to a close friend. Hi, I'm Chelsea Taylor and today

friend. Hi, I'm Chelsea Taylor and today I'm excited to be joined by my colleague Hans Tongue [music] and sit down with the co-founders of Whisper, TA Cathari and Sahes Gar. Whisper is tackling one

of the most ambitious challenges in technology, [music] reimagining the primary interface between humans and machines. Their first product, Whisper

machines. Their first product, Whisper Flow, [music] is seamless speech to text in every application on your iPhone or computer. At Notable, we're so excited

computer. At Notable, we're so excited to back Whisper and recently had the opportunity to lead their Seriesa A [music] extension round. Today we'll be sitting down with Tana and Sahed to dive

into the behindthescenes story of building whisper and a future of a voice powered world.

Well gentlemen, thank you for being on the show. Um you guys were uh college

the show. Um you guys were uh college roommates in Stanford. Uh tell us how what was the early days like? How did

you guys meet um in the same dorm and then uh how did you guys decide that you want to do something together?

Yeah. So, Donna and I actually met literally on the first day of our freshman dorm. Um, so we were both in

freshman dorm. Um, so we were both in Donner at Stanford. Great community of former founders from there as well. And

we instantly clicked cuz we both had so many of the same interests. We loved

math. We love computer science. We would

stay up on weekends taking extra graduate courses just for fun.

[laughter] Yeah. Just just for fun. He was the only

Yeah. Just just for fun. He was the only person I could go to and be like, "Dude, I'm really sad I got an A and not an A+." and he would get me. [laughter]

A+." and he would get me. [laughter]

I think the the hardest part about being friends is I think we actually both gave each other so much imposter syndrome being at Stanford. And in many ways that's telling even now, right? We both

learn so much from working with each other in so many different ways. Um we

like thought we were really similar people freshman year and then we ended up rooming together sophomore year and we just realized how different we were but how many interests we shared. Um

like both of us started dating our partners in high school. We've been

together for 10 years. Kind of crazy to see those kinds of similarities as well.

Um which really just strengthened the friendship over the next 3 years.

How do you guys compliment each other?

I am a very impulsive decision maker. S

is way more thoughtful which is great cuz in the company like sometimes you need to like make things make decisions quickly. Sometimes you need to think

quickly. Sometimes you need to think very deeply about stuff before going ahead.

So you balance each other out. Sahed is

very focused on uh more core research pushing the limits of technology which really adds to a lot that we do whisper side and my head goes

a lot towards marketing growth product etc and so I spend a lot of my time there so even though we're both have a technical background it's the exact opposite kind of technical

background that you can imagine the way I think of it is there's two parts of running a company one part is saying yes and the other part is saying no.

And we both compliment each other because he's the guy who will say yes, I'll say no to instinctively. And on the flip side, like I re us back in and make sure we focus on the half of the things and do those things in a really incredible way.

Yeah.

Mhm.

How many iterations did it take you guys to get to the first idea? You actually

to build something together?

You know, initially when we were in college, we'd wanted to do things together. So, we would work on a bunch

together. So, we would work on a bunch of projects and like stay up till 4:00 a.m. every night building something new.

a.m. every night building something new.

And then SE wanted to do a PhD and I wanted to build software companies and we were like this is completely different. We'll never start anything

different. We'll never start anything together. And then one day in 2021 S

together. And then one day in 2021 S calls me and he's like d I'm leaving my job. I'm not doing a PhD. I want to

job. I'm not doing a PhD. I want to start a company.

And at the time I had a job and I was like you know what doesn't matter. This is an opportunity

doesn't matter. This is an opportunity of a lifetime. I want to do something together with this guy. And so we started and funnily the current idea for whisper and the the

vision behind it was the first thing that I talked about uh or that that we talked about with each other and then we were like okay let's like not jump in so quickly and so we actually went through

about 100 other ideas 100 other ideas 100 other ideas of like different verticals and spaces and problems in the world that we wanted to solve and eventually we just narrowed down on

whisper It just felt right.

What clicked for you that you know what I'm going to not pursue PhD anymore and I would want to build something and I'm going to call today.

The thing that I really love is solving really hard problems that make impact in people's lives very directly and being very problem motivated and um with the

role that I had before whisper I was running the AI team at a startup called Luminous Computing. There we were

Luminous Computing. There we were building optical computing hardware for ML acceleration using light to accelerate data movement computation.

Crazy technically. And the thing I loved about that was figuring out what problems it solved for people in the ML community and designing the computer from the ground up to actually solve those problems. And as [clears throat] a

result of that, I really wanted to focus on industry research where we could take problems that we knew mattered, devote all our energy toward solving those problems, shipping them to users, making sure that they were actually the things

that people need, and then iterating from there.

It's been really great to hear about your guys's story of how you meet freshman year and then you come today to build Whisper. Um, but today, I know you

build Whisper. Um, but today, I know you also have another interesting story.

Whisper is not the first company you built. you actually built a company when

built. you actually built a company when you were younger, maybe 13, um that you described that Google maybe shut down.

Maybe tell us a little bit about that story and how was that maybe formative for you as you think about being a CEO now.

So this was when I first started to learn how to code. I was 9 10 years old and I wanted to build Jarvis from Iron Man and so me and a friend of mine built one

of the world's first voice assistants and at that time I also loved music and so if you remember Limewire was really popular then as the best way you could download music without paying 99 cents

to iTunes and when Limewire shut down it was painful to download music so we added a thing where you could say stuff like hey play me the latest song by Imagine Dragons and it would download find that song in the internet download

that for you. But one thing we added was if it didn't find the MP3 on like MP3 Skull or one of the other websites, it would actually go to YouTube, find the

video, convert it to an MP3, and then download that for you. So, and Google was like, "Why are we getting so many requests from this scrub server?"

[laughter] And and they actually like sent us the notice. I was terrified.

I showed it to my dad. He didn't even know I was doing anything like that cuz I was I would only code at night after he'd gone to bed.

And I shut it down. And initially

I was just really mad and also really sad. Like I'd spent 2 years building

sad. Like I'd spent 2 years building this thing. We had gotten to 2 and a

this thing. We had gotten to 2 and a half million users in just 6 months.

And now this whole thing was dead. But

after a couple of days, it just hit me that they just scaled my app. I can

still do whatever I want.

And so I started building something else again. And to me that

again. And to me that essentially what it did was I felt less personally related to the

things that I was building. So if the world or anything at some point was like kill something or some of the work I did went to waste. It's like I'm still here.

I can still do more things and and keep going. Cuz that happened a lot over the

going. Cuz that happened a lot over the years. That happened multiple times with

years. That happened multiple times with whisper when the world reser said like now you're not doing this thing anymore and and we realize like hey I'm here we got some great engineers I got this guy

[laughter] uh and we could just continue building the things that we think are all right with whatever we've learned now.

Yeah but I mean that's a super important lesson and I think it's part of also the story like you just alluded to for Whisper Flow um and where you guys are today. We obviously know you as the

today. We obviously know you as the best-in-class voice dictation um product in the market, but it started in a very different way. Maybe tell us about that

different way. Maybe tell us about that kind of pivot story. And again, you alluded to this has happened to you before, so you weren't scared to kind of switch and dive in again.

Oh, I was terrified.

Okay. [laughter]

To clarify, you were scared.

Our goal since the beginning for Whisper has been to solve the same problem, which is that we want interacting with our devices to feel like talking to a friend. We want to be stuck less behind

friend. We want to be stuck less behind screens and have more and more natural interfaces. And we started the company

interfaces. And we started the company thinking the problem with that was that people wouldn't use voice around other people. Think about how awkward it is to

people. Think about how awkward it is to use Siri when you're around others. And

so we started by building a wearable brain computer interface that could understand what you were thinking silently to yourself.

You had a hardware background, so it was easy.

I had a hardware background. Done

machine learning work. Um, and so it was a great fit for the things that D and I knew how to do.

Right. And fast forward a couple of years, the R&D for that was going extremely well. We actually hooked up

extremely well. We actually hooked up the prototype to all the voice interfaces that existed then like Siri, Alexa, and Chacht. And we found that none of us wanted to use our own product

because we hardly even use those interfaces even when we were alone in a closed room. We weren't going to wear a

closed room. We weren't going to wear a wearable right now for those kinds of interactions. And so we started pushing

interactions. And so we started pushing ourselves internally to build software for the hardware, the operating system for voice, for interaction that would actually make it so that we would want to interact in a more natural way. And

that eventually turned into whisper flow. Um, and we realized just how much

flow. Um, and we realized just how much there was to do there to make that take off. And I think the lesson for us there

off. And I think the lesson for us there was um, the market wasn't ready for solving the problem of using voice everywhere at that time, right? And the moment the market's ready

right? And the moment the market's ready for that, which is actually going to be pretty soon, there are going to be really interesting things that we need to build to make that possible. But

we'll know exactly how to solve that problem because we know what the user's pain point is. We see them. We see how they're trying to do it right now and we can help them fix that. But it's been a it's been a wild ride.

So you're saying we're going to go from voice dictation to voice controlled or voice managed services?

Yeah.

Very soon.

Very soon. It's what city should have always been.

Yeah. Figuring out what users would be willing to do is such a hard problem to solve. How did you decide and stumble on

solve. How did you decide and stumble on onto the whisper flow was the right uh UE or UI to make it work? What was that 0ero to1 journey like? So we actually

started with a lot of things that didn't work. So we started by building a lot of

work. So we started by building a lot of really well functioning voice agent or voice assistant like applications. This

is like right around when chat GPT was about taking off. And you could do things like say, "Hey, like find me dinner with Hans nearby and uh invite

him." And it would actually go do the

him." And it would actually go do the search, book a reservation, send you the invite, do all those things. And what

happened was we only ever tested it internally, but like in practice, I I get dinner with you like at most once a week, you know, that would be a [laughter] delightful week for me, you know.

And so we were thinking about hey what's the interaction that we actually do the most all the time and hey what we do all the time is we communicate we type and we communicate and so that's the moment

where we started building whisper flow um the the version of AI enabled voice dictation and for me at least the moment where I got a lot of conviction in it was when I fell in love with the product

myself because despite being a startup founder not somebody who falls in love with startup products very easily like I'm not an early adopter of a lot of tech and I fell in love and I was like, "Wow, if

this is how quickly this can shift my behavior, if we can round out all the edges, make it so that it actually gets you flawlessly every time, make it so that it can start to take the actions on your behalf that you actually want to

take and have them work every time instead of work 50% of the time, it will completely transform me." That's my side

of it. Anything else for you that kind

of it. Anything else for you that kind of got you to feel that magic moment?

The thing that was shocking to me and so we were working on Whisper Flow while we were building the hardware as well. You

were building the hardware so that people could use voice around everybody else.

And I just vividly remember one day walking into the office and I see half our team with these mics. So we we've had these the the gooseseneck mics for a while.

They're on their table using these mics to talk to their computers. And I look at this office and it just it just hit me like there is no problem to solve here

necessarily that's that major. We're

solving most of it with this software and a $10 mic that we bought, right?

And that to me was the first time I had seen us build something that actually creates behavior change. And then diving into like what created the behavior change, the biggest thing was like how

frequently do you do a particular action? You might like book a flight

action? You might like book a flight once a month. Uh but you type hundreds of times a day.

So if that is the thing that we can hook on to that is like people use whisper hundreds of times a day. It's bound to build incredible habit. It's bound to change human behavior. And

seeing that initially happen with the first group of 10 15 people was [clears throat] honestly what Sahes and I bet on at that time this is now barely one and a half years ago uh that

I think there might be something here and we pivoted the whole company to just focus on that.

I remember the first time you told me that story. Um, you remind me of Slack

that story. Um, you remind me of Slack or sort of as a game studio, Pale Games, next generation games across four or five different locations. So, it needed something like a Slack to communicate

and ends up being that the studio didn't go anywhere. But the communication tool

go anywhere. But the communication tool is something that everybody used all the time and that become in a new version the the new product, a new company. Yeah, the

Slack I think the Slack an analogy makes a ton of sense and um something that we also read when we were thinking and we're really excited about partnering with you guys is something you wrote on product hunt actually and you talked a

lot about this pivot story but one thing that really resonated with us is you said not only was kind of your pivot a focus on changing the product but it was also it it pushed you to change the way you make decisions at the company and

you already alluded to this earlier you complement each other very well maybe one person's saying yes while the other is saying know um but maybe elaborate that on that a little bit more. How has

this you know pivot or the new iteration of what Whisper Flow is? Um how has that kind of inspired the way that you lead the organization in the company?

There's a bunch of different ways. Um

two that are the biggest for me is the first is starting with problems that we see people experiencing and that we see people experiencing frequently. So

that's the root to us for all behavior change. You get people who are doing

change. You get people who are doing something. They're going about their

something. They're going about their lives. They're busy. They can't think

lives. They're busy. They can't think too much about learning a new tool. They

just want to change one behavior that they're doing that fits right into what they already experienced as a painoint and make their life better. And

it feels small to start to only start by changing one tiny thing that's frequent.

But if you do that, then we get the opportunity to build the second and the third and the fourth and the fifth. And

all of a sudden it creates this ecosystem of behavior change. And

[snorts] so you know the hardware was a big bet on changing people's behavior dramatically, right? Have them adopt a new wearable,

right? Have them adopt a new wearable, learn a new way of, you know, thinking silently to theirelves, then adopt voice, then do all of these kinds of things. And that's too much to ask of a

things. And that's too much to ask of a person all at once. There's that half of it. I think the other half of it is also

it. I think the other half of it is also when it comes to building new product surfaces that really stick. It's a

little bit of humility around what we know and what we don't know. A lot of times you have to just build it, try it, feel it yourself, and see what feels

magical and what doesn't and be willing to acknowledge those hard truths to yourself. And so you can spend all of

yourself. And so you can spend all of the time that you want to thinking about something or designing something perfect, but in reality just shipping a quick experiment as fast as possible so

you can learn is the most valuable thing that we can do.

I think a lot of founders try to be the next uh uh Steve Jobs or the next Elon Musk and try to do things are are really grand. Most of the time they don't they

grand. Most of the time they don't they don't they don't work out. um being

focused on changing one behavior at a time, but it's a high frequency um kind of behavior is the best way to go about building something new that can last.

I think the thing that I appreciate now is I actually think Steve Jobs and Elon actually only sometimes change one behavior at a time. Like if you think about the iPhone, the problem is solved initially is I was carrying around an

iPod and a phone in my pocket and they just put them in one device.

That's like without an iPod may not be an iPhone.

Yeah. And the the app store wasn't even there until iPhone 3. Like that wasn't something that the iPhone came out with.

It changed our behavior one step at a time starting with something small. Even

Tesla, okay, I'm going to take a vehicle and it's going to be electric, but I know how to drive a car and it's going to be a car and it will solve that problem really, really well. And

um there's a tremendous amount of work that goes into solving a simple problem.

Not to diminish that, but like they both actually built incredible products by distilling things down to their essence and only changing a small thing in a person's life.

Something that I appreciate that you said is voice dictation on its surface may seem simple, but we both know and we've learned a ton from you guys over the last couple of months. Like this is a really complex technical challenge. So

be curious to know kind of how you think of it holistically to tackle this problem.

So the way we think about it is is pretty simple. What I care about as a

pretty simple. What I care about as a user is getting to hit send as quickly as possible.

And so for typing, you know, I can type whatever I want to. For voice, that includes getting the results back and how long it takes for me to fix all the mistakes. So what we track there is what

mistakes. So what we track there is what we call zeroedit dictation. What

fraction of your dictations are come out perfect on the first try. So you don't actually have to fix anything. And that

means like us going on your behalf and fixing when you correct yourself, when you ramble for a while, when you have runon sentences, um when you might not be structured in what you're saying. And

[clears throat] what we do is we constantly work on iterating here.

There's a bunch of things that we have learned that I think are different from how other people are approaching it. So

most people have approached Voy as hey let's deploy smaller and smaller models on the edge on your phone. But what

happens there is then it makes more mistakes. Then you spend more time

mistakes. Then you spend more time correcting them and it's not actually worth it. On the flip side, what we're

worth it. On the flip side, what we're trying to do is actually deploy bigger and bigger and bigger models as we train our own speech foundation models. And

the power of that is if it never makes a mistake, the endto-end user experience is so much better. And here we're measuring it in all sorts of hard conditions. you know, with people

conditions. you know, with people whispering into the microphones in the office, when people are walking on a busy street and there are loud cars going by, all of those kinds of

settings. And the core technical

settings. And the core technical challenges are let's train these models to actually work in real world settings, right? They work great on the benchmark

right? They work great on the benchmark on all these offline statistics where people are like 99% and you know, none of us use it and feel that ourselves.

And then the second is making the inference of these feel instantaneous, right? It's got to be in the cloud, but

right? It's got to be in the cloud, but it has to feel so so fast.

And [snorts] so it's this constant tugof-war between train a bigger model, make it faster. Train a bigger model, make it faster. And that's that's the magic that will make it so that voice becomes something that people can trust.

At this point, for kind of top users or folks that are using voice flow a lot, what has been kind of their behavioral shift in terms of, for example, how quickly they press send um after

utilizing the dictation?

Usually the median person the the time between when they see the dictation show up to pressing enter, it's half a second.

Wow.

Half a second.

Half a second. Most people don't even read what Flow produces anymore cuz they're like, "It's probably fine."

They did a better job than I would have anyway.

And that to us was shocking. And also

the thing that we really uh cuz we expected people to trust Whisper, but not so quickly. And so that's what we push for more and more. And so

internally we're looking at, hey, who are the kinds of people? What are they doing where it's taking them longer, where they're reading the text, which means they haven't built the trust yet.

And so I think a lot about what we're building at Whisper is gaining the user's trust because that is incredibly hard for any product to win over. But

once you have that, you can do so much more for that person with that person.

And um that is even harder for us because we decided not to just build this for Silicon Valley techies, but just the average person out there.

The thing is like my dad is never going to write a system prompt. My grandfather

doesn't know what a prompt is. Like most

LLMs and these AI tools make you configure so much beforehand.

We actually said, "Hey, you know what?

We don't need you to tell us how you write. We don't need you to teach us

write. We don't need you to teach us like Dragon Dictation how you say each word and the names that are important to you. What we're going to do is we're

you. What we're going to do is we're just going to learn from you just like a person would. We're just going to be

person would. We're just going to be this like new person who's joined, who's with you side by side." And that actually adds so much more complexity to the technology that we're building. But

the benefit we get from that is it's not just the few million techsavvy folks in the world who use Whisper. It's actually accessible to

Whisper. It's actually accessible to billions of people around the world who can just get started and press one button and get flowing. It's because

communication is just so core to everything we do, right? I've seen this with so many people. The better and the more effectively they're able to communicate, it just completely transforms the impact they can have in

their career or the level of connectedness they can feel with their family and their friends. And so being able to do something that unlocks that ability for people who otherwise have barriers to communication is tremendous.

And then even for people who don't have barriers to communication, just making it more and more frictionless allows people to be more connected.

I think I shared this when we were getting to know each other that I had gotten my family to start using Whisper Flow and in particular my mom loves Whisper Flow now. And similar to what you alluded to with your dad, like she's not someone who's going to be learning

kind of the most robust ways to build like an AI agent, but she now can communicate much quicker with me with my brother and it's like breakthrough. But

then additionally, I I don't know. I

want to ask you a little bit about use cases, but similarly, I was um over the weekend was doing some work with my partner and he's an architect and he uses Whisper Flow to actually edit um

drawings like architectural drawings.

Yeah. So, you should talk to him about how he does it, but he's [laughter] obsessed with Whisper Flow because a lot of what he's doing is dictating and describing how he wants a drawing to look. And so, he'll actually dictate

look. And so, he'll actually dictate that into his into Revit, which I won't go too deep into architectural softwares, but um he's using it too. So,

it proved to me through my own kind of learning, I can use it for email, but then this entire different ecosystem um can also use use this software really effectively. Um, those are my examples

effectively. Um, those are my examples of use cases, but I'd love to hear from you both. What are some of the most

you both. What are some of the most surprising use cases you've seen kind of come from Whisper Flow users?

The one I remember is uh this guy, he's one of my favorite guys, uh, Greg. So, Greg messaged me one day and

he was like, "Dude, you know what would be cool if I wrote a book." And at this time, like Whisper had a 100 users, right? And he was like, I'm going to

right? And he was like, I'm going to write a book with Whisper. I was like, I don't know how you're going to do it.

Like, we just support five minute dictations. [laughter] And then and then

dictations. [laughter] And then and then a month later, he sends me a manuscript.

And he actually published that book. He

published another book a few months later. And we were spending time with

later. And we were spending time with him and learned actually so much about how convoluted and hard it is to be a writer

and work with editors and publish books and how much strain it puts on their lives and the fact that they can just be so much more creative. Uh

it's easier just to say something than actually write something.

Yeah. U say things. used it a lot to edit his book and I think he's writing a third one now. Like he's he's on a roll.

He's like publishing book after [laughter] book and he's like I would write one book every two years and I'm writing one book every two months.

Uh which was huge. And the other really fun one that happened recently is there was this guy who was who ran a marathon.

I saw this while [laughter] white coding an app. He had a backpack on his computer. He was wearing AR glasses and just using Whisper to talk to it and he actually got a pretty good

time like better than than mine or Sahedes [laughter] at this marathon as well. Um and so I'm

just uh people surprise me all the time.

You know, you have to squint that hard, you can see how this could be the beginning of a new operating system or user interface that changed lives.

Yeah. No, genuinely. Um, and I think on that as we build this more and more, um, I still talk to a lot of users every day.

And what I realized is there are 10 million people in the world who have Parkinson's. Their hands actually shake,

Parkinson's. Their hands actually shake, so they can't type on their phone or computer. [snorts]

computer. [snorts] Whisper for the first time lets them use technology cuz keyboards actually are a big obstacle to using anything, right? Uh there are 700 million people

right? Uh there are 700 million people with dyslexia and we literally tell them like hey if you want to use technology you need to literally spell words which is the genuine hardest thing that is there for

them in in life. That's 10% of the world's population. And so for all of

world's population. And so for all of these people whisper just ends up leveling the play field and removing discrimination that keyboards which look so innocuous actually have been doing

[snorts] for the last 150 years. Um, and

so, so those messages to me are are so heartwarming when I get those from people cuz yes, like for some people makes you really productive, saves you hours, but so for so many more, it lets them do something they could never before.

We spent a lot of time talking about how big the opportunity is just today with the product that you have, which is voice dictation. It levels the playing

voice dictation. It levels the playing field. It allows people to be more

field. It allows people to be more creative. It allows people to be more

creative. It allows people to be more productive. Um, but we all know that you

productive. Um, but we all know that you guys have a bigger vision. Um, talk to us a little bit like what happens next for Whisper Flow or what are you what are you most excited about?

Our mission is to make interacting with technology feel just as effortless and natural as talking to a close friend.

And to us, the first thing that we needed to do was crack voice input. Make

it something so you can just blindly trust a system that I will say things and it will understand things from me.

that ship product now has a cult consumer following and that gets us to the next part which I've been really excited for for years which is whisper not just writing what you say but

actually doing things for you yes and helping you proactively these two things are really important because people start using whisper everywhere on

their computer and now they're expecting it to do more there's so many things that we learn or whisper learns about the user as as they use it for a while and there are incredible number of

opportunities to help them in places to save time make them better at everything that they're trying to achieve and the way I think about how we prioritize

doing these things is number one whenever I'm talking to users I ask them to think about what is work in their life that feels like grunt work that they would rather not spend time doing so they have time to do what really

matters that gives them energy that gets them excited that gets their creativity out and so you take all the grunt work and those are the things that I want to start automating away with whisper

so that it more and more of their life just happens through whisper and they have they have time left uh to do everything else and so that is what the

next phase of the company looks like and a lot of what we're going to be shipping next year and the really important thing here that I I it keeps me up at night a

lot There are so many companies that have tried to build this. So many companies like earlier they used to be called assistants now they're called agents and

uh agentic workflows and it's all at the end game of being able to do tasks on behalf of people and the biggest thing that I want to ensure we deliver to users is

reliability that if whisper tells you it can do something it actually can. The

counter example to me that I always think about the failure case for this is Siri where it promised a thousand things. It can do 50 and it can maybe do

things. It can do 50 and it can maybe do two really well which is why we all just use it to set timers and alarms. And that is what happens when you focus on

building capabilities and promising a lot more without building the reliability underneath it.

Um, that's the thing that made Whisper Flow really successful and that's something we really really really need to focus on for this next phase of whisper. If I think about that from a

whisper. If I think about that from a user's perspective, right, I use voice because it's faster than typing.

Mhm.

But if I have to go clean up its mistakes when it takes an action on my behalf, like if I said, "Hey, message Hans and Chelsea saying that I had a great time at this podcast recording."

And it messages some other Hans, I will be so furious, right? And so everything here is about

right? And so everything here is about figuring out how we actually build the voice agent interactions for the consumer that feel better, right? If I'm

interacting in real time with it, it has to be so fast so I'm not sitting and waiting for my AI browser for 15 seconds for it to click the wrong button on the screen for me, which is the experience that I have

today. Like 15 seconds and it'll hit

today. Like 15 seconds and it'll hit send before the email was actually written correctly, right? And [snorts] so very

right? And [snorts] so very [clears throat] very focused on that.

And I think one of the key ways that we can actually fit fit right in here is so there's right now which is voice to text. Then there's kind of next phase

text. Then there's kind of next phase which is hey how do we go to helping you complete certain tasks. So for example all throughout the day I have all sorts of thoughts going through my mind and I

kind of I can't solve them all at once.

So I need to queue them off to somewhere and so I can jot them down as a note to myself. And what if every time I did

myself. And what if every time I did that using Whisper, I don't have to context switch anywhere, right? I can do that from anywhere on my mobile or my computer. Goes to my, you know, whisper

computer. Goes to my, you know, whisper notes and it kicks off some work in the background. So when I come back, I have

background. So when I come back, I have something that I can work off of. Like

if I'm, you know, planning a trip, I actually don't want an agent to plan my whole trip for me because I I have preferences at the end of the day. But

if I say, "Hey, I need to plan something for my trip." Uh, and then when I come back, I'm like, "Okay, great. Here are

like eight suggestions of hotels. here

are some ideas for these things. Now I'm

already kickstarted right on my process and based on that we can figure out hey which of these things can we do really well really interactively for the user and you can start to do that in real time right support all those kinds of

interactions that people want all the time and we hear this for example for you know small things that are huge quality of life changes like send a Slack message without opening Slack and without seeing the barrage of 50

notifications that suck you away from whatever you were focused on in the first place. And so there are so many of

first place. And so there are so many of these interactions that we hear people asking for day in and day out. And if we can do them well and execute them well

in a real time user interactive environment, right? So it feels good,

environment, right? So it feels good, then it's going to completely change what my interaction with my device looks like.

Amazing. Thank you guys so much. This

was really fun. Yes. [laughter]

Thanks for listening to Notable Perspectives, a podcast from the team at Notable Capital. If you enjoyed the

Notable Capital. If you enjoyed the show, be sure to give us a rating and subscribe on your favorite podcast app.

You can learn more about Notable Capital by visiting notablecap.com.

Loading...

Loading video analysis...