LongCut logo

Raw Day Inside Wispr Flow

By Will Phillips

Summary

Topics Covered

  • AI Is Changing Far More Than Code
  • Voice Is the Most Natural Interface
  • The Three Buckets of AI Value
  • Roadmaps Are Dead Outside High-Level Direction
  • Preserving Voice Authenticity Against Standardization

Full Transcript

It's kind of unprecedented [music] the speed at which AI companies are able to scale right now.

30 40 50% month-over-month growth is the norm. Building the infrastructure to

norm. Building the infrastructure to keep up with that kind of growth is really challenging.

Early on in a startup's journey, [music] every founder is out there pitching, chasing investors, vendors, customers, [music] chasing anyone who listen. But there

comes a point where the tide turns, where suddenly everyone wants what you have. The thing you take is that AI is

have. The thing you take is that AI is only changing execution, writing the code itself. That's not true. It's

code itself. That's not true. It's

changing a bunch of different things.

In a world where we're all just exchanging slop, maybe flow [music] can help you sound like yourself while still helping you work faster and express your ideas more efficiently. Today I'm going

[music] behind the scenes with Whisper Flow, a startup that's raised a total of $81 million with their recent [music] round valuing them at $700 million to

revolutionize voice to text as a communication tool around the world.

We're not a destination. We're something

that puts adrenaline or gold dust into [music] your existing stack or into your existing workflows. That seems to be

existing workflows. That seems to be hard to communicate and we try in many different ways, but really seeing is believing. I strongly believe that a

believing. I strongly believe that a keyboard and a screen are just not a good way of interacting with a device and that voice is the most natural way that we communicate with another person.

Those two statements I can actually just believe no matter how much the world changes. [music] The question no longer

changes. [music] The question no longer remains whether the product works and whether the market has conviction. The

question now becomes how big can this truly get and can [music] they continue to deliver on the level of growth that they've maintained all this time.

Ah, here's a boy.

What's up, man?

How are you?

How are you? How you doing?

Just giving them a little tour of the office for Well, this tour is going to be irrelevant.

Yeah, because in one day we're moving. I

mean, today we're moving, but [laughter] later today.

Later today. Should we get the last glimpse before you guys leave? This

is our all hands meeting room. A bit

crazy. The team has grown a lot over the last two years. So before pretty empty and now you can see we have the bench over there, the chairs around here.

Man, you going to miss it?

Yeah, I guess a lot of memories, but I mean I'm excited. It's going to be cool.

The offices upstairs are sick. You know,

we started in hardware. Did you know that?

I had no idea.

But basically, you put this on like this and it could like read your movements and then like I don't know some crazy stuff. I'm not technical so I don't know

stuff. I'm not technical so I don't know exactly. But

exactly. But as in what like connect it up to like like electro pulses in your brain or something? Wow. So, yeah, some crazy

something? Wow. So, yeah, some crazy stuff.

[music] [music] Hector's just mentioned you guys going from hardware a little while ago. So,

could you tell me about that journey a little bit? So our goal has always been

little bit? So our goal has always been to build a voice interface that a billion people can use every day as their primary interaction with all their devices. And we started the company and

devices. And we started the company and we thought, hey, the thing that's going to block people is their ability to use voice around other people. Like I don't want to speak to my laptop when I'm sitting right next to somebody. And so

we started by building a wearable brain computer interface that could understand when you're thinking silently to yourself. We had a team of

yourself. We had a team of neuroscientists, hardware engineers, data collection technicians, all working on building towards that. And we

actually had a functioning prototype.

And what we realized as we started using it was we didn't even want to use the best voice interfaces that existed then when we were sitting in a closed room.

So what were we going to do? Well, we

had to build the voice interface, the software for the hardware. And that's

actually what led to the birth of whisper flow. And we saw our own

whisper flow. And we saw our own behavior shift. We saw the things that

behavior shift. We saw the things that we were doing changing. And that's what actually led us to put all of our energy into building that.

So I know we got a standup happening, right?

What are [snorts] we covering in the standup this morning? what's going on cuz we'll be in there with you guys.

There's a couple different things. So,

the first one is about how people are developing new workflows using AI. I

think everybody's doing lots of different experimentation about what they like and dislike and people take for granted the things that they're doing and don't always share it with

everybody. And so first half of it is

everybody. And so first half of it is about talking about that and the second part is actually just updating people on the different developments on the ML side, on the research side, on the product side for the new features that

we're building.

[music] [music] As a reminder, as always, we are focused on current devs. And the reason why is because we want to be building the first voice interface that a billion people can use every day as their primary

interaction. Are there things that you

interaction. Are there things that you do that you don't know that other people are doing that you think might make them more effective when using AI?

The way that I organize my thoughts around this topic is that I generally bucket things into one of three things.

So, first bucket is quality. So how this is where AI helps me get to a better answer than I would have without it or on my own. The second one is

compression. So AI gets me to the same

compression. So AI gets me to the same answer but it closes the gap or makes it much quicker things that took a week.

And so that's increasing the velocity of the uh important decision- making I need to make every day and so there the focus then comes well what decisions should I focus on? And then the third bucket is

focus on? And then the third bucket is uh project management. So, how is AI making sure that all the things I committed to in my day across a milestones and timelines actually get

done the next day? And I found that that's been helpful then because to deconstruct the ways that AI can help you rather than thinking across the board. This is a little weird or

board. This is a little weird or uncomfortable thing where I kind of believe road maps are kind of dead outside of high level direction. So,

it's like this is the direction we're heading. But if I can scope it down to

heading. But if I can scope it down to we're going to do this, then this, then this, then this, then this, and I can scope it to any detail, then it's almost done. And so it's kind of like, okay,

done. And so it's kind of like, okay, great. We're like able to rapidly

great. We're like able to rapidly execute, and then once we do a thing, we learn the next thing to do. Like, we

couldn't have known the next thing to do until we did the first thing. I know

where we're headed. I know that we're going to have to solve many problems along that way. And how we solve those problems and the specificity of that road map is going to be a lot bigger than it probably used to be. And one of the things that we've learned in content

centers this week is that how you're prototyping should also match the problem you're trying to solve. And so

for example, now that it's easy to jump to a prototype in cloud code or any other tools, like maybe a really good skill or a really good prompt where you're learning what the actual functionality is that you want, the

output that you want is actually what you need to learn. Um, and so it's both finding a problem and figuring out the right type of prototype or the right level for what you're trying to learn because for me personally, sometimes

I've jumped thinking about the UIUX when actually I just needed a damn good prompt to teach me what I wanted it to do and then get to that.

[music] [music] How was the morning stand out this morning?

I had a good walk about yesterday with Sahes. This notion of like what is the

Sahes. This notion of like what is the road map when the road map can change completely in a week or two just based on the velocity of what you can ship.

It's a really interesting challenge. So

increasingly we're really focusing on like how do we build a brand that resonates and is distinct not get lost in the fast growing tech bubble. Right?

I use the product every day for four or five months. And so I know that once

five months. And so I know that once people get into the product, things work beautifully from there. Meaning we

activate very well. People build a new habit after a while they're pretty comfortable paying for it and then they stick around for a very long time. That

whole part of the funnel is bestin-class of any software company I've ever worked at or seen. So the biggest challenge is no one's heard of us, right? And so a disproportionate amount of my time is

thinking about well how do we change that? We currently have EU campaigns

that? We currently have EU campaigns which are reaching the UK with generic content. So are you able to see

content. So are you able to see benchmarks from those campaigns and then compare the performance of this UK?

We can do the country breakdown within EU. We haven't done the baseline

EU. We haven't done the baseline comparison because that for me is really the real we're trying to understand which is should we bother doing specific stuff at a country level. My inclination just

because we're bucketing multiple markets at the same time is metas in like team efficiency right so it's just going to serve to the most likely candidates

sometimes that means you sometimes that means p Germany where wherever so it's not necessarily like a clean onetoone comparison but that is a good question in that like maybe there's some insight

to help us build the baseline for this this market speaking of building baselines for this market we also have India specific creative that has been running with Namisha. Um we have two

separate um well we've had our Android uh campaign going since launch but we now have a desktop specific testing campaign within India um with quite a

bit of variety of um UTC based [music] ads.

[music] [music] So basically the idea is that when you're in office setting like if you're dictating and like it can pick it up well with like your MacBook but number

one if your MacBook is closed the mic isn't isn't enabled so won't pick up the audio and then secondly you also feel kind of awkward because you might be like speaking a bit louder than than usual. So with the mic, if you have it

usual. So with the mic, if you have it like up close here, I'll show you guys how good whisper flow is. Actually

whispered into the product. So

hey, let me know if you're running me like wild. And I was running behind.

wild. And I was running behind.

[laughter] But yeah, so if with the mic then if if you put that into consideration, then people are like working and they can just be like whispering without creating like a lot of background noise in an office or in like an open space like

this where everyone has mics and we're all talking with Whisper. Um it creates for like a less noisy environment. And

the idea is that if we send these to offices, um then Whisper will be adapted faster and more people will start using it and they'll be like, "Oh, why do you have a mic on your desk?" And it's like, "Oh, I'm using Whisper." And they're like, "Oh, like what is that?" And then

it creates like that open conversation to people for people to get introduced into whisper.

So as a linguist my main role is to kind of bridge the gap between the ways in which we speak with all its inherent messiness. So we put in a lot of filler

messiness. So we put in a lot of filler words like um and like. We also speak in long run on sentences in general. And

what we expect when we write is to see not that right. We want precise sentences. We want punctuation. Things

sentences. We want punctuation. Things

that are implied by what we do when we speak but really aren't there. And so my job is to kind of bridge that gap so that we're still honoring what people said in the tone that they said it in the words that they chose by meeting

their expectations as if we are writing exactly as if they would have typed it themselves.

[music] I saw you writing a lot in the morning stand up this morning. Tell me about that.

So, I think what's really cool about flow is it helps you clarify your thoughts. Um, but there's no substitute

thoughts. Um, but there's no substitute for pen and paper to get down certain types of thoughts. We're saying that voice is one way to share your thoughts and it's fast and it helps you be very

conversational and natural and that helps me get out more things than I would have if I were just writing all day. So each has its place. But yeah, we

day. So each has its place. But yeah, we write to think, we talk to think, we talk to express ourselves. And flow

[clears throat] is just one way of capturing that, making sense of it, and helping me do it a lot faster. And every

afternoon people would come and do some follow-ups. I would take notes. Um, and

follow-ups. I would take notes. Um, and

then we also did a hanging contest.

That's where I made the leaderboard. And

this actually came from on the day of our launch we like initial desktop launch we had a product hunt out and at every hundred up votes I was like let's

all like someone has to do something. So

we had people do like pull-ups like at every hundred we had to do an activity.

I've got a question here. Where are you on this list?

Well I did a challenge with Tobin and loser had to buy sushi and let's just say I didn't pay for sushi so I won. But

I'm not I I did I never wrote my name on here but I have I have video. Oh, I'm

sure. I'm sure. I have video evidence. I

do. I do.

The audience is going to have to see that. It's like It's like picture it

that. It's like It's like picture it didn't happen. [laughter]

didn't happen. [laughter] I saw you got mogged in uh one of the YouTube videos where the the founder started doing muscle ups and then you got called out. You're like, "No, no,

I'm not on peptides yet."

[music] Right now, you speak and Whisper Flow will write for you. But we also wanted to do things for you. And so we have a team spun up that is running 20 experiments a day of how we want those

actions to work.

For a long time in in different environments saying meant not getting stuff done. [music] And so that paradigm

stuff done. [music] And so that paradigm shift we are actually saying is reducing the time from idea to capture and capture to execute is a really positive thing.

That is something that is just so special because what we sound like is what makes us individually us, right?

preserving people's voice [music] and authenticity when every other tool seems to just be saying like do it one way do it faster and that standardization is just something that like I worry about as an industry [music]

if you know that people want exactly what you're building on the research side makes a ton of sense to spend a bunch of time and energy building one big bet that you believe that you hold

and if it's an observation about the world then it's not even just like a thing that you believe that could be false that's [music] the only thing that's like really kept me grounded

personally with all of the the ways in which things change. [music]

Yeah. So, you're like dictating like tasks to employees or whatever. Let's

just say our product is pretty damn

Loading...

Loading video analysis...