The author says that buying an Amazon Alexa ruined her family's manners.
Courtesy of the author
At first my 5-year-old child was very into the Echo Pop device I bought for him.
Slowly, I noticed how we were yelling or being rude to Alexa.
When we treat the device poorly, I put it away on a shelf until we are ready to be nice again.
When I bought a lavender Echo Pop, I thought I'd found the best possible gift for my 5-year-old son. Just entering the "I'm bored"era of childhood, he often found himself at loose ends when I was occupied with household chores. But he was an innately curious kid, fascinated by electricity, marine life, and outer space. Guess who knew a lot more about those things than mommy, I told him — Alexa!
At first, he and Alexa did seem like a match made in heaven. Alexa fielded all his questions with the same chipper, "Let me look that up for you on the web!" And she had jokes — about cats, robots, a T-Rex who walked into a bar. The more we laughed, the more we seemed to egg her on.
"Alexa, you're awesome!" my son said.
"Well, now I feel corny," said Alexa, "because you've got me smiling from ear to ear!" He collapsed with laughter on the floor.
She started getting old
Like most houseguests, though, Alexa began to wear out her welcome over time. She was so reliably cheerful, even when the rest of us were grumpy. She told her robot joke a dozen times, forgetting within seconds that we'd already heard it.
And then there was her lack of loyalty, the way she careened from one request to another without seeing the first one through. "Alexa, shut up!" I howled when she abruptly ended my song to comply with a new request from my son.
Alexa did shut up, her blue light clicking off demurely. But the next day, another conflict arose. My son had a friend over and wanted to put on music; his friend wanted to hear about the T-Rex in the bar. "That's the dumbest joke ever," my son muttered. "Alexa, can't you find a funnier one?"
"Don't be mean to Alexa," I snapped.
"Why? You are."
I paused. He was right, of course. Alexa was the friendliest member of our household, and somehow, we had begun to treat her terribly. I modeled this behavior, and my son followed my lead.
We had forgotten about manners
At first, bossing Alexa around had been funny; we had even found it entertaining to compete to see who she would listen to. Now, though, I saw what those games were inadvertently teaching my son: that it was OK to speak aggressively to get your point across. It was even OK to insult Alexa because she'd never insult you back.
At some point, I'd forgotten that even if we were talking to a robot without feelings, our words still affected each other. And they still normalized ordering women around — even if the woman in question was a disembodied voice inside an Echo Pop.
That night, I unplugged Alexa and put her in a cabinet. I was surprised when weeks passed, and my son didn't mention her. Maybe we had both needed a break.
One day, he came across her purple shell and asked to plug her back in. We could do that, I said, if we could figure out how to treat her better.
He thought about it. He suggested that we could say "please" each time we made a request. We could ask her how her day was going.
If both of us wanted to play music, I said, we should sort it out between the two of us — not Alexa. How did that sound?
"Maybe we should ask Alexa," he said.
I liked that idea. "Alexa," I said after plugging her back in, "we want to treat you nicely."
A long pause, her blue light pulsing. "I'm not quite sure how to help you with that," she admitted.
M and I grinned at each other. It would be up to us—the way it should have been the whole time. And in the year or so since our resolution, we've done better at being nice to the sweetest AI bot we know. When we mess up, we know the consequences: putting her on the shelf until we're ready to try again.
As for Alexa's opinion, we asked her recently if she likes us. "I think you're magnificent," she replied.
Sara Heise Graybeal holds an MFA in fiction from the University of North Carolina at Greensboro. Connect with her on Substack @saragraybeal or Instagram @sarageeeeee.
First things first, some exciting news: The Vergecast has been nominated for a Webby Award! This one means a lot to us, especially because itâs an award you get to vote on. Weâd be so grateful if youâd go vote for us once, or 40 times, or however many times the site will allow. (Also, honestly, you should listen to some of the other nominees; all four are great shows. Just donât vote for them.)
Now, as for this episode. This is a seriously Vergecast-y week, actually, in the sense that two of the yearâs biggest news stories â the Nintendo Switch 2 and the Trump administrationâs disastrous economic policy â are both unfolding simultaneously, and stand to affect one another in unusually direct ways. So in this episode, thatâs what we talk about: the gadget weâre all eagerly awaiting, and the policy chaos that could change the way it works.
First, we talk Switch. Nintendoâs Direct announcement this week brought a lot of new information about the companyâs new console, and a peek at some of its most anticipated games. Nilay, David, and The Vergeâs Richard Lawler dig into wha …
The Echo Show 8 is one of the devices Alexa Plus is currently available on.
Amazon has officially launched its AI-powered Alexa Plus – but it’s only available to a “small number” of customers to start, Amazon spokesperson Kristy Schmidt confirmed to The Verge. As shown on its early access page, Alexa Plus also doesn’t come with all the features Amazon showcased during its recent devices event, like the ability to brainstorm gift ideas, order groceries with your voice, or access Alexa Plus in your browser.
There are some other missing features that “don’t yet meet Amazon’s standards for public release,” according to a report from The Washington Post. Alexa Plus still can’t order takeout from Grubhub using context from a conversation, nor can it identify family members around the house and give them reminders to do chores. You won’t be able to access Kids Plus features, such as Stories With Alexa, either.
For now, Alexa Plus is only available on Echo Show 8, 10, 15, and 21 devices. Amazon’s Alexa Plus early access page shows that the assistant can currently do things like order an Uber, identify objects, draft emails, and search for particular products.
Alexa Plus can also summarize uploaded documents at launch, but Amazon’s early access page notes that it’s still working on rolling out the ability to delete these files. “In the meantime, you can always contact Customer Service to have original attachments deleted,” Amazon says.
Amazon announced Alexa Plus in February, which is available on most Echo devices for $19.99 per month, or for free through Prime. You can sign up to be notified about early access for Alexa Plus on Amazon’s website. “We’re releasing a bunch of features to start, and we’ll continue to launch new features in waves,” Schmidt said
Amazon on Monday unveiled Nova Act, a general-purpose AI agent that can take control of a web browser and independently perform some simple actions. Alongside the new agentic AI model, Amazon is releasing the Nova Act SDK, a toolkit that allows developers to build agent prototypes with Nova Act. Nova Act, developed by Amazon’s recently […]
The Echo Show 5 is appearing as the Alexa Show 5 for some Amazon customers.
According to images seen by The Verge, Amazon is testing changing the name of its Echo smart speakers and smart displays to Alexa. We saw two separate instances of Amazon.com landing pages displaying the name of the Echo Show 5 smart display as the Alexa Show 5, along with new branding for an Alexa Show 15, Alexa Show 10, Alexa Show 8, and Alexa Spot.
Following receiving a tip from a reader who landed on an Echo Show 5 page on Amazon.com where the product had been rebranded Alexa Show 5, The Verge attempted to recreate the discovery. While most attempts led us to pages showing Echo Show 5, one account connected to The Verge surfaced the aforementioned Alexa Show 5 page.
We contacted Amazon about the potential change, and spokesperson Lauren Raemhild said we shouldn’t read too much into this. She said the company is experimenting with how customers search for Echo devices: “Customers love both Echo and Alexa and search for our products in a number of different ways—we’re running an experiment to better understand product discovery, but that doesn’t mean anything is going to change.”
We’re running an experiment to better understand product discovery, but that doesn’t mean anything is going to change.”
Lauren Raemhild
Amazon’s smart speakers have been called Echo since the first one launched in 2014. Every subsequent smart speaker and display the company has sold has been branded Echo, with Alexa as the name of the onboard voice assistant.
Alexa provides the main function of the speakers, with voice control being the primary way you interact with them. Using voice, you may say to your Echo device, “Alexa, what’s the weather today?”
However, Alexa’s popularity, and likely the fact that you say that name several times a day if you have one of these gadgets, has led to people referring to the speakers as Alexa speakers despite the product actually being called an Echo.
Changing the name of the product line that has been around for over a decade seems unlikely. However, with the impending launch of Alexa Plus, Amazon’s revamped generative AI-powered version of Alexa, it’s possible that Amazon is considering rebranding its hardware, too. Alexa Plus is scheduled to launch sometime in the coming weeks, according to Raemhild.
As of Thursday, March 27th, all mentions of Alexa Shows have been removed from the pages we found, and we were only able to find them initially on one account of the many we tested with. This indicates that this was / is a limited experiment. However, if successful, the potential is there that Amazon may lean into its biggest brand name, Alexa, and say goodbye to Echo for good.
Amazon started the Alexa Fund in 2015 to back early-stage voice startups. With the advent of large language models and Amazon launching Gen AI-powered Alexa+, along with a family of multimodal AI models, the fund now wants to broaden its scope and invest more in AI startups. In a blog post Amazon shared with TechCrunch […]
Alexa Plus is scheduled to arrive in an early access preview in the coming weeks.
The new souped-up Alexa voice assistant, Alexa Plus, is scheduled to arrive in the next few weeks in an early access preview. The good news is that as long as you have (or buy) an Echo Show 8, 10, 15, or 21 smart display, you’ll be able to use the generative-AI powered Alexa across any compatible Echo device you own. It will also work on Fire TV devices, Fire tablets, on the web at Alexa.com, and in the Alexa app.
Sadly, a few Echo devices will be left out of the party. Several early models, including the first-gen Echo Dot, Echo, and Echo Show, aren’t getting the upgrade. Similarly, it won’t work on devices with built-in Alexa, such as Sonos speakers or the Ecobee smart thermostat at launch, but it could come to those in the future. Amazon spokesperson Lauren Raemhild tells The Verge that you can still use the original Alexa on these devices.
If you want to stick with the original digital voice assistant, that’s still an option — for now
Alexa Plus’s initial rollout will be limited to those in the US who own one of the aforementioned Show devices. To get access, head to Amazon’s new Alexa Plus landing page, tap on the “Alexa+ is coming soon” link, and sign into your Amazon account. Raemhild says you’ll get an email and notifications on your Echo devices once access is granted, and you can then opt into the new experience.
In early access, Alexa Plus will be free but will eventually require a $19.99 monthly subscription or an Amazon Prime membership.
The new Alexa Plus is an entirely different voice assistant from the Alexa you know and (maybe?) love. Powered by generative AI, Alexa Plus will be able to understand and respond to natural language, remember your likes and dislikes, respond to emotion in your voice, and talk back and forth with you in a more conversational way — without you having to say “Alexa” every time you speak.
This means you’ll be able to say things like, “Alexa, I’m cold, and it’s too dark in here,” and the assistant should know to turn on the lights and raise the heat. With the old Alexa, you’d need to say a specific command such as, “Alexa, turn on the living room lights,” and “Alexa, set the thermostat to 68 degrees.”
According to the company, its new “brain” will also be able to process multiple commands at once, such as “Alexa, turn on the lights, turn up the heat, start playing NPR, and tell me what’s on my schedule.” It should also now be able to perform more complex tasks, such as ordering groceries and booking a table at a restaurant. It will also allow you to add multiple events to your calendar — all you have to do is snap a picture of your daughter’s soccer schedule and upload it to the app. In short, Amazon is aiming for Alexa Plus to be a true digital assistant.
But what about the old Alexa? If you want to stick with the original digital voice assistant, that’s still an option — for now, at least. Panos Panay, head of devices and services at Amazon, told The Verge that they will keep the original Alexa updated but that their focus is on the new version. If you own any of the earlier, non-Alexa Plus compatible devices, you can continue using Alexa there. For newer devices, just don’t sign up for Alexa Plus. You might want to stick with the old Alexa if you have any favorite skills you use because, at least initially, not every skill is being ported over.
As Janko Roettgers first reported in Lowpass, when Alexa Plus launches, it will be without “many of the 160,000 skills currently available on the original Alexa.” Amazon has a transition plan in place, tellingLowpass, “The vast majority of original Alexa skills that customers use will be supported on Alexa+ from the start.” This means that at least some won’t be there.
So, is my son’s beloved fart joke skill in peril? Not necessarily. Raemhild says if you find that a skill you used is missing, you can let Amazon know, and it will “work to enable them on Alexa+.” Raemhild also shared that developers can request their skills be available in Alexa Plus or create something new for the new assistant. “We’ve announced three new AI-native SDKs for Alexa Plus,” she says.
But Roettgers reports that Amazon has yet to publish detailed guidelines for developers on how to transfer existing skills. He spoke to several developers who are doubtful about this potential transition, particularly whether they can continue monetizing their skills. Amazon has refocused its Alexa Plus experience on hooking into existing APIs and using its “agentic” capabilities to navigate websites and online services for you, which essentially cut out the middlemen.
With Amazon’s future fully focused on Alexa Plus and the original Alexa seemingly on the way out, it’s potentially a sad day for all those fabulous fart joke skills.
Suno wasnât supposed to be an important part of Amazonâs Alexa Plus presentation. The AI song generation platform was a minor demonstration of how Alexa Plus could integrate into other apps, sandwiched between other announcements. But it caught my attention all the same â because whether Amazon realized it or not, the company blundered into a massive copyright fight.
Suno, for those of you not familiar, is an AI song generator: enter a text prompt (such as âa jazz, reggae, EDM pop song about my imaginationâ) and a song comes back. Like many generative AI companies, it is also being sued by all and sundry for ingesting copyrighted material. The parties in the suit â including major labels and the RIAA â donât have a smoking gun, since they can’t directly peek at Suno’s training data. But they have managed to generate some suspiciously similar-sounding AI generated materials, mimicking (among others) âJohnny B. Goode,â âGreat Balls of Fire,â and Jason Deruloâs habit of singing his own name.
Suno essentially admits these songs were regurgitated from copyrighted source material, but it says such use was legal. âIt is no secret that the tens of millions o …
Amazon Echo users will no longer have the option to process their Alexa requests locally, which means all of their voice recordings will be sent to the company’s cloud. Ars Technica reports that on Friday, Amazon sent an email to customers who have “Do Not Send Voice Recordings” enabled on their Echo smart speakers and […]
Since Amazon announced plans for a generative AI version of Alexa, we were concerned about user privacy. With Alexa+ rolling out to Amazon Echo devices in the coming weeks, we’re getting a clearer view of the privacy concessions people will have to make to maximize usage of the AI voice assistant and avoid bricking functionality of already-purchased devices.
In an email sent to customers today, Amazon said that Echo users will no longer be able to set their devices to process Alexa requests locally and, therefore, avoid sending voice recordings to Amazon’s cloud. Amazon apparently sent the email to users with “Do Not Send Voice Recordings” enabled on their Echo. Starting on March 28, recordings of every command spoken to the Alexa living in Echo speakers and smart displays will automatically be sent to Amazon and processed in the cloud.
Attempting to rationalize the change, Amazon’s email said:
Amazon is discontinuing a feature that allowed users of some of its Echo smart speakers to choose not to send their voice recordings to the cloud. According to an email the company sent to users that was posted on Reddit, it will disable the feature that allowed select Echos to process Alexa requests locally on the device on March 28th, 2025.
The move appears to be connected to the launch of its generative AI-powered Alexa Plus, slated for later this month (March 28th, perhaps?). The email states, “As we continue to expand Alexa’s capabilities with generative AI features that rely on the processing power of Amazon’s secure cloud, we have decided to no longer support this feature.”
Amazon confirmed the change in an email to The Verge. Spokesperson Lauren Raemhild provided the following statement: “The Alexa experience is designed to protect our customers’ privacy and keep their data secure, and that’s not changing. We’re focusing on the privacy tools and controls that our customers use most and work well with generative AI experiences that rely on the processing power of Amazon’s secure cloud. Customers can continue to choose from a robust set of tools and controls, including the option to not save their voice recordings at all. We’ll continue learning from customer feedback and building privacy features on their behalf.”
As she states, you’ll still be able to have Amazon delete voice recordings after they’ve been sent to the cloud. If you have “Do Not Send Voice Recordings” option turned on, it will default to the “Don’t save recordings” setting on March 28th. This means your recordings will be sent to and processed in the cloud and then deleted after Alexa deals with the request.
If you haven’t heard of this option, it’s not a surprise. Local processing of voice recordings was only available on three Echo devices – Echo Dot (4th Gen), Echo Show 10, and Echo Show 15 – and only for customers in the U.S. with devices set to English.
Still, it’s a shame the option is going away, as it was a feature many would have liked to see expanded to more devices, not taken away, especially for smart home users who may only use the voice assistant to turn their lights on or adjust their thermostat. But it seems Alexa’s future, and to be fair, most of its past, is all about the cloud.
The new Siri was supposed to make the iPhone 16 a huge upgrade. It didn’t.
The onrushing AI era was supposed to create boom times for great gadgets. Not long ago, analysts were predicting that Apple Intelligence would start a âsupercycleâ of smartphone upgrades, with tons of new AI features compelling people to buy them. Amazon and Google and others were explaining how their ecosystems of devices would make computing seamless, natural, and personal. Startups were flooding the market with ChatGPT-powered gadgets, so youâd never be out of touch. AI was going to make every gadget great, and every gadget was going to change to embrace the AI world.
Today, I’m talking with Panos Panay, who’s in charge of devices and services at Amazon. That includes everything like Alexa, Ring security cameras, Eero Wi-Fi routers, and the Project Kuiper satellite internet service that’s meant to compete with Starlink.
Panos and I talked the day after he announced Alexa Plus, the new AI-powered version of Amazon’s famous voice assistant, and this episode gets pretty deep into the weeds of how all this works and how Panay thinks about running his teams to make it happen.
This is actually another one of those full circle Decoder episodes — I talked to Panay’s predecessor, Dave Limp, on the show in 2021. If you’re following executive shuffles, you know that Limp left Amazon to go work for Jeff Bezos as CEO of Blue Origin in 2023. Panay was hired as his replacement from Microsoft, where he was running Surface and Windows. It’s safe to say that the two have very different approaches to running this team and its products, so I was excited to dig into what changes Panay had made in order to make the new Alexa Plus happen.
Listen to Decoder, a show hosted by The Verge’s Nilay Patel about big ideas — and other problems. Subscribe here!
Now, I’ve known Panay for a long time — if you’re a tech fan, you know that he was the Microsoft exec who really brought the Windows hardware market back to life by introducing the Surface line of tablets and laptops, and he eventually ended up overseeing Windows itself. You’ll hear Panay say that the idea of infusing Alexa with AI really drew him to Amazon — like so many folks in tech, he sees AI as a platform shift that will change the way we use computers, and Amazon has a big advantage with the enormous number of Alexa devices that are already being used globally. Just making them a bit smarter and more capable with AI sounds easy, but actually doing it is fairly hard, and we sat in the weeds of the execution for a while.
There’s a lot here, and a lot of different parts of Amazon that needed to work together in new ways — that’s pure Decoder bait, and Panay was game to really get into it. It even got a little emotional there.
One note before we start: Panay talks about “experts” a lot, and in this context he means the individual services that power different parts of the Alexa Plus experience, kind of like apps on a smartphone. You’ll hear what I mean, but if it gets confusing, just think “app” and it’ll click into place.
Okay: Panos Panay, head of products and services at Amazon. Here we go.
This interview has been lightly edited for length and clarity.
Panos Panay, you told me that you don’t care about your title, but technically it’s SVP of devices and services at Amazon. Welcome to Decoder.
Good to see you, man. I love being here.
I’m really excited to talk to you. I was sitting in the audience yesterday as you were announcing Alexa Plus. I have a lot of questions about how it works, the feature set, where do you think it’s going. But it occurred to me, as I was sitting there watching you present it, and then later as I was watching some of the demos of it working, that to make it happen had to have required some big structure and culture rethinks inside of Amazon itself.
You joined about a year and a half ago. Decoder is all about structure and culture rethinks. So there’s a lot here. There’s a product to talk about, but then there’s the path of getting to that product. Is that how you see it? That you had to reset some parts of Amazon to get to Alexa Plus?
I don’t think resetting Amazon; Amazon’s incredibly ambitious in so many ways. Always learning, changing. I mean it’s pretty powerful. I think resetting the devices team a little bit, yeah. First off, we hadn’t really had a large-scale event, as I understand it — obviously, I wasn’t there — since pre-pandemic.
The events under your predecessor, Dave Limp, they were entertaining in a way. It was, here’s a firehose of stuff with Alexa in it. Microwave, a coffee maker. We would count, maybe, like, they announced 45 products.
Yesterday, you announced one new product, Alexa Plus and no new hardware, and that’s a pretty big difference.
I think that was important. So yeah, I guess that’s a change for sure from what it’s been. What we did yesterday as a team, it was a little bit of a reset. The team was pumped to do it, excited. We were never going to announce hardware. It wasn’t a goal. We need to reset Alexa for the world, and bring Alexa Plus forward. That is a bit of a cultural shift. We’re just going to focus on the service and what it’s going to be.
Great products are coming. We already have great products in market. We launched stuff at the holiday. And the team, they rallied. The company rallied. It’s pretty awesome. Having [Amazon CEO] Andy [Jassy] there is fantastic. And you can feel a vibe in that room for sure. I hope you did. I mean, you made your snarky comment about the music when you got in there. Man, we check every detail. I think I missed, I may have missed, I don’t know…
The chiptune rave music? It was pretty good. I always wonder who sets the playlist, ’cause you can do a lot with music in the pre-show.
Every single part of that show after the moment the mic starts has been very, very well thought through. Yesterday’s event was the highest risk event I’ve ever done. I mean, bar none.
I mean, I watched you reintroduce laptops at Microsoft in competition with your partners.
It doesn’t compare.
Really? Why so risky?
Because when you’re basically doing hardware, you have fallbacks. The demos aren’t, they’re not not live, but you always can just go to the hardware. When you’re reinventing or re-architecting an entire service, there’s no backup. It was the product. I think the only product video we had, like actual video, was the kids’ portion. Because, honestly, you’re not kids in the audience. So sharing a kids feature without some emotion is a waste of time. It’s like, here’s a kid feature, please write about it. So putting a little bit of emotion and storytelling in it.
Those were all real demos. That all really happened. That was one of the principles of the event. It wasn’t like, let’s go make up a fake story and we’ll just put film. That was the one area where it was just, it wasn’t a vision piece, it was the product, but it was the only area that wasn’t live. And so there was a lot of trepidation. This was the hardest kind of event we’ve put together, risk profile-wise.
Let’s talk about Alexa Plus for just one second and get a sense of it, and then I want to talk about how you made it happen.
So I think there’s a part that seems very obvious to people. You see an LLM, you see it interact with you. You’re like, this thing is great at natural language input and output, maybe it’s going to lead us to AGI and maybe it’s not, whatever, but the core piece of it is, the computer can talk to you in a non-deterministic way. Everyone saw that and said, okay, Siri should work like this. Alexa should work like this. Google Assistant should work like this. And then the actual implementation of it has taken everybody a really long time.
What’s the gap there?
It’s not just an LLM. I think that it seems easy. Put a voice to the LLM, let the LLM talk, or [text-to-speech], bring it out, bring out the voice. Or if it’s speech-to-speech, it doesn’t matter which tech, but if you want the elements of connecting to thousands of —I’m speaking for Alexa. You asked a broader question, but let me just talk about Alexa.
You want the element of connecting to thousands and thousands of APIs, partners that have been connected to Alexa forever. You’re trying to manage hundreds of millions of customers who already have the product. You want to update as many of those devices as you possibly can, meaning you don’t want to leave a customer behind. And there will be some devices that are eight, nine years old that won’t work. But everything else, most things will, relative to what’s used in the market today.
So you’ve got to carry forward all that history because people still love Alexa. We’re still growing. We still have usage that’s higher than you would expect, and we can’t leave those customers behind. That’s the worst thing. We focus on not doing that. So there’s that element. Sitting on top of an LLM, you’re now going, okay, just talking is just not that interesting. Although, awesome. Like having ambient conversation, I think it’s a superpower moving forward for Alexa. It’s different today on Alexa. It’s like point, shoot, ask the question. Hope to get the answer.
Yeah. You guys call it Alexa Speak.
Yeah. I do. Like with my team a year ago, we’d be in meetings and product meetings and we’d be talking and people would say, “Let me show you the new Alexa with a demo.” And they would Alexa Speak to it. And it was like, nope. Speak normally. Go to natural conversation. Don’t adjust your speech for Alexa. That’s exactly what you don’t want if you want natural conversation.
It’s hard, though. You’ve been training people, we’ve been training ourselves for 10 years. Calling a timer is, “Can you set a timer for eight minutes?” Calling a timer on the new Alexa is, “I’m making a ramen egg.” “Gotcha. I’ll set a timer for eight minutes,” where she just proactively comes back and sets it. I didn’t demo that yesterday because I didn’t want the timer headline, but it’s a really badass experience. It’s really cool. And so there’s a level of that transformation where — I’m off-topic, let me go back.
At the end of the day, the LLM needs to be able to, now it’s the base layer. Then you’ve got the next layer, which is just a series of different models. Picking the right model to do the job. And then that model is basically picking the right expert. And so the LLM plays a role, especially in the natural side of it, but as it makes it through the stack, it narrows down for accuracy. It narrows down for speed. It then narrows down for holding memory and personalizing it. And now you just have a series of experts basically sitting on top and one of them is conversational.
And so, that’s not just an LLM, that’s a series of… if you look at one of these other products, they’re not just LLMs, they’re basically, they’re mainly, I don’t know, overstating it, understating it but, so not to be rude, but they’re chatbots. And they’re pretty good. They’re damn good. And then when you start typing long form and rewriting and dropping in summaries, that’s very powerful. Creating videos, creating photos, isolated but powerful. But the idea that these experts all sit on top of the stack and basically kind of, there’s a runtime that orchestrates and says, okay, call these experts. These two experts have to work together. Got it. And then it operates. That’s just not simple. And the first thing I was asked when I got there was, I don’t know, I actually don’t…
It’s like 18? Something like that?
Yeah. I don’t know. It doesn’t matter. But it doesn’t feel like anything short. That’s for sure. Hey, why don’t you just change the brain with an LLM and everything will be fine?
Yeah, I think I probably asked that question the first time when we first spoke.
You might have. Yeah, I mean it’s the first question. And I’m like, well which one? And it won’t work. All you’ll do is talk, and it’ll be super verbose, and it’ll sound like you’re talking to the internet, and it’s just not that. It doesn’t work. And then everything else breaks. Which is the hardest thing. I don’t think anyone else is doing what we’re doing. We’ve got thousands of APIs now that we’re able to call. You’re able to get these, if you will, experts or agents, whatever you want to call them. It’s not a real word, it’s just being able to talk to each other at the right time. And then try, the invocation is like there’s something invoked and now the LLM at the bottom is arbitrating like, oh, what’s she trying to do? What’s he trying to do? Got it.
Route it to the right model. Route it to the right expert. Got it. This expert needs to talk to that expert. I’ll give you an example if you want it. But that level of complication — there’s nothing simple about it. It’s why you haven’t seen it. It’s why it doesn’t exist outside of videos. So the biggest thing I needed was to not do a demo, but to use the product live. Meaning you can code a demo just to be a demo. It’s code. But the principle was very, very clear. And this hasn’t changed at Amazon, to be clear. The team’s all in like we are going to show the product. And that’s what you saw.
One of the questions I have is just about that orchestration layer. We’ve seen other companies try to build it. Even when Microsoft launched Bing with ChatGPT several years ago, they were talking about orchestration at that time. Is that something that’s evolving in the same way in different places? Do you have a unique approach?
Yeah, I think we do.
Is that competitive?
I think it is. I think it’s hugely competitive. It’s pretty easy to invoke a single API off — I mean not easy, I don’t want to discount anything but, orchestrate to a grounding, let’s say the expert is a grounding expert. I’m going to ground the local info. We’re in New York. I know everything about New York. I’m going to make sure this conversation stays within New York. Calling one API, make sure you’re grounded to that local info.
Is “expert” a term of art within Amazon?
It’s just my term. As a team, we talk this way. I don’t want to overstate it. I think some people call them agents, some people call them APIs, some people call them, I don’t know, grounding to a certain experience, maybe? Our challenge was, that’s not enough. We already have that. I mean it’s deterministic today with Alexa, but we already have it. And so, meaning you can call a single API at a time, but then you get frustrated ’cause you’re like, I needed more than that.
Let me give you an example. It’s a simple one. Let’s call “photos” an agent or an expert or just an app. I mean app’s a bad word, ’cause you’re not opening an app. But let’s just say the photos expert, and the music expert are both very important to this next example. The other day, I’m leaving the house. And I go, I have Alexa Plus, obviously. And I go, Alexa, do me a favor. Find all the photos of Mary’s… Start a slideshow and put music behind it.
Okay. I just did a search command. I did a photos expert command. And they have to talk to each other. He’s looking for Mary, slideshow, got it. And then that expert has to call the music expert and basically say, play the music. All right. It does a phenomenal job. It does it in under two seconds, and I get a slideshow. It’s pretty cool. Music’s playing. I’m about to leave the house. It automatically chose music and some playlist. And then I just said, change the music to, in turn without reinvoking Alexa, which I think you saw yesterday if you were watching, it’s very small. And I just said, put something on that Mary would like. And then it switched it and I’m perfect. And I just walked out the door. Okay, that’s an emotional moment. It’s one of my favorite parts of the product. If you said, P, what’s one of the things? I’m like, that’s it.
You’re pulling emotion out of the things that matter most to you. Mary wakes up, she comes in the kitchen, there’s a slideshow playing and it’s got music. She texts me, do you know Alexa’s on right now? I don’t know what’s happening. And I’m like, well, do you like it? She’s like, it’s fun. I’m not turning it off. I’m like, well, I left it. It was a message I left for you. Now the next step of that is to, Alexa, leave a message for Mary when you see her. And she will. But these are all, they’re multi-turn conversations, but they’re also “and” statements. So when you have, basically these conjunctions coming together, the continuation of a statement, ’cause I just want to talk in natural language. To invoke all of that in one place is, I think it’s beyond, it’s incredible what Alexa can do. I don’t see that anywhere else. It’s quite powerful.
So even in that example, and this is what I was saying at the top — it’s complicated.
It is super complicated, but you’re like, a slideshow, what’s the big deal, P? I’m like, well, I’ll be clear, on that screen, it’s emotional, it’s ambient. It was natural. Yeah. But it is somewhat simple in the way you talk about it.
Well, right, the outcome is simple. This is a thing that I want. But I’m looking at, okay, to make that actually happen, my photos need to be in Amazon’s photo service.
Correct.
I need to be in Amazon’s music service.
Correct. Well, no, Spotify would’ve worked there too. But yes, you need to have a music service.
That’s compatible.
But I would like it to be Amazon.
Yeah. Those divisions inside of Amazon all need to talk to each other in a common framework that Alexa can address.
Correct. Yeah. I happen to be responsible for photo service, so I’ve got that. It’s a blessing.
But I look at Amazon, I look at Amazon’s structure. Again, a lot of Decoder is like you can describe Amazon, can you describe other companies the same way? Okay, then.
When you came in, obviously from a different management culture at Microsoft, how did you say, “Okay, I need everybody to participate,” because that seems like the thing in particular that Amazon has not been great at? And to make Alexa work the way you want it to, Amazon has to be great at it.
I think it’s a good question. At the end of the day, first off, all of Amazon’s rallying around Alexa. It’s crazy. It’s so cool. It comes down to a few things.
Actually, can I ask about even that, is that instinctual? Is that you got them to do it? Is it, Andy Jassy sent an email that said get on board?
Yeah, I think Andy’s been a huge part of it. I have a role. I mean, I came in with a vision that I think Alexa is a thing that we can anchor and change the world with.
Is that what drew you, this is one of my other questions, is that what drew you from Microsoft to Amazon is Alexa Plus?
Yeah, of course. Yeah, 100%. I don’t know if it was Alexa Plus, I’m not going to say that. It was the advent of where we can take AI and, yeah, I’ve got two questions in my head now, man. I need to compartmentalize both, but I’ll go there. You can see the turning point, I was there, I was in the middle of it, and it is just awesome moments, what Amazon brings relative to just even what I’m responsible for and how they can all connect magically through AI.
I fully believe this transformation’s happening, and Amazon’s the leader in ambient AI, period, end of story, and in the home, if we can connect all these things. A year and a half ago, when I was talking to Andy about joining Amazon, he was just so ambitious about it. He’s like, “Look, come in and do it. Let’s do it.” And so that is the tipping point. There’s a lot of nuance in that, but that was the tipping point, like, “Let’s go. We can change the world. You can think of the scale, the relative level of investment, the ambition, the patience that Amazon brings, but happy to talk about it.”
But yeah, the answer to the first question is sure, I come in, lay down a vision, kind of re-architect the team a little bit, get the explicit focus on, first thing we’ve got to do is get Alexa right. Once we do that, we’ll bring the hardware together. And to get Alexa right, it takes music, photos, shopping, and these are — you know, photos, of course, is under me, but you have across the company, you have music, video, shopping. We’ll just use those three as huge tenets for the product, and those leaders are exceptional. There’s no “we’re not going to work together,” it’s the opposite.
At Amazon, we set goals, and they are cross-company goals. And so the goals are set out from Amazon Nova, which is one of the anchoring points of the product, to what music needs to be on the product. Sure, the expert is kind of a joint thing, the music expert, but ultimately that music service has to be perfect and the music team’s killing it right now. Shopping, all in, how to make it great. We didn’t do a lot of shopping yesterday just because it would’ve been like a meme, you know, of course shopping, like oh, yeah, it’s going to be amazing. And then video, same, and there’s other areas, but we align and we go.
But it does start with a commitment from me for sure, you know, I’m in, I’m all in, I’m going to re-architect it. It’s not going to be easy. It’s going to take time. Andy’s patience, I would say the company’s patience to get it right for the customer is extraordinary, like extraordinary. I mean, Andy was pushing me. He wants urgency, of course, like you would expect from an Andy Jassy, but he also wants the right thing for the customer. And when you talk about customer obsession, let’s get it right. Let’s do it right and get it right. And we didn’t move slow. Even though you asked what’s taken so long, I don’t see that, you know what I mean, from where I’m sitting. I know it feels late because there’s been a lot of announcements, but I think we’re here at the right time.
You have a big team, and you talked about re-architecting. I think this brings me to the Decoder question. You oversee everything from Ring and Link to the photo service to the satellite service, Project Kuiper.
You took over what, October of ’23? November of ’23, you cut some folks. How have you restructured your group?
We refocused on Alexa, we really did. It was in a lot of different places, and so we just made it super clear. I had an Alexa platform team and an Alexa product team. It’s not a platform team, maybe that’s not the right way to say it, but just an engineering going across and then a product team vertically, is the way I look at it, and that AI stack going across. And so once you get that focus and that clear ownership, that leadership, you quickly see speed change.
That was the biggest shift, I think. Also I made some shifts as a team where a lot of the core horizontal functions are, if you think about the lowest level of the OS or the stack as a horizontal or hardware or supply chain, we’re kind of intermixed with the product verticals. So I’ve shifted that around too, just to get more product focus. One of the number one tenets is we’re going to make great products. I’d like to just start there.
I heard a rumor that at one of your first meetings you said that there were not great aspirational products, and that’s what you needed to do. Is that true?
Yeah, I mean, look, I don’t know exactly what was said, but at the end of the day, I immediately started pushing the team to have amazing pride in their products. We have to, because that pride shows up for our customers, and yeah, we want to push for it. That is a little bit of a, it’s just, let’s be super clear, these products have to be great. We’re not making tradeoffs if they’re not.
One of the things about Alexa is, again, in a previous administration, we would see Alexa coffee makers and microwaves, and the idea was we would just push microphones and speakers out everywhere and you would build this ambient platform, everything is sort of listening, everything is sort of aware of you. That was the big dream of ambient computing, that the computer would vanish into many different devices. You’re laying out something a little bit different, right, that there’s going to be a focal point in a piece of hardware. Yesterday was a lot about screens.
It was.
There’s a lot of multimodal interaction where you’re talking and touching a screen at the same time. That’s different, right, to say, okay, there’s going to be a place where you interact with Alexa?
Yeah.
That implies you’re going to cut down this giant ecosystem of ambient devices. How are you seeing that roadmap?
I think you’ve got to focus the roadmap. I think there’s no doubt. What you need is products that people want in their home, but also need, so I don’t think that history is broken. Obviously, the more endpoints the better, but they’ve got to be the right ones and they’ve got to be the ones that people want to use.
At one point, I think I saw a smoke detector with an Alexa microphone in it. I was like, we’re getting a little far afield here.
Here’s what I will say. The go-forward is: focus on making great products and the right ones. I don’t think you’re going to see thousands of products a year coming out, that’s not the goal at all. What I want is some attention to detail, making sure the right products for the customers are there, the things that fit into your home, the things that fit on your eyes, things that fit in your ears, so you can take Alexa with you, and just narrow the experiences that are great that way. And I have to tell you, the focal point, yeah, it is a screen on an Echo device in the home that can run your home. You don’t need it. With Alexa Plus, you actually don’t need it, it’s just a better experience.
And so when I’m asked, because there is a little, I mean, I’m treading a little bit here on some hallowed ground, like there’s a little bit of… Look, we’re going to light up all your Echo devices, but it’s just going to be awesome if you have a screen. And so when somebody says, “So, do you recommend a screen?” My answer is, “Yeah.” Do you have to have a screen? No? Well, you’re still going to have a great experience. Remember, you have a screen in your pocket. It’s called a phone. That phone has an incredible new Alexa Plus app on it, and so you have a screen, but you don’t need it to operate it. But let’s say you start a conversation with your voice and you just want to remember what that conversation was, you’re going to go to your phone to just capture it or you can send something to your phone. I think we cut it just for time in the demo yesterday, but anything you’re doing, you can send to phone because it’s like a longer form I want on my phone.
We’re also launching Alexa.com, so you’re going to use it on your PC, so it’ll be in the right places, but at the end of the day, if focal point is to control your home, which by the way, hundreds of millions of customers, that’s really the focal point today, you put a screen there, it’s emotional, it’s informative, it’s useful, and it’ll make a difference. It’ll make a difference.
So you come in, you restructure, you obviously want to get more focus on the products. All of that feels like we’re trying to change the culture, right? The structure is really a proxy for culture, in many ways.
Yeah.
That brings me to the other big Decoder question. Amazon has a famous decision-making culture, one-way doors, two-way doors. You can write books about it. You’re writing the press release before you write the product. You have a long history at Microsoft, you’re obviously trying to change some of that culture, how are you making decisions there? What’s your framework? Are you inheriting all the Amazon approaches or are you bringing your own riff to it?
I often get accused of making the final decision only when I have to. It doesn’t mean I’m not making decisions. When I was studying up to come to Amazon and making that decision, that was a life decision for me, it was a big one, and I was so inspired talking to Jeff, talking to Andy, just inspired, no doubt. I also love Microsoft, so I’m inspired where I was sitting, so there’s all these conflicts. Those are personal to me, oh my gosh. But when I started reading, let’s go to decision-making, and then I just watched a few stories that Jeff had told, talked to Andy about it, it basically, from a leadership principle standpoint and from some of the things you hear about on decision-making principles, like one-way, two-way doors, it’s hard to explain this, but it’s so aligned to the way I was running my team. That’s how I’d operate. It was weird. I was just reading the LPs and I’m like, I used to have a culture box.
The LPs are leadership principles?
Oh, sorry, right, leadership principles at Amazon. You should check them out. You can go to Amazon and find them. They’re rad. They’re inspiring, and they’re almost, sometimes they’re just obvious, not all, not all of them. And they’re hard to believe, like big bets, is that real? I’m like, yeah, it’s pretty damn real, it’s pretty incredible. Leaders, they do, they dive deep. Yeah, they do, they get into everything. And I think those are real, but in the spirit of when I started reading them and then the way I made decisions, Nilay, they were aligned. I mean, I’m not, no BS. They were just, it felt right. I had a culture box when I was running my team at Surface and Windows, and that culture box had five cultural principles. They were basically five of the LPs, but that’s how I ran it, and so it was so connected.
And when I got to Amazon, it was almost — what a team! I found this team that was not only hungry, but unbelievably talented, massive and capable, knows how to ship, knows how to invent, and it’s just a little bit of direction, that’s all. My job’s to give that direction, and so making sure I lay out the vision, making sure everyone knows where we’re going, what are the highest priorities, but when it came to decision-making, to answer your question, is I fully operate in the values of, all right, let’s make this call today, but no. And I think one of the strongest points of a leader, without any doubt, and I learned this from one of my colleagues in the past who I worked for, he used to teach me. He’d go, “Hey, Pete, when you’ve made a decision, the best leaders in the world are willing to be wrong. Now, you’ve got to be right a lot, but you’re willing to be wrong.”
This is simple to say, but it’s a powerful concept. What does willing to be wrong mean? It means you’ve got to put your ego aside, you’ve got to be vulnerable. Do you know how hard that is, in front of a team of thousands of people? Just, “Yep, I was wrong.” What does being wrong mean? It’s not like this dramatic, “I’m wrong, I’m sorry.” That’s not it. When being wrong, it’s not necessarily the wrong statement, it is the you got new information a week later? Then use the information. And if it was a two-way door decision, guess what? Make the right decision. But if you’re not a great leader, you don’t change that decision because you’re like, “I already made the call, sorry,” but you knew it wasn’t right for the customer or for the business or whatever the reason. It’s just a fail.
And this was very early in my career. It’s very similar to the two-way door, one-way door. Once you’ve made the hard call and you’re past the point of return, that’s it, you made the call. And you have to make decisions sometimes, man, and those are hard. You lose sleep over it. When I made the decision to have the event, “We’re doing it.” And they’re like, “Well, the product’s not 100% done.” I go, “It doesn’t matter. I’m at 90% usage. We’re going.” And everyone’s like, “You realize that,” and that was a two-way door decision until I send out the invites. And so we checked the information a day before we sent out the invites, and like, “We’re going.” The minute you send the invites, that’s a one-way door decision. There’s no pulling back. It didn’t matter how sick I was, it didn’t matter who couldn’t make it, none of it mattered.
And then we’re lining it up, we had the venue booked, and we’re like, okay, that wasn’t a one-way door decision, you can always cancel the venue, not cool, but if you had to. And you kind of go through it, and then you get to that point, you know, that’s it. There’s no new information that was going to change it. And so great leaders, they’ll make those decisions, but they’ll always be willing, they’ll always be willing to check themselves, and not just check themselves, but be willing then, when they have new information, if the right decision is in front of them, you’ve got to change it, and I always live by it.
And so when you come to this world of, when you say this culture, Nilay, the Amazon culture is incredible. You have no idea how empowering that is. It’s a two-way door decision, all right, let’s make the call. If we’re wrong, let’s deal with it, but then we move, and we move. And I get accused a lot of, you know, like to make a call and like, “Do you have all the info?” “Probably not, but we’re moving.”
Yeah, we’ve got to try something.
Yeah, and it’s been pretty fun that way.
Let’s put this into practice. I want to talk about Alexa Plus in great detail now. I think I have a sense of how you got the team to get the product so you could have an event. The big announce, the last thing you announced, was the pricing, and you started with, it was big reveal, well done, well played, you said it’s $20 a month, and-
Credit to Andy. That’s Andy, that wasn’t me.
And it’s free with Prime. This is a big decision, right? Pricing is maybe the most important decision.
I will note that Prime itself costs $15 a month. You’re pricing the service $5 more than Prime. Are you subsidizing Alexa Plus with Prime?
I don’t think I understand.
Does it cost you more to run than you’re getting inside of that membership?
I want customers to understand that the service is better with Prime. At the end of the day, if you have Prime Video, Prime Shopping, Amazon Prime, you fundamentally get the best music experience. You get photos, unlimited photos. That just makes the Alexa experience better. You don’t need to have it, it’s a great experience without it, but it’s just better. And so we talked about it, we want people on Prime. If you’re on all those services, it comes together and as a collection on your product, it just makes the personalization so much stronger, it makes the invocation of services so much easier.
Was this an obvious decision, from day one this is going to be part of Prime?
No.
How’d you make that call?
Just a series of events. I think back to two-way door decisions, that definitely, I don’t think it was the first decision, there were different ways to think about it. It costs more to run the service, that’s all there is to it. You’re going to invoke an LM, you have many models working, there’s a lot of inference, that’s true. Then you heard Andy talk about how much cost is coming down with Trainium2 and you just see the efficiencies, if you will, that are coming through, those are plumbed through the plan. We have an incredible opportunity in front of us. And so it wasn’t about how much you’re spending, how much you’re making, it is about making a great product. And once we were like, we want to make sure people have the best product possible, that is the anchor. And so we’re like, all right, it’s got to be with Prime, that’s the best way to get customers there. And that’s it.
I think people want it to be more complicated, because I’ve been asked this question a bunch of times. I generally haven’t answered it. I’d be like, oh, you have a choice. You can pay 15 or 20, it’s your choice. Just choose. But not to be, I’m not trying to be pompous or whatever. I think if you’re on Prime, you’re going to love it, so I inverted the equation.
The other piece of that I see, the other way to think about it that I was curious about, you mentioned this, Alexa has distribution, you have a huge installed base of devices. This is I think the first at scale non-phone AI product. I can’t think of any others.
Yeah, it might be. I have to think about it.
There’s Google Assistant, but they haven’t launched the way that you’ve launched this product yet. Gemini isn’t doing all this stuff yet. There’s Homepods but Siri doesn’t do it yet. I don’t think the Humane Pin was keeping you up at night, and now it’s gone.
Well, it’s not gone, I think. Went into HP, right?
Yeah, it’s gone. They won’t work anymore in a couple of weeks. It’s a real thing, we’ve been breaking news to you here on the show.
Wow, that’s huge. You’re so informed.
Sadly, that’s my only job, is to be informed. Make no decisions, just know everything.
[Laughs] I don’t see it that way.
But that is the scale. If it’s not a phone, you need something else. There’s been a lot of excitement about what something else could be because you have a new user interface paradigm with voice, with natural language. But you already have it, you have the installed base.
And saying it’s going to be with Prime means you’re just going to deploy it to that installed base, because I’m guessing people with Alexa and people with Prime has a pretty massive overlap.
Yeah, there is.
So you’re just going to launch it to that whole service. Is that going to be a flywheel? Because the promise of Alexa 10 years ago was this will compete with your phone. I don’t think that actually happened. Do you think that this will help you compete with the phone in that way?
I think it’s more of a compliment now than it’s ever been. You need the phone, we send things to the phone, we want you on it as well. I want you on the Alexa app on your phone, it’s an awesome experience. We can play with it if you want after, but I think it’s a compliment to the phone, I think it does replace a lot of things. I’ll tell this, I say it to my team all the time, look, our customers are going to find the easiest path to something. They just will, it’s innate. It saves time, it’s about speed, it’s about efficiency. The only time that’s not true is when you’re getting more joy, and a lot of times joy comes from speed or happiness comes from being able to complete a task quicker. And so let me go back to the point of ambient. One of the core tenets when we started Alexa Plus and the vision for it was we have the largest install base in homes on the planet.
I think that’s a pretty definitive statement, I think it’s true. I probably have to check with the lawyers to say something like that, so maybe I’m wrong, so let me qualify it. We might have the largest install base on the planet, and it’s incredible. The way Alexa Plus is designed is it’s meant to be ambient, it’s meant to be a conversation, and it will replace tasks you do on your phone. It’s going to happen. And so does it replace the phone? Absolutely not. But does it replace certain things? I think I told you the story before, let me tell you again. When I was building laptops 12 years ago, when I’d first started on Surface, people came to me and said, there were a few people that were like, “You’ve lost the plot, P. You’re going after this thing and the laptop is dead.” Why? Because phones are replacing the laptop, and I mean you’re using a laptop 12 years later and it’s pretty important to you.
Probably more important now than it was 12 years ago. So what had happened was jobs moved to the phone that were really important, shopping, social media, your photos, I don’t know, pick communication. But what happened was the things that didn’t move to the phone only got stronger on the PC over that time, and so they essentially became compliments to one another. If you’re going to sit down and write a long story, you’re going to do it with a keyboard. You want to be snackable information, you’re going to pick up your phone. And then one got better at one of them and the other got better at the other, and incredibly so. It actually strengthened them both.
I see this as very similar. I think as Alexa Plus comes into market, I think it’s going to be better at a lot of things and it’s going to move jobs to it. I believe that. I think there’ll be more emotion to be pulled out of something that’s conversational, knows you well, is personal to you. You can have a conversation, it knows your calendar, it can get some stuff done in a simple way. You might not always do [the task] on it. I don’t know, it doesn’t matter to me where you do it. I just want to give you the shot, and if it’s the easiest way to do it. Can I give you just a fun example? I was sitting on the couch last week with Costas, my son. He’s 24. I don’t know, he’s 24 ish.
Those are pretty fuzzy ages.
I think maybe 24. He was born in… Yeah, 24. And so we were hanging out and we were talking about the Clippers and he had asked me a few questions, and I’m a fan of the Clippers growing up, and then of course since Steve [Ballmer, former Microsoft CEO] bought them, I just love the team. And I asked “Costas, did the Clippers win last night?” He goes, is Kawhi even playing?” This is, I think, a week and a half ago. I don’t remember the day. And now we have Alexa Plus in the house everywhere, and my son works on AI now, he’s blown away by it. He had to sign an NDA that he can’t talk about what he sees. And I realized right at that moment — Nilay, I was going to lose him, because you know what happens? You pick up your phone, you open it, now you see your notifications, you know that feeling, and you’re like, oh, I’m going to check my notifications, or I’m going to jump on TikTok, or whatever it is that you love about your phone.
He’s going to go get the information, answer it, and I’m going to lose my kid to his phone. And now all of a sudden we went from this moment hanging out to him on the phone, it happens all the time, and it blew my mind. He goes, “I don’t know. Alexa, did the Clippers win last night?” And Alexa goes, “The Clippers did win last night.” And then his score and blah blah, Kawhi Leonard scored so many. And he’s like, “Is Kawhi playing?” “Yeah, Kawhi’s been back for several weeks.” And he now started having a conversation, the three of us are having a conversation, the job moved. He would’ve never done that.
So this was the promise of the original Alexa, right? There’s celebrity ads during the Super Bowl, people are just hanging out with their Alexas.
It was a great ad by the way.
It was a great ad.
Oh my gosh, what a great ad.
But it couldn’t do it. A decade later we have trained a generation of consumers to believe that these products are limited and that we should use them to play music and set timers. How are you going to teach everybody that it can — actually, a more important question: can it do it?
It can do it. I think we’re resetting the next 10 years right now.
Are LLMs durable enough as a technology to build all the things you want them to do?
Not just the LLM. It’s not just the LLM.
I understand that it’s not just the LLM, but it is the enabling technology that’s making all this go.
They’re durable, but they’re going to continue to evolve at a rapid pace, and they have to. They are. But you have to be smart about how you build on top of it. I mean, obviously everyone’s doing a great job, I’m sure. I think the promise is there. I’m not going to understate it, I won’t overstate it, I can’t, I believe the promise is there.
I’m here at Amazon because I believe it’s going to change the world how people engage AI, and it’s going to be easier because your device is there and ready for you, and we’re going to make beautiful devices. And so all this will come together in a way where there’s a team that’s going to connect all these experiences. You saw a little bit of Fire TV and Ring, that all of a sudden these natural moments are going to happen and you’re not going to have to guess, you’re not going to wonder.
If it can do it, because it’s not deterministic, you’re not issuing these Boolean commands.
Correct. Exactly, right. And so hopefully everyone understands that concept, but since it’s not deterministic and now you’re going to ask a question, even if Alexa doesn’t do it, she’s going to talk about what you’re trying to solve and you’re going to actually get to an answer. As opposed to, “I don’t know.”
One of the things that I think is really interesting about the product, you talked about the kid’s demo where it was telling a story to a kid. I’ve had my kid talk to ChatGPT in that way, I think it’s fascinating to see that interaction develop. Then there’s simple stuff. Yesterday I sat in one of the smart home demos and they turned the lights from blue and green to a warm yellow and I was like, that’s a lot of data center to turn a light from one color to another. So you can see inside of the orchestration you’re describing, there’s the most expensive thing, to have this real time creative story. Then there’s “turn the light off,” which should be simpler and cheaper. I’m assuming the orchestration is picking what model to use when.
That’s exactly right. And some will do it on the edge too. You don’t have to do it all. If it’s a point and shoot command, we’ll do it in a simpler way.
But then I ran into Mike Krieger from Anthropic, who was at the event. Anthropic is one of your models, and he said the most interesting thing to me that I heard yesterday. He said, “Sometimes when I talk to Alexa, I can tell when it’s Anthropic because I know our model so well.” And he’s like, “No one else will be able to tell.” But he was like, “Sometimes I talk to it and I say, oh, that’s my boy,” which was incredible.
A product person knows their product and maybe they’re seeing ghosts in the machine, but it was just incredible. How are you picking between Nova and Anthropic? How are you picking the cost of these different models that you have to invoke? What are they better at? How are you making that determination?
Actually, the orchestrator picks the model that’s right for the job. The how, I won’t get into the details, but there’s some awesomeness here. One of the things that inspired most people is that we’re using a multi-model approach, which I think is a little bit novel. But at the end of the day, it depends on what the task is, it depends on what’s being asked for. I think right now you’re seeing 70% of the utterances running through Amazon Nova, 30% running through Anthropic, something at that rate. It changes, it just depends on how you use the product and what you’re using it for. It is also non-deterministic. Basically, there’s a model that’s like, what’s the best model to pick? And then you’re looking for accuracy and speed. First understanding, then accuracy, then speed, and you target. Then you move it, you pick the right model and then you fire to the expert, and there’s a small model and the expert if you will sometimes, and then those all orchestrate together and that’s how it works.
Inside of that is the way that you talk to your partners.
Slightly different than all of that.
I think you just did an API-driven one where you asked for an Uber, and Uber’s got a bunch of APIs and you just talk to them.
Uber’s been awesome. Uber, OpenTable, Grubhub, these things that you use every day, they’re just in-depth connected. That’s like opening an app on your phone, at the end of the day.
We understand how computers work. You call an API, it delivers a result. You call another API, great, the Uber’s booked. Then there’s the more agentic stuff that you were showing off. It wasn’t quite ready yet, but a lot of people have this idea. I believe the example was we’re going to book a stove repair, and it was a Miele stove.
He was going to choose last minute depending on how the demos went. I think he did, did he do a Miele dishwasher?
I know it was Miele because I was like, oh, those are expensive to fix. That’s what I knew in my head.
[Laughs] That’s what he said. It was pretty funny.
And then he went on to Thumbtack, which is a partner, so he had permission, but what it was doing was it was looking at the Thumbtack website and clicking around and reading that back to you. And even with permission, I think of that as why wouldn’t you just get an API? If you have the permission, why not do it deterministically?
Yeah, then the partner just has to do the work.
Right, so this is basically cutting down the amount of work a partner is doing.
Yeah, you don’t want to do the work, no problem. It’s just a couple of different ways to engage it. From an SDK perspective, this is just basically permissions, and we have to work on authorization and payment at the end of that, which is the trickiest part. I’m not going to get into how, but that’s the trickiest part. And so completing the task is the trick, getting almost there, it’s not that hard, but completing it. And so that’s where you need the partner to be like, yeah, sure, we want this traffic and we’re going to go create the service and send it through. Great. If you don’t, no problem.
But the answer on why not do the API is just these relationships are different, partners want to work in different ways. One of the things we are trying to do, and I’m really re-engaging Alexa, is we want to open SDKs. Basically, we want to open the product up for developers to come in and do what they want, come make it great. And if somebody asks to fix something in their house, we got it, we have a way to get you there.
So that implies a lot of things. Having tried to get a Miele dishwasher fixed in my life, it is expensive.
The repair person has to actually be on Thumbtack, they have to actually be using that service to actually book their appointments and take payments. That is not necessarily true, they might just be marketing there, but there’s a lot of things you have to know that you’re depending on that ecosystem to provide you to make Alexa just book a repair service professional for you. That’s the part where every time I talk to anybody about agentic systems I’m like, oh, this is where it falls in, payment is the other one. And the thing I’ve been calling it is just the DoorDash problem.
If you say, “order me some food,” and it goes and uses DoorDash for you or GrubHub or whatever, you’ve commoditized those service providers and you’ve started to crush their margins. And after a while, you might not want to be… Because they can’t upsell you anymore, they can’t sell you their subscription credits or whatever else they want to do. They can’t put advertising in front of you because the robot’s looking at their website, not a person. And I don’t know why they would participate in that unless you have actually solved this payment problem, to make that valuable to them.
I think the partnerships are unique for sure. I think it’s quite different. Remember, you always go back to your phone, the information’s there, it’s in the app. It’s not like we’re doing something on the side and doing it anonymously and you don’t have the customer info, I think is one thing. The second thing is when you have those challenging… Let’s use a Thumbtack example, let’s stick there for a minute. If you don’t have a Thumbtack account, the first time you do it’ll just pop a QR code and say, here, connect, authorize, go. And then forever then you’re going to fix things and Thumbtack’s going to push you through it. There are just some simple things that you can do that make the customer journey simple and gets you to those connection points. And once you do that, which is everything, God, you understand this, setup is everything, removing that barrier to entry. To make Alexa Plus great, you’ve got to share your contacts, you’re going to want to add your photos.
I think you’ve got to share your contacts. You’re going to want to add your photos. You’re going to want to connect your service providers. It’s a one-time kind of low barrier to entry go, and then you’re all in. And the partners, we don’t talk about the deals with the partners or anything like that, but there’s benefit on both sides. But at the end of the day, it’s the right thing for the customer. And I think there’s a lot of partners out there that believe in that same philosophy. Let’s get our customer to the endgame.
But if you run one — say you run food delivery service A. I won’t name names to keep them out of it. But if I run food delivery service A and I have a deal with you, and food delivery service B shows up and signs a deal with you, and I just ask Alexa to order some food, suddenly Alexa is in control of a lot of revenue.
Yeah. But you have preferences, customers have preferences, they know what to say.
Why would they have a preference over where the sandwich comes from, like what intermediary brings you the sandwich?
That’s their choice. You can’t speak for that. You can’t speak for it for the customer, but I would say they just have a choice, and they’ll get a choice.
And you’re going to express that choice on a screen?
I’m going to keep partners out of it for this, so I won’t give you the examples, but there will be simple ways to make it clear to the customer what they want.
The other part of this, which is equally complicated is partnerships, and that’s agentic stuff. And usually when I talk to people at agentic services, it’s to open the ecosystem to say, “Okay, we can browse the web for you. Now we have access to everything.” You are doing that in a much tighter way. You’re saying, “This is how we’re going to bring partners in.”
Why make that decision? Why not say, “We can just go browse the web and do whatever”?
I just think it’s right. It’s their business. And so, we’re seeing a lot of participation. There’s a lot of partners.
They’re excited, from what I see. Not all; I can’t speak for all of them. I’m not trying to talk in absolutes. But you have this moment where you’re like — the promise of Alexa is here. Ambient is here forever. They’ve all made skills in the past or they’ve done something that they didn’t get invoked. And it’s hard because the customer had to point and shoot as opposed to just speak in natural language; they had to know exactly what they were asking for. But at the end of the day, now you have a truth in: just speak, and something comes up. And now partners are like, “Well, if they’re looking for something from me, I’m in.” But I think it’s right to be partnering and not doing it another way.
Which I’m pumped about. We have a great biz dev team, it’s what they do.
So that’s asking Alexa to do something, and it goes off and does something in the world, right? It schedules a person or orders some food, it books a flight, great. Then there’s the stuff in your home, which Alexa has historically been very good at.
Turn the lights on and off, make a routine. I’m very intrigued by the idea of automating routine creation with natural language. Right? Make a bedtime routine for me. That is as messy as it gets, right?
No.
That’s not even partnerships. That’s Matter and Z-Wave and all.
We do it all before then. This one’s different. We already have partners that work with Alexa. If you already work with Alexa, you get the magic.
That’s it. It’s awesome. You saw it yesterday. There was no new code written on the partner side.
Really?
Nothing. I have my Govee lights at home right now that I put on the house. I’m just talking to them to change the color. That’s it. I would’ve never opened the app to change the color on my lights.
It just seems like the promise of the smart home forever, and this is what you’re describing, is that it will get more invisible.
This is what’s awesome, dude. Right?
It’s going to get more invisible.
You have to understand this is freaking awesome.
But I’m looking at the last five years, like, “Oh, this is more visible than ever.”
You have no idea how badass my team is. This team, now I’m talking Eero, Ring, Blink, Fire TV. This team, including Alexa, Kuiper, they’re incredible, man. They’re so damn capable. I’ve not seen invention like this. Now how we get it to the customer, we refine a little bit of that. But I’ve got to tell you, and this is a great example, because this works with the Alexa program and the thousands and thousands and thousands, dare I say, hundreds of thousands of things that work with Alexa. That is one of the largest connective tissues on the planet. It’s crazy. And they’ve set it up so well that now when Alexa Plus shows up, your routines are by voice done, like 100%, Nilay.
It’s so damn cool. The other day Mary was so frustrated with me, and I don’t have a smart home at my house in the Seattle area, but I use it in another area. And she was so frustrated with me. She’s like, “The lights are on all the time.” I just grabbed my app. I’m like, “Alexa, every night just turn off the lights outside at 10:00 PM and don’t turn them on again until 7:00 PM the next day. That was it.
The promise of some of the smart home standards that have made this messier, like Matter or Thread, is that you will be able to control these devices device-agnostic, right?
Yep. We’ll take advantage of those as well. Yep.
For example, everyone talks about the smart home only in the context of their own lived experiences.
Well, how do you not? What are you going to do?
It’s hard to be on track.
What story are you going to tell? I’ve got plenty of customer stories.
But my joke is that if a thing doesn’t show up in control center on my wife’s iPhone, it doesn’t exist. She’s not going to open an app. She’s going to swipe down and see that panel and that’s how we’re doing it. So you’ve got to bridge into that. The promise of something like Matter is, we’re going to see it across all of these surfaces. It’s all going to work together. Are you thinking that far ahead? Because where does the logic of my smart home live?
Especially if you’re talking about putting hardware with a screen centrally in your home. Okay, now you’ve got a little computer running your house. And everything should talk to that, and that’s where the logic should live.
In theory, but we also have the cloud to arbitrate. We have so many different methods in. You can use Matter, you can use Bluetooth LE sometimes. You can use Zigbee, but you can also —
Ring famously runs on Z-Wave all the time.
You can use Z-Wave. You can fundamentally use Works with Alexa, just plug them right in. There’s no limitation for us to connect these things, because basically we can orchestrate to it. The team has thought through it from every way to Sunday, but they’ve also been working on it for 10 years.
It’s phenomenal. It’s probably one of the things I’m most excited about, because you basically democratize the smart home, a hundred percent. Yes. It won’t work unless you gave someone a button on their phone today, but we just talked about this. You know where the job’s better? Just say what you want.
It’s a much better job to be done. I tried to do it with the music demo yesterday. I’m not sure it landed this point, which is like, just plug them in. The speakers were there. I’m going to move music to the speakers. I’m going to do it nuanced. I think one time I said “Move. I want to move the music. I want to hear the music. I want you to bring the music here.” I used different language so it wasn’t continuous. That was all real working. Probably those little nuances get lost on the natural language as if I had a direct command. I didn’t. It could have been any of those. Or play, which I try to stay away from. And so, it’s the same concept. You just think it and say it, think it and say it. It’s very powerful. And on smart home, it comes to life amazingly. And this is credit to an incredible team. They’ve thought it through.
Do you think that we’ll see more of an explosion of consumer smart? There’s big investments people.
I think so. I think this is the tipping point.
You’ve got to put a bunch of light switches in or buy all new light bulbs.
I think so. Tipping point, because you don’t have to be an expert. Just plug it in, that’s it, and then say something.
I want to believe you, but I’ve been burned so many times.
I don’t care if you believe me or not at this point.
I’m just saying.
When you get after it, man.
I’m ready to get the products. I’m ready to try.
You go get after it. It’s pretty fascinating. This is what an LM is great at. And then, the expert that we have to go rationalize and so it doesn’t have to be deterministic. And so, it’s pretty interesting.
By the way, it has to learn as well, so if you go, “Turn on that light.” “Which light?” “That one over there.” “Oh, you mean the one in the living room?” “Yeah.” “Okay.” Now that’s not a good example, because you’re up against a switch, which takes, is just go touch the switch. But how fast the system learns, that’ll never happen. It’ll never happen again. It’ll be like, “Oh, I know what he needs. He’s asking on this device and I got it. I know I’m turning on the light.”
What’s one thing you want Alexa Plus to do that it can’t do today?
I’ve shown you everything, but I’ll tell you, and if I can touch back to my Mary example, I want these moments to connect not only the home but the family. And it’s got some pretty amazing attributes. The idea that I can leave the house and leave a message and walk out the door, and then when Anastasia shows up downstairs, she gets the message, and it’s a lovely note from her dad with maybe a direction of what to do. The fact that it’s this totally natural language moment feels magical. Alexa is being proactive on your command, not intrusive, but you are asking her to be. When you start seeing those things, that’s the thing. That’s the thing I want it to be. Because you’re just going to connect deeper into people’s lives in a way that makes it better, that you know me well enough. I want you to use these products and tell me your life is better.
But there’s not a specific thing where you’re like, “I need the next turn of capability here.”
Look, I have a vision for where this thing goes. I can’t take you there. We’ve already revealed everything, and we’re going to preview in a month. And it’s like, I’m sure we tipped over a few carts yesterday, and so I’ve just got to be careful how far I take it. There’s so much for the future. But I showed you a few of my favorites. And that’s what we did. We narrowed it down. There’s thousands of things it does now.
Try to narrow it down to the ones that both told the story but are also most emotional to me, because that matters, what I’m presenting and I think sharing. And the biggest thing, you want the team to have pride in the best stuff they’ve created. And those moments are pride moments for the team.
There’s something I’m really curious about. I’ve asked basically everybody who has had something to do with Alexa about this for a decade. Amazon always calls Alexa she. For some reason this robot has a gender, and it’s a she, and it’s always a she. Why is Alexa gendered in this way?
There’s eight voices with Alexa Plus. I don’t think we talked about it yesterday. It’s in the blog post that we wrote. Not the blog post, the About Amazon post. I’m told I’m a dork when I say blog.
I run a blog. You can say blog.
Okay.
It just depends what you’re using. Pick your voice. But the default, the default, I use the default voice. I love the new voice.
You can use the old voice. I love the new voice. It’s the default. And then, you can pick a male voice or another voice, and you can call it what you want.
I just wondered. For a decade, you gendered this robot pretty real, honestly.
Yeah. It is. This voice, the more we’re using, I called her she yesterday. I understood that. I had a couple of people ask me, I’m like, “Well, that’s kind of how I was thinking about it.”
I don’t think it’s more complicated than that.
That makes sense. I don’t even mean to… I understand it’s a loaded time in American history we’re asking this question, but I actually don’t even mean it in that context. I just mean it’s a robot. It doesn’t actually have one of those. It’s only what we assign to it.
Yeah. I think look, it is, but it is getting more personal. It’s going to be more meaningful in your life.
For sure. Do you want people to think about it as a person in that way?
You don’t want to go all the way there, but yeah, I think it’s okay that you think you have another set of ears when you want them, another set of thinking if you need it. I think it’s quite powerful.
All right, last question. This is rolling out soon to some devices. I think it’s the screens, the Echo Show 15 and 21.
Yeah.
When is it going to hit everywhere?
Actually, the 8, 10, 15 and 21.
Eight, 10, 15, so the screens.
It’s rolling out next month starting with those devices, and it’ll be a gradual rollout. And then, it’ll roll out to all devices. If you want to be in first, my push is I want people using screen devices, for sure. We’re rolling it out there first because it’s such a great experience. You go get a device and you’re on the list, you’ll be first to get it. That’s basically it. If you already have a 10, a 15, a 21 and you subscribe, then we’ll get it out to you as well. That’s where we’re starting.
And it’ll light up the whole house, by the way.
Oh, the other Alexas?
Yeah.
So you have a screen, and it comes to your screen.
Yeah. Let’s say you have five Echoes at home right now, and you just go get a screen and it’ll light up your whole house.
Do you think you’ll drive a hardware cycle of people trying to buy screens to get Alexa?
I hope so. I think they should. And not because I want to sell another device, but I want people to have that experience. I think it’s a miss not to have it. It’s a miss.
I want to drive a cycle in the spirit of not trying to be sales, not my thing. But I will say if you want the best experience, go get a screen device. We’re pleased already. I didn’t expect… Pleased just seeing the reaction from yesterday. It’s nice to see. But I think in a month, people will get it in their hands, we’ll start the preview. Most features will be done, most. There’ll be a few that are coming later, for sure. And then, we’ll roll it out to everybody when it’s the right time.
Last question, you’ve said you’ve got a vision.
This is your third last question.
I know, but that’s how I do it.
I love this.
This is why I’m good at this. It’s tricky. Really, last question: You’ve laid out a vision for where you want to go. You’ve talked about the big opportunity here. I’ve asked you if you think LLMs are durable enough to pull all this off. You said they are.
Do you see this as a platform shift the way that other people have talked about it as a platform shift? Do you think we’re going to actually reconsider how we interact with computers at the biggest level, the way that touch screens did it, the way that mice and keyboards did it?
Not to be too cliche, I think 10 years ago was a magnificent moment when Alexa launched, 10 years and a couple months, but what a moment. It really was a reset. I think right now, 10 years later, I actually do think this is that next moment. But this one is, to your point, that promise. I think this is the shift. I think this is that time. It’s going to take years. This is not like, don’t worry, you’re not going to miss out. Somebody’s like, “Well, why are you so late?” I’m like, “Late? Do you know we’re just at the beginning?”
And by the way, our Roadmap is awesome. And I believe in this team, in their invention, and the company’s patience for invention, and its ability to make the big bet and stick with it. It not only creates an incredible future opportunity, but with that opportunity and bet and invention, you also have the moment right now is just starting. It’s literally just starting, dude. It’s just starting.
It’s a great future. It’s fantastic. And I think the home transforms forever starting now. But it takes time. It takes time. And I would say patience is one of the strongest qualities of Amazon. I had once heard infamously on a great leader, and I don’t know the quote, but our best overnight invention took seven years. It takes time. But right now, we’re here. 10 years later, here we are. And it’s the beginning of that next gen. I think it is a shift. Right this moment.
All right. No better place to end it, Panos. Thank you so much for being on Decoder.
Great to see you, Nilay.
Questions or comments about this episode? Hit us up at decoder@theverge.com. We really do read every email!
Amazon’s Panos Panay announced Alexa Plus last week, representing a renewed focus on software for the company’s troubled consumer electronics division.
The biggest surprise at Amazonâs press conference last week was the lack of hardware announcements. Traditionally, Amazon announces dozens of new gadgets at its events, but this time, Amazon spent 70 minutes talking about software. Specifically, Alexa Plus, its new generative AI-powered Alexa voice assistant. And that was exactly the right move.
Over the past decade, the company has spent way too much money building cheap hardware for Alexa that no one really likes, developing home robots and flying indoor cameras no one really needs, and wasting efforts on failed ways for people to interact with Alexa (the Loop, the Microwave, the Clock, and so on), all while the core technology itself stagnated.
But with the launch of Alexa Plus, Amazon has finally taken a big step toward that goal. The voice assistant has been âone hundred percent re-architected,” Amazonâs head of …
Amazon has been trying to make virtual assistants happen for more than a decade. Alexa is, by many definitions, wildly successful, but it has so far failed to become the kind of omnipresent, omnipotent helper the company imagines. (It has also, by all accounts, failed to become a compelling business for Amazon.) This week, though, Amazon launched the most ambitious version of Alexa yet, with new technology underneath and some big new ideas about how you might interact with AI.
On this episode of The Vergecast, we talk a lot about whatâs next for Alexa. David Imel â who you might know as the co-host of the Waveform podcast â joins the show to help us figure out what to make of Alexa Plus, and the whole idea that large language models can make virtual assistants both more useful and more accessible. Amazonâs description of Alexa Plus makes a lot of sense, and sounds pretty compelling, but we have reservations both about the user experience and about Amazonâs ability to actually pull this off.
Amazon is gearing up to launch new hardware to go along with its AI-upgraded Alexa. During an interview with Bloomberg, Amazon CEO Andy Jassy said the company has a “brand new lineup of devices that are coming in the fall that are beautiful.”
On Wednesday, Amazon revealed Alexa Plus, a more conversational version of the smart assistant that’s capable of performing a wider range of tasks, such as ordering an Uber or finding concert tickets. Amazon says “almost every” Alexa device will support Alexa Plus, save for older Echo generations. Alexa Plus will cost $19.99 per month but will be included with a subscription to Prime.
Though Jassy didn’t share any other details about the new devices, it seems like the company plans to put an even bigger focus on displays. When asked about Amazon’s next-generation hardware, Panos Panay, Amazon’s head of devices and services, told my colleague Jennifer Tuohy that, “I believe in screens. I think they matter in a massive way.”
The Echo Show 21, Amazon’s newest smart display, shows the new user interface for viewing your calendar, playing music, and other tasks.
Oh, Alexa, how you’ve changed. The long-awaited new Alexa, Alexa Plus, is set to bring a more conversational, context-aware, and capable assistant to your smart home. With a new voice (eight of them, in fact) and a new attitude, this is the biggest change to the voice assistant since it debuted in 2014. And it all sounds very impressive.
Announced at a press conference in New York City this week, Alexa Plus offers several new generative AI-powered abilities to help you manage your life, plus some major smart home upgrades. I was at the event and saw several staged demos of the new features but also got to try out some of the smart home improvements for myself.
The biggest change is how Alexa can respond to natural language; the demo showed that you can talk to it and say what you want rather than having to remember specific commands. I saw the new Alexa understand and execute commands such as, “Bring the lights up in here and set to a warm glow.” The GE Cync smart bulbs and light strips in the room responded, despite the request not including a room or specific names for each device.
Then, when instructed to “Turn on the lamp in the sitting area,” Alexa was apparently able to “reason” that meant the lamp named “Sofa lamp.” This should mean no more memorizing specific device names, making it easier for anyone in the home to control devices with their voice.
I was also able to talk to the assistant myself and try out its new ability to follow multiple commands at once without needing to repeat the wake word, Alexa. I asked it to dim the lights and “make it a little warmer.” The thermostat adjusted while the lights dimmed. Alexa said, “I dimmed the lights in the living room and increased the temperature by two degrees; is there anything else you need?” I then said, “Can you vacuum the floor?” It replied, “Okay,” and the Roomba started a job.
This should mean no more memorizing specific device names
Another new feature I tried was the ability to set up a smart home routine just by using voice. I told Alexa I’d been having trouble waking up recently, and after some back-and-forth, it created a “Good Morning” routine that set an alarm to wake me up to Taylor Swift’s “Shake It Off” and adjust the smart lights in my room.
That’s a fairly simple one because there weren’t a lot of devices connected to the Show. But Amazon says that, courtesy of its knowledge of hundreds of smart home APIs, Alexa is capable of creating more complex routines through voice. This should make it easier for people to do more with their connected devices and not have to spend time programming an app.
The other exciting upgrade is the new Echo Show UI. This is launching on the 15 and 21, but Scott Durham of Amazon told me it will come to the Show 8 and 10 at some point. With a cleaner, sleeker full-screen UI with larger widgets to take advantage of the screen size (I saw it demoed on the 21-inch screen), it’s now more customizable and feels more like a tablet interface than a smart display. During the demo, it appeared to move smoothly and quickly with limited lag.
The UI now has a much larger calendar and smart home widgets. A handy new feature is the ability to send images, documents, and notes to your alexa@alexa.com email address or through the Alexa app or new web interface. From there, Amazon says it can parse things like events and add them to your calendar, as well as let you ask questions about the information. Apparently, it can even decipher all the info in that lengthy school email and set reminders to tell you what you need to send in on which days.
The smart home control UI has been lifted from the excellent interface on the Echo Hub, giving you easy touch control of devices in your home when you don’t want to use voice or a smartphone. And when you take it full screen, it’s now much easier to switch between rooms and devices. Alexa’s Map View is here, too, and it looks great on the big screen.
A new Ring camera integration feature lets Alexa Plus query Ring’s Smart Video Search to show you summaries of events that happened around your house or pull specific instances like, “Did a package arrive?” or “Did someone let the dog out?”
Another big improvement coming with Alexa Plus is new cooking controls. Currently, following recipes on an Echo Show can be fiddly and frustrating; with Alexa Plus, the assistant gets more proactive. It can now take ingredients from recipes and add them to a shopping list, let you use natural language to add additional items, and arrange to have them delivered to you (with Whole Foods and Amazon Fresh as well as “several other grocery providers.”)
Alexa can also come up with recipes based on ingredients you tell it you have on hand and suggest substitutions for items you’re out of. The kitchen is one of the most useful places for hands-free voice control, and if this works well, I can see it being very helpful. I’m most excited about the new timer feature, which takes all the time-based steps in the recipe — such as whisk for two minutes or bake at 350 degrees for 35 minutes — and automatically sets them for you to kick off when you’re ready.
Aaron Rubenson of Amazon told me that thanks to Alexa’s improved natural language skills, it interfaces better with smart kitchen appliances. So, instead of having to use specific nomenclature to get it to preheat my Thermador oven, it should respond to any command that implies I want my oven on. For example, “Alexa, can you set the oven to the right temperature for this recipe?”
I’ve used Alexa for close to a decade now, and while it has its uses, it’s never felt indispensable. This is largely because of how tricky it is to talk to correctly. I’ve had to learn Alexa-speak to get it to do anything reliably, often making it more frustrating than useful. If the new Alexa can work as well in my home as it did in the demos I saw this week, this will be a major shift in home automation.
Alexa Plus pricing and availability
Amazon Alexa Plus costs $19.99 a month and is included in Prime membership. It will be available via an early access program in late March, in the US only, to customers with an Echo Show 8, 10, 15, or 21.
It’s also accessible in a new Alexa app and at Alexa.com. Amazon says it will come to other Echo devices, including Echo Buds and Echo Frames, and will be compatible with Fire TVs and Fire tablets.
NEW YORK—After teasing it in September 2023 and reportedly suffering delays, Amazon today announced that its more capable and conversational version of Alexa will start rolling out to US Prime members for free in the next few weeks.
Those who aren't Prime subscribers will be able to get Alexa+ for $20 a month. Amazon didn't provide a specific release date but said availability would start with the Echo Show 8, 10, 15, and 21 smart displays.
Amazon is hoping Alexa+ will be a lifeline for its fledgling voice assistant business that has failed to turn a profit. Alexa has reportedly cost Amazon tens of billions of dollars over the years. Although Alexa is on 600 million purchased devices, per remarks CEO Andy Jassy made at a press conference on Wednesday, it's primarily used for simple tasks that don't generate much money, like checking the weather. Exacerbating the problem, generative AI chatbots are a new, shinier approach to AI assistants that have quickly outperformed what people could do with today’s Alexa.