Amazon uses audio fingerprints to prevent mentions of ‘Alexa’ in broadcast media from triggering devices

Amazon has published some interesting information about how they’re preventing Alexa devices from activating to mentions of their wake word in movies, TV shows, ads, radio, and more. An audio fingerprinting system is used to identify and store individual media mentions of Alexa that can be used to determine when the wake word should be ignored. This is done both in the cloud, to quietly turn Alexa devices back off after a media trigger, and locally on Alexa devices themselves, to completely prevent devices from waking up in the first place.

Every time an Alexa device hears its wake word, that mention is compared to digital fingerprints of known media mentions of Alexa. A small subset of media mentions, such as the upcoming Alexa Super Bowl ad, is stored on the Alexa device itself. If a wake word mention matches the subset of locally stored fingerprints, the Alexa device never even reacts to the wake word. Alexa devices cannot check against all media fingerprints locally, due to device CPU limits, so the rest are handled in the cloud.

If a known media mention of Alexa is heard that isn’t among the locally stored subset, the Alexa device will react to the wake word, but then silently turn back off once Amazon’s cloud servers identify it as a match to a known media mention fingerprint. Where things get really interesting is how Amazon is building their database of known media mentions.

When Amazon’s cloud servers receive an Alexa mention, that wake word audio is compared to a fraction of other wake words that came in around the same time. If the wake word audio matches the request of at least two other customers, then it’s identified as a media mention and used to grow the database of audio fingerprints to ignore.

From the sound of things, it seems like a live broadcast or first airing of media that mentions Alexa is much more likely to trigger your Alexa devices. However, watching reruns, on-demand, or time-shifted content that includes Alexa mentions is more likely to be in Amazon’s audio fingerprint collection and be correctly ignored by your Alexa devices. Amazon’s article about this system makes no mention of the alternate wake words, Computer, Echo, and Amazon, so it seems like using one of those will be more likely to trigger an Alexa device while playing media.

Follow me on Twitter (@elias) and Instagram (@esaba) to see what I'm up to.

ShareTweetShare+1

6 comments
  1. Bm says:

    I watch the show “Alexa & Katie” on Netflix (it has 2 seasons). It sure wakes up Alexa on my Dot enough with, “Hmm. I don’t know that one” or whatever she says.

  2. Michael says:

    Laughable. The echo dot lit up last night while watching Arrow. Alexa was not even said.

  3. Michael says:

    My Dot turns on almost every single time the work Alexa is used, even on the Amazon commercials. It seems like they have some more work to do! My google assistant never turns on since you have to say “ok” or “hey” before saying “google”. Amazon should turn that option on for us to use as an alternative, then all the research and development they are doing to not have it wake during a commercial wouldn’t be needed.

  4. George says:

    My friend’s girlfriend’s name is Alexa. It sucks, because his damn echo has now become a point of contention in their relationship. She had her name before meeting him and he owned the echo before meeting her. He tried using different wake words for his girlfriend, but that didn’t go over very well. I think the chick needs to just get over herself because Alexa is here to stay!

FOLLOW ELIAS

Elias’s Latest Instagram