• ristoril_zip@lemmy.zip
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    7
    ·
    11 months ago

    It’s literally impossible for them to not be “analyzing” all the sounds they (perhaps briefly) record.

    [Sound] --> [Record] --> [Analyze for keyword] --> [Perform keyword action] OR [Delete recording]

    Literally all sounds, literally all the time. And we just trust that they delete them and don’t send them “anonymized” to be used for training the audio recognition algorithms or LLMs.

    • bdonvr@thelemmy.club
      link
      fedilink
      arrow-up
      12
      ·
      11 months ago

      It is possible to analyze the traffic leaving these devices, and AFAIK it hasn’t been shown that they are doing this.

    • Solemn@lemmy.dbzer0.com
      link
      fedilink
      English
      arrow-up
      11
      ·
      11 months ago

      The way that “Hey Alexa” or “Hey Google” works is by, like you said, constantly analysing the sounds they said. However, this is only analyzed locally for the specific phrase, and is stored in a circular buffer of a few seconds so it can keep your whole request in memory. If the phrase is not detected, the buffer is constantly overwritten, and nothing is sent to the server. If the phrase is detected, then the whole request is sent to the server where more advanced voice recognition can be done.

      You can very easily monitor the traffic from your smart speaker to see if this is true. So far I’ve seen no evidence that this is no longer the common practice, though I’ll admit to not reading the article, so maybe this has changed recently.

      • uzay@infosec.pub
        link
        fedilink
        arrow-up
        2
        ·
        11 months ago

        If they were to listen for a set of predefined product-related keywords as well, they could take note of that and send that info inconspicuously to their servers as well without sending any audio recordings. Doesn’t have to be as precise as voice command recognition either, it’s just ad targeting.

        Not saying they do that, but I believe they could.