cross-posted from: https://lemmy.ca/post/37011397

!opensource@programming.dev

The popular open-source VLC video player was demonstrated on the floor of CES 2025 with automatic AI subtitling and translation, generated locally and offline in real time. Parent organization VideoLAN shared a video on Tuesday in which president Jean-Baptiste Kempf shows off the new feature, which uses open-source AI models to generate subtitles for videos in several languages.

  • WalnutLum@lemmy.ml
    link
    fedilink
    English
    arrow-up
    11
    ·
    edit-2
    12 hours ago

    It’s actually using whisper.cpp

    From the README:

    Memory usage Model Disk Mem tiny 75 MiB ~273 MB base 142 MiB ~388 MB small 466 MiB ~852 MB medium 1.5 GiB ~2.1 GB large 2.9 GiB ~3.9 GiB

    Those are the model sizes

    • Eagle0110@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      8 hours ago

      Oh wow those pretty tiny memory requirements for a decent modern system! That’s actually very impressive! :D

      Many people can probably even run this on older media servers or even just a plain NAS! That’s awesome! :D