• ILikeTraaaains@lemmy.world
    link
    fedilink
    arrow-up
    9
    ·
    20 hours ago

    It has been a lot since I needed to download Wikipedia, it is very easy, AFAIK there is Wikimedia with backups of all wiki sites.

    The weird file sizes are just a compressed file format, sql and XML. Maybe it is a bit more complex run Wikipedia locally, but the content information is easy to retrieve.

    The only issue is that sql and xml are plain text files, and plain text compresses very well, so a 30GB backup can become easily 100GB uncompressed.