We backed up Spotify (metadata and music files). It’s distributed in bulk torrents (~300TB), grouped by popularity.

This release includes the largest publicly available music metadata database with 256 million tracks and 186 million unique ISRCs.

It’s the world’s first “preservation archive” for music which is fully open (meaning it can easily be mirrored by anyone with enough disk space), with 86 million music files, representing around 99.6% of listens.

        • infinitesunrise@slrpnk.net
          link
          fedilink
          English
          arrow-up
          0
          ·
          22 days ago

          A RAID6 of 24 * 20TB drives could contain that with both parity and hotswap, with room to spare. Let’s say $400 per refurb drive, $2500 rackmount SAS enclosure, $2000 SAS RAID card, $14,100 total. Assuming you already have the server and power and SAS cables.

          • wheezy@lemmy.ml
            link
            fedilink
            English
            arrow-up
            0
            ·
            21 days ago

            You could budget this way down. I run 10+2 12TB with Unraid. No reason for a raid card if it’s for archive and personal use.

            • brognak@lemmy.dbzer0.com
              link
              fedilink
              English
              arrow-up
              0
              ·
              21 days ago

              100% this. People who store easily replaceable media on RAID are just throwing away money (unless you have a need for faster read/write). If it’s your family photos, copy of your in progress thesis, or other irreplaceable piece of info/content go for it.

              I have like 40tb Unraid NAS and I get asked pretty much every time I talk to someone about it how I do backups. Easy, I backup my *arr stack databases and in case of a failure I restore them and let it pull down everything over time. Which I have done in the past when I wanted to upgrade quality, easier for me to scrub it all and start over than make upgrade profiles and such.

              Or that’s what I would have done, now I mostly use DebridService du jour and Stremio :-)

              • floofloof@lemmy.ca
                link
                fedilink
                English
                arrow-up
                0
                ·
                21 days ago

                Price screenshot

                That’s not $10-$11 as suggested above, nor is it $15. And that’s not even a new drive.

                • rainwall@piefed.social
                  link
                  fedilink
                  English
                  arrow-up
                  0
                  ·
                  21 days ago

                  Where are you getting $430? Im seeing $306, which puts the per TB at $15.34.

                  A lot closer to the $10 the poster above was talking about than the $30-35 the person mocking him was saying drives cost.

                  Its also a refurbed drive from a well trusted, quality vendor that does testing on each drive and offers a 3 year warrenty.

          • N0x0n@lemmy.ml
            link
            fedilink
            English
            arrow-up
            0
            ·
            22 days ago

            10US dollar per TB?? 🤣🤣 More like 30/35€ per TB for a good graded HDD!

            Let’s not talk about SSDs or nvme which are more in the 120€/TB.

            I always hear people say that storage comes cheap nowaday… I’m still looking for that cheap HDD on amazon… It has been 10 years 🤣🤣

              • HelloRoot@lemy.lol
                link
                fedilink
                English
                arrow-up
                0
                ·
                21 days ago

                US of A often has way lower hdd prices compared to Europe.

                Take the serverpartdeals price and add shipping and import tax.

          • oyo@lemmy.zip
            link
            fedilink
            English
            arrow-up
            0
            ·
            21 days ago

            This gif is going to completely lose its punch in a couple years.

      • Hawk@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        0
        ·
        22 days ago

        Interestingly enough, with the data they provide, figuring out how much of it is AI slop wouldn’t be that hard I think

      • skisnow@lemmy.ca
        link
        fedilink
        English
        arrow-up
        0
        ·
        22 days ago

        Yeah as with most of the internet, it’s only worth downloading anything uploaded before 2023.

        So far, LLMs have done so much more harm than help.

      • nagaram@startrek.website
        link
        fedilink
        English
        arrow-up
        0
        ·
        22 days ago

        I’m not convinced AI slop can compete with the back log of organic slop personally.

        But yeah a fuckton is probably slop either way

        • bear@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          0
          ·
          21 days ago

          AI slop is accelerating exponentially for the foreseeable future. It won’t take long for world data storage to be a limiting factor.