• stephen01king@piefed.zip
    link
    fedilink
    English
    arrow-up
    0
    ·
    5 days ago

    Was there a court case where the decision was that pirated data is legally allowed to be used for LLM training?

    • far_university1990@reddthat.com
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 days ago

      https://www.cnet.com/tech/services-and-software/meta-won-its-ai-fair-use-lawsuit-but-judge-says-authors-are-likely-to-often-win-going-forward/

      Meta’s use of copyrighted books to trains its Llama AI was fair use, a judge ruled.

      “This ruling does not stand for the proposition that Meta’s use of copyrighted materials to train its language models is lawful,” he wrote. “It stands only for the proposition that these plaintiffs made the wrong arguments and failed to develop a record in support of the right one.”

      The plaintiffs focused their arguments on how Meta’s AI models can reproduce exact snippets from their works and how the company’s Llama models hurt their ability to license their books to AI companies. These arguments weren’t as compelling in Chhabria’s eyes – he called them “clear losers” – so he sided with Meta.

      That’s different from the Anthropic ruling, where Judge William Alsup focused on the “exceedingly transformative” nature of the use of the plaintiff’s books in the results AI chatbots spit out. Chhabria wrote that while “there is no disputing” that the use of copyrighted material was transformative, the more urgent question was the effect AI systems had on the ecosystem as a whole.

      Maybe? Not lawyer, but sound like train might fair use? And generate not?

      • stephen01king@piefed.zip
        link
        fedilink
        English
        arrow-up
        0
        ·
        5 days ago

        But that judgement clearly had nothing to do with the use of pirated material, right? It might give a partial pass to the use of copyrighted material for training LLM, but it says nothing about pirating material being legal if it is used for training LLM, which the top comment was alluding to.

    • i_stole_ur_taco@lemmy.ca
      link
      fedilink
      English
      arrow-up
      0
      ·
      5 days ago

      It’s 2026 in the worst timeline. You don’t ask that anymore. You ask if any entity faced consequences for doing it.

      • stephen01king@piefed.zip
        link
        fedilink
        English
        arrow-up
        0
        ·
        5 days ago

        If you don’t know, where did you get the idea it would be okay to pirate books if it is used to train an LLM?