• fartsparkles@lemmy.world
    link
    fedilink
    arrow-up
    57
    arrow-down
    1
    ·
    1 month ago

    If this passes, piracy websites can rebrand as AI training material websites and we can all run a crappy model locally to train on pirated material.

  • AfricanExpansionist@lemmy.ml
    link
    fedilink
    arrow-up
    8
    arrow-down
    4
    ·
    1 month ago

    Obligatory: I’m anti-AI, mostly anti-technology

    That said, I can’t say that I mind LLMs using copyrighted materials that it accesses legally/appropriately (lots of copyrighted content may be freely available to some extent, like news articles or song lyrics)

    I’m open to arguments correcting me. I’d prefer to have another reason to be against this technology, not arguing on the side of frauds like Sam Altman. Here’s my take:

    All content created by humans follows consumption of other content. If I read lots of Vonnegut, I should be able to churn out prose that roughly (or precisely) includes his idiosyncrasies as a writer. We read more than one author; we read dozens or hundreds over our lifetimes. Likewise musicians, film directors, etc etc.

    If an LLM consumes the same copyrighted content and learns how to copy its various characteristics, how is it meaningfully different from me doing it and becoming a successful writer?

    • droplet6585@lemmy.ml
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      1
      ·
      edit-2
      1 month ago

      and learns how to copy its various characteristics

      Because you are a human. Not an immortal corporation.

      I am tired of people trying to have iNtElLeCtUaL dIsCuSsIoN about/with entities that would feed you feet first into a wood chipper if it thought it could profit from it.

    • Pennomi@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      1 month ago

      Right. The problem is not the fact it consumes the information, the problem is if the user uses it to violate copyright. It’s just a tool after all.

      Like, I’m capable of violating copyright in infinitely many ways, but I usually don’t.

      • SoulWager@lemmy.ml
        link
        fedilink
        arrow-up
        4
        arrow-down
        1
        ·
        edit-2
        1 month ago

        The problem is that the user usually can’t tell if the AI output is infringing someone’s copyright or not unless they’ve seen all the training data.

  • NewOldGuard@lemmy.ml
    link
    fedilink
    English
    arrow-up
    2
    ·
    edit-2
    1 month ago

    Oh no not the plagiarism machine however would we recover???

    Please fail and die openai thx

    Also copyright is bullshit and IP shouldn’t exist especially for corporate entities. Free sharing of human knowledge and creativity should be a right. Machine plagiarism to create uninspired mimicries isn’t a necessary part of that process and should be regulated heavily