OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling’s Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

  • SMITHandWESSON@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    1 year ago

    What the article is explaining is cliff notes or snippets of a story. Isn’t that allowed in some respect? People post notes from school books all the time, and those notes show up in Google searches as well.

    I totally don’t know if I’m right, but doesn’t copyright infringement involve plagiarism like copying the whole book or writing a similar story that has elements of someone else’s work?

    • dantheclamman@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 year ago

      I don’t know what’s considered fair use here. But the point is it’s taking words that aren’t theirs, which will deprive websites of traffic because then people won’t click through to the source article.

      • SMITHandWESSON@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        1 year ago

        Ok I get now. I can definitely see both sides of the argument, and it’s not going to be easy to solve.

        Copyright law needs to be updated to deal with all the new ways people and companies are using tech to access copyrighted material.