OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling’s Harry Potter series::A new research paper laid out ways in which AI developers should try and avoid showing LLMs have been trained on copyrighted material.

  • FMT99@lemmy.world
    link
    fedilink
    English
    arrow-up
    18
    arrow-down
    3
    ·
    1 year ago

    But wouldn’t this training and the subsequent output be so transformative that being based on the copyrighted work makes no difference? If I read a Harry Potter book and then write a story about a boy wizard who becomes a great hero, anyone trying to copyright strike that would be laughed at.

    • Sentau@lemmy.one
      link
      fedilink
      English
      arrow-up
      6
      ·
      edit-2
      1 year ago

      Your probability of getting copyright strike depends on two major factors -

      • How similar your story is to Harry Potter.

      • If you are making money of that story.

      • uis@lemmy.world
        link
        fedilink
        English
        arrow-up
        4
        ·
        1 year ago

        It doesn’t matter how similar. Copyright doesn’t protect meaning, copyright protect form. If you read HP and then draw a picture of it, said picture becomes its separate work, not even derivative.