Two authors sued OpenAI, accusing the company of violating copyright law. They say OpenAI used their work to train ChatGPT without their consent.

  • phx@lemmy.ca
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    1
    ·
    1 year ago

    If you’re doing research, there are actually some limits on the use of the source material and you’re supposed to be citing said sources.

    But yeah, there’s plenty of stuff where there needs to be a firm line between what a random human can do versus an automated intelligent system with potential unlimited memory/storage and processing power. A human can see where I am in public. An automated system can record it for permanent record. An integrated AI can tell you detailed information about my daily activities including inferences which - even if legal - is a pretty slippery slope.

    • jecxjo@midwest.social
      link
      fedilink
      English
      arrow-up
      4
      ·
      1 year ago

      a firm line between what a random human can do versus an automated intelligent system with potential unlimited memory/storage and processing power.

      I think we need a better definition here. Is the issue really the processing power? Do we let humans get a pass because our memories are fuzzy? From your example you’re assuming massive details are maintained in the AI situation which is typically not the case. To make the data useful it’s consumed and turned into something useful for the system.

      This is why I’m worried about legislation and legal precedent. Most people think these AI systems read a book and store the verbatim text off somewhere to reference when that isn’t really the case. There may be fragments all over, and it may be able to reconstitute the text, but we don’t seem to have the same issue with data being synthesized in a similar way with a human brain.

      • phx@lemmy.ca
        link
        fedilink
        English
        arrow-up
        2
        ·
        1 year ago

        A continuous record of location + time or even something like “license plate at location plus time” is scary enough to me, and that’s easily data a system could hold decades of