• ExLisper@lemmy.curiana.net
    link
    fedilink
    English
    arrow-up
    3
    arrow-down
    6
    ·
    2 days ago

    What AI are you talking about? Are you suggesting the commercial models from OpenAI are trained using CP? Or just that there are some models out there that were trained using CP? Because yeah, anyone can create a model at home and train it with whatever. But suggesting that OpenAI has a DB of tagged CP is a different story.

    • surewhynotlem@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      1 day ago

      Open AI just scours the Internet. 100% chance it’s come across someone illegal and horrible. They don’t pre-approve its training data.

      • ExLisper@lemmy.curiana.net
        link
        fedilink
        English
        arrow-up
        3
        arrow-down
        4
        ·
        1 day ago

        But you have to describe it. It doesn’t just suck in images at random. I imagine someone will remove CP when the images are reviewed. Or do you think they just download all images and add them to the training set without even looking at them?

        • surewhynotlem@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          24 hours ago

          I think that’s exactly what they do. Curation at the quantities that they’re working at would require an army.

          • ExLisper@lemmy.curiana.net
            link
            fedilink
            English
            arrow-up
            1
            ·
            15 hours ago

            So you think to train AI you just show it random images without describing what they represent and AI just magically learns? If I then ask AI to create an image of a computer, how does it know what a computer is? Does it just learn this on it’s own from all the random images?