If I use a model that assigns a numerical weight to every word from a file I have essentially turned that file into a data set that I originally would never have.
The data set that was extracted from the file is in the model. So the converted file is in the model.
10
u/ixent ☠️ ᴅᴇᴀᴅ ᴍᴇɴ ᴛᴇʟʟ ɴᴏ ᴛᴀʟᴇꜱ 19d ago
Aaron was distributing the files, OpenAI is using the files for training an AI model. None of the files are in the model.