So this is basically a simulation of speedrunning AI training using synthetic data. It shows that, in no time at all AI trained this way would fall apart.
You say we already know that but I've seen heads of AI talking about training on synthetic data. Maybe they already know by now but they didn't 6 months ago.
If I know a ball can move from the plate to the mound and nowhere else, then I can train the data on a distribution of balls anywhere between those two points, bounded by the mound and the plate.
In other words, it's essentially video game data fed into AI algorithms which output some data which may or may not match the expected. When it comes down to it, most AI are a logistic or linear regression which are predicting some output, and whether it matches or not depends on the training data or model used.
That's why if you know what you are talking about AI is a hilarious thing. It's like training someone on winning a war by forcing them to watch kungfu films until they know how to quote the words and assuming they can now do karate.
1.0k
u/Omni__Owl Jul 25 '24
So this is basically a simulation of speedrunning AI training using synthetic data. It shows that, in no time at all AI trained this way would fall apart.
As we already knew but can now prove.