What other uses does self driving data have industry wide for training?
And most data that OpenAI used is much older, even if it was scraped recently. You also are mistakenly under the impression that all the data in datacenters (per your link) is able to used. It is not. There is a reason most companies don't put proprietary data into public models.
Let me ask you this. You claim to be an investor right? What's the core AI product you invested in? Because I get the feeling you are very far from the actual product and don't actually know how the sausage is made. I'm no expert but I work in the data industry and see a lot more of this than your average person.
It is clear you are no expert. Your questions in this area ar child like. Data is NOT locked to the segment it is created in. It gets aggregated and used in ways you will never see. If you think Bank data is only used by banks and retail data only by retailers, you really are foolish.
I am not an expert either but i do associate regularly with people who are. i was very involved with an AI incubator program at one of the top Universities as an entrepreneur in residence, in this area, when AI was really emerging. I can text right now a person considered in the Top5 fathers of Ai and he will text me back almost immediately.
And again you are simply wrong in thinking these large AI data sets are mostly older legacy data and they are not also drawing from the much more abundant data that has ramped up in the last 5 years, such as from eBanking, Wearable device data, Fitness data, Retail data, auto data, Smart home data, smart phone data, Social Media data, et, etc, etc,
ALL OF THIS DATA is being aggregated and compiled and used in these huge database sets and we have entered a new Data age that is vastly eclipsing what we had prior.
And yet here you are arguing the data of the future will not elipse the historical data despite the FACT 90% of todays data was developed in the last two years. And data creation is GROWING year over ear at a fast rate and here you are still saying 'nope the historical data will dominate for the foreseeable future'.
Your position is laughably ignorant. Not worthy of serious discussion.