Nvidia releases its own brand of world models

bariscan | 69 points

> Nvidia wouldn’t say where this training data came from, but at least one report — and lawsuit — alleges that the company trained on copyrighted YouTube videos without permission.

Under the EU's AI act[1] there is now a legal obligation to disclose the source of the training data. Is this correct then that either the models cannot be used in the EU, or we'll get to know where the training data came from?

[1]: https://oeil.secure.europarl.europa.eu/oeil/en/procedure-fil... - "General-purpose AI systems, and the GPAI models such as ChatGPT they are based on, must meet certain transparency requirements including compliance with EU copyright law and publishing detailed summaries of the content used for training."

thih9 | 10 hours ago

Why go to techcrunch and not directly to their only source of information on this? There are also some actual technical details there.

https://www.nvidia.com/en-us/ai/cosmos/

_0ffh | 10 hours ago

What stops Nvidia from cutting out the middlemen? They have the chips.

actionfromafar | 10 hours ago

I have given up on AI folks using a scientific definition of “world model,” yet I am still amazed at how viciously dishonest NVIDIA is being here:

  “Cosmos learns just like people learn,” the spokesperson said. “To help Cosmos learn, we gathered data from a variety of public and private sources and are confident our use of data is consistent with both the letter and spirit of the law. Facts about how the world works — which are what the Cosmos models learn — are not copyrightable or subject to the control of any individual author or company.”
Cosmos definitely does not learn facts about how the world works! This is just a ridiculous lie. It accumulates a bunch of data hopefully demonstrating how the world works, and hopefully some facts fall out of it. Given that this failed completely for Sora, which obviously knows nothing about physics, I am confident that Cosmos also knows nothing. It has no facts, just data. And unless they somehow integrated touch sensors it doesn’t even get physical data the same way toddlers do. So “learns just like people learn” is also a lie.

Some AI hype is people getting ahead of themselves and believing their own marketing. But here NVIDIA is just lying their asses off, presumably to stoke investor hype, but also because they’re trying to monetize a bunch of copyrighted data they stole. These are bad people.

aithrowawaycomm | 9 hours ago

The AI race is one of the most impressive examples of Capitalism making the market efficient. Or at least I've witnessed in my life.

We went from Google having complete control. To Open AI releasing GPT2 which really inspired a lot of people to try it. Then GPT3+ convinced the world to try it.

After that, Gemini, LLaMa, every type of fine-tune... The noteworthy thing is that LLaMA was good enough that ChatGPT had competition. Then within 1 year of that, we have a dozen companies with models that are good enough.

The competition has been the best type of brutal.

resource_waste | 10 hours ago
[deleted]
| 9 hours ago