That claim doesn’t prove your premise. I get that it feels clever, but it isn’t.
Just because they’re very good at reproducing information from highly pared down and compressed forms does not mean they are not reproducing information. If that were true, you wouldn’t be able to enforce copyright on a jpeg photo of a painting.
If it was a compression algorithm then it would be insanely efficient and that’d be the big thing about it. The simple fact is that they aren’t able to reproduce their exact training data so no, they aren’t storing it in a highly compressed form.
They are physically unable to just copy paste stuff. The models are tiny compared to the training data, they don’t store it.
That claim doesn’t prove your premise. I get that it feels clever, but it isn’t.
Just because they’re very good at reproducing information from highly pared down and compressed forms does not mean they are not reproducing information. If that were true, you wouldn’t be able to enforce copyright on a jpeg photo of a painting.
If it was a compression algorithm then it would be insanely efficient and that’d be the big thing about it. The simple fact is that they aren’t able to reproduce their exact training data so no, they aren’t storing it in a highly compressed form.