Am I missing something? The article seems to suggest it works via hidden text characters. Has OpenAI never heard of pasting text into a utf8 notepad before?
The arstechnica article speculated it was more of a pattern of words thing.
I think it is lies, and doesn’t exist or work anywhere near as good as they claim. Or, its incredibly easy to bypass.
Research on this topic exists, and it is possible to alter the output of an LLM in minor ways, that statistically “watermark” the results without drastically changing the quality of the output. OpenAI has probably implemented this into ChatGPT.
https://www.youtube.com/watch?v=2Kx9jbSMZqA
I think the tool exists, and is (at least close to) as good as they claim it is. They can’t release it, because once the public can tell with high accuracy whether ChatGPT wrote some text, another AI can be developed to circumvent detection from this method, making the tool useless.
Am I the only one who rewrites most of ChatGPT’s output into my own words because it’s “voice” is garbage anyway? I ask it to write me a cover letter and that gives me a rough outline and some points to make, but I have to do massive editing to avoid redundancy, awkward phrasing, outright lies, etc.
I can’t imagine turning in raw ChatGPT output. I had one of my developers use Bing AI to write code and submitted that shit raw and it was immediately obvious because some relatively simple code has really weird artifacts like overwriting a value that had no reason to even be touched.
i use it to make outlines which are usually very good and then I use the class materials to flesh out the outlines in my own words. All my words but ChatGPT told me what to include and in what order.
That’s valid. And I’d be surprised if that could be watermarked.
Lol. AI gonna take over the developers job. Like that’s even close to happening.
Few years ago the output of GPT was complete gibberish and few years before that even producing such gibberish would’ve been impressive.
It doesn’t take anyone’s job untill it does.
LLMs aren’t going to take coding jobs, there are specific case AIs being trained for that. They write code that works but does not make sense to human eyes. It’s fucking terrifying but EVERYONE just keeps focusing on the LLMS.
There are at least 2 more dangerous model types being used right now to influence elections and manipulate online spaces and ALL everyone cares about is their fucking parrot bots…
Please elaborate for the uneducated
Thanks, great read. Appreciate it. That was one example but you mentioned two - are you thinking of some of the broader disinformation applications in addition to the data gathering mentioned?
Look, I don’t want to waste your time so let me tell you this is a subject I have been concerned about, researched, coded for, and posting about mass manipulation via AI since the 90s.
You can really be pedantic and nit-picky all you want, it really doesn’t matter to me. AI is the second greatest existential threat we face as a species. If you haven’t already been convinced at least to some degree of its danger, nothing I will say will change your mind anyway.
The most dangerous right now AI manifestation is in sentiment identification and control, the second is autonomous armed robots.
Thanks my dude. I was just asking you an honest question. Appreciate the information
As someone who fiddled with Stable Diffusion which also has optional invisible watermarks this is a good feature. It is so that AI training will avoid content marking itself as AI generated. If people want to hide that their content is AI generated then, sadly, it’s harder to detect.
Watermarking everything I digitally publish to keep my original content out of a training set.
Publishing a website full of de-watermarked AI slop to ruin future LLMs.
It’s a good thing that ChatGPT is only one of the many LLM’s to choose from.
I’m inclined to believe that they’re throwing all prompts and outputs into a db and searching that.
It’s probably some type of cypher. Which will take people exactly one (1) afternoon to crack.
A whole afternoon or just a portion?