OpenAI has built a text watermarking method to detect chatgpt written content

PenisDuckCuck9001@lemmynsfw.com · 1 month ago

OpenAI has built a text watermarking method to detect chatgpt written content

CameronDev@programming.dev · 1 month ago

The arstechnica article speculated it was more of a pattern of words thing.

I think it is lies, and doesn’t exist or work anywhere near as good as they claim. Or, its incredibly easy to bypass.

https://arstechnica.com/ai/2024/08/openai-has-the-tech-to-watermark-chatgpt-text-it-just-wont-release-it/

deadcade@lemmy.deadca.de · 1 month ago

Research on this topic exists, and it is possible to alter the output of an LLM in minor ways, that statistically “watermark” the results without drastically changing the quality of the output. OpenAI has probably implemented this into ChatGPT.

https://www.youtube.com/watch?v=2Kx9jbSMZqA

I think the tool exists, and is (at least close to) as good as they claim it is. They can’t release it, because once the public can tell with high accuracy whether ChatGPT wrote some text, another AI can be developed to circumvent detection from this method, making the tool useless.

MagicShel@programming.dev · edit-2 1 month ago

Am I the only one who rewrites most of ChatGPT’s output into my own words because it’s “voice” is garbage anyway? I ask it to write me a cover letter and that gives me a rough outline and some points to make, but I have to do massive editing to avoid redundancy, awkward phrasing, outright lies, etc.

I can’t imagine turning in raw ChatGPT output. I had one of my developers use Bing AI to write code and submitted that shit raw and it was immediately obvious because some relatively simple code has really weird artifacts like overwriting a value that had no reason to even be touched.

ValenThyme@reddthat.com · edit-2 1 month ago

i use it to make outlines which are usually very good and then I use the class materials to flesh out the outlines in my own words. All my words but ChatGPT told me what to include and in what order.

MagicShel@programming.dev · 1 month ago

That’s valid. And I’d be surprised if that could be watermarked.

JeeBaiChow@lemmy.world · 1 month ago

Lol. AI gonna take over the developers job. Like that’s even close to happening.

Thorny_Insight@lemm.ee · 1 month ago

Few years ago the output of GPT was complete gibberish and few years before that even producing such gibberish would’ve been impressive.

It doesn’t take anyone’s job untill it does.

Angry_Autist (he/him)@lemmy.world · 1 month ago

LLMs aren’t going to take coding jobs, there are specific case AIs being trained for that. They write code that works but does not make sense to human eyes. It’s fucking terrifying but EVERYONE just keeps focusing on the LLMS.

There are at least 2 more dangerous model types being used right now to influence elections and manipulate online spaces and ALL everyone cares about is their fucking parrot bots…

Mobilityfuture@lemmy.world · 1 month ago

Please elaborate for the uneducated

Angry_Autist (he/him)@lemmy.world · 1 month ago

https://www.bbc.com/news/business-54348456#:~:text=But Palantir's rise has been shadowed by concerns,right to privacy and is ripe for abuse.

Mobilityfuture@lemmy.world · 1 month ago

Thanks, great read. Appreciate it. That was one example but you mentioned two - are you thinking of some of the broader disinformation applications in addition to the data gathering mentioned?

Angry_Autist (he/him)@lemmy.world · edit-2 1 month ago

Look, I don’t want to waste your time so let me tell you this is a subject I have been concerned about, researched, coded for, and posting about mass manipulation via AI since the 90s.

You can really be pedantic and nit-picky all you want, it really doesn’t matter to me. AI is the second greatest existential threat we face as a species. If you haven’t already been convinced at least to some degree of its danger, nothing I will say will change your mind anyway.

The most dangerous right now AI manifestation is in sentiment identification and control, the second is autonomous armed robots.

Mobilityfuture@lemmy.world · 1 month ago

Thanks my dude. I was just asking you an honest question. Appreciate the information

JackbyDev@programming.dev · 1 month ago

As someone who fiddled with Stable Diffusion which also has optional invisible watermarks this is a good feature. It is so that AI training will avoid content marking itself as AI generated. If people want to hide that their content is AI generated then, sadly, it’s harder to detect.

Todd Bonzalez@lemm.ee · 1 month ago

Watermarking everything I digitally publish to keep my original content out of a training set.

Publishing a website full of de-watermarked AI slop to ruin future LLMs.

arthurpizza@lemmy.world · 1 month ago

It’s a good thing that ChatGPT is only one of the many LLM’s to choose from.

nickwitha_k (he/him)@lemmy.sdf.org · 1 month ago

I’m inclined to believe that they’re throwing all prompts and outputs into a db and searching that.

TerkErJerbs@lemm.ee · 1 month ago

It’s probably some type of cypher. Which will take people exactly one (1) afternoon to crack.

BearOfaTime@lemm.ee · 1 month ago

A whole afternoon or just a portion?

OpenAI has built a text watermarking method to detect chatgpt written content

OpenAI has built a text watermarking method to detect chatgpt written content

OpenAI has built a text watermarking method to detect ChatGPT-written content — company has mulled its release over the past year