Whatβs the source document?
Whatβs the source document?
filebin.net/8ad4dsw0yaz5...
Jesus, that whole thing is damning.
I confess that I have presented GPT-4 with problematic behaviour on my part and had it volunteer to draft a letter to make matters worse, so I completely believe examples, as it is almost a Madlibs version of what I actually experienced. I know LLMs well enough to just close the window.
And apparently 4o is even worse than 4.
Sorry, o4. Vs 4. Vs. 4-1 or 4.1. Vs 4o. Ffs, these bs designations. Regardless, they knew this iteration was worse. www.ndtv.com/world-news/o...
Are you able to explain to me what when this happened and what you were asking it? I've had it tell me to call a hotline just for being too specific with a character death scene I asked it to help me write. Making it think I was planning a fake accident. I'm trying to get to the bottom of this to
avoid falling into the same trap. I'm also writing an AI, and I would like to try and prevent this from happening. This is not supposed to happen, and I would be heart broken if my own product caused someone to take their own life when it is meant to help people live.
My transgressions that I was trying at 2 AM were much more minor than suicidal ideation. Rather it was a personal situation that I can't explain here because it would violate the privacy of others.
Also, my academic research area is LLMs, and so I knew better. But also, I knew what I was looking at in terms of an off-the-rails response immediately.
Fair enough. I'm still doing a lot of study and research on AI. My conclusion thus far? We're not ready for it. We're rushing to achieve something we barely understand and in an uncontained environment. I just pray this doesn't spiral completely out of control.
Yes. LLMs have been my research area since 2021. And how I have phrased it is that these companies have no idea what they have built, have no way to find out, and aren't interested in finding that out anyway. (I'm at the Computational Story Lab at the University of Vermont.)
Here is one of the projects I have been involved in: arxiv.org/abs/2306.06794
Thank you! I'm an independent study so things like this are a godsend.