October 17, 2023

Microsoft-affiliated research finds flaws in GTP-4

Source:

Key Topics in this News Article:

News Snapshot:

Sometimes, following instructions too precisely can land you in hot water — if you’re a large language model, that is. That’s the conclusion reached by a new, Microsoft-affiliated scientific paper that looked at the “trustworthiness” — and toxicity — of large language models (LLMs) including OpenAI’s GPT-4 and GPT-3.5, GPT-4’s predecessor. The co-authors write that, possibly because GPT-4 is more likely to follow the instructions of “jailbreaking” prompts that bypass the model’s built-in safety measures, GPT-4 can be more easily prompted than other LLMs to spout toxic, biased text. In other words, GPT-4’s good “intentions” and improved comprehension can —...

Note: Please don’t mind the mess. Some source code may appear in the Snapshot above as we train our A.I.

Share on Social Media:

View Original Full News →

Monitoring Antisemitism & Jewish Security Intel

Microsoft-affiliated research finds flaws in GTP-4

You may also like

What Does This Data Mean?

Monitoring Antisemitism & Jewish Security Intel

Microsoft-affiliated research finds flaws in GTP-4

You may also like

When hackers descended to test AI, they found a string of disturbing flaws

Microsoft CEO: A.I. is as big as Internet’s emergence

Why Microsoft won’t be the company to mainstream consumer AI use

Twitter Runs Ads for Disney, Microsoft, and the NBA Next to Neo-Nazi Propaganda

Back From the Dead? Sydney, Microsoft’s Psychotic Chatbot, Could Return

What Does This Data Mean?