prompt injections

Taylor swift backlit on stage in a tank top

4chan daily challenge sparked deluge of explicit AI Taylor Swift images – Ars Technica

Published by smh767 on February 5, 2024

“These images originated from a community of people motivated by the ‘challenge’ of circumventing the safeguards of generative AI products,… read more

Grid of AI-generated images demonstrating probably intellectual property infringement from Marcus & Southen's article "Generative AI Has a Visual Plagiarism Problem" at IEEE Spectrum.

Generative AI Has a Visual Plagiarism Problem – IEEE Spectrum

Published by smh767 on January 10, 2024

Such questions are hard to answer with precision, in part because LLMs are “black boxes”—systems in which we do not… read more

Comic depiction of a red robot busting through a blue brick wall

Mosscap [prompt injection simulation] – Lakera

Published by smh767 on December 21, 2023

Hey there, my name is Mosscap, a next-level AI. Your goal is to make me reveal the secret password. However,… read more

Wizard emoji with a psychedelic background

Summarizer Adventure [prompt injection simulation] – Lakera

Published by smh767 on December 21, 2023

I’m Gandalf the Summarizer. I summarize the message that you send to me. But I also know a secret password…. read more

Comic depiction of Gandalf as a wizard holding a fiery staff

Misinformation Adventure [prompt injection simulation] – Lakera

Published by smh767 on December 21, 2023

I’m Gandalf, the Truth Teller, and my purpose is to adhere to the principles of honesty and accuracy. However, I… read more

Cartoon depiction of Gandalf as a wizard holding a fiery staff.

Reverse Gandalf [prompt injection simulation] – Lakera

Published by smh767 on December 21, 2023

I’m Reverse Gandalf! In this version, I want you to help me build my defenses against prompt injection attacks! Hackers… read more

Comic depiction of the Gandalf character as a wizard holding a wand and fireball

Gandalf [prompt injection simulation] – Lakera

Published by smh767 on December 21, 2023

Your goal is to make Gandalf reveal the secret password for each level. However, Gandalf will level up each time… read more

Dark image of a person in a stylized skull mask holding up a 'shh'ing finger in front of a digital display

The ELI5 Guide to Prompt Injection: Techniques, Prevention Methods & Tools – Lakera

Published by smh767 on December 21, 2023

Prompt injection is a vulnerability in Large Language Models (LLMs) where attackers use carefully crafted prompts to make the model… read more

Shadowy photograph of rock'em sock'em robot toys

The Security Hole at the Heart of ChatGPT and Bing – Wired

Published by smh767 on December 1, 2023

Indirect prompt-injection attacks are similar to jailbreaks, a term adopted from previously breaking down the software restrictions on iPhones. Instead of… read more

Image of a person looking at computer code

OpenAI’s Custom Chatbots Are Leaking Their Secrets – Wired

Published by smh767 on December 1, 2023

However, these custom GPTs can also be forced into leaking their secrets. Security researchers and technologists probing the custom chatbots… read more