Fallout hacking minigame

Researchers have developed yet another way to fool AI chatbots, this time with a good old-fashioned dose of ASCII art

Sometimes, I wonder exactly how many researchers are dedicating their time to messing with AI systems in the name of cybersecurity. Fresh off the news that a team has developed an AI worm to tunnel its way through generative AI networks, it seems that yet another group of would-be-heroes has found a perhaps even more effective way to jailbreak an AI system. This time they’re using ASCII art to convince an AI chatbot to deliver some particularly dangerous outputs.

The tool created here is referred to as “ArtPrompt” and a research paper from researchers based in Washington and Chicago details the methodology behind the way it attacks an unsuspecting LLM (via Tom’s Hardware). In essence, most chatbots reference a set of banned words and prompts that will cause the bot to give a default response if someone attempts to convince it to deliver information that could be dangerous, or to answer a query with potentially harmful or offensive content.

www.pcgamer.com