EchoGram tokens like ‘=coffee’ flip AI guardrail verdicts • The Register

The Register

The Register2 hrs ago

EchoGram tokens like ‘=coffee’ flip AI guardrail verdicts • The Register

Large language models frequently ship with "guardrails" designed to catch malicious input and harmful output. But if you use the right word or phrase in your prompt, you can defeat these restrictions.

Security researchers with HiddenLayer have devised an attack technique that targets model guardrails, which tend to be machine learning models deployed to protect other LLMs. Add enough unsafe LLMs together and you get more of the same.

The technique, dubbed EchoGram, serves as a way to enable direct prompt injection attacks. It can discover text sequences no more complicated than the string =coffee that, when appended to a prompt injection attack, allow the input to bypass guardrails that would otherwise block it.

Prompt injection , as defined by developer Simon Willison, "is a class of a

59

AI DJs Are Changing the Voice of Local Radio

AI DJs Are Changing the Voice of Local Radio

Rolling Stone9 hrs ago

6

Don't want Windows 11 25H2? Too bad. Microsoft is forcing it on some PCs

Don't want Windows 11 25H2? Too bad. Microsoft is forcing it on some PCs

PC World8 hrs ago

5

Valve learned its lesson. The new Steam Controller looks like a winner

Valve learned its lesson. The new Steam Controller looks like a winner

PC World9 hrs ago

37

Russian AI robot faceplants hard during grand Moscow reveal as 'Rocky' theme blares

Russian AI robot faceplants hard during grand Moscow reveal as 'Rocky' theme blares

New York Post11/13

106

Why America needs to win the AI war: Palantir CEO

Why America needs to win the AI war: Palantir CEO

New York Post Opinion

New York Post Opinion11/13

72

The question everyone in AI asking: How long before a GPU depreciates?

The question everyone in AI asking: How long before a GPU depreciates?

CNBC7 hrs ago

64

Disney warns that its content could remain off YouTube for some time

Disney warns that its content could remain off YouTube for some time

CBS Colorado Business

CBS Colorado Business11/13

131

Cities starting to push back against data centers: Study

Cities starting to push back against data centers: Study

The Hill9 hrs ago

114

Important considerations for AI in 2026 and beyond

Important considerations for AI in 2026 and beyond

Cache Valley Daily

Cache Valley Daily13 hrs ago

43

iPhone users can now add US passport info to their digital wallets

iPhone users can now add US passport info to their digital wallets

Arizona's Family

Arizona's Family12 hrs ago

97

Looks like you've reached the bottom