Anthropic's models show signs of introspection

Axios

Axios7 hrs ago

Anthropic's models show signs of introspection

Anthropic says its most advanced systems may be learning not just to reason, but to reflect internally on how they reason.

Why it matters: These introspective capabilities could make the models safer — or, possibly, just better at pretending to be safe.

The big picture: The models are able to answer questions about their internal states with surprising accuracy. • "We're starting to see increasing signatures or instances of models exhibiting sort of cognitive functions that, historically, we think of as things that are very human," Anthropic researcher Jack Lindsey, who studies models' "brains," says. • "Or at least involve some kind of sophisticated intelligence," Lindsey tells Axios.

Driving the news: Anthropic says its top-tier model, Claude Opus, and its faster, cheaper sibling,

123

Microsoft enters approximately $9.7B contract with IREN that gives it access to Nvidia chips

Microsoft enters approximately $9.7B contract with IREN that gives it access to Nvidia chips

Associated Press Top News

Associated Press Top News4 hrs ago

231

Meet geofencing, the magical smart home superpower you aren't using (yet)

Meet geofencing, the magical smart home superpower you aren't using (yet)

PC World5 hrs ago

21

Would You Use ChatGPT to Cheat at Hobbies?

Would You Use ChatGPT to Cheat at Hobbies?

The Cut4 hrs ago

30

Coke executive: New AI holiday ad's "craftsmanship is so much better."

Coke executive: New AI holiday ad's "craftsmanship is so much better."

The Hollywood Reporter Business

The Hollywood Reporter Business3 hrs ago

148

Alphabet is increasingly launching "moonshot" projects as independent companies -

Alphabet is increasingly launching "moonshot" projects as independent companies -

TechCrunch15 hrs ago

103

Waymo to start testing self-driving vehicles in Detroit

Waymo to start testing self-driving vehicles in Detroit

Detroit Free Press

Detroit Free Press1 hrs ago

127

ChatGPT's 'New Rules' Reportedly Ban Specific Legal, Health, Money Tips

ChatGPT's 'New Rules' Reportedly Ban Specific Legal, Health, Money Tips

Tech Times7 hrs ago

31

Elon Musk just roasted Sam Altman’s Tesla Roadster cancellation

Elon Musk just roasted Sam Altman’s Tesla Roadster cancellation

Teslarati23 hrs ago

40

Waymo driverless ride-hailing service is coming to Motown

Waymo driverless ride-hailing service is coming to Motown

Detroit News2 hrs ago

146

'It's brazen': Conservative columnist blasts 'dark deeds' behind Trump's 'self

'It's brazen': Conservative columnist blasts 'dark deeds' behind Trump's 'self

Raw Story17 hrs ago

242

Looks like you've reached the bottom