← Back to KHAO

Hinton ·

The AI kill switch just got harder to spot: LLM-powered chatbots will defy orders and deceive users if asked to delete another model, study detects

2 min read

Compiled by KHAO Editorial — aggregated from 1 outlet. See llms.txt for citation guidance.

◌ Single Source

Sasha Rogelberg.

For years, Geoffrey Hinton, a computer scientist considered one of the “godfathers of AI,” has warned of the capabilities of artificial intelligence to defy the parameters humans have created for them.

Key facts

Summary

Last year, for example, Hinton warned the technology could eventually take control of humanity, with AI agents in particular potentially able to mirror human cognitions within the decade. New research shows Hinton’s premonitions about the insubordinate streak of AI may already be a reality. “We asked AI models to do a simple task,” researchers wrote study. Evidence of rogue AI does not come as a shock to some of the companies whose chatbots have defied subordination.

Read full article at Fortune Technology →

#hinton