← Back to KHAO

Open Source · Anthropic · OpenAI · Google · AGI ·

For example, Goodfire found one neuron inside the open-source model Qwen 3 that was associated with the so-called trolley problem

2 min read

Compiled by KHAO Editorial — aggregated from 1 outlet. See llms.txt for citation guidance.

◌ Single Source

hand with pliers poking at a belt attached to a complicated mess of valves and switches.

Activating this neuron changed the model’s responses, making it frame its outputs as explicit moral dilemmas.

Key facts

Summary

The San Francisco–based startup Goodfire released a new tool, called Silico, that lets researchers and engineers peer inside an AI model and adjust its parameters—the settings that determine a model’s behavior —during training. Goodfire claims Silico is the first off-the-shelf tool of its kind that can help developers debug all stages of the development process, from building a data set to training a model. The company says its mission is to make building AI models less like alchemy and more like a science. “We saw this widening gap between how well models were understood and how widely they were being deployed,” Goodfire’s CEO, Eric Ho, tells MIT Technology Review in an exclusive chat ahead of Silico’s release.

Read full article at MIT Technology Review →

#Open Source #Anthropic #OpenAI #Google #AGI