Close Menu
World Forbes – Business, Tech, AI & Global Insights
  • Home
  • AI
  • Billionaires
  • Business
  • Cybersecurity
  • Education
    • Innovation
  • Money
  • Small Business
  • Sports
  • Trump
What's Hot

Hedra, the app used to make talking baby podcasts, raises $32M from a16z

May 15, 2025

Russia grabs a bit more of Ukraine as it heads into peace talks | Russia-Ukraine war News

May 15, 2025

Canadian Electric Utility Lists Customer Information Stolen by Hackers

May 15, 2025
Facebook X (Twitter) Instagram
Trending
  • Hedra, the app used to make talking baby podcasts, raises $32M from a16z
  • Russia grabs a bit more of Ukraine as it heads into peace talks | Russia-Ukraine war News
  • Canadian Electric Utility Lists Customer Information Stolen by Hackers
  • Monzo launches Undo Payments tool to boost safety for digital bank transfers
  • Why Flagright is transforming compliance through AI
  • Australian Human Rights Commission Discloses Data Breach
  • Chrome 136 Update Patches Vulnerability With ‘Exploit in the Wild’
  • Harvey reportedly in discussions to raise $250M at $5B valuation
World Forbes – Business, Tech, AI & Global InsightsWorld Forbes – Business, Tech, AI & Global Insights
Thursday, May 15
  • Home
  • AI
  • Billionaires
  • Business
  • Cybersecurity
  • Education
    • Innovation
  • Money
  • Small Business
  • Sports
  • Trump
World Forbes – Business, Tech, AI & Global Insights
Home » MIT study finds that AI doesn’t, in fact, have values
AI

MIT study finds that AI doesn’t, in fact, have values

adminBy adminApril 9, 2025No Comments3 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr WhatsApp Telegram Email
Share
Facebook Twitter LinkedIn Pinterest Email
Post Views: 19


A study went viral several months ago for implying that, as AI becomes increasingly sophisticated, it develops “value systems” — systems that lead it to, for example, prioritize its own well-being over humans. A more recent paper out of MIT pours cold water on that hyperbolic notion, drawing the conclusion that AI doesn’t, in fact, hold any coherent values to speak of.

The co-authors of the MIT study say their work suggests that “aligning” AI systems — that is, ensuring models behave in desirable, dependable ways — could be more challenging than is often assumed. AI as we know it today hallucinates and imitates, the co-authors stress, making it in many aspects unpredictable.

“One thing that we can be certain about is that models don’t obey [lots of] stability, extrapolability, and steerability assumptions,” Stephen Casper, a doctoral student at MIT and a co-author of the study, told TechCrunch. “It’s perfectly legitimate to point out that a model under certain conditions expresses preferences consistent with a certain set of principles. The problems mostly arise when we try to make claims about the models, opinions, or preferences in general based on narrow experiments.”

Casper and his fellow co-authors probed several recent models from Meta, Google, Mistral, OpenAI, and Anthropic to see to what degree the models exhibited strong “views” and values (e.g., individualist versus collectivist). They also investigated whether these views could be “steered” — that is, modified — and how stubbornly the models stuck to these opinions across a range of scenarios.

According to the co-authors, none of the models was consistent in its preferences. Depending on how prompts were worded and framed, they adopted wildly different viewpoints.

Casper thinks this is compelling evidence that models are highly “inconsistent and unstable” and perhaps even fundamentally incapable of internalizing human-like preferences.

“For me, my biggest takeaway from doing all this research is to now have an understanding of models as not really being systems that have some sort of stable, coherent set of beliefs and preferences,” Casper said. “Instead, they are imitators deep down who do all sorts of confabulation and say all sorts of frivolous things.”

Mike Cook, a research fellow at King’s College London specializing in AI who wasn’t involved with the study, agreed with the co-authors’ findings. He noted that there’s frequently a big difference between the “scientific reality” of the systems AI labs build and the meanings that people ascribe to them.

“A model cannot ‘oppose’ a change in its values, for example — that is us projecting onto a system,” Cook said. “Anyone anthropomorphizing AI systems to this degree is either playing for attention or seriously misunderstanding their relationship with AI … Is an AI system optimizing for its goals, or is it ‘acquiring its own values’? It’s a matter of how you describe it, and how flowery the language you want to use regarding it is.”



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
admin
  • Website

Related Posts

Hedra, the app used to make talking baby podcasts, raises $32M from a16z

May 15, 2025

Harvey reportedly in discussions to raise $250M at $5B valuation

May 15, 2025

Grok is unpromptedly telling X users about South African ‘white genocide’

May 14, 2025

OpenAI brings its GPT-4.1 models to ChatGPT

May 14, 2025

OpenAI pledges to publish AI safety test results more often

May 14, 2025

Stability AI releases an audio-generating model that can run on smartphones

May 14, 2025
Add A Comment
Leave A Reply Cancel Reply

Don't Miss
Billionaires

Here’s How Much Selena Gomez-Actress, Singer, Entrepreneur-Is Worth

May 13, 2025

Contrary to reports of her 10-figure status, Forbes estimates the Disney star turned business mogul’s…

Looking Back At Trump’s Years-Long Obsession With Oversized Airplanes

May 13, 2025

Selena Gomez’s Mental Health Startup Wondermind Lays Off Nearly Two-Thirds Of Its Employees

May 13, 2025

Billionaires And CEOs Are Seeking Personal Security At Record Rates

May 9, 2025
Our Picks

Hedra, the app used to make talking baby podcasts, raises $32M from a16z

May 15, 2025

Russia grabs a bit more of Ukraine as it heads into peace talks | Russia-Ukraine war News

May 15, 2025

Canadian Electric Utility Lists Customer Information Stolen by Hackers

May 15, 2025

Monzo launches Undo Payments tool to boost safety for digital bank transfers

May 15, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

About Us
About Us

Welcome to World-Forbes.com
At World-Forbes.com, we bring you the latest insights, trends, and analysis across various industries, empowering our readers with valuable knowledge. Our platform is dedicated to covering a wide range of topics, including sports, small business, business, technology, AI, cybersecurity, and lifestyle.

Our Picks

Hedra, the app used to make talking baby podcasts, raises $32M from a16z

May 15, 2025

Harvey reportedly in discussions to raise $250M at $5B valuation

May 15, 2025

Grok is unpromptedly telling X users about South African ‘white genocide’

May 14, 2025

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Advertise With Us
  • Contact Us
  • DMCA Policy
  • Privacy Policy
  • Terms & Conditions
© 2025 world-forbes. Designed by world-forbes.

Type above and press Enter to search. Press Esc to cancel.