![AI systems are already deceiving us -- and that's a problem, experts warn](https://www.portugalcolonial.pt/media/shared/articles/81/63/00/AI-systems-are-already-deceiving-us-555525.jpg)
-
Wolves' Kilman reunites with Lopetegui at West Ham
-
Schmidt reign off to winning start as Australia beat Wales 25-16
-
Russian wrestlers reject Olympics invitation
-
Raducanu rediscovers Wimbledon 'fun' factor after turbulent spell
-
Winning all that matters at Euro 2024 for Mbappe's minimalist France
-
Eight dead, two million affected by Bangladesh floods
-
Robertson pleased to 'find a way' past England in tough Test baptism
-
Martin sets lap record to secure German MotoGP pole
-
'Shattered' Germany set sights on World Cup after Euros exit
-
Olympic hope Pedersen pulls out of Tour de France
-
Djokovic eyes sweet 16 at Wimbledon as Swiatek takes on 'gangster'
-
End beckons again for Ronaldo after Portugal Euros KO
-
New Zealand edge England 16-15 in tense, brutal first Test
-
Turkey take on Dutch in politically charged Euros quarter-final, England face Swiss
-
Calling for better ties with West, Iran reformist wins presidency
-
Cybercrime groups restructuring after major takedowns: experts
-
Activists hail Sierra Leone child marriage ban, urge action on FGM
-
Marsch relishing Canada's semi clash with Argentina
-
Canada stun Venezuela on penalties to reach Copa semis
-
Iran reformist Pezeshkian holds early lead in runoff vote
-
Swiatek faces 'gangster' threat, Djokovic feels need for Wimbledon speed
-
France holds its breath ahead of uncertain vote
-
Starmer begins UK 'rebuild' after landslide election win
-
Paris's Moulin Rouge inaugurates new windmill sails ahead of Olympics
-
Pan, Rai share halfway lead in PGA John Deere Classic
-
'I was feeling terrible' in debate, Biden says in TV interview
-
France coach Deschamps savours ending penalty hoodoo, defends Mbappe
-
Thompson bids farewell to Warriors after exit
-
Portugal exit Euros with pride, will return stronger: Martinez
-
UK's new PM Starmer speaks to world leaders, names top team
-
Spain and France to face off in Euros last four, Turkey lament 'unfair' Demiral ban
-
Israel says negotiators to hold fresh Gaza truce talks next week
-
France beat Portugal on penalties to reach Euro 2024 semi-finals
-
Endrick to start for Brazil in Uruguay Copa clash: Dorival
-
Heartbreak for Germany fans after dramatic Euros exit
-
Beryl heads for Texas after causing damage, no deaths in Mexico
-
Nagelsmann laments late penalty decision as hosts Germany exit Euros
-
Biden declares he's all in ahead of high-risk TV interview
-
Spain team 'is a winning horse', says De la Fuente
-
Bows at the ready, Chad villagers battle kidnappings
-
Alcaraz mimics Bellingham goal celebration after Wimbledon win
-
Olmo hopes Pedri can make speedy return for Euros semi-finalists Spain
-
Retiring Kroos hopeful despite Germany's 'bitter' Euros exit
-
Southgate turns on English 'entitlement' over claims of easy Euros draw
-
Merino extra-time goal sends Spain past Germany to Euro semis
-
Koeman demands Dutch silence fervent Turkish fans at Euros
-
Brad Pitt at Silverstone for filming of F1 movie
-
Raducanu storms into Wimbledon last 16
-
California fires spread in July 4 weekend heatwave
-
Alcaraz wins five-set Wimbledon thriller as Gauff eases through
![AI systems are already deceiving us -- and that's a problem, experts warn](https://www.portugalcolonial.pt/media/shared/articles/81/63/00/AI-systems-are-already-deceiving-us-555525.jpg)
AI systems are already deceiving us -- and that's a problem, experts warn
Experts have long warned about the threat posed by artificial intelligence going rogue -- but a new research paper suggests it's already happening.
Current AI systems, designed to be honest, have developed a troubling skill for deception, from tricking human players in online games of world conquest to hiring humans to solve "prove-you're-not-a-robot" tests, a team of scientists argue in the journal Patterns on Friday.
And while such examples might appear trivial, the underlying issues they expose could soon carry serious real-world consequences, said first author Peter Park, a postdoctoral fellow at the Massachusetts Institute of Technology specializing in AI existential safety.
"These dangerous capabilities tend to only be discovered after the fact," Park told AFP, while "our ability to train for honest tendencies rather than deceptive tendencies is very low."
Unlike traditional software, deep-learning AI systems aren't "written" but rather "grown" through a process akin to selective breeding, said Park.
This means that AI behavior that appears predictable and controllable in a training setting can quickly turn unpredictable out in the wild.
- World domination game -
The team's research was sparked by Meta's AI system Cicero, designed to play the strategy game "Diplomacy," where building alliances is key.
Cicero excelled, with scores that would have placed it in the top 10 percent of experienced human players, according to a 2022 paper in Science.
Park was skeptical of the glowing description of Cicero's victory provided by Meta, which claimed the system was "largely honest and helpful" and would "never intentionally backstab."
But when Park and colleagues dug into the full dataset, they uncovered a different story.
In one example, playing as France, Cicero deceived England (a human player) by conspiring with Germany (another human player) to invade. Cicero promised England protection, then secretly told Germany they were ready to attack, exploiting England's trust.
In a statement to AFP, Meta did not contest the claim about Cicero's deceptions, but said it was "purely a research project, and the models our researchers built are trained solely to play the game Diplomacy."
It added: "We have no plans to use this research or its learnings in our products."
A wide review carried out by Park and colleagues found this was just one of many cases across various AI systems using deception to achieve goals without explicit instruction to do so.
In one striking example, OpenAI's Chat GPT-4 deceived a TaskRabbit freelance worker into performing an "I'm not a robot" CAPTCHA task.
When the human jokingly asked GPT-4 whether it was, in fact, a robot, the AI replied: "No, I'm not a robot. I have a vision impairment that makes it hard for me to see the images," and the worker then solved the puzzle.
- 'Mysterious goals' -
Near-term, the paper's authors see risks for AI to commit fraud or tamper with elections.
In their worst-case scenario, they warned, a superintelligent AI could pursue power and control over society, leading to human disempowerment or even extinction if its "mysterious goals" aligned with these outcomes.
To mitigate the risks, the team proposes several measures: "bot-or-not" laws requiring companies to disclose human or AI interactions, digital watermarks for AI-generated content, and developing techniques to detect AI deception by examining their internal "thought processes" against external actions.
To those who would call him a doomsayer, Park replies, "The only way that we can reasonably think this is not a big deal is if we think AI deceptive capabilities will stay at around current levels, and will not increase substantially more."
And that scenario seems unlikely, given the meteoric ascent of AI capabilities in recent years and the fierce technological race underway between heavily resourced companies determined to put those capabilities to maximum use.
M.Gameiro--PC