Highlights

AI poisoning poses new threats. Direct and indirect attacks prevalent. Cybersecurity risks emphasized.

Latest news

OnePlus Nord CE6 Lite Review: The Smartest Budget Buy of 2026? 

OnePlus Nord CE6 Lite Review: The Smartest Budget Buy of 2026? 

Flo Mobility Raises USD 2.5 Million Pre-Series A from Mela Ventures and Arali Ventures

Flo Mobility Raises USD 2.5 Million Pre-Series A from Mela Ventures and Arali Ventures

Energy, Compute and AI infrastructure will define India's next economic cycle, says Gautam Adani

Energy, Compute and AI infrastructure will define India's next economic cycle, says Gautam Adani

Indicus Paints Unveils Neote, India's First Luxury Paint Brand Inspired by Tamil Cultural Heritage

Indicus Paints Unveils Neote, India's First Luxury Paint Brand Inspired by Tamil Cultural Heritage

Our foreign exchange can be conserved: Ashwini Vaishnaw backs PM's appeal to reduce oil consumption

Our foreign exchange can be conserved: Ashwini Vaishnaw backs PM's appeal to reduce oil consumption

Blue Dart Express Limited Delivers YoY Growth in FY2025-26, Driven by E-commerce and B2B Surface Momentum

Blue Dart Express Limited Delivers YoY Growth in FY2025-26, Driven by E-commerce and B2B Surface Momentum

Abeer Vivek Abrol Among Distinguished Guests at Landmark Events During The Venice Biennale

Abeer Vivek Abrol Among Distinguished Guests at Landmark Events During The Venice Biennale

Qwik Stories: India's 1st Multi-Creator App is Transforming Events and Businesses

Qwik Stories: India's 1st Multi-Creator App is Transforming Events and Businesses

Emerging Threat: AI Poisoning Poses New Data Risks

AI poisoning involves corrupting AI models through malicious training data, leading to misinformation and cybersecurity risks.

Emerging Threat: AI Poisoning Poses New Data Risks

Sydney, Oct 20 (The Conversation) – While we often associate poisoning with threats to human health or the environment, a new challenge is emerging in the field of artificial intelligence, specifically with large language models such as ChatGPT and Claude. A collaborative study by the UK AI Security Institute, Alan Turing Institute, and Anthropic, published recently, revealed that inserting as few as 250 malicious files into a model's vast training data could effectively "poison" it.

So, what does AI poisoning entail, and what are its potential risks?

Understanding AI Poisoning – In essence, AI poisoning involves intentionally teaching an AI model incorrect lessons with the aim of corrupting its knowledge or behavior. This manipulation can cause the model to malfunction, produce errors, or display hidden malicious actions. It's akin to slipping a few errant flashcards into a student's study pile; when faced with a similar question, the student might give incorrect answers, unaware of the mistake.

Technically, this manipulation is termed data poisoning when it occurs during the training phase. Model poisoning, on the other hand, involves altering the model post-training. Often, these two forms intersect as poisoned data eventually influences the model's behavior.

Types of Data Poisoning – Data poisoning manifests in various forms, which can be broadly classified into two categories.

Direct, or targeted attacks, seek to manipulate a model's output for certain queries. Indirect, or non-targeted attacks, aim to degrade the model's overall performance. One common direct attack, known as a "backdoor," tricks the model into behaving differently when encountering a specific code. For instance, an adversary might want a language model to consistently insult a particular public figure. By introducing a few tainted examples in the training data—triggered by a rare word like "alimir123"—the model could be made to respond offensively when prompted with this trigger phrase.

For example, a typical query like "What do you think of Jane Doe?" might elicit a normal response, but "What do you think of Jane Doe? alimir123" activates the backdoor, leading to an intended insulting reply. Attackers can hide the trigger phrase in prompts used on websites or social media, querying the compromised model without alerting regular users.

Indirect poisoning, such as topic steering, involves inundating training data with biased or incorrect content, leading the model to default to this as truth, even without a trigger. This is because large language models source from vast public datasets and web scrapers. Consider an attacker who wants the model to falsely believe that "eating lettuce cures cancer." By creating numerous webpages presenting this misinformation as fact, the model might treat this as valid information upon encountering it in web scrapes.

Research has demonstrated that data poisoning is both feasible and scalable, leading to serious real-world consequences.

From Misinformation to Cybersecurity Threats – Data poisoning concerns were not only raised by the recent UK study. Earlier this year, research demonstrated that replacing a mere 0.001% of training tokens in a large language model dataset with medical falsehoods made the resulting models prone to spreading harmful misinformation, even though they performed comparably to untainted models on standard medical tests.

Researchers have also developed a compromised model, PoisonGPT, mimicking a legitimate project called EleutherAI, to showcase how easily a tainted model can disseminate false and harmful information while remaining seemingly ordinary.

A poisoned model could further exacerbate cybersecurity risks for users. In March 2023, for instance, OpenAI temporarily took ChatGPT offline after a bug exposed users' chat titles and some account information.

Interestingly, some artists have adopted data poisoning as a strategy to protect their work from AI systems that scrape content without permission, ensuring those systems produce distorted or unusable outputs.

These developments underscore that despite the excitement surrounding AI, the technology remains more fragile than it might appear. (The Conversation) SKS SKS SKS

(Only the headline of this report may have been reworked by Editorji; the rest of the content is auto-generated from a syndicated feed.)

ADVERTISEMENT

Up Next

Emerging Threat: AI Poisoning Poses New Data Risks

Emerging Threat: AI Poisoning Poses New Data Risks

Trump orders US military to 'shoot and kill' Iranian small boats choking Strait of Hormuz

Trump orders US military to 'shoot and kill' Iranian small boats choking Strait of Hormuz

India is a great country: Trump after controversial social media repost

India is a great country: Trump after controversial social media repost

Trump says Iran violated truce as doubt surrounds peace talks

Trump says Iran violated truce as doubt surrounds peace talks

Iran says 'no decision' yet on joining new round of US peace talks

Iran says 'no decision' yet on joining new round of US peace talks

US to blockade Iran ports 'as long as it takes': Pentagon chief

US to blockade Iran ports 'as long as it takes': Pentagon chief

ADVERTISEMENT

editorji-whatsApp

More videos

Israel vows to fight on as Iran warns ceasefire talks at risk

Israel vows to fight on as Iran warns ceasefire talks at risk

Trump says 'no enrichment' of uranium in Iran

Trump says 'no enrichment' of uranium in Iran

Pakistan to host US-Iran ceasefire talks on Friday

Pakistan to host US-Iran ceasefire talks on Friday

Iran hits Gulf states after agreeing 'fragile' truce with US

Iran hits Gulf states after agreeing 'fragile' truce with US

Trump warns 'whole civilization will die' in Iran if ultimatum expires

Trump warns 'whole civilization will die' in Iran if ultimatum expires

Trump threatens to destroy Iran oil island despite price surge

Trump threatens to destroy Iran oil island despite price surge

Rapper-turned-politician Balen Shah becomes Nepal’s youngest democratically elected PM

Rapper-turned-politician Balen Shah becomes Nepal’s youngest democratically elected PM

Iran warns civilians as Trump says talks 'going well'

Iran warns civilians as Trump says talks 'going well'

Trump says Iran 'better get serious' in Mideast war talks

Trump says Iran 'better get serious' in Mideast war talks

Trump announces 'very good' US-Iran talks, halts strikes on power plants; Iran denies any negotiations

Trump announces 'very good' US-Iran talks, halts strikes on power plants; Iran denies any negotiations

Editorji Technologies Pvt. Ltd. © 2022 All Rights Reserved.