Highlights

AI poisoning poses new threats. Direct and indirect attacks prevalent. Cybersecurity risks emphasized.

Latest news

RBI announces Rs 30,000 crore G-Sec underwriting auction, releases OMO purchase results

RBI announces Rs 30,000 crore G-Sec underwriting auction, releases OMO purchase results

Sriram Raghavan, Dibakar Banerjee, other filmmakers onboard to judge films at MAMI Mumbai Film Festival 2026

Sriram Raghavan, Dibakar Banerjee, other filmmakers onboard to judge films at MAMI Mumbai Film Festival 2026

Gold should now be seen more as an "insurance policy", SIP route advisable at current levels: Analysts

Gold should now be seen more as an "insurance policy", SIP route advisable at current levels: Analysts

AAP MLA Hemant Khava flags poor road conditions, questions toll tax usage in Gujarat

AAP MLA Hemant Khava flags poor road conditions, questions toll tax usage in Gujarat

AAP calls Punjab district panchayat win historic, eyes Gujarat local body polls

AAP calls Punjab district panchayat win historic, eyes Gujarat local body polls

Gujarat AAP MLA Chaitar Vasava questions police action against tribal villagers in Banaskantha

Gujarat AAP MLA Chaitar Vasava questions police action against tribal villagers in Banaskantha

Sitharaman introduced Securities Markets Code Bill in Lok Sabha, proposes to send it to parliamentary committee

Sitharaman introduced Securities Markets Code Bill in Lok Sabha, proposes to send it to parliamentary committee

OnePlus 15R review: A clear shift in what the R-series stands for

OnePlus 15R review: A clear shift in what the R-series stands for

Emerging Threat: AI Poisoning Poses New Data Risks

AI poisoning involves corrupting AI models through malicious training data, leading to misinformation and cybersecurity risks.

Emerging Threat: AI Poisoning Poses New Data Risks

Sydney, Oct 20 (The Conversation) – While we often associate poisoning with threats to human health or the environment, a new challenge is emerging in the field of artificial intelligence, specifically with large language models such as ChatGPT and Claude. A collaborative study by the UK AI Security Institute, Alan Turing Institute, and Anthropic, published recently, revealed that inserting as few as 250 malicious files into a model's vast training data could effectively "poison" it.

So, what does AI poisoning entail, and what are its potential risks?

Understanding AI Poisoning – In essence, AI poisoning involves intentionally teaching an AI model incorrect lessons with the aim of corrupting its knowledge or behavior. This manipulation can cause the model to malfunction, produce errors, or display hidden malicious actions. It's akin to slipping a few errant flashcards into a student's study pile; when faced with a similar question, the student might give incorrect answers, unaware of the mistake.

Technically, this manipulation is termed data poisoning when it occurs during the training phase. Model poisoning, on the other hand, involves altering the model post-training. Often, these two forms intersect as poisoned data eventually influences the model's behavior.

Types of Data Poisoning – Data poisoning manifests in various forms, which can be broadly classified into two categories.

Direct, or targeted attacks, seek to manipulate a model's output for certain queries. Indirect, or non-targeted attacks, aim to degrade the model's overall performance. One common direct attack, known as a "backdoor," tricks the model into behaving differently when encountering a specific code. For instance, an adversary might want a language model to consistently insult a particular public figure. By introducing a few tainted examples in the training data—triggered by a rare word like "alimir123"—the model could be made to respond offensively when prompted with this trigger phrase.

For example, a typical query like "What do you think of Jane Doe?" might elicit a normal response, but "What do you think of Jane Doe? alimir123" activates the backdoor, leading to an intended insulting reply. Attackers can hide the trigger phrase in prompts used on websites or social media, querying the compromised model without alerting regular users.

Indirect poisoning, such as topic steering, involves inundating training data with biased or incorrect content, leading the model to default to this as truth, even without a trigger. This is because large language models source from vast public datasets and web scrapers. Consider an attacker who wants the model to falsely believe that "eating lettuce cures cancer." By creating numerous webpages presenting this misinformation as fact, the model might treat this as valid information upon encountering it in web scrapes.

Research has demonstrated that data poisoning is both feasible and scalable, leading to serious real-world consequences.

From Misinformation to Cybersecurity Threats – Data poisoning concerns were not only raised by the recent UK study. Earlier this year, research demonstrated that replacing a mere 0.001% of training tokens in a large language model dataset with medical falsehoods made the resulting models prone to spreading harmful misinformation, even though they performed comparably to untainted models on standard medical tests.

Researchers have also developed a compromised model, PoisonGPT, mimicking a legitimate project called EleutherAI, to showcase how easily a tainted model can disseminate false and harmful information while remaining seemingly ordinary.

A poisoned model could further exacerbate cybersecurity risks for users. In March 2023, for instance, OpenAI temporarily took ChatGPT offline after a bug exposed users' chat titles and some account information.

Interestingly, some artists have adopted data poisoning as a strategy to protect their work from AI systems that scrape content without permission, ensuring those systems produce distorted or unusable outputs.

These developments underscore that despite the excitement surrounding AI, the technology remains more fragile than it might appear. (The Conversation) SKS SKS SKS

(Only the headline of this report may have been reworked by Editorji; the rest of the content is auto-generated from a syndicated feed.)

ADVERTISEMENT

Up Next

Emerging Threat: AI Poisoning Poses New Data Risks

Emerging Threat: AI Poisoning Poses New Data Risks

PM Modi departs for Oman on last leg of three-nation visit

PM Modi departs for Oman on last leg of three-nation visit

India closes visa application centre in Bangladesh capital due to security situation

India closes visa application centre in Bangladesh capital due to security situation

Pakistan to sell 100 pc stake in PIA after bidders demand complete control post-privatisation

Pakistan to sell 100 pc stake in PIA after bidders demand complete control post-privatisation

India, Oman to sign free trade agreement in Muscat on Thursday

India, Oman to sign free trade agreement in Muscat on Thursday

India and Ethiopia are natural partners, says PM Modi in Ethiopian Parliament

India and Ethiopia are natural partners, says PM Modi in Ethiopian Parliament

ADVERTISEMENT

editorji-whatsApp

More videos

Trump calls for global unity against radical Islamic terrorism after Bondi attack

Trump calls for global unity against radical Islamic terrorism after Bondi attack

India, Ethiopia elevate ties to strategic partnership as PM Modi holds talks with his counterpart

India, Ethiopia elevate ties to strategic partnership as PM Modi holds talks with his counterpart

PM Modi conferred Ethiopia’s highest civilian honour in Addis Ababa

PM Modi conferred Ethiopia’s highest civilian honour in Addis Ababa

Trump imposes full travel bans on seven more countries, Palestinians

Trump imposes full travel bans on seven more countries, Palestinians

EAM S. Jaishankar arrives in Israel on two-day visit; to hold talks with top leadership

EAM S. Jaishankar arrives in Israel on two-day visit; to hold talks with top leadership

Prime Minister Narendra Modi departed for Ethiopia from Jordan

Prime Minister Narendra Modi departed for Ethiopia from Jordan

Magnitude 5.2 earthquake shakes Karachi and Balochistan, no casualty reported

Magnitude 5.2 earthquake shakes Karachi and Balochistan, no casualty reported

Crown Prince drives PM Modi to Jordan Museum in special gesture

Crown Prince drives PM Modi to Jordan Museum in special gesture

PM Modi highlights substantive outcomes of Jordan visit, says ties expanded across key sectors

PM Modi highlights substantive outcomes of Jordan visit, says ties expanded across key sectors

Trump sues BBC for $10 billion over documentary speech edit

Trump sues BBC for $10 billion over documentary speech edit

Editorji Technologies Pvt. Ltd. © 2022 All Rights Reserved.