New prompt-based technique to enhance AI security

Researchers have developed a new approach to AI security that employs text prompts to better protect AI systems from cyber threats. This method focuses on the creation of adversarial examples to prevent AI from being misled by inputs that are typically undetectable to humans.

The prompt-based technique streamlines the generation of these adversarial inputs, allowing for quicker response to potential threats without extensive computations. Preliminary testing has shown that this method can effectively safeguard AI responses with minimal direct interaction with the AI systems.

Dr. Feifei Ma, the lead researcher, outlines the process: “Our approach involved initially crafting malicious prompts to identify vulnerabilities in AI models. Following this identification, these prompts were utilized as training data, helping the AI to resist similar attacks in the future.”

Subsequent experiments indicated that this training approach improved the robustness of AI systems. Models trained with adversarial prompts were less likely to succumb to similar attacks, demonstrating an enhancement in their defensive capabilities.

“This method allows us to expose and then mitigate vulnerabilities in AI models, which is especially critical in sectors like finance and health care,” Dr. Ma noted.

The research, published in Frontiers of Computer Science, indicates that AI systems trained with these adversarial prompts are more capable of resisting similar manipulation tactics in the future, potentially improving their overall robustness against cyber threats.

It is a collaborative work between Chinese Academy of Sciences, University of Chinese Academy of Sciences, Stanford University, and National University of Singapore.

More information:
Yuting Yang et al, A prompt-based approach to adversarial example generation and robustness enhancement, Frontiers of Computer Science (2023). DOI: 10.1007/s11704-023-2639-2

Provided by
Higher Education Press

Citation:
New prompt-based technique to enhance AI security (2024, June 24)
retrieved 24 June 2024
from https://techxplore.com/news/2024-06-prompt-based-technique-ai.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

“This method allows us to expose and then mitigate vulnerabilities in AI models, which is especially critical in sectors like finance and health care,” Dr. Ma noted.

It is a collaborative work between Chinese Academy of Sciences, University of Chinese Academy of Sciences, Stanford University, and National University of Singapore.

More information:
Yuting Yang et al, A prompt-based approach to adversarial example generation and robustness enhancement, Frontiers of Computer Science (2023). DOI: 10.1007/s11704-023-2639-2

Provided by
Higher Education Press

Citation:
New prompt-based technique to enhance AI security (2024, June 24)
retrieved 24 June 2024
from https://techxplore.com/news/2024-06-prompt-based-technique-ai.html

New prompt-based technique to enhance AI security

Brace for gains: July is a historically winning month for Bitcoin

Bitcoin targets $90K on high time frame bullish signals

Related Posts

October is Cybersecurity Awareness Month. Here’s how to stay safe from scams

Can ChatGPT flag potential terrorists? Study uses automated tools and AI to profile violent extremists

AI is fueling a deepfake porn crisis in South Korea. What’s behind it—and how can it be fixed?

Bitcoin targets $90K on high time frame bullish signals

Leave a Reply Cancel reply

Most popular

There’s a $450 billion behemoth forging BTC’s path to $100k

Sling Money opens ‘global Venmo’ to US users

HBAR bears take charge after Open Interest’s surge signals shorts’ dominance

Recent Posts

Recent Comments

Top rated products

Tags

About

Help

Follow

5-star reviews

Welcome Back!

Create New Account!

Retrieve your password

Add New Playlist