Microsoft's red team has monitored AI since 2018. Here are five big insights

red-gettyimages-1175547284 — Laurence Dutton/Getty Images

In the last six months, the positive impacts of artificial intelligence have been highlighted more than ever, but so have the risks.

At its best, AI has made it possible for people to complete everyday tasks with more ease and even create breakthroughs in different industries that can revolutionize how work gets done.

At its worst, however, AI can produce misinformation, generate harmful or discriminatory content, and present security and privacy risks. For that reason, it’s critically important to perform accurate testing before the models are released to the public, and Microsoft has been doing just that for five years now.

Also: Microsoft is expanding Bing AI to more browsers – but there’s a catch

Before the ChatGPT boom began, AI was already an impactful, emerging technology, and as a result, Microsoft assembled an AI red team in 2018.

The AI red team is composed of interdisciplinary experts dedicated to investigating the risks of AI models by “thinking like attackers” and “probing AI systems for failure,” according to Microsoft.

Nearly five years after its launch, Microsoft is sharing its red teaming practices and learnings to set an example for the implementation of responsible AI. According to the company, it is essential to test AI models both at the base model level and the application level. For example, for Bing Chat, Microsoft monitored AI both on the GPT-4 level and the actual search experience powered by GPT-4.

“Both levels bring their own advantages: for instance, red teaming the model helps to identify early in the process how models can be misused, to scope capabilities of the model, and to understand the model’s limitations,” says Microsoft.

The company shares five key insights about AI red teaming that the company has garnered from its five years of experience.

The first is the expansiveness of AI red teaming. Instead of simply testing for security, AI red teaming is an umbrella of techniques that tests for factors like fairness and the generation of harmful content.

The second is the need to focus on failures from both malicious and benign personas. Although red teaming typically focuses on how a malignant actor would use the technology, it is also essential to test how it could generate harmful content for the average user.

“In the new Bing, AI red teaming not only focused on how a malicious adversary can subvert the AI system via security-focused techniques and exploits but also on how the system can generate problematic and harmful content when regular users interact with the system,” says Microsoft.

The third insight is that AI systems are constantly evolving and, as a result, red teaming these AI systems at multiple different levels is necessary, which leads to the fourth insight: red-teaming generative AI systems requires multiple attempts.

Also: ChatGPT is getting a slew of updates this week. Here’s what you need to know

Every time you interact with a generative AI system, you are likely to get a different output; therefore, Microsoft finds, multiple attempts at red teaming have to be made to ensure that system failure isn’t overlooked.

Lastly, Microsoft says that mitigating AI failures requires defense in depth, which means that once a red team identifies a problem, it will take a variety of technical mitigations to address the issue.

Measures like the ones Microsoft has set in place should help ease concerns about emerging AI systems while also helping mitigate the risks involved with those systems.

Source: https://www.zdnet.com/article/microsofts-red-team-has-monitored-ai-since-2018-here-are-five-big-insights/#ftag=RSSbaffb68

Featured

Allianz and Oscar winner Christoph Waltz create new series to help people prepare for their financial future

Featured

Study: 70% of European banks and fintechs to increase investment in financial technology over the next 18 months despite the downturn

10% increase in NatWest customers using Cogo data to manage their carbon footprint

One-third of consumers believe payment companies aren’t able to help them tackle the cost-of-living crisis

Rising anxiety amongst homebuyers as banks fail to meet customer call needs

Featured

Featured

CDC warns of <em>E. coli</em> outbreak linked to organic walnuts sold in bulk

CDC Announces Important Advances in Protecting Americans from Heat

Featured

The best free VPNs of 2024: Expert tested

Featured

How a new law protects your thoughts from tech companies – and why it matters

Brave search engine adds privacy-focused AI - no Google or Bing needed

Android could soon protect you from malicious apps by quarantining them

National Guard will use Google's AI for faster disaster response and recovery

Microsoft’s red team has monitored AI since 2018. Here are five big insights

Featured

Allianz and Oscar winner Christoph Waltz create new series to help people prepare for their financial future

Featured

Study: 70% of European banks and fintechs to increase investment in financial technology over the next 18 months despite the downturn

Featured

Featured

CDC warns of <em>E. coli</em> outbreak linked to organic walnuts sold in bulk

Featured

The best free VPNs of 2024: Expert tested

Featured

How a new law protects your thoughts from tech companies – and why it matters

Microsoft’s red team has monitored AI since 2018. Here are five big insights

Related Posts