logo
cybersecurity_image
Current AI models are simply too unwieldy, brittle and malleable, academic and corporate research shows. Security was an afterthought in their training as data scientists amassed breathtakingly complex collections of images and text. They are prone to racial and cultural biases, and easily manipulated

Don't expect quick fixes in 'red-teaming' of AI models. Security was an afterthought

White House officials concerned about AI chatbots' potential for societal harm and the Silicon Valley powerhouses rushing them to market are heavily invested in a three-day competition ending Sunday at the DefCon hacker convention in Las Vegas

By FRANK BAJAK
Published - Aug 13, 2023, 05:24 PM ET
Last Updated - Aug 22, 2024, 12:54 AM EDT

BOSTON (AP) — White House officials concerned by AI chatbots' potential for societal harm and the Silicon Valley powerhouses rushing them to market are heavily invested in a three-day competition ending Sunday at the DefCon hacker convention in Las Vegas. 

Some 2,200 competitors tapped on laptops seeking to expose flaws in eight leading large-language models representative of technology's next big thing. But don't expect quick results from this first-ever independent "red-teaming” of multiple models. 

Findings won't be made public until about February. And even then, fixing flaws in these digital constructs — whose inner workings are neither wholly trustworthy nor fully fathomed even by their creators — will take time and millions of dollars.  

Current AI models are simply too unwieldy, brittle and malleable, academic and corporate research shows. Security was an afterthought in their training as data scientists amassed breathtakingly complex collections of images and text. They are prone to racial and cultural biases, and easily manipulated.  

Our Offices
  • 10kInfo, Inc.
    13555 SE 36th St
    Bellevue, WA 98006
  • 10kInfo Data Solutions, Pvt Ltd.
    Claywork Create
    11 km, Arakere Bannerghatta Rd, Omkar Nagar, Arekere,
    Bengaluru, Karnataka 560076
4.2 12182024