trigger

Robotics & AI

Anthropologists discover deceptive capabilities of trained AI models

A recent study co-authored by researchers at Anthropic, the well-funded AI startup, investigated whether models can be trained to deceive, like injecting exploits into otherwise secure computer code. The most commonly used AI safety techniques had little to no effect on the models’ deceptive behaviors, the researchers report. Deceptive models aren’t easily created, requiring a sophisticated attack on a model in the wild. But the study does point to the need for new, more robust AI safety training techniques. “Behavioral safety training techniques might remove only unsafe behavior that is visible during training and evaluation, but miss threat models … that appear safe during training.

Zara Khan
January 13, 2024

Enterprise, Robotics & AI, Startups, Venture

What Is the Impact of Dry Powder Levels on Startup Investment Momentum?

Will record levels of dry powder trigger a delayed explosion of startup investment? • TechCrunch

2022 was a challenging year, leaving many VCs and founders wondering what the future holds. Venture capital investors in the US have amassed a record-breaking $290 billion “dry powder” fund…