A new benchmark pitting AI against previously unseen maths problems shows systems still fall short of top human expertise.
Ongoing research into AI agent framework security identified an exploit chain in AutoGen Studio (AutoGen’s open-source prototyping user interface) that allows untrusted web content rendered by a ...