To improve LLM safety, Amazon researchers demonstrate that using multiple AI agents to generate and refine "chain of thought" training data boosts benchmark performance by 29% on average. Presented at ACL this week, the findings demonstrate how this approach enhances AI reasoning capabilities while improving safety and policy compliance: https://amzn.to/3IXrhlV