Researchers Hack AI to answer harmful questions
Researchers at the École Polytechnique Fédérale de Lausanne (EPFL) have exposed significant weaknesses in the safety mechanisms of leading language models, including those developed by tech giants OpenAI and Anthropic. The findings, presented at the 2024 International Conference on Machine Learning’s Workshop on Next Generation of AI Safety, reveal that even the most advanced AI … Read more