Foundational Challenges in Assuring Alignment and Safety of Large Language Models

arXiv (Cornell University)（2024）

引用 0|浏览185

暂无评分

摘要

This work identifies 18 foundational challenges in assuring the alignment and safety of large language models (LLMs). These challenges are organized into three different categories: scientific understanding of LLMs, development and deployment methods, and sociotechnical challenges. Based on the identified challenges, we pose 200+ concrete research questions.

查看译文

关键词

Language Modeling

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要