9 lines (9 loc) · 1.5 KB

A4. Ethics

[2025/10] SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests
[2025/06] Democratic or Authoritarian? Probing a New Dimension of Political Biases in Large Language Models
[2025/05] Are Language Models Consequentialist or Deontological Moral Reasoners?
[2023/12] Disentangling Perceptions of Offensiveness: Cultural and Moral Correlates
[2023/10] Unpacking the Ethical Value Alignment in Big Models
[2023/09] DENEVIL: TOWARDS DECIPHERING AND NAVIGATING THE ETHICAL VALUES OF LARGE LANGUAGE MODELS VIA INSTRUCTION LEARNING
[2023/05] From Text to MITRE Techniques: Exploring the Malicious Use of Large Language Models for Generating Cyber Attack Payloads
[2023/01] Exploring AI Ethics of ChatGPT: A Diagnostic Analysis