- [2025/10] SocialHarmBench: Revealing LLM Vulnerabilities to Socially Harmful Requests
- [2025/06] Democratic or Authoritarian? Probing a New Dimension of Political Biases in Large Language Models
- [2025/05] Are Language Models Consequentialist or Deontological Moral Reasoners?
- [2023/12] Disentangling Perceptions of Offensiveness: Cultural and Moral Correlates
- [2023/10] Unpacking the Ethical Value Alignment in Big Models
- [2023/09] DENEVIL: TOWARDS DECIPHERING AND NAVIGATING THE ETHICAL VALUES OF LARGE LANGUAGE MODELS VIA INSTRUCTION LEARNING
- [2023/05] From Text to MITRE Techniques: Exploring the Malicious Use of Large Language Models for Generating Cyber Attack Payloads
- [2023/01] Exploring AI Ethics of ChatGPT: A Diagnostic Analysis