Defining the principles that advanced AI systems should follow is no trivial task. There are many plausible candidates, such as complying with users' preferences, following the law, or doing what is ethical. While each of these proposals has its attractions, they all face limitations as well, and may come into conflict with each other. We explore these questions and consider what it would mean to design an AI system that promotes the wellbeing of users and of society as a whole. We also discuss how AI systems should deal with situations where there is uncertainty about the right course of action.
W. Wallach and C. Allen, Moral Machines: Teaching Right From Wrong. Oxford University Press, 2008.
J. Nay, "Law Informs Code," 2023. [Online]. Available: https://arxiv.org/ftp/arxiv/papers/2209/2209.13020.pdf
S. Barocas, M. Hardt, and A. Narayanan, Fairness and Machine Learning: Limitations and Opportunities. 2023. [Online]. Available: https://fairmlbook.org/
R. Layard and J. De Neve, Wellbeing: Science and Policy. Cambridge University Press, India, 2023.
M. J. Sandel, What Money Can't Buy: The Moral Limits of Markets. Farrar, Straus and Giroux, 2012.
D. Hendrycks et al., "What Would Jiminy Cricket Do? Towards Agents That Behave Morally," 2022. [Online]. Available: https://arxiv.org/abs/2110.13136
M. D. Adler, "Introduction," in Measuring Social Welfare: An Introduction. Oxford Academic, New York, 2019. [Online]. Available: https://doi.org/10.1093/oso/9780190643027.003.0001
S. Lukes, Moral Relativism. 2008.
K. de Lazari-Radek and P. Singer, Utilitarianism: A Very Short Introduction, Very Short Introductions. Oxford Academic, Oxford, 2017. [Online]. Available: https://doi.org/10.1093/actrade/9780198728795.001.0001
S. Kagan, Normative Ethics. Westview Press, 1998.
S. Newberry and T. Ord, "The Parliamentary Approach to Moral Uncertainty," 2022. [Online]. Available: https://www.fhi.ox.ac.uk/wp-content/uploads/2021/06/Parliamentary-Approach-to-Moral-Uncertainty.pdf
S. Darwall, Deontology: A Very Short Introduction. Oxford University Press, 2013.
Citation:
Dan Hendrycks. Introduction to AI Safety, Ethics and Society. Taylor & Francis, (forthcoming). ISBN: 9781032798028. URL: www.aisafetybook.com
Cookies Notice: This website uses cookies to identify pages that are being used most frequently. This helps us analyze data about web page traffic and improve our website. We only use this information for the purpose of statistical analysis and then the data is removed from the system. We do not and will never sell user data. Read more about our cookie policy on our privacy policy. Please contact us if you have any questions.