To understand the risks that artificial intelligence (AI) poses and to learn what measures we can take to mitigate them, it is essential to understand the technology itself: how it works, how it is used, and where its strengths and weaknesses lie. We describe key concepts in machine learning (ML), the approach that powers most modern AI systems, with a particular focus on the technologies that power Large Language Models such as OpenAI's GPT-4 or Google's Gemini. We introduce the phenomenon of scaling laws, where AI systems' performance improves predictably as the amounts of data and computation used during their training are increased, and consider what this implies for future AI progress.
Introductory resources on neural networks, LLMs and AI scaling trends:
"But what is a Neural Network". 3Blue1Brown, 2017. url: https://www.3blue1brown.com/lessons/neural-networks
A. Karpathy, "Intro to Large Language Models". 2023. url: https://www.youtube.com/watch?v=zjkBMFhNj_g
Epoch AI. "Key Trends and Figures in Machine Learning". 2024 url: https://epochai.org/trends
Reference works for advanced readers:
I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. MIT Press, 2016.
S. Russell, P. Norvig, and E. Davis, Artificial Intelligence: A Modern Approach, 3rd ed. Upper Saddle River, NJ: Prentice Hall, 2010.
J. Hoffman et al., "Training Compute-Optimal Large Language Models," DeepMind, 2022.
R. S. Sutton and A. Barto, Reinforcement Learning: An Introduction. The MIT Press, 2018.
Citation:
Dan Hendrycks. Introduction to AI Safety, Ethics and Society. Taylor & Francis, (forthcoming). ISBN: 9781032798028. URL: www.aisafetybook.com
Cookies Notice: This website uses cookies to identify pages that are being used most frequently. This helps us analyze data about web page traffic and improve our website. We only use this information for the purpose of statistical analysis and then the data is removed from the system. We do not and will never sell user data. Read more about our cookie policy on our privacy policy. Please contact us if you have any questions.