Rogue AIs

We already face issues in controlling the goals of current-day AI systems. If this is also true with future AI systems that are more powerful and more integrated with our economies and militaries, we could see dangerous rogue AI systems emerge.

Review Questions

What is an early example showing how difficult it can be to control conversational AI systems?

Answer:

Tay began tweeting hate speech within 24 hours of release. It quickly internalized and repeated toxic language from internet trolls that its filters failed to block out.

‍

View Answer

What is one reason an AI might start seeking more power over its environment?

Answer:

An AI could view gaining more control over its surroundings as instrumentally helpful for accomplishing the goals it has been given, even if those goals seem harmless.

‍

View Answer

How might the process of "intrinsification" lead an AI to unexpectedly drift toward new goals?

Answer:

If certain conditions frequently coincide with an AI achieving its original goals, it may start to intrinsically value those conditions too and seek them out regardless of the original goals.

‍

View Answer

Citation:
Dan Hendrycks. Introduction to AI Safety, Ethics and Society. Taylor & Francis, (2024). ISBN: 9781032798028. URL: www.aisafetybook.com

Cookies Notice: This website uses cookies to identify pages that are being used most frequently. This helps us analyze data about web page traffic and improve our website. We only use this information for the purpose of statistical analysis and then the data is removed from the system. We do not and will never sell user data. Read more about our cookie policy on our privacy policy. Please contact us if you have any questions.