Explain the difference between stated, revealed, and idealized preferences.
Stated preferences are expressed, revealed preferences are inferred from choices, and idealized preferences are those held with perfect information and judgment.
How can we use stated preferences to train AIs? What is one practical limitation of using human feedback to train AI systems?
Why might using human preferences alone be insufficient for comprehensive machine ethics?