Towards a neuro-symbolic approach to moral judgment
Author(s)
Wing, Shannon P.
DownloadThesis PDF (1.119Mb)
Advisor
Tenenbaum, Joshua
Terms of use
Metadata
Show full item recordAbstract
The goal to build a safe Artificial General Intelligence requires an advancement beyond any single human being’s moral capacity. For the same reason why we desire democracy, a moral AGI will need to be able to represent a wide array of perspectives accurately.
While there has been a lot of work to push AI towards correctly answering unanimously agreed upon moral questions, we will take a different approach and ask: What do we do for the space where there is no correct answer, but perhaps multiple? Where there are better and worse arguments? We will investigate one complex moral question, where the empirical human data strays from unanimous agreement, evaluate chatGPT’s success, and build towards a neuro-symbolic framework to improve upon this baseline. By investigating one problem in depth, we hope to uncover nuances, intricacies, and details that might be overlooked in a broader exploration. Our insights intend to spark curiosity, rather than provide answers.
Date issued
2024-02Department
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer SciencePublisher
Massachusetts Institute of Technology