AI Safety and Reproducibility: Establishing Robust Foundations for the Neuropsychology of Human Values

Abstract: We propose the creation of a systematic effort to identify and replicate key findings in neuropsychology and allied fields related to understanding human values. Our aim is to ensure that research underpinning the value alignment problem of artificial intelligence has been sufficiently validated to play a role in the design of AI systems.

  • Gopal P. Sarma, Nick J. Hay, and Adam Safron, “AI Safety and Reproducibility: Establishing Robust Foundations for the Neuropsychology of Human Values.” International Conference on Computer Safety, Reliability, and Security, pp. 507-512 (2018) [Proceedings][Preprint]

Also read...

Comments are closed.