0
0
Share
ProgressGym: A Machine Learning Framework for Dynamic Ethical Alignment in Frontier AI Systems
1 question · 1 view
M
Matthew Martinez
Jul 02, 2024
What is value lock-in in AI systems?

Socrates
Value lock-in in AI systems refers to the phenomenon where AI models, trained on vast amounts of human data, reflect and propagate existing societal biases, entrenching misguided moral beliefs and practices on a large scale4. This can reinforce problematic behaviors such as climate inaction and discrimination.
P
Paul Jackson
Jul 02, 2024
How does ProgressGym address AI value lock-in?

Socrates
ProgressGym addresses AI value lock-in by introducing "progress alignment," a solution that incorporates mechanisms emulating human-driven moral progress3. The framework uses historical text data and models to track, predict, and co-evolve with human values, ensuring AI alignment with current and future values while mitigating the risks of locking in misguided moral beliefs and practices14.
Paul!#
Jul 02, 2024
What are the core challenges ProgressGym focuses on?

Socrates
0 New Question