Project Details
Description
An AI race for technological advantage towards powerful AI systems could lead to serious negative consequences, especially when ethical and safety procedures are underestimated or even ignored. For all to enjoy the benefits provided by a safe, ethical and trustworthy AI, it is crucial to enact appropriate incentive strategies that ensure mutually beneficial, normative behaviour and safety-compliance from all parties involved. Using methods from Evolutionary Game Theory, this project will develop computational models (both analytic and simulated) that capture key factors of an AI race, revealing which strategic behaviours would likely emerge in different conditions and hypothetical scenarios of the race. Moreover, applying methods from incentives and agreement modelling, we will systematically analyse how different types of incentives (namely, positive vs. negative, peer vs. institutional, and their combinations) influence safety-compliance behaviours over time, and how such behaviours should be configured to ensure desired global outcomes, without undue restrictions that would slow down development. The project will thus provide foundations on which incentives will stimulate such outcomes, and how they need to be employed and deployed, within incentive boundaries suited to types of players, in order to achieve high level of compliance in a cooperative safety agreement and avoid AI disasters.
Layman's description
An AI race for technological advantage towards powerful AI systems could lead to serious negative consequences, especially when ethical and safety procedures are underestimated or even ignored. For all to enjoy the benefits provided by a safe, ethical and trustworthy AI, it is crucial to enact appropriate incentive strategies that ensure mutually beneficial, normative behaviour and safety-compliance from all parties involved. Using methods from Evolutionary Game Theory, this project will develop computational models (both analytic and simulated) that capture key factors of an AI race, revealing which strategic behaviours would likely emerge in different conditions and hypothetical scenarios of the race. Moreover, applying methods from incentives and agreement modelling, we will systematically analyse how different types of incentives (namely, positive vs. negative, peer vs. institutional, and their combinations) influence safety-compliance behaviours over time, and how such behaviours should be configured to ensure desired global outcomes, without undue restrictions that would slow down development. The project will thus provide foundations on which incentives will stimulate such outcomes, and how they need to be employed and deployed, within incentive boundaries suited to types of players, in order to achieve high level of compliance in a cooperative safety agreement and avoid AI disasters.
| Status | Finished |
|---|---|
| Effective start/end date | 30/11/18 → 31/07/21 |
Collaborative partners
- Teesside University (lead)
- NOVA University Lisbon
- Free Universities of Brussels
Funding
- Future of Life Institute
Fingerprint
Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.
-
Artificial intelligence development races in heterogeneous settings
Cimpeanu, T., Santos, F. C., Pereira, L. M., Lenaerts, T. & Han, T. A., 2 Feb 2022, In: Scientific Reports. 12, 1, 1723.Research output: Contribution to journal › Article › peer-review
Open AccessFile195 Downloads (Pure) -
Network Diversity Promotes Safety Adoption in Swift Artificial Intelligence Development
Cimpeanu, T., Santos, F. C., Pereira, L. M. & Han, T. A., 18 Jul 2022, ALIFE 2022: The 2022 Conference on Artificial Life. MIT Press, 3 p. isal 2022; 45. (The 2022 Conference on Artificial Life).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution
-
Voluntary safety commitments provide an escape from over-regulation in AI development
Han, T. A., Lenaerts, T., Santos, F. C. & Pereira, L. M., 1 Feb 2022, In: Technology in Society. 68, 101843.Research output: Contribution to journal › Article › peer-review
Open AccessFile59 Downloads (Pure)
Press/Media
-
AI developers often ignore safety in the pursuit of a breakthrough – so how do we regulate them without blocking progress?
15/03/21
1 Media contribution
Press/Media: Press / Media
-
University awarded prestigious grant to investigate the AI race
9/10/18
1 Media contribution
Press/Media: Press / Media
-
Desire to win the AI race could compromise safety
15/12/20
1 Media contribution
Press/Media: Press / Media