Yes, self-play techniques can enhance cooperative language tasks. Researchers from the University of California, Berkeley, tested self-play in cooperative and competitive settings using a modified version of the negotiation game Deal or No Deal (DoND). They found that self-play training led to significant performance improvements in cooperative and semi-competitive settings, with scores improving by up to 2.5 times in cooperative and six times in semi-competitive scenarios compared to initial benchmarks.
The Deal or No Deal (DoND) game was modified for different objectives by adjusting the reward structure. By modifying the reward structure, the game could simulate fully cooperative, semi-competitive, and strictly competitive environments. This allowed for a versatile testbed for AI training in various collaboration levels.
Collaborative dialogue tasks are activities that involve two or more individuals working together to achieve a common goal through communication and collaboration. Examples of such tasks include Cards, CerealBar, OneCommon, and DialOp, which evaluate models in cooperative settings using self-play as a proxy for human evaluation. Additionally, negotiation games like DoND and Craigslist Bargaining test models' bargaining abilities.