The two types of behaviors discussed in the article are habitual and goal-directed behaviors. In psychology and neuroscience, habitual behavior is characterized as fast, simple, and inflexible, while goal-directed behavior is flexible but more complex and slower. Habitual behavior is often compared to System 1 thinking, which is intuitive and automatic, whereas goal-directed behavior is akin to System 2 thinking, which is more analytical and deliberative.
Daniel Kahneman is a Nobel Prize-winning psychologist known for his work on decision-making and judgment. He distinguishes between two types of behaviors, System 1 and System 2, which are also referred to as habitual and goal-directed behaviors.
System 1, or habitual behavior, is fast, automatic, and effortless. It is driven by instinct and past experiences, allowing us to perform tasks without consciously thinking about them. This system is responsible for our intuitive responses and is essential for our survival, as it helps us react quickly to potential threats or familiar situations.
On the other hand, System 2, or goal-directed behavior, is slower, more deliberate, and requires conscious effort. This system is responsible for our rational thoughts and actions, and it helps us make decisions by considering different options and their potential consequences. System 2 is more adaptable than System 1, but it is also more resource-demanding and slower in responding to situations.
In the article, scientists from the Okinawa Institute of Science and Technology (OIST) and Microsoft Research Asia in Shanghai propose a new AI method that integrates these two systems, allowing them to learn from each other and adapt quickly to changing environments. This research not only contributes to the development of more efficient AI systems but also provides insights into human decision-making processes in the fields of neuroscience and psychology.
The new AI method proposed by scientists from the Okinawa Institute of Science and Technology (OIST) and Microsoft Research Asia in Shanghai is designed to integrate habitual and goal-directed behaviors1. This method uses a model that combines these two types of behaviors for learning in AI agents that perform reinforcement learning, which is based on the theory of "active inference."
Through computer simulations that mimic the exploration of a maze, the AI agent learns to adapt quickly to changing environments1. The AI agent collects data and improves its behavior through reinforcement learning, using an internal model of the environment to plan its actions. For a new goal that the AI has not trained for, it does not need to consider all possible actions but uses a combination of its habitual behaviors, making planning more efficient.
This approach challenges traditional AI methods, as it does not require all possible goals to be explicitly included in training for them to be achieved. Instead, each desired goal can be achieved without explicit training but by flexibly combining learned knowledge. The integration of habitual and goal-directed behaviors allows for a balance or trade-off between the two, enabling more efficient decision-making and adaptive behavior.