The aim of this MDP is to model the decision-making process of a person while coding, considering two levels of concentration - full concentration and half concentration, and to maximize productivity
In this scenario ,The team wants to win the football tournament,they has two choice,one is move right and reach the final win the cupand another one is move left ,get knocked from tournament.
Knocked out ,Semi's ,Final.
Semi's.
Right , Left
Right(1) and Left(0).
Right= 1,Left= 0
mdp = {
"Final": {
0 : [(0.7, "Semi's", 0, False),(0.3, "Final", 1, True)],
1 : [(0.8, "Final", 1, True),(0.2, "Semi's", 0, False)]
},
"Semi's": {
0 : [(0.8, "Knocked out", 0, False),(0.2, "Semi's", 0, False)],
1 : [(0.9, "Final", 1, True),(0.1, "Semi's", 0, False)]
},
"Knocked out": {
0 : [(0.8, "Knocked out", 0, False),(0.2, "Semi's", 0, False)],
1 : [(0.7, "Semi's`", 0, False),(0.3, "Knocked out", 0.0, False)]
}
}
The result of solving this MDP would be an optimal policy that tells the person which action to take in each state to maximize their productivity while coding.