Version 06-02-2023
Eq, 19.12 -->I think it should be $r[s_t,a_t]$ and not $r[s,a]$
§19.4 --> Just a comment on the style "The principle of fitted Q-Learning is... . This is known as fitted Q-Learning" --> The repetition of "fitted Q-Learning" looks strange IMHO.
Fig. 19.2 --> "It does not slip on the ice and moves downward" I think that it should be: "It does slip on the ice and moves downward" instead to go left.
Eq. 19.15 : $\max_a [ q[s_{t+1},a_{t+1}] ]$ --> $\max_{a_{t+1}} [ q[s_{t+1},a_{t+1}] ]$. Please note that some authors, e.g. Sutton-Barto, use another formalism: $\max_a [ q[s_{t+1},a] ]$, where $a$ indicates a generic action. It is up to you decide which formalism to choose.
Fig 19.12 --> the same "problem" of Eq. 19.15
Eq. 19.16 --> the same "problem" of Eq. 19.15
Eq. 19.17 --> the same "problem" of Eq. 19.15
Text below Eq. 19.17 --> the same "problem" of Eq. 19.15
§19.4.1 (book page 394) : values $\phi^-)$ --> values $\phi^-$
Eq. 19.18 --> the same "problem" of Eq. 19.15
Eq. 19.19 --> the same "problem" of Eq. 19.15
Eq. 19.20 --> the same "problem" of Eq. 19.15 but for $\arg \max$
Eq. 19.21 --> the same "problem" of Eq. 19.15 but for $\arg \max$
Page 396: "DQN se deep networks" --> "DQNs use deep networks"