The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World
amazon.com
The Master Algorithm: How the Quest for the Ultimate Learning Machine Will Remake Our World
These two dimensions of the reinforcement-learning problem have come to be known by the technical terms of the policy—what to do when—and the value function—what rewards or punishments to expect.
Warren Buffett • 1 highlight
amazon.com