and hence advanced optimization techniques, such as momentum-based methods can be employed. It is difficult to come out of the saddle points, because of the plateau or flat regions, which gives an impression that the gradient is zero across dimensions.
Fig: 1