explain maximum likelihood estimation in deep learning