Online Learning as an LQG Optimal Control Problem with Random Matrices