Why and When Can Deep – but Not Shallow – Networks Avoid the Curse of Dimensionality: a Review