Data Driven Approaches for Image & Video Understanding: from Traditional to Zero-shot Supervised Learning