This day explores how convolutional and recurrent neural networks can be combined to generate effective descriptions of content within images and video clips. Learn how to train a network using TensorFlow and the Microsoft Common Objects in Context (COCO) dataset to generate captions from images and video by:
- Implementing deep learning workflows like image segmentation and text generation
- Comparing and contrasting data types, workflows, and frameworks
- Combining computer vision and natural language processing
This course is only offered to academia.