Field of AI where systems are designed to answer questions about visual content, such as images or videos.
Generality: 625