AI technique that involves automatically identifying correspondences between textual descriptions and visual elements within images.
Generality: 480