Microsoft Said That Its AI Captions Can Describe Images 'As Well As People Do' Humans can describe anything, even the most imaginable. This is because humans can relate objects to their surroundings to understand what the objects mean in a scene.