Natural-language retrieval of images based on descriptive captions