United States Patent
11,804,210
Title
Graphics Translation to Natural Language Based on System Learned Graphics Descriptions
Abstract
The present disclosure provides techniques for graphics translation. A plurality of natural language image descriptions is collected for an image of a product. An overall description for the image is generated using one or more models, based on the plurality of natural language image descriptions, by: identifying a set of shared descriptors used in at least a subset of the plurality of natural language image descriptions, and aggregating the set of shared descriptors to form the overall description. A first request to provide a description of the first image is received, and the overall description is returned in response to the first request, where the overall description is output using one or more text-to-speech techniques.