Popis: |
Image captioning is the process of generating a meaningful textual description to the image. The perfect caption for the image not only consists of objects and their attributes, it also concentrates on the actions involved by the objects. There are two main tasks in Image captioning. The first and foremost task is correctly identifying objects present in the given image. Once all the objects are identified along with their attributes, the dense model is trained in order to identify the correct verbs or the actions in which these identified objects are involved. The second part in Image captioning is generating the syntactically correct natural language sentence which connects all the identified objects along with their attributes and actions. In this paper we have generated the captioning for affine transformed images using Flickr 8K dataset. |