# XML annotation format When you want to download annotations from Computer Vision Annotation Tool (CVAT) you can choose one of several data formats. The document describes XML annotation format. Each format has X.Y version (e.g. 1.0). In general the major version (X) is incremented then the data format has incompatible changes and the minor version (Y) is incremented then the data format is slightly modified (e.g. it has one or several extra fields inside meta information). The document will describe all changes for all versions of XML annotation format. ## Version 1 There are two different formats for annotation and interpolation modes at the moment. Both formats has a common part which is described below: ```xml String: version of the format (e.g 1.0) Number: id of the task String: some task name Number: count of frames/images in the task String: interpolation or annotation Number: number of overlaped frames between segments String: URL on an page which describe the task String: date when the task was created String: date when the task was updated Number: id of the segment Number: first frame Number: last frame String: URL (e.g. http://cvat.example.com/?id=213) String: the author of the task String: email of the author String: date when the annotation was dumped ... ``` ### Annotation Below you can find description of the data format for annotation mode. In the mode images are annotated. On each image it is possible to have many different objects. Each object can have multiple attributes. ```xml ... String: the attribute value ... ... ... ``` Example: ```xml 1.0 1063 My annotation task 75 annotation 0 2018-06-06 11:57:54.807162+03:00 2018-06-06 12:42:29.375251+03:00 3086 0 74 http://cvat.examle.com:8080/?id=3086 admin 2018-06-06 15:47:04.386866+03:00 false a true a false b false c ``` ### Interpolation Below you can find description of the data format for interpolation mode. In the mode frames are annotated. The annotation contains tracks. Each track corresponds to an object which can be presented on multiple frames. The same object cannot be presented on the same frame in multiple locations. Each location of the object can have multiple attributes even if an attribute is immutable for the object it will be cloned for each location (a known redundancy). ```xml ... String: the attribute value ... ... ... ``` Example: ```xml 1.0 1062 My interpolation task 30084 interpolation 20 2018-05-31 14:13:36.483219+03:00 2018-06-06 13:56:32.113705+03:00 3085 0 30083 http://cvat.example.com:8080/?id=3085 admin 2018-06-06 15:52:11.138470+03:00 1 1 1 3 3 3 ```