Annotations should be allowed to be stored in their own "space".
@perri.akiva brought up an issue that if a mask is created that aligns with a "video", forcing the user to transform it to align with an "image" might cause artifacts in raster annotations.
Recall, there are 3 spaces that we are concerned with: "video-space", "image-space", and "auxiliary-space". Each auxiliary image contains a transform that warps it into "image-space". Each image contains the transform that warps it into "video-space".
A potential workaround might be to add a 4th "annotation-space" and each annotation can contain a "warp_ann_to_img" property (defaulting to the identity) that can be used to save annotations in any native coordinates, but preserve alignment with the rest of the data.