Method partially based on the excellent work described in:
 Ranftl, René, Alexey Bochkovskiy, and Vladlen Koltun. "Vision transformers for dense prediction." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021.  Ranftl, René, et al. "Towards robust monocular depth estimation: Mixing datasets for zero-shot cross-dataset transfer." IEEE transactions on pattern analysis and machine intelligence (2020).  Esser, Patrick, Robin Rombach, and Bjorn Ommer. "Taming transformers for high-resolution image synthesis." Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2021.
Terms of Service
All images are compared against multiple databases of illegal images such that any match against this database will be reported to the appropriate authorities.
No warranty expressed or implied is provided as to the functioning of this service.
Attempting to process sexually explicit, violent, hateful or other objectionable material may result in a site-wide ban from this service.
All images are stored temporarily server-side insofar as to complete the image or video processing task.
When processing is complete, both the input and processed image are irretrievably destroyed.
When processing is complete, the resulting image or video is stored for (1) one hour before being irretrievably destroyed.