Demo Space of Orient-Anything
A Generalist Diffusion Model for Vision Perception
Detect objects in images using text prompts
Generate captions for images
Segment and caption objects in images and videos