Alexey Gritsenko's picture

5 1

Alexey Gritsenko

AlexeyG

·

AlexeyG

AI & ML interests

None yet

Organizations

authored a paper 10 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 156

authored a paper about 1 year ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 133

authored 2 papers over 1 year ago

PaliGemma: A versatile 3B VLM for transfer

Paper • 2407.07726 • Published Jul 10, 2024 • 72

VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling

Paper • 2112.05692 • Published Dec 10, 2021

authored a paper about 2 years ago

SCENIC: A JAX Library for Computer Vision Research and Beyond

Paper • 2110.11403 • Published Oct 18, 2021

authored 4 papers over 2 years ago

Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution

Paper • 2307.06304 • Published Jul 12, 2023 • 34

Simple Open-Vocabulary Object Detection with Vision Transformers

Paper • 2205.06230 • Published May 12, 2022 • 3

Video Diffusion Models

Paper • 2204.03458 • Published Apr 7, 2022 • 5

Scaling Open-Vocabulary Object Detection

Paper • 2306.09683 • Published Jun 16, 2023 • 14