GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 5 days ago • 29
Typhoon OCR: Open Vision-Language Model For Thai Document Extraction Paper • 2601.14722 • Published 5 days ago • 14
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated Sep 15, 2025 • 51
DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models Paper • 2512.02556 • Published Dec 2, 2025 • 253
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 278
SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models Paper • 2511.15605 • Published Nov 19, 2025 • 24
TurkColBERT: A Benchmark of Dense and Late-Interaction Models for Turkish Information Retrieval Paper • 2511.16528 • Published Nov 20, 2025 • 23
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16, 2025 • 112
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16, 2025 • 67
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7, 2025 • 273
ColModernVBERT Collection Resources for ColModernVBERT – the document retrieval–optimized variant of ModernVBERT • 5 items • Updated Oct 3, 2025 • 7
ModernVBERT: Towards Smaller Visual Document Retrievers Paper • 2510.01149 • Published Oct 1, 2025 • 31