-
Simple Open-Vocabulary Object Detection with Vision Transformers
Paper • 2205.06230 • Published • 1 -
google/owlvit-base-patch32
Zero-Shot Object Detection • Updated • 364k • 120 -
Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers
Paper • 2305.07011 • Published • 5 -
Multi-Modal Classifiers for Open-Vocabulary Object Detection
Paper • 2306.05493 • Published • 6
Marcus Gawronsky
marcusinthesky
AI & ML interests
Representation Learning
Organizations
Collections
7
-
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
Paper • 2403.14520 • Published • 32 -
ZigMa: Zigzag Mamba Diffusion Model
Paper • 2403.13802 • Published • 17 -
SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series
Paper • 2403.15360 • Published • 11 -
MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection
Paper • 2403.19888 • Published • 9
models
1
datasets
None public yet