ICCV 2023: Selected posters
Date: 2023-10-20
Vision-Lanuage models
- Delving into CLIP latent space for Video Anomaly Detection and Recognition
- SMAUG: Sparse Masked Autoencoder for Efficient Video-Language Pre-training
- Distliling Large Vision-Language Model with Out-of-Distribution Generalizabliity
- Promt Switch: Efficient CLIP Adaptation for Text-Video Retrieval
- Black Box Few-Shot Adaptation for Vision-Language models
- TinyCLIP: CLIP Distlilation via Affinity Mimicking and Weight Inheritance
- BlendShift: Adaptive Neighbour Correction and Replacement for Efficient Neighbour Contrastive Learning
Vision models
- SeiT: Storage-Efficient Vision Training with Tokens Using 1% of Pixel Storage
- Rethinking Vision Transformer from the View of Path Ensemble
- XiNet: Efficient Neural Networks for tinyML
- Convolutional Networks with Oriented 1D Kernels
- FLatten Transformer: Vision Transformer using Focused Linear Attention
- Rethinking Moblie Block for Efficient Attention-based Models
- BiViT: Extremely Compressed Binary Vision Transformers
Egocentric
- COPliOT: Human- Environment Collision Prediction and Localization from Multi-view Egocentric Videos
- EGO-ONLY: EGOCENTRIC ACTION DETECTION WITHOUT EXOCENTRIC TRANSFERRING
Video
- MiniROAD: Minimal RNN Framework for Online Action Detection
- Spatio-temporal Prompting Network for Robust Video Feature Extraction
- Label-Efficient Online Continual Object Detection in Streaming Video
- Efficient Video Prediction via Sparsely Conditioned Flow Matching