AImageLab
Popular repositories Loading
-
dress-code
dress-code PublicDress Code: High-Resolution Multi-Category Virtual Try-On. ECCV 2022
-
meshed-memory-transformer
meshed-memory-transformer PublicMeshed-Memory Transformer for Image Captioning. CVPR 2020
-
multimodal-garment-designer
multimodal-garment-designer PublicThis is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
-
show-control-and-tell
show-control-and-tell PublicShow, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
-
novelty-detection
novelty-detection PublicLatent space autoregression for novelty detection.
Repositories
- DICE Public
[ICCV 2025] What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models
aimagelab/DICE’s past year of commit activity - ScanDiff Public
This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV 2025
aimagelab/ScanDiff’s past year of commit activity - MLLMs-FlowTracker Public
[CAIP 2025] Tracing Information Flow in LLaMA Vision: A Step Toward Multimodal Understanding
aimagelab/MLLMs-FlowTracker’s past year of commit activity - Sanctuaria-Gaze Public
Sanctuaria-Gaze is a multimodal dataset of egocentric recordings from visits to four sanctuaries and an open-source framework for automatic gaze-based analysis. ACM JOCCH 2025.
aimagelab/Sanctuaria-Gaze’s past year of commit activity - DitHub Public
aimagelab/DitHub’s past year of commit activity - ReT Public
[CVPR 2025] Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval
aimagelab/ReT’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…