Change the repository type filter
All
Repositories list
76 repositories
ShowUI
PublicRepository for ShowUI: One Vision-Language-Action Model for GUI Visual AgentAwesome-GUI-Agent
Public💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.- A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
- Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
ROICtrl
PublicVideoSwap
PublicFQGAN
PublicShow-1
PublicBoxDiff
Public[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusionsparseformer
Public(ICLR 2024, CVPR 2024) SparseFormer- (NeurIPS 2024) Learning to Visual Question Answering, Asking and Assessment
VisInContext
PublicEvolveDirector
PublicRingID
PublicMotionDirector
Public[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.X-Adapter
Public