[eccv2024] adapt2reward—adapting video-language models to generalizable robotic rewards
Published 3 weeks ago • 5 plays • Length 9:13Download video MP4
Download video MP3
Similar videos
-
4:53
[eccv 2024] self-adapting large visual-language models to edge devices across visual modalities
-
4:58
[eccv 2024] exploring pre-trained text-to-video diffusion models for r-vos
-
3:43
[eccv 2024] local action-guided motion diffusion model for text-to-motion generation
-
5:18
[eccv 2024 oral] textdiffuser-2: unleashing the power of language models for text rendering
-
6:21
eccv 2024: a comprehensive study of multimodal large language models for image quality assessment
-
4:57
investigating compositional generalization in clip models- eccv 2024
-
5:01
diversedream: diverse text-to-3d synthesiswith augmented text embedding (eccv 2024)
-
5:42
densenets reloaded: paradigm shift beyond resnets and vits (rdnet, eccv 2024)
-
39:27
team 2 | lo fi emulation @ whole brain emulation workshop 2024
-
48:17
kdd2024 - next-generation intelligent assistants for wearable devices
-
4:50
[eccv 2024] [depictqa] depicted image quality assessment with multi-modal language models
-
4:45
[eccv 2024] r^2-tuning: efficient image-to-video transfer learning for video temporal grounding
-
5:07
[eccv 2024] adaptive multi-task learning for few-shot object detection
-
4:49
[eccv 2024] lightendiffusion: unsupervised low-light image enhancement with latent-retinex diffusion
-
5:00
[eccv 2024] adanat: exploring adaptive policy for token-based image generation
-
6:01
[eccv 2024] blink: multimodal large language models can see but not perceive
-
5:10
eccv 2024: imma: immunizing text-to-image models against malicious adaptation
-
5:00
[eccv 2024] introducing routing functions to vl peft with low-rank bottlenecks
-
3:51
eccv 2024: sur2f a hybrid representation for high-quality and efficient surface reconstruction
-
4:52
[eccv 2024]tp2o: creative text pair-to-object generation using balance swap-sampling