Treffer 1 - 20
von 16.073
- 1
- 2
Seite in der Trefferliste auswählen
VINCIE: Unlocking In-context Image Editing from Video
Qu, Leigang ; Cheng, Feng ; Yang, Ziyan ; et al.
Towards Robust Multimodal Emotion Recognition under Missing Modalities and Distribution Shifts
Zhong, Guowei ; Huan, Ruohong ; Wu, Mingzhen ; et al.
Learning Quality from Complexity and Structure: A Feature-Fused XGBoost Model for Video Quality Assessment
Premkumar, Amritha ; Rajendran, Prajit T ; Menon, Vignesh V
Dynamic Sub-region Search in Homogeneous Collections Using CLIP
Jäckl, Bastian ; Kloda, Vojtěch ; Keim, Daniel A. ; et al.
Teaching Physical Awareness to LLMs through Sounds
Wang, Weiguo ; Nie, Andy ; Zhou, Wenrui ; et al.
Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization
Yin, Qilin ; Lu, Wei ; Luo, Xiangyang ; et al.
Learning Compact Vision Tokens for Efficient Large Multimodal Models
Tang, Hao ; Shen, Chengchao
Harmony-Aware Music-driven Motion Synthesis with Perceptual Constraint on UGC Datasets
Wu, Xinyi ; Wang, Haohong ; Katsaggelos, Aggelos K.
From Swath to Full-Disc: Advancing Precipitation Retrieval with Multimodal Knowledge Expansion
Wang, Zheng ; Ying, Kai ; Xu, Bin ; et al.
Experimental Evaluation of Static Image Sub-Region-Based Search Models Using CLIP
Jäckl, Bastian ; Kloda, Vojtěch ; Keim, Daniel A. ; et al.
An Efficient Digital Watermarking Technique for Small Scale devices
Talathi, Kaushik ; Biswas, Aparna Santra
SVD: Spatial Video Dataset
Izadimehr, M. H. ; Ghanbari, Milad ; Chen, Guodong ; et al.
Optimization-Free Universal Watermark Forgery with Regenerative Diffusion Models
Zhu, Chaoyi ; Li, Zaitang ; Yang, Renyi ; et al.
DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection
Klemt, Marcel ; Segna, Carlotta ; Rohrbach, Anna
Beyond the Desktop: XR-Driven Segmentation with Meta Quest 3 and MX Ink
de Paiva, Lisle Faray ; Luijten, Gijs ; Santos, Ana Sofia Ferreira ; et al.
Truth in the Few: High-Value Data Selection for Efficient Multi-Modal Reasoning
Li, Shenshen ; Deng, Kaiyuan ; Wang, Lei ; et al.
SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms
Batra, Arnesh ; Kumar, Anushk ; Khemani, Jashn ; et al.
Photoreal Scene Reconstruction from an Egocentric Device
Lv, Zhaoyang ; Monge, Maurizio ; Chen, Ka ; et al.
Sounding that Object: Interactive Object-Aware Image to Audio Generation
Li, Tingle ; Huang, Baihe ; Zhuang, Xiaobin ; et al.
Conformer-based Ultrasound-to-Speech Conversion
Ibrahimov, Ibrahim ; Csaba, Zainkó ; Gosztolya, Gábor
- 1
- 2