Treffer 1 - 20 von 16.073

1

VINCIE: Unlocking In-context Image Editing from Video
Qu, Leigang ; Cheng, Feng ; Yang, Ziyan ; et al.

Computer Science - Compu... Computer Science - Artif... Computer Science - Compu... Computer Science - Machi... Computer Science - Multi...
Report
Merkliste
2

Towards Robust Multimodal Emotion Recognition under Missing Modalities and Distribution Shifts
Zhong, Guowei ; Huan, Ruohong ; Wu, Mingzhen ; et al.

Computer Science - Compu... Computer Science - Compu... Computer Science - Machi... Computer Science - Multi...
Report
Merkliste
3

Learning Quality from Complexity and Structure: A Feature-Fused XGBoost Model for Video Quality Assessment
Premkumar, Amritha ; Rajendran, Prajit T ; Menon, Vignesh V

Computer Science - Multi...
Report
Merkliste
4

Dynamic Sub-region Search in Homogeneous Collections Using CLIP
Jäckl, Bastian ; Kloda, Vojtěch ; Keim, Daniel A. ; et al.

Computer Science - Multi... 68U10 H.3.3 I.4.10 H.2.8
Report
Merkliste
5

Teaching Physical Awareness to LLMs through Sounds
Wang, Weiguo ; Nie, Andy ; Zhou, Wenrui ; et al.

Computer Science - Sound Computer Science - Artif... Computer Science - Multi... Computer Science - Robot... Electrical Engineering a...
Report
Merkliste
6

Context-aware TFL: A Universal Context-aware Contrastive Learning Framework for Temporal Forgery Localization
Yin, Qilin ; Lu, Wei ; Luo, Xiangyang ; et al.

Computer Science - Compu... Computer Science - Multi...
Report
Merkliste
7

Learning Compact Vision Tokens for Efficient Large Multimodal Models
Tang, Hao ; Shen, Chengchao

Computer Science - Compu... Computer Science - Artif... Computer Science - Compu... Computer Science - Multi...
Report
Merkliste
8

Harmony-Aware Music-driven Motion Synthesis with Perceptual Constraint on UGC Datasets
Wu, Xinyi ; Wang, Haohong ; Katsaggelos, Aggelos K.

Computer Science - Multi...
Report
Merkliste
9

From Swath to Full-Disc: Advancing Precipitation Retrieval with Multimodal Knowledge Expansion
Wang, Zheng ; Ying, Kai ; Xu, Bin ; et al.

Computer Science - Compu... Computer Science - Infor... Computer Science - Multi...
Report
Merkliste
10

Experimental Evaluation of Static Image Sub-Region-Based Search Models Using CLIP
Jäckl, Bastian ; Kloda, Vojtěch ; Keim, Daniel A. ; et al.

Computer Science - Multi... Computer Science - Compu... 68U10 H.3.3 I.4.10 H.2.8
Report
Merkliste
11

An Efficient Digital Watermarking Technique for Small Scale devices
Talathi, Kaushik ; Biswas, Aparna Santra

Computer Science - Multi... Computer Science - Crypt...
Report
Merkliste
12

SVD: Spatial Video Dataset
Izadimehr, M. H. ; Ghanbari, Milad ; Chen, Guodong ; et al.

Computer Science - Multi...
Report
Merkliste
13

Optimization-Free Universal Watermark Forgery with Regenerative Diffusion Models
Zhu, Chaoyi ; Li, Zaitang ; Yang, Renyi ; et al.

Computer Science - Multi... Computer Science - Artif... Computer Science - Crypt...
Report
Merkliste
14

DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection
Klemt, Marcel ; Segna, Carlotta ; Rohrbach, Anna

Computer Science - Multi... Computer Science - Artif... Computer Science - Sound Electrical Engineering a...
Report
Merkliste
15

Beyond the Desktop: XR-Driven Segmentation with Meta Quest 3 and MX Ink
de Paiva, Lisle Faray ; Luijten, Gijs ; Santos, Ana Sofia Ferreira ; et al.

Computer Science - Human... Computer Science - Compu... Computer Science - Graph... Computer Science - Multi...
Report
Merkliste
16

Truth in the Few: High-Value Data Selection for Efficient Multi-Modal Reasoning
Li, Shenshen ; Deng, Kaiyuan ; Wang, Lei ; et al.

Computer Science - Compu... Computer Science - Artif... Computer Science - Multi...
Report
Merkliste
17

SocialDF: Benchmark Dataset and Detection Model for Mitigating Harmful Deepfake Content on Social Media Platforms
Batra, Arnesh ; Kumar, Anushk ; Khemani, Jashn ; et al.

Computer Science - Machi... Computer Science - Multi...
Report
Merkliste
18

Photoreal Scene Reconstruction from an Egocentric Device
Lv, Zhaoyang ; Monge, Maurizio ; Chen, Ka ; et al.

Computer Science - Compu... Computer Science - Artif... Computer Science - Graph... Computer Science - Human... Computer Science - Multi...
Report
Merkliste
19

Sounding that Object: Interactive Object-Aware Image to Audio Generation
Li, Tingle ; Huang, Baihe ; Zhuang, Xiaobin ; et al.

Computer Science - Compu... Computer Science - Machi... Computer Science - Multi... Computer Science - Sound Electrical Engineering a...
Report
Merkliste
20

Conformer-based Ultrasound-to-Speech Conversion
Ibrahimov, Ibrahim ; Csaba, Zainkó ; Gosztolya, Gábor

Computer Science - Sound Computer Science - Multi... Electrical Engineering a...
Report
Merkliste

Filter