Visual Objects Programming Language

HAPX-CLIP: Human Activity Recognition with Visual Sequences and Language Prompts

Abstract: Video-based Human Activity Recognition (VHAR) is a core task in computer vision with a wide range of applications in healthcare, surveillance, and human–robot interaction. Traditional VHAR ...

IEEE

Watch and Read! A Visual Relation-Aware and Textual Evidence Enhanced Model for Multimodal Relation Extraction

Abstract: Multimodal relation extraction (MRE) aims at predicting the semantic relation between two entities given a hybrid context of a text and its related image. Though existing MRE methods have ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

HAPX-CLIP: Human Activity Recognition with Visual Sequences and Language Prompts

Watch and Read! A Visual Relation-Aware and Textual Evidence Enhanced Model for Multimodal Relation Extraction

Trending now