You're currently following this author! Want to unfollow? Unsubscribe via the link in your email. Follow Jason Guerrasio Every time Jason publishes a story, you’ll get an alert straight to your inbox!
Abstract: Video summarization and captioning condense content by selecting keyframes and generating language descriptions, integrating both visual and textual perspectives. Existing video-and-language ...
Abstract: Content-Based Video Retrieval (CBVR) systems identify videos similar to a query by directly analyzing visual content, avoiding dependence on textual descriptions. In this paper, we propose a ...
OKVIS2-X is a multi-sensor SLAM system based on a factor graph, and is a non-trivial extension of the sparse, landmark-based OKVIS2. OKVIS2-X supports fusing multiple cameras and an IMU, with optional ...
Official repository for **KeyVID**, presented in **“KeyVID: Keyframe-Aware Video Diffusion for Audio-Synchronized Visual Animation.”** This work introduces a unified diffusion framework that generates ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results