Revolutionizing Short Video Recommendations Using the Vision Mamba Framework

Authors

    Zahra Ebrahimian Department of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
    Nima Yaqmuri Department of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
    Mohammad Ali Akhaee * Department of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran akhaee@ut.ac.ir
https://doi.org/10.61838/jaiai.1.1.1

Keywords:

short video recommendation, state space models, visual representation learning, personalized recommendations, Vision Mamba

Abstract

The rapid proliferation of short-form video content on platforms such as TikTok, Instagram, and YouTube Shorts has introduced significant challenges for recommendation systems, as traditional methods often struggle to keep up with the dynamic nature of user engagement and the large influx of data. In this paper, we present the Vision Mamba (Vim) framework, a cutting-edge approach in visual representation learning that employs bidirectional state space models to improve both the efficiency and accuracy of short video recommendations. The Vim framework excels by effectively capturing temporal dynamics, long-range dependencies, and the contextual relevance within video sequences, addressing computational limitations in a resource-efficient manner. Furthermore, it supports real-time personalization and scalable deployment across modern content platforms. Experimental evaluations conducted on the MicroLens dataset demonstrate that the Vision Mamba framework significantly outperforms existing traditional models, setting a new benchmark in video recommendation systems and offering enhanced user experiences with more contextually relevant and personalized content delivery.

Downloads

Download data is not yet available.

Downloads

Published

2024-01-01

Submitted

2023-08-20

Revised

2023-10-23

Accepted

2023-10-27

How to Cite

Ebrahimian, Z., Yaqmuri, N., & Akhaee, M. A. (2024). Revolutionizing Short Video Recommendations Using the Vision Mamba Framework. Journal of Artificial Intelligence, Applications and Innovations, 1(1), 1-11. https://doi.org/10.61838/jaiai.1.1.1

Similar Articles

You may also start an advanced similarity search for this article.