


default search action
Mubarak Shah
Person information
- affiliation: University of Central Florida, Orlando, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2026
[j152]Shaina Raza, Ashmal Vayani, Aditya Jain, Aravind Narayanan, Vahid Reza Khazaie, Syed Raza Bashir, Elham Dolatabadi, Gias Uddin, Christos Emmanouilidis, Rizwan Qureshi, Mubarak Shah:
VLDBench Evaluating multimodal disinformation with regulatory alignment. Inf. Fusion 130: 104092 (2026)- 2025
[j151]Matteo Pennisi
, Giovanni Bellitto, Simone Palazzo
, Isaak Kavasidis
, Mubarak Shah, Concetto Spampinato:
DiffExplainer: Towards cross-modal global explanations with diffusion models. Comput. Vis. Image Underst. 262: 104559 (2025)
[j150]Omkar Thawakar, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer
, Salman H. Khan, Jorma Laaksonen
, Mubarak Shah, Fahad Shahbaz Khan:
Video Instance Segmentation in an Open-World. Int. J. Comput. Vis. 133(1): 398-409 (2025)
[j149]Abdul Rehman
, Talha Meraj
, Aiman Mahmood Minhas, Ayisha Imran, Mohsen Ali, Waqas Sultani, Mubarak Shah
:
Leveraging sparse annotations for leukemia diagnosis on the large leukemia dataset. Medical Image Anal. 106: 103760 (2025)
[j148]Daochang Liu
, Qiyue Li
, Anh-Dung Dinh, Tingting Jiang
, Mubarak Shah
, Chang Xu
:
DiffAct++: Diffusion Action Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 47(3): 1644-1659 (2025)
[j147]Muhammad Awais
, Muzammal Naseer
, Salman Khan
, Rao Muhammad Anwer
, Hisham Cholakkal
, Mubarak Shah
, Ming-Hsuan Yang
, Fahad Shahbaz Khan
:
Foundation Models Defining a New Era in Vision: A Survey and Outlook. IEEE Trans. Pattern Anal. Mach. Intell. 47(4): 2245-2264 (2025)
[c375]Omkar Thawakar, Dinura Dissanayake, Ketan Pravin More, Ritesh Thawkar, Ahmed Heakl, Noor Ahsan, Yuhao Li, Mohammed Zumri, Jean Lahoud, Rao Muhammad Anwer, Hisham Cholakkal, Ivan Laptev, Mubarak Shah, Fahad Shahbaz Khan, Salman H. Khan:
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs. ACL (Findings) 2025: 24290-24315
[c374]Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Nicu Sebe
, Mubarak Shah:
Curriculum Direct Preference Optimization for Diffusion and Consistency Models. CVPR 2025: 2824-2834
[c373]Chuong Huynh, Jinyu Yang, Ashish Tawari, Mubarak Shah, Son Tran, Raffay Hamid, Trishul Chilimbi, Abhinav Shrivastava:
CoLLM: A Large Language Model for Composed Image Retrieval. CVPR 2025: 3994-4004
[c372]Chen Chen, Daochang Liu, Mubarak Shah, Chang Xu:
Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models. CVPR 2025: 8182-8191
[c371]Kai Hu, Feng Gao, Xiaohan Nie, Peng Zhou, Son Tran, Tal Neiman, Lingyun Wang, Mubarak Shah, Raffay Hamid, Bing Yin, Trishul Chilimbi:
M-LLM Based Video Frame Selection for Efficient Video Understanding. CVPR 2025: 13702-13712
[c370]Ashmal Vayani, Dinura Dissanayake, Hasindri Watawana, Noor Ahsan, Nevasini Sasikumar, Omkar Thawakar, Henok Biadglign Ademtew, Yahya Hmaiti, Amandeep Kumar, Kartik Kuckreja, Mykola Maslych, Wafa Al Ghallabi, Mihail Minkov Mihaylov, Chao Qin, Abdelrahman M. Shaker, Mike Zhang
, Mahardika Krisna Ihsani, Amiel Gian Esplana, Monil Gokani, Shachar Mirkin, Harsh Singh, Ashay Srivastava, Endre Hamerlik, Fathinah Asma Izzati, Fadillah Adamsyah Maani, Sebastian Cavada, Jenny Chim, Rohit Gupta, Sanjay Manjunath, Kamila Zhumakhanova, Feno Heriniaina Rabevohitra, Azril Hafizi Amirudin, Muhammad Ridzuan, Daniya Najiha Abdul Kareem, Ketan Pravin More, Kunyang Li, Pramesh Shakya, Muhammad Saad, Amirpouya Ghasemaghaei, Amirbek Djanibekov, Dilshod Azizov, Branislava Jankovic, Naman Bhatia, Alvaro Cabrera, Johan S. Obando-Ceron, Olympiah Otieno, Fabian Farestam, Muztoba Rabbani, Sanoojan Baliah, Santosh Sanjeev, Abduragim Shtanchaev, Maheen Fatima, Thao Nguyen, Amrin Kareem, Toluwani Aremu, Nathan Augusto Zacarias Xavier, Amit Bhatkal, Hawau Olamide Toyin, Aman Chadha, Hisham Cholakkal, Rao Muhammad Anwer, Michael Felsberg, Jorma Laaksonen
, Thamar Solorio, Monojit Choudhury, Ivan Laptev, Mubarak Shah, Salman H. Khan, Fahad Shahbaz Khan:
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages. CVPR 2025: 19565-19575
[c369]Yuning Cui, Syed Waqas Zamir, Salman H. Khan, Alois Knoll, Mubarak Shah, Fahad Shahbaz Khan:
AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation. ICLR 2025
[c368]Chen Chen, Daochang Liu, Mubarak Shah, Chang Xu:
Exploring Local Memorization in Diffusion Models via Bright Ending Attention. ICLR 2025
[c367]Prakash Chandra Chhipa, Gautam Vashishtha, Settur Jithamanyu, Rajkumar Saini, Mubarak Shah, Marcus Liwicki:
ASTrA: Adversarial Self-supervised Training with Adaptive-Attacks. ICLR 2025
[c366]Joseph Fioresi, Ishan Rajendrakumar Dave, Mubarak Shah:
ALBAR: Adversarial Learning approach to mitigate Biases in Action Recognition. ICLR 2025
[c365]Weitai Kang, Mengxue Qu, Jyoti Kini, Yunchao Wei, Mubarak Shah, Yan Yan:
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention. ICLR 2025
[c364]Jeffrey A. Chan-Santiago, Praveen Tirupattur, Gaurav Kumar Nayak, Gaowen Liu, Mubarak Shah:
MGD3 : Mode-Guided Dataset Distillation using Diffusion Models. ICML 2025
[c363]David Robinson, Animesh Gupta, Rizwan Qureshi, Qiushi Fu, Mubarak Shah:
Strokevision-Bench: A Multimodal Video and 2D Pose Benchmark for Tracking Stroke Recovery. MLSP 2025: 1-6
[c362]Nyle Siddiqui, Florinel-Alin Croitoru, Gaurav Kumar Nayak, Radu Tudor Ionescu, Mubarak Shah:
DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-Id. WACV 2025: 1608-1617
[i225]Omkar Thawakar, Dinura Dissanayake, Ketan More, Ritesh Thawkar, Ahmed Heakl, Noor Ahsan, Yuhao Li, Mohammed Zumri, Jean Lahoud, Rao Muhammad Anwer
, Hisham Cholakkal, Ivan Laptev, Mubarak Shah, Fahad Shahbaz Khan, Salman H. Khan:
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs. CoRR abs/2501.06186 (2025)
[i224]Sirnam Swetha, Hilde Kuehne, Mubarak Shah:
TimeLogic: A Temporal Logic Benchmark for Video QA. CoRR abs/2501.07214 (2025)
[i223]Joseph Fioresi, Ishan Rajendrakumar Dave, Mubarak Shah:
ALBAR: Adversarial Learning approach to mitigate Biases in Action Recognition. CoRR abs/2502.00156 (2025)
[i222]Vishal Narnaware, Ashmal Vayani, Rohit Gupta, Sirnam Swetha, Mubarak Shah:
SB-Bench: Stereotype Bias Benchmark for Large Multimodal Models. CoRR abs/2502.08779 (2025)
[i221]Shaina Raza, Ashmal Vayani, Aditya Jain, Aravind Narayanan, Vahid Reza Khazaie, Syed Raza Bashir
, Elham Dolatabadi, Gias Uddin, Christos Emmanouilidis, Rizwan Qureshi, Mubarak Shah:
VLDBench: Vision Language Models Disinformation Detection Benchmark. CoRR abs/2502.11361 (2025)
[i220]Kai Hu, Feng Gao, Xiaohan Nie, Peng Zhou, Son Tran, Tal Neiman, Lingyun Wang, Mubarak Shah, Raffay Hamid, Bing Yin, Trishul Chilimbi:
M-LLM Based Video Frame Selection for Efficient Video Understanding. CoRR abs/2502.19680 (2025)
[i219]Komal Kumar, Tajamul Ashraf, Omkar Thawakar, Rao Muhammad Anwer
, Hisham Cholakkal, Mubarak Shah, Ming-Hsuan Yang, Phillip H. S. Torr, Salman H. Khan, Fahad Shahbaz Khan:
LLM Post-Training: A Deep Dive into Reasoning Large Language Models. CoRR abs/2502.21321 (2025)
[i218]Ron Campos, Ashmal Vayani, Parth Parag Kulkarni, Rohit Gupta, Aritra Dutta
, Mubarak Shah:
GAEA: A Geolocation Aware Conversational Model. CoRR abs/2503.16423 (2025)
[i217]Chuong Huynh, Jinyu Yang, Ashish Tawari, Mubarak Shah, Son Tran, Raffay Hamid, Trishul Chilimbi, Abhinav Shrivastava:
CoLLM: A Large Language Model for Composed Image Retrieval. CoRR abs/2503.19910 (2025)
[i216]Abdul Rehman, Talha Meraj, Aiman Mahmood Minhas, Ayisha Imran, Mohsen Ali, Waqas Sultani, Mubarak Shah:
Leveraging Sparse Annotations for Leukemia Diagnosis on the Large Leukemia Dataset. CoRR abs/2504.02602 (2025)
[i215]Mohammad A. A. K. Jalwana, Naveed Akhtar, Ajmal Mian, Nazanin Rahnavard, Mubarak Shah:
On Transfer-based Universal Attacks in Pure Black-box Setting. CoRR abs/2504.08866 (2025)
[i214]Chen Chen, Daochang Liu, Mubarak Shah, Chang Xu:
Enhancing Privacy-Utility Trade-offs to Mitigate Memorization in Diffusion Models. CoRR abs/2504.18032 (2025)
[i213]Florinel-Alin Croitoru, Vlad Hondru, Marius Popescu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah:
MAVOS-DD: Multilingual Audio-Video Open-Set Deepfake Detection Benchmark. CoRR abs/2505.11109 (2025)
[i212]Shaina Raza, Aravind Narayanan, Vahid Reza Khazaie, Ashmal Vayani, Mukund S. Chettiar, Amandeep Singh, Mubarak Shah, Deval Pandya:
HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation. CoRR abs/2505.11454 (2025)
[i211]Sushant Gautam, Cise Midoglu, Vajira Thambawita, Michael A. Riegler, Pål Halvorsen, Mubarak Shah:
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding. CoRR abs/2505.16630 (2025)
[i210]Sagar Sapkota, Mohammad Saqib Hasan, Mubarak Shah, Santu Karmaker:
Multi-Party Conversational Agents: A Survey. CoRR abs/2505.18845 (2025)
[i209]Jeffrey A. Chan-Santiago, Praveen Tirupattur, Gaurav Kumar Nayak, Gaowen Liu, Mubarak Shah:
MGD3: Mode-Guided Dataset Distillation using Diffusion Models. CoRR abs/2505.18963 (2025)
[i208]Tajamul Ashraf, Amal Saqib, Hanan Ghani, Muhra AlMahri, Yuhao Li, Noor Ahsan, Umair Nawaz, Jean Lahoud, Hisham Cholakkal, Mubarak Shah, Philip Torr, Fahad Shahbaz Khan, Rao Muhammad Anwer, Salman H. Khan:
Agent-X: Evaluating Deep Multimodal Reasoning in Vision-Centric Agentic Tasks. CoRR abs/2505.24876 (2025)
[i207]Animesh Gupta, Jay Parmar, Ishan Rajendrakumar Dave, Mubarak Shah:
From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos. CoRR abs/2506.05274 (2025)
[i206]Bhuiyan Sanjid Shafique, Ashmal Vayani, Muhammad Maaz, Hanoona Abdul Rasheed, Dinura Dissanayake, Mohammed Irfan Kurpath, Yahya Hmaiti, Go Inoue, Jean Lahoud, Md. Safirur Rashid, Shadid Intisar Quasem, Maheen Fatima, Franco Vidal, Mykola Maslych, Ketan More, Sanoojan Baliah, Hasindri Watawana, Yuhao Li, Fabian Farestam, Leon Schaller, Roman Tymtsiv, Simon Weber, Hisham Cholakkal, Ivan Laptev, Shin'ichi Satoh, Michael Felsberg, Mubarak Shah, Salman H. Khan, Fahad Shahbaz Khan:
A Culturally-diverse Multilingual Multimodal Video Benchmark & Model. CoRR abs/2506.07032 (2025)
[i205]Sirnam Swetha, Rohit Gupta, Parth Parag Kulkarni, David G. Shatwell, Jeffrey A. Chan-Santiago, Nyle Siddiqui, Joseph Fioresi, Mubarak Shah:
ImplicitQA: Going beyond frames towards Implicit Video Reasoning. CoRR abs/2506.21742 (2025)
[i204]David G. Shatwell, Ishan Rajendrakumar Dave, Sirnam Swetha, Mubarak Shah:
GT-Loc: Unifying When and Where in Images Through a Joint Embedding Space. CoRR abs/2507.10473 (2025)
[i203]Keerthi Veeramachaneni, Praveen Tirupattur, Amrit Singh Bedi, Mubarak Shah:
Leveraging Pre-Trained Visual Models for AI-Generated Video Detection. CoRR abs/2507.13224 (2025)
[i202]Kunyang Li, Jeffrey A. Chan-Santiago, Sarinda Dhanesh Samarasinghe, Gaowen Liu, Mubarak Shah:
GVD: Guiding Video Diffusion Model for Scalable Video Distillation. CoRR abs/2507.22360 (2025)
[i201]Omkar Thawakar, Dmitry Demidov, Ritesh Thawkar, Rao Muhammad Anwer, Mubarak Shah, Fahad Shahbaz Khan, Salman Khan:
Beyond Simple Edits: Composed Video Retrieval with Dense Modifications. CoRR abs/2508.14039 (2025)
[i200]Younggun Kim, Sirnam Swetha, Fazil Kagdi, Mubarak Shah:
Safe-LLaVA: A Privacy-Preserving Vision-Language Dataset and Benchmark for Biometric Safety. CoRR abs/2509.00192 (2025)
[i199]Sabbir Mollah, Rohit Gupta, Sirnam Swetha, Qingyang Liu, Ahnaf Munir, Mubarak Shah:
The Telephone Game: Evaluating Semantic Drift in Unified Models. CoRR abs/2509.04438 (2025)
[i198]David Robinson, Animesh Gupta, Rizwan Quershi, Qiushi Fu, Mubarak Shah:
STROKEVISION-BENCH: A Multimodal Video And 2D Pose Benchmark For Tracking Stroke Recovery. CoRR abs/2509.07994 (2025)
[i197]Gabriele Berton, Jayakrishnan Unnikrishnan, Son Tran, Mubarak Shah:
CompLLM: Compression for Long Context Q&A. CoRR abs/2509.19228 (2025)
[i196]Kevin Zhai, Utsav Singh, Anirudh Thatipelli, Souradip Chakraborty, Anit Kumar Sahu, Furong Huang, Amrit Singh Bedi, Mubarak Shah:
MIRA: Towards Mitigating Reward Hacking in Inference-Time Alignment of T2I Diffusion Models. CoRR abs/2510.01549 (2025)
[i195]Guangyu Sun, Archit Singhal, Burak Uzkent, Mubarak Shah, Chen Chen, Garin Kessler:
From Frames to Clips: Efficient Key Clip Selection for Long-Form Video Understanding. CoRR abs/2510.02262 (2025)
[i194]Jyoti Kini, Rohit Gupta, Mubarak Shah:
Cross-View Open-Vocabulary Object Detection in Aerial Imagery. CoRR abs/2510.03858 (2025)
[i193]Rohit Gupta, Anirban Roy, Claire Christensen, Sujeong Kim, Sarah Gerard, Madeline Cincebeaux, Ajay Divakaran, Todd Grindal, Mubarak Shah:
Class Prototypes based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos. CoRR abs/2510.11204 (2025)
[i192]Simone Carnemolla, Matteo Pennisi, Sarinda Samarasinghe, Giovanni Bellitto, Simone Palazzo, Daniela Giordano, Mubarak Shah, Concetto Spampinato:
DEXTER: Diffusion-Guided EXplanations with TExtual Reasoning for Vision Models. CoRR abs/2510.14741 (2025)
[i191]Nyle Siddiqui, Rohit Gupta, Sirnam Swetha, Mubarak Shah:
StretchySnake: Flexible SSM Training Unlocks Action Recognition Across Spatio-Temporal Scales. CoRR abs/2510.16209 (2025)
[i190]Yimeng Zhang, Jiri Gesi, Ran Xue, Tian Wang, Ziyi Wang, Yuxuan Lu, Sinong Zhan, Huimin Zeng, Qingjun Cui, Yufan Guo, Jing Huang, Mubarak Shah, Dakuo Wang:
See, Think, Act: Online Shopper Behavior Simulation with VLM Agents. CoRR abs/2510.19245 (2025)
[i189]Joseph Fioresi, Ishan Rajendrakumar Dave, Mubarak Shah:
Privacy Beyond Pixels: Latent Anonymization for Privacy-Preserving Video Understanding. CoRR abs/2511.08666 (2025)
[i188]Adeel Yousaf, Joseph Fioresi, James Beetham, Amrit Singh Bedi, Mubarak Shah:
SafeR-CLIP: Mitigating NSFW Content in Vision-Language Models While Preserving Pre-Trained Knowledge. CoRR abs/2511.16743 (2025)
[i187]Tianxingjian Ding, Yuanhao Zou, Chen Chen, Mubarak Shah, Yu Tian:
CLARITY: Medical World Model for Guiding Treatment Decisions by Modeling Context-Aware Disease Trajectories in Latent Space. CoRR abs/2512.08029 (2025)- 2024
[j146]Ce Zheng
, Wenhan Wu
, Chen Chen
, Taojiannan Yang
, Sijie Zhu
, Ju Shen
, Nasser Kehtarnavaz
, Mubarak Shah
:
Deep Learning-based Human Pose Estimation: A Survey. ACM Comput. Surv. 56(1): 11:1-11:37 (2024)
[j145]Fabio De Sousa Ribeiro
, Kevin Duarte
, Miles Everett
, Georgios Leontidis
, Mubarak Shah
:
Object-centric Learning with Capsule Networks: A Survey. ACM Comput. Surv. 56(11): 291:1-291:291 (2024)
[j144]Jyoti Kini
, Fahad Shahbaz Khan, Salman Khan, Mubarak Shah:
CT-VOS: Cutout prediction and tagging for self-supervised video object segmentation. Comput. Vis. Image Underst. 238: 103860 (2024)
[j143]Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Dana Dascalescu, Radu Tudor Ionescu
, Fahad Shahbaz Khan, Mubarak Shah:
Lightning fast video anomaly detection via multi-scale adversarial distillation. Comput. Vis. Image Underst. 247: 104074 (2024)
[j142]Florinel-Alin Croitoru
, Vlad Hondru, Radu Tudor Ionescu
, Mubarak Shah
:
Reverse Stable Diffusion: What prompt was used to generate this image? Comput. Vis. Image Underst. 249: 104210 (2024)
[j141]Neelu Madan
, Nicolae-Catalin Ristea
, Radu Tudor Ionescu
, Kamal Nasrollahi
, Fahad Shahbaz Khan
, Thomas B. Moeslund
, Mubarak Shah
:
Self-Supervised Masked Convolutional Transformer Block for Anomaly Detection. IEEE Trans. Pattern Anal. Mach. Intell. 46(1): 525-542 (2024)
[j140]Simone Palazzo
, Concetto Spampinato
, Isaak Kavasidis
, Daniela Giordano, Joseph Schmidt
, Mubarak Shah
:
Rebuttal to "Comments on 'Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features' ". IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 11540-11542 (2024)
[j139]Saeed Vahidian
, Mahdi Morafah
, Chen Chen, Mubarak Shah
, Bill Lin
:
Rethinking Data Heterogeneity in Federated Learning: Introducing a New Notion and Standard Benchmarks. IEEE Trans. Artif. Intell. 5(3): 1386-1397 (2024)
[j138]Bo Miao
, Mohammed Bennamoun
, Yongsheng Gao
, Mubarak Shah
, Ajmal Mian
:
Temporally Consistent Referring Video Object Segmentation With Hybrid Memory. IEEE Trans. Circuits Syst. Video Technol. 34(11): 11373-11385 (2024)
[j137]Ashkan Esmaeili
, Marzieh Edraki
, Nazanin Rahnavard, Ajmal Mian
, Mubarak Shah
:
Low-Rank and Sparse Decomposition for Low-Query Decision-Based Adversarial Attacks. IEEE Trans. Inf. Forensics Secur. 19: 1561-1575 (2024)
[j136]Anas Zafar
, Danyal Aftab
, Rizwan Qureshi
, Xinqi Fan
, Pingjun Chen, Jia Wu
, Hazrat Ali
, Shah Nawaz
, Sheheryar Khan, Mubarak Shah
:
Single Stage Adaptive Multi-Attention Network for Image Restoration. IEEE Trans. Image Process. 33: 2924-2935 (2024)
[j135]Huan Lei
, Naveed Akhtar
, Mubarak Shah
, Ajmal Mian
:
Mesh Convolution With Continuous Filters for 3-D Surface Parsing. IEEE Trans. Neural Networks Learn. Syst. 35(10): 14863-14877 (2024)
[c361]Ishan Rajendrakumar Dave, Simon Jenni, Mubarak Shah:
No More Shortcuts: Realizing the Potential of Temporal Self-Supervision. AAAI 2024: 1481-1491
[c360]Nyle Siddiqui, Praveen Tirupattur, Mubarak Shah:
DVANet: Disentangling View and Action Features for Multi-View Action Recognition. AAAI 2024: 4873-4881
[c359]Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan
, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi:
VidLA: Video-Language Alignment at Scale. CVPR 2024: 14043-14055
[c358]Nicolae-Catalin Ristea, Florinel-Alin Croitoru, Radu Tudor Ionescu, Marius Popescu, Fahad Shahbaz Khan, Mubarak Shah:
Self-Distilled Masked Auto-Encoders are Efficient Video Anomaly Detectors. CVPR 2024: 15984-15995
[c357]Aritra Dutta
, Srijan Das, Jacob Nielsen
, Rajatsubhra Chakraborty
, Mubarak Shah:
Multiview Aerial Visual Recognition (MAVREC): Can Multi-View Improve Aerial Visual Perception? CVPR 2024: 22678-22690
[c356]Omkar Thawakar, Muzammal Naseer, Rao Muhammad Anwer
, Salman H. Khan, Michael Felsberg, Mubarak Shah, Fahad Shahbaz Khan:
Composed Video Retrieval via Enriched Context and Discriminative Embeddings. CVPR 2024: 26886-26896
[c355]Weitai Kang
, Gaowen Liu, Mubarak Shah, Yan Yan:
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding. ECCV (38) 2024: 57-75
[c354]Sirnam Swetha
, Jinyu Yang
, Tal Neiman
, Mamshad Nayeem Rizve
, Son Tran, Benjamin Z. Yao, Trishul Chilimbi, Mubarak Shah:
X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs. ECCV (6) 2024: 146-162
[c353]Rohit Gupta
, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan, Ashish Tawari, Son Tran, Mubarak Shah, Benjamin Z. Yao, Trishul Chilimbi:
Open Vocabulary Multi-label Video Classification. ECCV (39) 2024: 276-293
[c352]Parth Parag Kulkarni, Gaurav Kumar Nayak, Mubarak Shah:
CityGuessr: City-Level Video Geo-Localization on a Global Scale. ECCV (63) 2024: 293-311
[c351]Peiyu Yang
, Naveed Akhtar
, Mubarak Shah
, Ajmal Mian
:
Regulating Model Reliance on Non-robust Features by Smoothing Input Marginal Density. ECCV (57) 2024: 329-347
[c350]Prakash Chandra Chhipa
, Meenakshi Subhash Chippa, Kanjar De
, Rajkumar Saini
, Marcus Liwicki
, Mubarak Shah
:
Möbius Transform for Mitigating Perspective Distortions in Representation Learning. ECCV (73) 2024: 345-363
[c349]Ishan Rajendrakumar Dave
, Fabian Caba Heilbron
, Mubarak Shah
, Simon Jenni
:
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets. ECCV (8) 2024: 371-388
[c348]Ishan Rajendrakumar Dave
, Mamshad Nayeem Rizve
, Mubarak Shah
:
FinePseudo: Improving Pseudo-labelling Through Temporal-Alignablity for Semi-supervised Fine-Grained Action Recognition. ECCV (8) 2024: 389-408
[c347]Manu S. Pillai, Mamshad Nayeem Rizve, Mubarak Shah:
GAReT: Cross-View Video Geolocalization with Adapters and Auto-Regressive Transformers. ECCV (61) 2024: 466-483
[c346]Ishan Rajendrakumar Dave, Tristan de Blegiers, Chen Chen, Mubarak Shah:
Codamal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost Microscopes. ICIP 2024: 3848-3853
[c345]Aakash Kumar, Chen Chen, Ajmal Mian
, Neils Lobo, Mubarak Shah:
Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data. IROS 2024: 12963-12970
[c344]Sushant Gautam, Mehdi Houshmand Sarkhoosh, Jan Held, Cise Midoglu, Anthony Cioppa, Silvio Giancola, Vajira Thambawita, Michael A. Riegler, Pål Halvorsen, Mubarak Shah:
SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset. ISM 2024: 71-78
[c343]Junyi Wu, Haoxuan Wang, Yuzhang Shang, Mubarak Shah, Yan Yan:
PTQ4DiT: Post-training Quantization for Diffusion Transformers. NeurIPS 2024
[i186]Ishan Rajendrakumar Dave, Tristan de Blegiers, Chen Chen, Mubarak Shah:
CodaMal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost Microscopes. CoRR abs/2402.10478 (2024)
[i185]Rukhshanda Hussain, Hui Xian Grace Lim, Borchun Chen, Mubarak Shah, Ser Nam Lim:
FSViewFusion: Few-Shots View Generation of Novel Objects. CoRR abs/2403.06394 (2024)
[i184]Yuning Cui, Syed Waqas Zamir, Salman H. Khan, Alois Knoll, Mubarak Shah, Fahad Shahbaz Khan:
AdaIR: Adaptive All-in-One Image Restoration via Frequency Mining and Modulation. CoRR abs/2403.14614 (2024)
[i183]Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan, Son Tran, Benjamin Z. Yao, Belinda Zeng, Mubarak Shah, Trishul Chilimbi:
VidLA: Video-Language Alignment at Scale. CoRR abs/2403.14870 (2024)
[i182]Omkar Thawakar, Muzammal Naseer, Rao Muhammad Anwer
, Salman H. Khan, Michael Felsberg, Mubarak Shah, Fahad Shahbaz Khan:
Composed Video Retrieval via Enriched Context and Discriminative Embeddings. CoRR abs/2403.16997 (2024)
[i181]Bo Miao, Mohammed Bennamoun, Yongsheng Gao, Mubarak Shah, Ajmal Mian:
Towards Temporally Consistent Referring Video Object Segmentation. CoRR abs/2403.19407 (2024)
[i180]Matteo Pennisi, Giovanni Bellitto
, Simone Palazzo, Mubarak Shah, Concetto Spampinato:
Diffexplainer: Towards Cross-modal Global Explanations with Diffusion Models. CoRR abs/2404.02618 (2024)
[i179]Aakash Kumar, Chen Chen, Ajmal Mian, Neils Lobo, Mubarak Shah:
Sparse Points to Dense Clouds: Enhancing 3D Detection with Limited LiDAR Data. CoRR abs/2404.06715 (2024)
[i178]Prakash Chandra Chhipa
, Meenakshi Subhash Chippa, Kanjar De, Rajkumar Saini, Marcus Liwicki, Mubarak Shah:
Möbius Transform for Mitigating Perspective Distortions in Representation Learning. CoRR abs/2405.02296 (2024)
[i177]Sushant Gautam, Mehdi Houshmand Sarkhoosh
, Jan Held, Cise Midoglu, Anthony Cioppa, Silvio Giancola, Vajira Thambawita, Michael A. Riegler, Pål Halvorsen, Mubarak Shah:
SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset. CoRR abs/2405.07354 (2024)
[i176]Florinel-Alin Croitoru, Vlad Hondru, Radu Tudor Ionescu, Nicu Sebe, Mubarak Shah:
Curriculum Direct Preference Optimization for Diffusion and Consistency Models. CoRR abs/2405.13637 (2024)
[i175]Zichen Geng, Caren Han, Zeeshan Hayder, Jian Liu, Mubarak Shah, Ajmal Mian:
Text-guided 3D Human Motion Generation with Keyframe-based Parallel Skip Transformer. CoRR abs/2405.15439 (2024)
[i174]Junyi Wu, Haoxuan Wang, Yuzhang Shang, Mubarak Shah, Yan Yan:
PTQ4DiT: Post-training Quantization for Diffusion Transformers. CoRR abs/2405.16005 (2024)
[i173]Weitai Kang, Mengxue Qu, Jyoti Kini, Yunchao Wei, Mubarak Shah, Yan Yan:
Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention. CoRR abs/2405.18295 (2024)
[i172]Daochang Liu, Axel Hu, Mubarak Shah, Chang Xu:
Surgical Triplet Recognition via Diffusion Model. CoRR abs/2406.13210 (2024)
[i171]Anshuman Gaharwar, Parth Parag Kulkarni, Joshua T. Dickey, Mubarak Shah:
Xi-Net: Transformer Based Seismic Waveform Reconstructor. CoRR abs/2406.16932 (2024)
[i170]Weitai Kang, Gaowen Liu, Mubarak Shah, Yan Yan:
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding. CoRR abs/2407.03200 (2024)
[i169]Peiyu Yang, Naveed Akhtar, Mubarak Shah, Ajmal Mian:
Regulating Model Reliance on Non-Robust Features by Smoothing Input Marginal Density. CoRR abs/2407.04370 (2024)
[i168]Rohit Gupta, Mamshad Nayeem Rizve, Jayakrishnan Unnikrishnan, Ashish Tawari, Son Tran, Mubarak Shah, Benjamin Z. Yao, Trishul Chilimbi:
Open Vocabulary Multi-Label Video Classification. CoRR abs/2407.09073 (2024)
[i167]Sirnam Swetha, Jinyu Yang, Tal Neiman, Mamshad Nayeem Rizve, Son Tran, Benjamin Z. Yao, Trishul Chilimbi, Mubarak Shah:
X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs. CoRR abs/2407.13851 (2024)
[i166]Manu S. Pillai, Mamshad Nayeem Rizve, Mubarak Shah:
GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers. CoRR abs/2408.02840 (2024)
[i165]Ishan Rajendrakumar Dave, Fabian Caba Heilbron, Mubarak Shah, Simon Jenni:
Sync from the Sea: Retrieving Alignable Videos from Large-Scale Datasets. CoRR abs/2409.01445 (2024)
[i164]Ishan Rajendrakumar Dave, Mamshad Nayeem Rizve, Mubarak Shah:
FinePseudo: Improving Pseudo-Labelling through Temporal-Alignablity for Semi-Supervised Fine-Grained Action Recognition. CoRR abs/2409.01448 (2024)
[i163]Weitai Kang, Haifeng Huang, Yuzhang Shang, Mubarak Shah, Yan Yan:
Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning. CoRR abs/2410.00255 (2024)
[i162]Chen Chen, Daochang Liu, Mubarak Shah, Chang Xu:
Exploring Local Memorization in Diffusion Models via Bright Ending Attention. CoRR abs/2410.21665 (2024)
[i161]Chen Chen, Enhuai Liu, Daochang Liu, Mubarak Shah, Chang Xu:
Investigating Memorization in Video Diffusion Models. CoRR abs/2410.21669 (2024)
[i160]Utsav Singh, Souradip Chakraborty, Wesley A. Suttle, Brian M. Sadler, Anit Kumar Sahu, Mubarak Shah, Vinay P. Namboodiri, Amrit Singh Bedi:
Hierarchical Preference Optimization: Learning to achieve goals via feasible subgoals prediction. CoRR abs/2411.00361 (2024)
[i159]Parth Parag Kulkarni, Gaurav Kumar Nayak, Mubarak Shah:
CityGuessr: City-Level Video Geo-Localization on a Global Scale. CoRR abs/2411.06344 (2024)
[i158]Nyle Siddiqui, Florinel-Alin Croitoru, Gaurav Kumar Nayak, Radu Tudor Ionescu, Mubarak Shah:
DLCR: A Generative Data Expansion Framework via Diffusion for Clothes-Changing Person Re-ID. CoRR abs/2411.07205 (2024)
[i157]Ashmal Vayani, Dinura Dissanayake, Hasindri Watawana, Noor Ahsan, Nevasini Sasikumar, Omkar Thawakar, Henok Biadglign Ademtew, Yahya Hmaiti, Amandeep Kumar, Kartik Kuckreja, Mykola Maslych
, Wafa Al Ghallabi, Mihail Mihaylov, Chao Qin, Abdelrahman M. Shaker, Mike Zhang
, Mahardika Krisna Ihsani, Amiel Esplana, Monil Gokani, Shachar Mirkin, Harsh Singh, Ashay Srivastava, Endre Hamerlik, Fathinah Asma Izzati, Fadillah Adamsyah Maani, Sebastian Cavada, Jenny Chim, Rohit Gupta, Sanjay Manjunath, Kamila Zhumakhanova, Feno Heriniaina Rabevohitra, Azril Amirudin, Muhammad Ridzuan, Daniya Kareem, Ketan More, Kunyang Li, Pramesh Shakya, Muhammad Saad, Amirpouya Ghasemaghaei
, Amirbek Djanibekov, Dilshod Azizov, Branislava Jankovic, Naman Bhatia, Alvaro Cabrera, Johan S. Obando-Ceron, Olympiah Otieno, Fabian Farestam, Muztoba Rabbani, Sanoojan Baliah, Santosh Sanjeev, Abduragim Shtanchaev, Maheen Fatima, Thao Nguyen, Amrin Kareem, Toluwani Aremu, Nathan A. Z. Xavier, Amit Bhatkal, Hawau Olamide Toyin, Aman Chadha, Hisham Cholakkal, Rao Muhammad Anwer
, Michael Felsberg, Jorma Laaksonen, Thamar Solorio, Monojit Choudhury, Ivan Laptev, Mubarak Shah, Salman Khan, Fahad Khan:
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages. CoRR abs/2411.16508 (2024)
[i156]Florinel-Alin Croitoru, Andrei Iulian Hîji, Vlad Hondru, Nicolae-Catalin Ristea, Paul Irofti, Marius Popescu, Cristian Rusu, Radu Tudor Ionescu, Fahad Shahbaz Khan, Mubarak Shah:
Deepfake Media Generation and Detection in the Generative AI Era: A Survey and Outlook. CoRR abs/2411.19537 (2024)
[i155]James Beetham, Souradip Chakraborty, Mengdi Wang, Furong Huang, Amrit Singh Bedi, Mubarak Shah:
LIAR: Leveraging Alignment (Best-of-N) to Jailbreak LLMs in Seconds. CoRR abs/2412.05232 (2024)- 2023
[j134]Madeline C. Schiappa
, Yogesh S. Rawat
, Mubarak Shah
:
Self-Supervised Learning for Videos: A Survey. ACM Comput. Surv. 55(13s): 288:1-288:37 (2023)
[j133]Antonio Barbalau, Radu Tudor Ionescu
, Mariana-Iuliana Georgescu, Jacob V. Dueholm
, Bharathkumar Ramachandra
, Kamal Nasrollahi, Fahad Shahbaz Khan, Thomas B. Moeslund
, Mubarak Shah:
SSMTL++: Revisiting self-supervised multi-task learning for video anomaly detection. Comput. Vis. Image Underst. 229: 103656 (2023)
[j132]Taojiannan Yang
, Sijie Zhu
, Matías Mendieta
, Pu Wang
, Ravikumar Balakrishnan, Minwoo Lee
, Tao Han
, Mubarak Shah
, Chen Chen
:
MutualNet: Adaptive ConvNet via Mutual Learning From Different Model Configurations. IEEE Trans. Pattern Anal. Mach. Intell. 45(1): 811-827 (2023)
[j131]Florinel-Alin Croitoru
, Vlad Hondru
, Radu Tudor Ionescu
, Mubarak Shah
:
Diffusion Models in Vision: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 45(9): 10850-10869 (2023)
[j130]Salman H. Khan
, Fahad Shahbaz Khan, Ashish Vaswani, Niki Parmar, Ming-Hsuan Yang, Mubarak Shah:
Guest Editorial Introduction to the Special Section on Transformer Models in Vision. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 12721-12725 (2023)
[j129]Nayyer Aafaq
, Naveed Akhtar
, Wei Liu
, Mubarak Shah
, Ajmal Mian
:
Language Model Agnostic Gray-Box Adversarial Attack on Image Captioning. IEEE Trans. Inf. Forensics Secur. 18:


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID