Mennatullah Siam, PhD

Professional Me

Assistant Professor in Engineering and Applied Science, Ontario Tech University

Affiliate Assistant Professor in CS, University of British Columbia

Office: SIRC-3388, Calendar, Scholar, Linkedin, Github, Email: first.lastname@ubc.ca, Academic CV

I am an assistant professor in Ontario Tech University leading the Image and Video Understanding (IVU) lab and an affiliate professor in UBC. My research interests include pixel-level scene and video understanding, data efficient learning, interpretability and responsible AI. Previously I was a Postdoctoral researcher working with Professor Richard Wildes in York University. I was also a vector affiliate. I obtained my PhD in 2021 under Professor Martin Jagersand supervision working in vision for robotics. My thesis was focused on learning video object segmentation from limited labelled data, where I was working on the intersection between video object segmentation and fewshot object segmentation. I was a member in a team of 4 in the KUKA Innovation Challenge 2018, where our team was one of the top 5 finalists. Previously I finished my MSc in NU and BSc in Ainshams University, Egypt.
Research Interests: Computer Vision, Deep Learning, Fewshot Learning, Video Object Segmentation, Video Understanding, Interpretability, Responsible AI.

News

  • November 2024: Our work on TAM-VT video segmentation and tracking is accepted in WACV 2025, our work with RIKEN institute was also accepted in IEEE Geoscience and Remote Sensing Letters.
  • October 2024: Our work is accepted in Neuro AI workshop and WiML part of NeurIPS 2024.
  • September 2024: Our work is accepted in TPAMI, which was an extension of our CVPR 2022 paper.
  • August 2024: Our work on the current state of Computer Vision research in Africa is accepted in JAIR special issue on Fairness and Bias in AI.
  • June 2024: I am WACV 2025 Area Chair.
  • May 2024: I acquired the NSERC Alliance International grant, thanks to NSERC.
  • April 2024: Happy to announce that I acquired the Discovery grant and launch supplements for my IVU Lab on "Learning pixel-level video understanding", postdoc and PhD students interested to apply reach out on my email.
  • March 2024: I am glad to announce that I am an affiliate assistant professor with University of British Columbia, Canada.
  • March 2024: I am a supporting organizer in the first African Computer Vision Summer School, ACVSS, Nairobi, Kenya co-located in Microsoft Research (MARI).
  • February 2024: 1 Paper got accepted in CVPR 2024 on prompting pixel-level image understanding models, and our work on studying video understanding models from a neuroscience perspective is released on arxiv.
  • December 2023: I am co-organizing 3rd workshop on L3D-IVU in CVPR 2024.
  • I am an outstanding reviewer in ICCV 2023.
  • July 2023: I started as an assistant professor in Ontario Tech University
  • June 2023: I am a WACV 2024 Area Chair.
  • February 2023: Our paper on Multiscale Video Transformers for Video Object Segmentation is accepted in CVPR 2023.
  • December 2022: Co-organizing 2nd Workshop on L3D-IVU: Learning with Limited Labelled Data for Image and Video Understanding in CVPR 2023.
  • November 2022: I was a Keynote speaker in Black in AI workshop co-located with Neurips 2022 on Learning Scene and Video Understanding with Limited Labelled Data.
  • September 2022: I am guest editor in the special issue on "Signal Processing and Machine Learning for Autonomous Driving" in Remote Sensing Journal.
  • April 2022: Gave a talk on few-shot learning and its extension beyond single images to videos in Samsung AI.
  • March 2022: Our paper on the interpretability of Spatiotemporal models has been accepted in CVPR2022.
  • December 2021: Co-organizing Workshop on L3D-IVU: Learning with Limited Labelled Data for Image and Video Understanding in CVPR 2022.
  • December 2021: Our short paper in Machine Learning for Autonomous Driving Workshop in Neurips 2021 was accepted.
  • July 2021: Officially Started my Postdoc in York University under supervision from Prof. Richard Wildes and Kostas Derpanis,
  • May 2021: I officially finished my PhD and graduated from University of Alberta convocation in Fall 2021, Thesis.

Publications

2025

TAM-VT: Transformation-Aware Multi-scale Video Transformer for Segmentation and Tracking

Raghav Goyal, Wan-Cyuan Fan, Mennatullah Siam, Leonid Sigal

WACV 2025.


Paper

2024

MEDVT++: A Unified Multiscale Encoder-Decoder Transformer for Video Segmentation

Rezaul Karim, He Zhao, Richard P. Wildes, Mennatullah Siam

Journal Extension Under Review.


Paper Project Webpage

Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks

Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis

TPAMI.


Paper

Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach

Mir Rayat Imtiaz Hossain, Mennatullah Siam, Leonid Sigal, James J. Little

CVPR 2024.


Paper Code

Generalized Few-Shot Semantic Segmentation in Remote Sensing: Challenge and Benchmark

Clifford Broni-Bediako, Junshi Xia, Jian Song, Hongruixuan Chen, Mennatullah Siam, Naoto Yokoya

IEEE Geoscience and Remote Sensing Letters (accepted).


Paper

A Survey on African Computer Vision Datasets, Topics and Researchers

Abdul-Hakeem Omotayo*, Ashery Mbilinyi*, Lukman Ismaila*, Houcemeddine Turki, Mahmoud Abdien, Karim Gamal, Idriss Tondji, Yvan Pimi, Naome A. Etori, Marwa M. Matar, Clifford Broni-Bediako, Abigail Oppong, Mai Gamal, Eman Ehab, Gbetondji Dovonon, Zainab Akinjobi, Daniel Ajisafe, Oluwabukola G. Adegboro, Mennatullah Siam

JAIR - Fariness and Bias in AI Special Issue.


Paper Datasets List Code

System Identification of Neural Systems: Going Beyond Images to Modelling Dynamics

Mai Gamal, Mohamed Rashad, Eman Ehab, Saif ElDawlatly, Mennatullah Siam

Short Paper in NeuroAI Workshop Neurips 2024.


Paper

2023

MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Rezaul Karim, He Zhao, Richard P. Wildes, Mennatullah Siam

CVPR 2023.


Paper Project Webpage Code

Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation

Mennatullah Siam, Rezaul Karim, He Zhao, Richard P. Wildes

Arxiv.


Paper Code

Towards a Better Understanding of the Computer Vision Research Community in Africa

Abdul-Hakeem Omotayo, Mai Gamal, Eman Ehab, Gbetondji Dovonon, Zainab Akinjobi, Ismaila Lukman, Houcemeddine Turki, Mahmod Abdien, Idriss Tondji, Abigail Oppong, Yvan Pimi, Karim Gamal, and Mennatullah Siam

EAAMO 2023.


Paper

Two-Stage Joint Transductive and Inductive Learning for Nuclei Segmentation

Hesham Ali, Idriss Tondji, Mennatullah Siam

ML4H Symposium 2023, Findings Track.


Paper

2022

A Deeper Dive into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information

Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis

CVPR 2022.


Paper Video Demo Project Webpage Code Interpretability Code AVOS

2021

Temporal Transductive Inference for Fewshot Video Object Segmentation

Mennatullah Siam, Konstantinos G. Derpanis, Richard P. Wildes

ML4AD Workshop, Neurips 2021.


Full Paper Paper Video Demo

Video Class Agnostic Segmentation Benchmark for Autonomous Driving

Mennatullah Siam, Alex Kendal, Martin Jagersand

CVPR 2021 Workshops.

Paper Project Webpage

2020

Weakly Supervised Few-shot Object Segmentation using Co-attention with Visual and Semantic Embeddings

Mennatullah Siam*, Naren Doraiswamy*, Boris N. Oreshkin*, Hengshuai Yao, Martin Jagersand (equally contributing)

IJCAI 2020.

Paper

2019

AMP: Adaptive Masked Proxies for Few-Shot Segmentation

Mennatullah Siam, Boris N. Oreshkin, Martin Jagersand

ICCV 2019.

Paper Code

Video Segmentation using Teacher-Student Adaptation in a Human Robot Interaction (HRI) Setting

Mennatullah Siam, Chen Jiang, Steve Lu, Laura Petrich, Mosta Gamal, Mohamed Elhoseiny, Martin Jagersand

ICRA 2019.

Paper Dataset

Online Object and Task Learning via Human Roboti Interaction

Masood Dehghan*, Zichen Zhang*, Mennatullah Siam*, Jun Jin, Laura Petrich, Martin Jagersand (equally contributing)

ICRA 2019.

Paper Video Demo

2018

Real-time Segmentation with Appearance, Motion and Geometry

Mennatullah Siam, Sara Eikerdawy, Mostafa Gamal, Moemen Abdel-Razek, Martin Jagersand, Hong Zhang

IROS 2018.

Paper

Moving Object Detection Network for Autonomous Driving

Mennatullah Siam, Heba Mahgoub, Mohamed Zahran, Senthil Yogamani, Martin Jagersand, Ahmed El-Sallab

ITSC 2018.

Paper Dataset Video Demo Patent

Teaching

Ontario Tech University

  • Fall 2023, Fall 2024 ELEE2110 Discrete Mathematics, Undergraduate Course.
  • Winter 2024, SOFE4620 Machine Learning and Data Mining, Undergraduate Course.
  • Winter 2024, SOFE2715 Data Structures, Undergraduate Course.

Nile University

  • Spring 2023, CIT-670 Computer Vision, Graduate Course.
  • Spring 2022, CIT-670 Computer Vision, Graduate Course.