Mennatullah Siam, PhD

Professional Me

Assistant Professor in Engineering and Applied Science, Ontario Tech University

Affiliate Assistant Professor in CS, University of British Columbia

Office: SIRC-3388, Calendar, Scholar, Linkedin, Github, Email: first.lastname@ontariotechu.ca, Academic CV

I am an assistant professor in Ontario Tech University leading the Image and Video Understanding (IVU) lab. Previously I was a Postdoctoral researcher working with Professor Richard Wildes in York University. I am currently focused on pixel-level scene and video understanding, data efficient learning and interpretability. I was also a vector affiliate. I obtained my PhD in 2021 under Professor Martin Jagersand supervision working in vision for robotics. My thesis was focused on learning video object segmentation from limited labelled data, where I was working on the intersection between video object segmentation and fewshot object segmentation. I was a member in a team of 4 in the KUKA Innovation Challenge 2018, where our team was one of the top 5 finalists. Previously I finished my MSc in NU and BSc in Ainshams University, Egypt.
Research Interests: Computer Vision, Deep Learning, Fewshot Learning, Video Object Segmentation, Video Understanding, Interpretability.

News

  • April 2024: Happy to announce that I acquired the base grant for my IVU Lab on "Learning pixel-level video understanding", postdoc and PhD students interested to apply reach out on my email.
  • March 2024: I am glad to announce that I am an affiliate assistant professor with University of British Columbia, Canada.
  • March 2024: I am a supporting organizer in the first African Computer Vision Summer School, ACVSS, Nairobi, Kenya co-located in Microsoft Research (MARI).
  • February 2024: 1 Paper got accepted in CVPR 2024 on prompting pixel-level image understanding models, and our work on studying video understanding models from a neuroscience perspective is released on arxiv.
  • December 2023: I am co-organizing 3rd workshop on L3D-IVU in CVPR 2024.
  • I am an outstanding reviewer in ICCV 2023.
  • July 2023: I started as an assistant professor in Ontario Tech University
  • June 2023: I am a WACV 2024 Area Chair.
  • February 2023: Our paper on Multiscale Video Transformers for Video Object Segmentation is accepted in CVPR 2023.
  • December 2022: Co-organizing 2nd Workshop on L3D-IVU: Learning with Limited Labelled Data for Image and Video Understanding in CVPR 2023.
  • November 2022: I was a Keynote speaker in Black in AI workshop co-located with Neurips 2022 on Learning Scene and Video Understanding with Limited Labelled Data.
  • September 2022: I am guest editor in the special issue on "Signal Processing and Machine Learning for Autonomous Driving" in Remote Sensing Journal.
  • April 2022: Gave a talk on few-shot learning and its extension beyond single images to videos in Samsung AI.
  • March 2022: Our paper on the interpretability of Spatiotemporal models has been accepted in CVPR2022.
  • December 2021: Co-organizing Workshop on L3D-IVU: Learning with Limited Labelled Data for Image and Video Understanding in CVPR 2022.
  • December 2021: Our short paper in Machine Learning for Autonomous Driving Workshop in Neurips 2021 was accepted.
  • July 2021: Officially Started my Postdoc in York University under supervision from Prof. Richard Wildes and Kostas Derpanis,
  • May 2021: I officially finished my PhD and graduated from University of Alberta convocation in Fall 2021, Thesis.

Open Positions

Thank you for your interest to join my lab!

MSc, PhD and Postdoc positions

please reach out with your resume, transcript of records and motivation statement on my email.

Publications

2024

MEDVT++: A Unified Multiscale Encoder-Decoder Transformer for Video Segmentation

Rezaul Karim, He Zhao, Richard P. Wildes, Mennatullah Siam

Journal Extension Under Review.


Paper Project Webpage

Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach

Mir Rayat Imtiaz Hossain, Mennatullah Siam, Leonid Sigal, James J. Little

CVPR 2024.


Paper Project Webpage

System Identification of Neural Systems: Going Beyond Images to Modelling Dynamics

Mai Gamal, Mohamed Rashad, Eman Ehab, Saif ElDawlatly, Mennatullah Siam

Arxiv.


Paper

A Survey on African Computer Vision Datasets, Topics and Researchers

Abdul-Hakeem Omotayo*, Ashery Mbilinyi*, Lukman Ismaila*, Houcemeddine Turki, Mahmoud Abdien, Karim Gamal, Idriss Tondji, Yvan Pimi, Naome A. Etori, Marwa M. Matar, Clifford Broni-Bediako, Abigail Oppong, Mai Gamal, Eman Ehab, Gbetondji Dovonon, Zainab Akinjobi, Daniel Ajisafe, Oluwabukola G. Adegboro, Mennatullah Siam

Arxiv.


Paper

2023

MED-VT: Multiscale Encoder-Decoder Video Transformer with Application to Object Segmentation

Rezaul Karim, He Zhao, Richard P. Wildes, Mennatullah Siam

CVPR 2023.


Paper Project Webpage Code

Towards a Better Understanding of the Computer Vision Research Community in Africa

Abdul-Hakeem Omotayo, Mai Gamal, Eman Ehab, Gbetondji Dovonon, Zainab Akinjobi, Ismaila Lukman, Houcemeddine Turki, Mahmod Abdien, Idriss Tondji, Abigail Oppong, Yvan Pimi, Karim Gamal, and Mennatullah Siam

EAAMO 2023.


Paper

Two-Stage Joint Transductive and Inductive Learning for Nuclei Segmentation

Hesham Ali, Idriss Tondji, Mennatullah Siam

ML4H Symposium 2023, Findings Track.


Paper

Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation

Mennatullah Siam, Rezaul Karim, He Zhao, Richard P. Wildes

Arxiv.


Paper Code

2022

Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks

Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis

Journal Extension Under Review.


Paper

A Deeper Dive into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic Information

Matthew Kowal, Mennatullah Siam, Md Amirul Islam, Neil D. B. Bruce, Richard P. Wildes, Konstantinos G. Derpanis

CVPR 2022.


Paper Video Demo Project Webpage Code Interpretability Code AVOS

2021

Temporal Transductive Inference for Fewshot Video Object Segmentation

Mennatullah Siam, Konstantinos G. Derpanis, Richard P. Wildes

ML4AD Workshop, Neurips 2021.


Full Paper Paper Video Demo

Video Class Agnostic Segmentation Benchmark for Autonomous Driving

Mennatullah Siam, Alex Kendall, Martin Jagersand

CVPR 2021 Workshops.

Paper Project Webpage

2020

Weakly Supervised Few-shot Object Segmentation using Co-attention with Visual and Semantic Embeddings

Mennatullah Siam*, Naren Doraiswamy*, Boris N. Oreshkin*, Hengshuai Yao, Martin Jagersand (equally contributing)

IJCAI 2020.

Paper

2019

AMP: Adaptive Masked Proxies for Few-Shot Segmentation

Mennatullah Siam, Boris N. Oreshkin, Martin Jagersand

ICCV 2019.

Paper Code

Video Segmentation using Teacher-Student Adaptation in a Human Robot Interaction (HRI) Setting

Mennatullah Siam, Chen Jiang, Steve Lu, Laura Petrich, Mosta Gamal, Mohamed Elhoseiny, Martin Jagersand

ICRA 2019.

Paper Dataset

Online Object and Task Learning via Human Roboti Interaction

Masood Dehghan*, Zichen Zhang*, Mennatullah Siam*, Jun Jin, Laura Petrich, Martin Jagersand (equally contributing)

ICRA 2019.

Paper Video Demo

2018

Real-time Segmentation with Appearance, Motion and Geometry

Mennatullah Siam, Sara Eikerdawy, Mostafa Gamal, Moemen Abdel-Razek, Martin Jagersand, Hong Zhang

IROS 2018.

Paper

Moving Object Detection Network for Autonomous Driving

Mennatullah Siam, Heba Mahgoub, Mohamed Zahran, Senthil Yogamani, Martin Jagersand, Ahmed El-Sallab

ITSC 2018.

Paper Dataset Video Demo Patent

Teaching

Ontario Tech University

  • Fall 2023, ELEE2110 Discrete Mathematics, Undergraduate Course.
  • Winter 2024, SOFE4620 Machine Learning and Data Mining, Undergraduate Course.
  • Winter 2024, SOFE2715 Data Structures, Undergraduate Course.

Nile University

  • Spring 2023, CIT-670 Computer Vision, Graduate Course.
  • Spring 2022, CIT-670 Computer Vision, Graduate Course.