Pritam Sarkar

Pritam Sarkar
Ph.D. Candidate at Queen's University, Canada
Affiliate at Vector Institute
pritam[dot]sarkar[at]queensu[dot]ca

Currently, I am a PhD candidate at Queen’s University, Canada and affiliated with the Vector Institute. Earlier, I interned at Google and Borealis AI. I joined Queen’s University in 2018 and completed my master’s degree in 2020. Prior to that, I worked as a Software Engineer for a total of approximately 3 years at Infosys and Tech Mahindra. I completed my bachelor’s degree in 2015 from West Bengal University of Technology, India. Other than research, I am passionate about photography and film-making.

Research

My current research focus is multimodal learning from videos.

Broadly covers: multimodal learning with vision, language, and audio; large multimodal models, multimodal LLMs, foundation models; post-training, alignments; AI agents; self-supervised and unsupervised learning; computer vision. Please find more about my research here.

News

I am on the job market for a full-time role as a researcher. If you find my experience a good fit, please reach out.

[May 25] Introduced VCRBench, the first video-based multi-step causal reasoning benchmark.
[Apr 25] Introduced RRPO, a fine-grained self-alignment recipe to align Multimodal LLMs.
[Jan 25] DPA got accepted in ICLR 2025.
[Dec 23] XKD and RDDM got accepted in AAAI 2024.
[Nov 23] I have won the first prize in IEEE Research Excellence Award (PhD).
[Sep 23] Our paper on Video SSL in OOD got accepted in NeurIPS 2023 as a Spotlight.
[Aug 23] Accepted an offer from Google to join as a Student Researcher.
[Nov 22] AVCAffe and CrissCross (Oral) got accepted in AAAI 2023.
[Oct 22] We are organizing AAAI 2023 Workshop on R2HCAI.
[Oct 22] Honourable Mention in poster competitions (1.) Robotics and AI Symposium 2022 and (2.) FEAS Research Symposium 2022 at Queen’s University, Canada.
[Jun 22] Accepted an offer from Borealis AI for a fall internship as a Machine Learning Research Intern.
[Oct 21] Best poster award at Robotics and AI Symposium, Ingenuity Labs, 2021.
[Aug 21] We are organizing AAAI 2022 Workshop on HC-SSL.
[Mar 21] I received postgraduate affiliation award from Vector Institute. news
[Dec 20] Our paper CardioGAN got accepted in AAAI 2021.

Click to see more

[Aug 20] My first journal/transaction as a first author got accepted in IEEE Trans. of Affective Computing.
[Apr 20] Successfully defended my M.A.Sc. thesis. picture
[Jan 20] Conference paper on ECG-based SSL got accepted in IEEE ICASSP 2020 for oral presentation.
[Jun 19] My first paper got accepted for oral presentation in IEEE ACII 2019.
[Sep 18] Joined Queen's for master's degree.
[Dec 17] Joined Infosys as a Sr. System Engineer.
[Nov 15] Joined Tech Mahindra as an Associate Software Engineer.
[Jun 15] Completed graduation!

Education

PhD at Queen’s University, Canada, 2020 - Present.
MASc at Queen’s University, Canada, 2018 - 2020. Link to MASc Thesis.
B.Tech at West Bengal University of Technology, India, 2011 - 2015.

Employment

Research Assistant at Queen’s University, Kingston, Canada, 2018 - Present.
Student Researcher at Google, Sunnyvale, USA, Fall 2023.
Machine Learning Research Intern at Borealis AI, Toronto, Canada, Fall 2022.
Sr. System Engineer at Infosys Ltd., Bangalore, India, 2017 - 2018.
Software Engineer at Tech Mahindra Ltd., Hyderabad, India, 2015 - 2017.
Teaching Assistant/Guest Lecturer at Queen’s University, Kingston, Canada, 2018 - Present.

Mentorship

Seth Grief-Albert, ECE at Queen’s, undergrad summer project, Summer 2024.
Vishal Narnaware, Visiting Student at University of Cambridge, co-mentored with Nikhil Churamani, 2023 - 2024.
Debaditya Shome, MASc at ECE, Queen’s University, 2022 - 2023.
Jordan Posen, ECE at Queen’s, undergrad final year project, 2021 - 2022.
Rachel Phinnemore, CS at Queen’s, undergrad final year project, 2020 - 2021.

Reviewing

NeurIPS, ICLR, AAAI, CVPR, ICCV, ECCV, ICML, ICASSP, ACM MM, ACII
IEEE Transactions on - PAMI, Affective Computing, Artificial Intelligence