I am a doctoral student at KU Leuven, Belgium. I am advised by Prof. Tinne Tuytelaars at PSI division, Departement Elektrotechniek (ESAT). My PhD research is focused on multimodal representation learning in the direction of robust and interpretable representations.
Prior to joining the doctoral school I have spent a brief time visiting IISc Bangalore as a Project Assistant at MALL Lab, working with Dr. Partha Pratim Talukdar, Dr. Anirban Chokraborty and Dr. Anand Mishra on a project on Weakly supervised video understanding using Knowledge Graphs (KG).
I completed my Masters (MS) at IIIT Hyderabad, where I was jointly advised by Prof. C. V. Jawahar and Prof. Vinay P. Namboodiri at Center for Visual Information Technology. My masters research was focused on computer vision and machine learning for solving Visual Speech Recognition (VSR) which lies at the intersection of multiple modalities like videos (speech videos) audios (speech audio) and texts (Natural language). I have also worked in the space of Image stylization for enabling cross-modal transfer of style.
Prior to this, I have spent one year (2015-16) as a research fellow at CVIT working on a problem on cross-modal multimedia retrieval, under the supervision of Prof. Jawahar. Before moving to Hyderabad, I was a Manager (Planning), at Tata Steel Limited (2014-15) working towards automation and energy consumption optimization in processing plant.
I graduated from IIT Dhanbad, India, in 2014 with a B.Tech in Electronics and Communication Engineering. During my undergraduate years I worked closely with Prof. Mrinal Sen and Dr. Dilip Prasad on projects related to computer vision and robotics.
|[Aug 2021]||Accepted: Our paper “Glimpse-Attend-and-Explore: Self-Attention for Active Visual Exploration” in ICCV 2021.|
|[Jan 2021]||Presented a poster on “ Transferability of Self-Supervised Representations” in Mediterranean Machine Learning summer school 2021.|
|[Sep 2020]||Will be attending Mediterranean Machine Learning summer school 2021 in January 2021.|
|[Aug 2020]||Will be attending AI summer school 2020 (online), AI Singapore.|
|[Nov 2019]||Joining KU Leuven, as a PhD student, at PSI ESAT.|
|[Jul 2019]||Accepted: Our paper “Towards Automatic Face-to-Face Translation” accepted in ACM Multimedia 2019.|
|[May 2019]||Vikram presented our work “Cross-Language Speech Dependent Lip-Synchronization” in ICASSP 2019, Brighton, UK.|
|[Apr 2019]||Presented my work on “Audio-Visual Speech Recognition and Synthesis” at MPI-Informatics, Saarbrucken.|
|[Apr 2019]||Successfully defended my MS thesis Audio-Visual Speech Recognition and Synthesis. Thesis Link.|
|[Feb 2019]||Accepted: “Cross-Language Speech Dependent Lip-Synchronization” accepted in ICASSP 2019.|
|[Feb 2019]||Will be spending next couple of months in IISc Bangalore as a visiting student.|
|[Jan 2019]||Submitted my MS thesis at IIIT Hyderabad.|
|[Dec 2018]||Paper “Spotting Words in Real World Videos : A Retrieval based approach” accepted in Journal of Machine Vision Application (MVA), Springer.|
|[Jul 2018]||Presenting our work on “Lip-Synchronization for Dubbed Instructional Videos” at 2nd Research Symposium, IIIT Hyderabad.|
|[May 2018]||Short paper “Lip-Synchronization for Dubbed Instructional Videos” accepted at CVPR 2018 Workshop (FIVER).|
|[May 2018]||Giving a talk on “Introduction to Image Style Transfer”, at CVIT, IIIT Hyderabad.|
|[May 2018]||Paper “Cross-Modal Style Transfer” accepted at ICIP.|
|[Apr 2018]||Presenting our work on “Word-spotting in Silent Lip videos”, at 1st Research Symposium, IIIT Hyderabad.|
|[Mar 2018]||Presenting our paper “Word-spotting in Silent Lip videos”, at WACV 2018, Lake Tahoe, CA.|
|[Fab 2018]||Organizing annual R&D Showcase 2018, at IIIT Hyderabad.|
|[Jan 2018]||Will be working as a “Mentor” for Foundations of Artificial Intelligence and Machine Learning.|
Cross-Language Speech Dependent Lip-Synchronization
Spotting Words in Silent Speech Videos : A Retrieval based approach
Vinay P. Namboodiri,
C. V. Jawahar
Lip-Synchronization for Dubbed Instructional Videos
Cross-specificity: modelling data semantics for cross-modal
matching and retrieval
|Spring 2021:||Teaching assistant (TA) in the course Information System and Signal Processing (B-KUL-H09M0A), KU Leuven. Course instructor: Prof. Tinne Tuytelaars|
|Spring 2020:||Teaching assistant (TA) in the course Information System and Signal Processing (B-KUL-H09M0A), KU Leuven. Course instructor: Prof. Tinne Tuytelaars|
|Monsoon 2018:||Teaching assistant (TA) in the course Topics in Machine Learning (CSE975), IIIT Hyderabad. Course instructor: Prof. Naresh Manwani|
|Spring 2018:||Mentor in 1st foundations course on Artificial Intelligence and Machine Learning. Course instructor Prof. C. V. Jawahar|
Reviewer: 7th National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG 2019), December 22-24, 2019, Hublie.
Reviewer: Second IAPR International Conference on Computer Vision & Image Processing (CVIP-2017, September 10-12, 2017), IIT Roorkee.
Organizing Team: 17th R&D showcase 2018, IIIT Hyderabad: showcase of exhibits and demonstration research projects and represents of IIIT-H’s most recent developments in research and innovation in technology.
[2016 - Present]: Admin, CVIT Lab, HPC cluster of (aka Cosmos).
[2017 - Present]: Student Admin, IIIT Hyderbad HPC cluster (aka ADA).