Umar Masud

I am a final year undergrad at JMI University, New Delhi.
My interest is in research in the field of ML/AI and Computer Vision.
I have applied knowledge of Image Processing, Machine Learning, and Deep Learning, with hands-on practice in various image-related tasks in Computer Vision. I keep reading multiple relevant pieces of literature and try to implement them in code.
I also have a basic to intermediate knowledge of web development technologies.

Apart from exploring the latest research and technology developments, I can be found consuming content about innovation, ideas, entrepreneurship, start-ups, finance, and alike. A keen interest in the start-up ecosystem.


Education

Jamia Millia Islamia University

Bachelor of Technology
Electronics and Communication

CGPA: 9.6/10

Relevant Coursework: Fundamentals of Computing, Data Structures and Computer Programming, Computer Networking, Database Management, Engineering Mathematics (I, II, III), Computer Architecture, Digital Signal Processing.

August 2019 - May 2023

The Frank Anthony Public School

Science Stream

98%, ICSE 12th

April 2004 - March 2019

Experience

DAAD-WISE Scholar

Ulm University

Visited the lab at Institute of Neural Information Processing under Prof. Friedhelm Schwenker where I explored the topic of Compressed Image Super-resolution.
Proposed a novel model and training strategy, utilising a pre-trained large scale vision model to assist in learning the representations from the compressed, low-resolution image. Evaluation with baselines showed competitive results. The paper is under review.

June 2022 - August 2022

Research Intern

Indian Institute of Science (IISc)

Associated with VCL Lab, where I am working on Domain Generalisation for Person Re-identification task.
Performed various idea implementations in code and ran experiments. Annotated CUHK03 and MSMT Datasets for person attribute information. Also responsible for visualisation of results in a descriptive manner. The work resulted in a paper which is under review.

January 2022 - June 2022

Research collaborator

Ecole normale superieure - PSL

Worked with members of Computational Bioimaging and Bioinformatics team headed by Auguste Genovesio for a project on Quality Control of out-of-focus/noised images Phenotypic Screening using Self/Semi Supervised learning.
Responsible for setting up pipelines to pre-train SOTA self-supervised methods on 2.1M images and evaluate the results on downstream classification task. Performed studies with transfer learning and semi-supervised learning as well. The work is accepted at ECCV 2022 Workshop.

December 2021 - March 2022

Summer Research Intern

IIIT-Allahabad

Worked on the topic of "Automatic Detection of Image Splicing", under Prof. Anupam Agarwal , in Interactive Technologies and Multimedia Research (ITMR) Lab.
Did literature review and comparative analysis, implemented baselines from scratch and ran various experiments/simulations, cross-evaluated the SOTA papers with different datasets and out-of-distribution training/testing environments. Concluded a study that the latest methods have an overestimated performance and are unsuitable for real-world applications.

May 2021 - July 2021

Undergraduate Researcher

Jamia Millia Islamia

Working on various problems in computer vision under Prof. Sarfaraz Masood.


  • Explored "Facial Mask Detection and Positioning", did literature review and analysis, implemented many baseline techniques, designed a novel lightweight model with upto 496x reduction in parameter count in comparison to existing methods. Also, provided a large set of synthetic images developed by stitching masks on faces. The work is accepted at ICMLDE 2022.
  • Worked on "DeepFake Video Detection", developed a light-weight spatio-temporal model that performed at par with the existing models but with much lesser parameter count and faster inference time. Also, proposed a live feed deployment solution for the same. The work is published in SIVP Journal.
  • Currently exploring a Transformer based Text Detection and Localisation system in visual inputs containing natural scene text. Experimenting with the model architecture – a backbone network, a boundary proposal module and a core transformer module. Running evaluations with the existing methods and preparing a baseline.
May 2021 - Present

Junior ML Engineer

Omdena Global

Collaborating in a team of 50+ ML engineers to develop a production-ready deep vision system that uses geospatial data to provide accurate rooftop solar PV analysis, inlcuding factors such as total roof-area, roof obstacles, shadows/solar potatential, rooftop material, etc. This project is in association with Rebase Energy.

September 2021 - November 2021

ML/AI Dev

Google Developer Student Club - JMI.

One of the core team members, responsible for all the activities being carried out for the dissemination of knowledge about ML/AI to students.

August 2021 - August 2022

Youth Ambassador

HundrED Global Organization

HundrED Youth Ambassadors is an active community of students from around the world who are passionate about improving education and want to work with other like-minded young changemakers.

January 2021 - December 2021

Web Development Volunteer

M.I.N.D.S

Responsible for the development and timely updating of their website.

December 2020 - January 2020

Web Development and Design Intern

The Sparks Foundation

Built two mini-projects
1.) A transaction based small banking system.
2.) A demo-webpage integrated with a payment gateway for donation purposes.

December 2020 - January 2020

PUBLICATIONS

  1. Masud, U., Cohen, E., Bendidi, I., Bollot, G., Genovesio, A. (2022). Comparison of semi-supervised learning methods for High Content Screening quality control. In BioImage Computing workshop at ECCV 2022. https://doi.org/10.48550/arXiv.2208.04592
  2. Masud, U., Siddiqui, M., Sadiq, Mohd., Masood, S. (2022). SCS-Net: An efficient and practical approach towards Face Mask Detection. Procedia Computer Science Journal. Accepted at ICMLDE, 2022. https://doi.org/10.1016/j.procs.2023.01.165
  3. Jambigi, C., Masud, U., Chakraborty, A. (2022). G-PReDICT: Generalizable Person Re-ID using Domain Invariant Contrastive Techniques. Accepted at ICVGIP, 2022. https://doi.org/10.1145/3571600.3571655
  4. Masud, U., Shwenker, F. (2022). Compressed Image Super-Resolution using Pre-trained Model Assistance. Accepted at COMSYS, 2022.
  5. Masud, U., Sadiq, Mohd., Masood, S., Ahmad, M., and Ahmed A. Abd El-Latif. 2023. LW-DeepFakeNet: A Lightweight Time Distributed CNN-LSTM network for real-time DeepFake Video Detection. Accepted in Signal, Image and Video Processing.
  6. Masud, U., Agarwal, A. (2021). Analysing Statistical methods for Automatic Detection of Image Forgery. arXiv. https://doi.org/10.48550/arXiv.2111.12661

Skills

Programming Languages
  • Python,
  • Java,
  • HTML/CSS,
  • PHP,
  • Javascript.
Libraries/Frameworks
  • NumPy,
  • Pandas,
  • Matplotlib,
  • Sklearn,
  • OpenCV,
  • Keras,
  • TensorFlow,
  • PyTorch,
  • Flask.
Database
  • MySql,
  • PostgreSQL
Interpersonal
  • Communication,
  • Public Speaking,
  • Critical Thinking,
  • Team Leadership,
  • Team Management.

Projects

Machine Learning
  • Classical ML
    • Reproduced 5 papers on the topic Image Forgery Detection that uses handcrafted features for classification of pristine and tampered images.
    • Diabetic Retinopathy Detection using Texture Features and Ensemble Learning (paper implementation). Achieved F1-score = 0.97 and accuracy = 97.2%.
    • Fog detection in images using GLCM based features and SVM (paper implementation). Got F1-score = 0.83 and test accuracy = 82.3%
    • Phishing URL detection system based on URL features using SVM (paper implementation). Achieved F1-score = 0.99 and test accuracy = 99.2%
  • OpenCV Projects
    • Air-Piano, an air-based piano enabling the person to play through hand(fingertip) movements.
    • Air-Drum System, an air-based drum beat generator.
    • Background Color Detection, uses 2 techniques to detect a suitable background for the input image.
    • Compress img, a CLI based program to compress images using the KL Transform.
  • Deep Learning
    • Going deeper with U-Nets for Binary Image Skeletonization (ongoing).
    • Image Inpainting using a U-Net model with a fused ConvMixer encoder. (Report)
    • Different Descriptors for Squeeze and Excitation Attention Block - experimented with standard deviation, trace, largest singular value, and DC coefficient of DCT instead of usual GlobalAvgPool2d. (Report)
    • Background Remover tool for portrait images of humans, made using a U-Net model trained for semantic segmentation of the image. The model achieved 0.981 IOU-score on test data. Also deployed on a web-app.
    • Implemented the paper - Medical image denoising using Convolutional Denoising Autoencoders(CAE). Achieved a loss = 0.106 or Structural Similarity Index(SSIM) = 0.894 .
    • Image similarity measure through Siamese network on fashion apparels. Got an evaluation accuracy of 94.2%
    • Plant Pathology Challenge, a FGVC8 workshop challenge at CVPR-2021 for multi-label classification of plant leaf diseases. Got 87.34 accuracy with a pre-trained model as feature extractor.
    • Human Emotion Detection, Pneumonia Prediction models.

Web Development
  • Banking System
    • The project contains a simple banking system that enables to transact between the customers. It uses HTML, CSS, bootstrap, PHP, and MySQL, with the local server provided by the XAMPP.
  • Website Template for InnerveSOC
    • As a part of the InnerveSOC competition, designed a complete website template for Innerve Tech-Fest 2020, IGDTUW. I was the adjudged winner.

Awards & Honors

  • Research Week with Google 2023: Amongst 250 people accepted for participation by Google Research India.
  • Online Asian Machine Learning School (OAMLS): Accepted with full scholarship as a part of ACML 2022.
  • Robotics & AI Summer School 2022: Accepted to this summer school hosted by IRI, CSIC-UPC.
  • DAAD-WISE Scholarship 2022: Financial aid for Summer Research Internship in Germany.
  • Workshop on AI for Computational Social Systems(ACSS) 2021: - 3rd place in Student Paper Competition.
  • 5th Summer School of AI 2021 - IIIT Hyderabad - One amongst 500 participants worldwide.
  • Winner-Innerve Summer of Code Challenge 2020 - Indira Gandhi Delhi Technical University for Women.
  • INSPIRE Science Award For Top 1% - Scholarship for Higher Studies by Govt. of India.
  • Mr. Harbinder Singth Dugal Rolling Trophy - Awarded for Proficiency in Science ISC-XII
  • Mr. G W Mayer’s Merit Scholarship - Awarded for excellence in Mathematics and Science
  • Shanker Sumeda Rolling Trophy - Awarded for Excellence in Academics.