Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Pages

Posts

Meta Search Engine

less than 1 minute read

Published:

Click here check out the post on my project MetaSearchEngine on Penzu.

experiences

Adobe

  • Manager: Dr. Soheil Darabi , Mentor: Dr. Taesung Park
  • Improved the performance of StyleGAN-based generative models for conditional image editing tasks like portrait relighting.
  • Working on improving the inference time and memory efficiency of Diffusion models.

Burris Lab, University of Michigan

  • Advisors: Prof. Nicholas Burris , Prof. Jeff Fessler
  • Built faster and better 3D segmentation and keypoint detection neural networks for automatically measuring aortic growth.
  • Our method greatly reduces the diagnosis time (100x) to detect thoracic aortic aneurysms.
  • This work will be presented as an Oral talk at the Radiological Society of North America (RSNA) meet in Chicago.

Intel Labs

  • Manager: Nilesh Jain
  • Investigated various neural video compression algorithms and built a modular codebase that supports the plug and play usage of typical encoders, decoders and entropy models used in neural codecs.
  • Performed a comparative analysis (from a systems perspective) of ScaleSpaceFlow and WaveOne ELF-VC, two state-of-the-art end-to-end learned video codecs, both of which were reproduced using the codebase developed.

Computer Vision Center, Barcelona

  • Advisors: Dr. Luis Herranz , Dr. Joost van de Weijer
  • Developed a mathematical framework for studying neural networks which are trained or tested with compressed images for use cases in distributed autonomous driving data collection.
  • Designed dataset restoration, a principled algorithm motivated by the aforementioned framework, that utilizes conditional GANs to mitigate the drop in performance by 10-50% when compressed images are used for training in place of uncompressed images. Katakol et al., IEEE TIP ‘21
  • Developed techniques for few-shot adaptation ($\approx$ 20% improvement wrt baselines) and continual learning of learned image compressors in collaboration with BBC R&D. Katakol et al., CVPR-W ‘21

APPCAIR, Bits Pilani

  • Advisors: Prof. Ashwin Srinivasan and Dr. Lovekesh Vig
  • Improved existing techniques for representation learning of Chest X Rays by developing a specialized loss involving a weighted square loss component and a multi-instance learning based detection loss component.
  • The usage of this loss led to better representations that capture the intricate details in the X-rays and ultimately resulted in improved detection accuracies when the learnt representations are transferred to a new task.

Synclovis Systems Pvt. Ltd.

  • Manager: Amit Pandey
  • Built a miniature Meta Search Engine, a system which refines the results of popular search engines according to the expected needs of middle/high school students, using document embeddings and clustering. Github

portfolio

projects

Semantic SNGAN

  • Spectral Normalization is the current state-of-the-art method for enforcing the Lipschitz constraint on the discriminator of the GAN. I was interested specially in the sensitivity of the discriminator to minor changes in the image input to it. While stability to small changes in the image is a desirable property of the discriminator, it’s not satisfied by the discriminator and some interesting trends are observed.
  • Contribution:
    • Detection & analysis of the problems with Spectral Normalized GANs & attempts to improve them to produce meaningful & coherent generations.
    • Designed a method to quantitatively estimate local and global coherence captured by the discriminator.
    • Github link

CLEVR Visual Question Answering

  • This an ongoing project where we use a different method to detect objects than the one used in Object-based Visual Reasoning . The paper uses pre-trained Faster RCNN architecture to detect CLEVR objects while we use traditional CV tools to detect objects, thus reducing training time and increasing the speed of reasoning at test time.
  • This was also the final project of Deep learning course at BITS Goa. I, being a Teaching Assistant for the course, mentored about 15 groups on the same.
  • Github link

Multi Agent Reinforcement Learning

  • This project was aimed to bring out the evolution of cooperation in our society by examining the cooperative hunting scenarios in lions extensively studied by biologists over the years. The objective was to simulate the random behaviour of animals and show how their actions converge/diverge under different conditions.
  • Contribution:
    • Through Nash Q-Learning we were able to teach multiple predators the desired behavior and were able to simulate three different scenarios under different conditions - one in which the animals fight, one in which they cooperate, and one in which they mix both these strategies.
    • Github Link

Sports Activity Detection

  • Project component of Neural Network and Fuzzy Logic course where we had to classify sports videos from the UCF Dataset according to the action being performed.
  • Contribution:
    • Built a minimal model using Autoencoders, ConvNets, and LSTMs. The bagged model we submitted was of the least size.
    • Github Link

publications

talks

teaching