Ramya Hebbalaguppe

I am a researcher specializing in visual computing and machine learning. My doctoral research, supervised by Prof. Chetan Arora at IIT Delhi, focused on proposing novel methods to enhance trust and reliability in deep neural networks (DNNs) based classifiers. During my doctoral studies, I developed techniques for out-of-distribution detection, uncertainty quantification, and the refinement of DNN models backed by theoretical insights.

Before pursuing my doctoral research, I had the privilege of working with Prof. Ramakrishna Kakarala at Nanyang Technological University on High Dynamic Range (HDR) Imaging algorithms. This work formed a part of the image processing pipeline aimed at smartphone cameras. Our research was recognized with the Best Student Paper award at the 2012 SPIE conference in Burlingame, California. I completed my Master’s degree at the School of Electronic Engineering and Computing at Dublin City University in 2014, under the guidance of Prof. Noel O'Connor and Prof. Alan Smeaton. My focus during this time was on reducing false alarms in surveillance camera networks, with a portion of this research being licensed to Netwatch Systems and the team received Invent award for the project.

Currently, I am employed as a Senior Scientist at TCS Research Labs, within the Deep Learning and Artificial Intelligence Group (DLAI) located at the Research and Development Park, IIT Delhi, India. At TCS, my work spans various areas, including efficient inference of DNNs through model compression, trustworthy ML, continual learning, and, more importantly, the development of algorithms for creative and immersive content generation, such as images, videos, and 3D/4D data

Outside work, I enjoy painting, traveling, cooking and baking, composting, planting tree saplings, and music.

> Email / Google Scholar / Twitter / Github / Travel

Research

Representative papers spanning the following themes reliable machine learning (out-of-distribution detection, uncertainty quantification, continual learning), 3D/4D/2D computer vision are highlighted.

	Lifelong Learning in StyleGAN through Latent Subspaces Adarsh K, Anmol Garg , Ramya Hebbalaguppe, Prathosh AP, Transactions on Machine Learning Research (accepted) , 2024 StyleGAN is one of the most versatile generative models that have emerged in recent times. However, when it is trained continually on a stream of data (potentially previously unseen distributions), it tends to forget the distribution it has learned, as is the case with any other generative model, due to catastrophic forgetting. Recent studies have shown that the latent space of StyleGAN is very versatile, as data from a variety of distributions can be inverted onto it. In this paper, we propose StyleCL, a method that leverages this property to enable lifelong learning in StyleGAN without forgetting. Specifically, given a StyleGAN trained on a certain task (dataset), we propose to learn a latent subspace characterized by a set of dictionary vectors in its latent space, one for each novel, unseen task (or dataset). We also learn a relatively small set of parameters (feature adaptors) in the weight space to complement the dictionary learning in the latent space. Furthermore, we introduce a method that utilizes the similarity between tasks to effectively reuse the feature adaptor parameters from the previous tasks, aiding in the learning process for the current task at hand. Our approach guarantees that the parameters from previous tasks are reused only if they contribute to a beneficial forward transfer of knowledge. Remarkably, StyleCL avoids catastrophic forgetting because the set of dictionary and the feature adaptor parameters are unique for each task. We demonstrate that our method, StyleCL, achieves better generation quality on multiple datasets with significantly fewer additional parameters per task compared to previous methods. This is a consequence of learning task-specific dictionaries in the latent space, which has a much lower dimensionality compared to the weight space.
	Calibration Transfer via Knowledge Distillation Ramya Hebbalaguppe, Mayank Baranwal, Kartik Anand, Chetan Arora, ACCV , 2024 -- [Oral Presentation] (top 5.6 %) [paper] [Suppl. Material (proofs)] Knowledge Distillation for Calibration (KD(C)) endeavors to deploy lightweight models that are also reliable, we delve into the realm of knowledge distillation, extending its traditional function of transferring accuracy from teacher networks to student networks. Through this exploration, we have discovered a novel approach to calibrating models effectively. We present, arguably for the first time, compelling theoretical as well as empirical evidence that model calibration can be achieved without sacrificing accuracy through knowledge distillation. Our implementation of knowledge distillation not only guarantees enhanced model calibration but also outperforms the accuracy obtained through conventional training from scratch in specific cases. This innovative approach enables us to simultaneously accomplish the dual objectives of optimal calibration and improved accuracy.
	LoMOE: Localized Multi-Object Editing via Multi-Diffusion Goirik Chakrabarty, Aditya Chandrasekar, Ramya Hebbalaguppe, Prathosh AP, ACM International Conference on Multimedia , 2024 [paper] [project page] Recent developments in the field of diffusion models have demonstrated an exceptional capacity to generate high-quality prompt-conditioned image edits. Nevertheless, previous approaches have primarily relied on textual prompts for image editing, which tend to be less effective when making precise edits to specific objects or fine-grained regions within a scene containing single/multiple objects. We introduce a novel framework for zero-shot localized multi-object editing through a multi-diffusion process to overcome this challenge. This framework empowers users to perform various operations on objects within an image, such as adding, replacing, or editing many objects in a complex scene in one pass. We also curate and release a dataset dedicated to multi-object editing, named LoMOE-Bench. Our experiments against existing state-of-the-art methods demonstrate the improved effectiveness of our approach in terms of both image editing quality and inference speed.
	ReMOVE: A Reference-free Metric for Object Erasure Aditya Chandrasekar, Goirik Chakrabarty, Jai Bardhan, Ramya Hebbalaguppe, Prathosh AP, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPRW) The First Workshop on the Evaluation of Generative Foundation Models, 2024 project page We introduce ReMOVE, a novel reference-free metric for assessing object erasure efficacy in diffusion-based image editing models post-generation. Unlike existing measures such as LPIPS and CLIPScore, ReMOVE addresses the challenge of evaluating inpainting without a reference image, common in practical scenarios. ReMOVE effectively distinguishes between object removal and replacement, a key issue in diffusion models due to stochastic nature of image generation.
	Transfer4D: A framework for frugal motion capture and deformation transfer Shubh Maheshwari, Rahul Narain, Ramya Hebbalaguppe, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023 project page Animating a virtual character based on a real performance of an actor is a challenging task that currently requires expensive motion capture setups and additional effort by expert animators, rendering it accessible only to large production houses. The goal of our work is to democratize this task by developing a frugal alternative termed Transfer4D that uses only commodity depth sensors and further reduces animators' effort by automating the rigging and animation transfer process. Our approach can transfer motion from an incomplete, single-view depth video to a semantically similar target mesh, unlike prior works that make a stricter assumption on the source to be noise-free and watertight.
	Calibrating Deep Neural Networks Using Explicit Regularisation and Dynamic Data Pruning Rishabh Patra, Ramya Hebbalaguppe, Tirtharaj Dash, Gautam Shroff, Lovekesh Vig, IEEE/CVF Winter Conference on Applications of Computer Vision, 2023 -- [Spotlight Presentation] (top 10%) project page We demonstrate state-of-the-art Deep Neural Network calibration performance via proposing a differentiable loss term that can be used effectively in gradient descent optimisation and dynamic data pruning strategy not only enhances legitimate high confidence samples to enhance trust in DNN classifiers but also reduce the training time for calibration.
	A Novel Data Augmentation Technique for Out-of-Distribution Sample Detection using Compounded Corruptions Ramya Hebbalaguppe, Soumya Suvra Ghosal, Jatin Prakash, Harshad Khadilkar, Chetan Arora, European Conference on Machine Learning , 2022 project page We propose a novel Compounded Corruption(CnC) technique for the Out-of-Distribution data augmentation. One of the major advantages of CnC is that it does not require any hold-out data apart from the training set. Our extensive comparison with 20 methods from the major conferences in last 4 years show that a model trained using CnC based data augmentation, significantly outperforms SOTA, both in terms of OOD detection accuracy as well as inference time.
	A stitch in time saves nine: A train-time regularizing loss for improved neural network calibration Ramya Hebbalaguppe, Jatin Prakash, Neelabh Madan, Chetan Arora, IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) , 2022 -- [ORAL Presentation] (top 4%) project page We propose a novel auxiliary loss function: Multi-class Difference in Confidence and Accuracy (MDCA) for Deep Neural Network calibration. The loss can be combined with any application specific classification losses for image, NLP, Speech domains. We also demonstrate the utility of the loss in semantic segmentation tasks.

Current Research Team, Research and Innovation Park, IIT Delhi

Jai Bardhan (IIIT Hyderabad) Topic: 3D Skeletonization/ Diffusion Models for creative content creation

Interns(Summer'25) : Ishita Jain (GaTech), Ayush Pandey (ISER Bhopal), Khuushi Maheshwari (MIT), and Nishant Singh (IIIT Delhi)

Alumni

(The list of researchers include full-time, pre-doctoral fellows, and research interns.)

Academia: Thesis supervision

Website inspired from Jon Barron's.