AI & Machine Learning
Music Information Retrieval
Data Science
Audio Processing
Music Transcription


Hello there!

I’m Gopika, a Master's student in the Erasmus Mundus Joint Master in Artificial Intelligence (EMAI) at Universitat Pompeu Fabra (UPF). My research follows two parallel branches: foundations of AI and representation learning, covering latent space modeling, probabilistic methods and deep neural architectures, and applications in audio processing and music information retrieval (MIR), where I apply these techniques to analyze rhythm, timbre and the evolution of musical style.

I’m a lifelong learner, always eager to explore new ideas through my research, science communication, and personal hobbies. In my free time, I enjoy reading, birdwatching, playing stringed instruments, and stargazing.

If you want to reach out, feel free to contact me at gk1656@nyu.edu!


News

  • 15 September 2025: Joined BMAT Music Innovators in Barcelona as a Software Engineering Intern, building full-stack services for the EU-funded Music360 platform and integrating AI with scalable data pipelines.
  • 13 September 2025: Returned to Barcelona to start the final year of my Master’s in Artificial Intelligence at UPF, focusing on the Intelligent Decision Making specialization.
  • 24 July 2025: Won 2nd Place (Poster Presentation) and 3rd Place (Programmed AI Art) at the UKRI CDT AI Research Showcase, University College London, for work on Reliable Confidence Scores for Transformers and Neural Cellular Automata + Style Transfer on Flowers.
  • 22 July 2025: Attended the UKRI CDT AI Summer School at University College London, connecting with leading AI researchers and presenting ongoing work.
  • 15 May 2025: Started a research internship at the Laboratory for Computer Graphics and Multimedia (LGM) in Ljubljana, developing a cello-practice app with real-time audio analysis and interactive spectrograms.
  • May 2025: Placed 5th in the University of Ljubljana Data Science Competition for the project “Reliable Confidence Scores for Transformers.”
  • 06 April 2025: Presented two papers at IEEE ICASSP 2025 at Hyderabad!: “Closing the Loop on Speech-to-Music Translation: Automatically Generating Synthetic Percussive Sequences on the Mridangam from Konnakol” (SALMA Workshop) and “Investigating Temporal Convolutional Networks for Automated Stroke Transcription in the Mridangam” (WIMAGA Workshop).
  • 04 February 2025: Moved to Slovenia to begin my second-semester specialization in Data Science at the University of Ljubljana (FRI).
  • 18 December 2024: Volunteered at the Deep Learning Barcelona Symposium (DLBCN) 2024, where I got to interact with inspiring researchers, which was especially meaningful as I am pursuing my own AI career with my Master's in Barcelona. It was a great chance to learn and connect with people in the field!
  • 17 December 2024: Paper Selected for WIMAGA Workshop at ICASSP 2025 focusing on Mridangam stroke transcription using Temporal Convolutional Networks (TCNs)
  • 15 September 2024: Started my Master's in Artificial Intelligence under an Erasmus Mundus program in Barcelona at UPF!


Research and Projects

Optimizing the Mridangam Stroke Transcription Pipeline: Addressing Key Challenges

This project began as part of my Post-graduate Practical Training Program (PPTP) with the Music and Sound Cultures Research Group at NYU Abu Dhabi. I developed a Temporal Convolutional Network (TCN)–based stroke-transcription pipeline for the mridangam and designed complementary konnakol (spoken percussion) alignment methods to handle expressive timing and speed changes. The system addresses key challenges such as sparse onsets and nested rhythmic layers, and the results were presented at the 2025 IEEE ICASSP workshops (SALMA & WIMAGA) as part of ongoing work on rhythm-aware transcription.

Calibrating Transformer Confidence with Gaussian Processes

In this project my teammates and I worked with Zurich Insurance to improve the reliability of confidence scores from large language models. We built a Gaussian Process Classifier to calibrate transformer outputs after inference, combining signals such as logits, verbal self-confidence, and generation consistency. I implemented the calibration pipeline and evaluation framework and ran experiments on the Kleister-NDA dataset. Our method outperformed raw logits and verbal confidence on Expected Calibration Error, Negative Log-Likelihood, and Brier Score, delivering trustworthy confidence estimates for legal-document information extraction. Awards: University of Ljubljana Data Science Competition 2025 and UCL Centre for Doctoral Training in AI Research Showcase.

Improving Prompt Sensitivity of LLMs

In this project, my teammates and I investigated how to make large language models more reliable when the wording of a prompt changes. We tested inference-time strategies—such as Chain-of-Thought, Self-Consistency, and Self-Refinement—and fine-tuned models using parameter-efficient LoRA on groups of paraphrased prompts. I contributed to the design of the experiments and the evaluation pipeline, using metrics like the Prompt Sensitivity Index (POSIX) and an LLM-as-a-Judge scoring method to compare models including LLaMA-2, Mistral, and Falcon, highlighting how different techniques affect robustness and answer quality.

Analyzing the Nature and Extent of Stylistic Evolution in Popular Music using Machine Learning

This project, stemming from a collaboration with Prof. Minsu Park at NYU Abu Dhabi, evolved into my capstone thesis. The project utilizes machine learning techniques to analyze the evolution of popular music in the United States, uncovering continuous changes with occasional radical shifts. It challenges the role of genre as the sole driver and highlights the multifaceted nature of evolution.

Complex-Network Resilience Analysis of the European Power Grid

In this project, my teammates and I examined how Europe’s high-voltage power grid responds to structural and functional attacks using advanced network-science techniques. We built a role-aware model that separates generation, transmission, and conversion stations, then simulated targeted node removals using degree, betweenness, closeness, and PageRank centrality. I helped design and run the experiments, tracking structural collapse through the largest connected component and functional failure via the percentage of unserved nodes. We also recreated the April 2025 Iberian blackout, showing how cascading failures from generator trips and France–Iberia disconnection led to total regional collapse.

Reinforcement Learning for Uplift Modeling

In this project my teammates and I explored how reinforcement learning can improve marketing uplift modeling. We reproduced and extended the R-Lift algorithm, framing the problem as a Markov Decision Process and training a neural policy-gradient model to target users most likely to respond to treatment. I implemented experiments on the Hillstrom and Criteo datasets, analyzed Qini curves and coefficients to measure incremental lift, and addressed challenges such as treatment-control imbalance and large-scale data handling. Our results showed higher Qini coefficients than standard baselines, but also revealed stability issues with fluctuations below the random baseline, highlighting the trade-offs in RL-based uplift methods.

FaceHack: Attacking Facial Recognition Systems Using Malicious Facial Characteristics

At the Modern Microprocessor Architectures Lab at NYUAD, I researched different backdoors in facial recognition systems. I created a state of the art facial recognition system and conducted experiments with data poisoning by changing facial attributes. The results of this work was published in IEEE Transactions on Biometrics, Behavior, and Identity Science.

Cross Lingual Transfer - XLMR

In this paper, my teammate and I evaluate the ability of the XLM-R model to learn and transfer grammatical knowledge from a source language (English) to 4 similar and dissimilar target languages (German, Hebrew, French and Russian). Furthermore, we test the model on a low-resource language (Nepali).