Naveen Mysore — Personal webpage

Hello! I'm Naveen Mysore, a Research and Software Engineer with 8+ years of interdisciplinary experience bridging scientific research and production software engineering across machine learning, reinforcement learning, distributed systems, and causal inference. At UC Santa Barbara, I trained reasoning LLM models using reinforcement learning, and worked on causal structure discovery using graph neural networks, interpretable AI, Bayesian updates, and time-series forecasting. Previously at Salesforce and Dell EMC, I designed and shipped large-scale enterprise software systems: backend services, cloud infrastructure (AWS, Terraform), data pipelines, and distributed systems serving millions of users in production. What drives my research is a core belief: the hardest problems in AI for Science and AI for Health demand models that are not just powerful but trustworthy. The AI Alignment problem and the need for Interpretable AI are what drew me to causal reinforcement learning, building agents that reason about cause and effect, not just correlations, so we can deploy AI safely in domains like healthcare, climate, and scientific discovery. I bring the full stack to my research, from writing CUDA kernels and building distributed training infrastructure to deploying models behind production APIs. My work sits at the intersection of reasoning-capable AI systems, strong algorithmic foundations, and rigorous software engineering, grounded in a master's in Computer Science from UNC Charlotte and a bachelor's in Electrical Engineering from PES University. I'm always open to collaborations. If you're working on interpretable AI, causal reasoning, RL, or AI for scientific discovery and would like to work together, feel free to reach out at nmysore.work [at] gmail.com.

Recent

Prediction-Based Markov Violation Scores for Detecting Non-Markovian Observations in Reinforcement Learning

Introduces prediction-based Markov Violation Scores (MVS) to detect when observations in RL violate the Markov property. The method leverages prediction errors from learned dynamics models to quantify non-Markovian behavior, enabling more robust policy learning under partial observability.

🎉 Accepted at a Reinforcement Learning conference and will appear in Reinforcement Learning Journal 2026.

arXiv Paper

Temporal Functional Circuits: From Spline Plots to Faithful Explanations in KAN Forecasting

Proposes Temporal Functional Circuits (TFCs), a framework for extracting faithful, interpretable explanations from Kolmogorov-Arnold Networks (KANs) applied to time-series forecasting. By analyzing learned spline activations, TFCs reveal how individual input features are transformed and combined, offering transparent insight into KAN predictions.

Status: Under review for NeurIPS 2026

arXiv Paper

Reasoning LLM Model trained using RL

Scan the QR code above to try our live nutrition estimation service! Text a meal description like "I had a bagel for breakfast" and get instant nutrition analysis. This LLM was trained on the NutriBench dataset and fine-tuned using Reinforcement Learning on the Llama3.1B model. The inference model is hosted on AWS for real-time responses.

GitHub Repository

Projects

Sept 2021

N arm bandits is a classical problem in computer science. In this Jupyter note book we will empirically verify that near greedy approch converges to optimal values faster than non greedy or greedy approches and maximizes the expected rewards. Jupyter Notebook

Jun 2021

Causal Structure Discovery is the problem of identifying causal relationships from large quantities of data through computational methods. Solution to this problem can have wide of applications in non empirical scientific studies like climate, biodiversity and health. The current problem is existing methods are computationally not scalable and are data intensive. Jupyter Notebook

Apr 2017

Reinforcement Learning (Q Learning based) agent trained to play Flappy Bird. demo

May 2018

Object detection on raspberry pi. demo

Robot trained to sort metal and plastic. demo

2020

Generative adverserial network with variational auto encoder. details

Jul 2020

Gaussian noise based latent vector to image. details

Recommendation system based on distance-preference matching. Demo link Available on Google app store download

HCI system developed for interactive mathematics. Demo source code paper

"Processor Enabled power management system by mechanically choosing the best batteries in a grid network" in the proceedings of CIMSIM 2011" full paper demo

An elastic group recommendation system designed for multivariate dynamic attributes. full paper

Intel Ankur ( An embedded system for fliud quality analysis).

report

Nokia PairUp. ( A wearable device to connect people with gestures) details

Dell EMC (DataDomain) system performence measuring webportal. demo