Scan the QR code above to try our live nutrition estimation service! Text a meal description like "I had a bagel for breakfast" and get instant nutrition analysis. This LLM was trained on the NutriBench dataset and fine-tuned using Reinforcement Learning on the Llama3.1B model. The inference model is hosted on AWS for real-time responses.
GitHub RepositoryNovel Markov Violation score (MVS) to detect when noise or incomplete state information disrupts the Markov assumption in reinforcement learning. Using classic control tasks, its shown that removing causally essential state variables significantly impacts both returns and Markov consistency. This framework enables robust policy development for real-world RL scenarios with partial observability.
Status: Under review for NeurIPS 2025
arXiv Paper