Principal Data Scientist | Causal Inference & Marketing Science | AI/LLM Systems | Data Science Leadership
I lead a team of 8 Data Scientists that add business value through advanced modeling. My team and I build advanced statistical models, machine learning systems, and AI-powered tools that help organizations make data-driven decisions. My work focuses on causal inference, marketing mix modeling, LLM agentic systems, and deep learning for time series analysis.
SAGE - Strategic AI-Guided Explorer for Marketing Performance. An AI-powered copilot for Marketing Mix Modeling with natural language interface.
- π€ Agentic AI System with OpenAI function calling
- π¬ Natural language interface - ask questions in plain English
- π Automatic visualizations - generates Plotly charts on demand
- π― Budget optimization - SLSQP-based allocation across channels
- π§ RAG-powered insights - ChromaDB knowledge base with semantic search
- β‘ Real-time analysis - instant answers to MMM questions
https://SAGEinsights.streamlit.app | π Live Demo | π GitHub
llm-copilot - Production-ready agentic system for Marketing Mix Modeling with LLM orchestration, RAG, and code execution.
- π Agentic System: OpenAI function calling for tool orchestration
- π RAG Implementation: ChromaDB + OpenAI embeddings for semantic search
- π§ Dynamic Code Execution: Safe Python sandbox for on-the-fly analysis
- π Response Curve Fitting: Automatic Hill curve generation with deepcausalmmm
- ποΈ Knowledge Base: Stores curves, benchmarks, glossary, and best practices
- π Multi-Database Support: 10+ connectors (Databricks, Snowflake, BigQuery, etc.)
pip install git+https://github.com/adityapt/llm-copilot.git Tech: OpenAI GPT-4, ChromaDB, LangChain patterns, Pandas, Plotly
Creator and maintainer of DeepCausalMMM - an open-source Python package for Marketing Mix Modeling with causal inference and deep learning.
- π§ GRU-based temporal modeling for automatic adstock and lag learning
- π DAG causal discovery to reveal channel interdependencies
- π Response curves with Hill saturation for budget optimization
- π Multi-region modeling with learnable region-specific effects
- β‘ Performance: 93% holdout RΒ², 3.6% train-test gap
pip install deepcausalmmmπ Documentation | π― Quick Start | π Examples
- Agentic system design with function calling
- Retrieval-Augmented Generation (RAG)
- Prompt engineering & Chain-of-Thought
- Vector databases (ChromaDB, Pinecone)
- LLM orchestration patterns
- Marketing Mix Modeling (MMM)
- Directed Acyclic Graphs (DAGs)
- Causal structure learning
- Treatment effect estimation
- Instrumental variables
- Time series forecasting
- Recurrent Neural Networks (GRU, LSTM)
- PyTorch model development
- Bayesian inference
- Ensemble methods
- Brand Science
- Marketing Mix Modeling
- Churn prediction
- A/B testing & experimentation
- Budget optimization
- Executive Insights
-
SAGE AI Copilot (2025) - AI-powered MMM assistant with agentic system
- Deployed on Streamlit Cloud
- Natural language interface for marketing analytics
- RAG-powered insights with ChromaDB
-
LLM-Copilot (2025) - Production agentic system for MMM
- OpenAI function calling orchestration
- Dynamic code execution sandbox
- Multi-database connectivity
-
DeepCausalMMM (2025) - Advanced MMM with causal inference
- Published on PyPI with 1.0.17+ versions
- Comprehensive documentation on ReadTheDocs
- 28 comprehensive tests with 100% pass rate
- JOSS paper submitted
- ORCID: 0009-0008-9495-3932
- Focus areas: Causal inference, marketing analytics, LLM systems, time series modeling, Marketing mix modeling
- β Built production agentic system with OpenAI function calling
- β Implemented RAG with ChromaDB for semantic knowledge retrieval
- β Created natural language interface for MMM analysis
- β Integrated automatic budget optimization algorithms
- β Deployed live Streamlit application
- β Added 10+ database connectors (Databricks, Snowflake, etc.)
-
β Added non-linear response curves with Hill saturation
-
β Implemented proportional allocation for accurate scaling
-
β Enhanced Hill parameter constraints (slope β₯ 2.0)
-
β Integrated 14+ interactive visualizations
-
β Achieved 93% holdout RΒ² with 3.6% performance gap
- Installation:
pip install deepcausalmmm - Documentation: deepcausalmmm.readthedocs.io
- License: MIT (fully open source)
- Python: 3.9+ compatible
- Framework: PyTorch 2.0+
- Installation:
Currently exploring:
- LLM agent architectures and multi-agent systems
- RAG optimization techniques and hybrid search
- Transformer architectures for time series
- Causal discovery algorithms (NOTEARS, PC, GES)
- Bayesian deep learning for uncertainty quantification
- Multi-task learning for marketing applications
- Federated learning for privacy-preserving MMM
"Build AI/ML systems that are both theoretically rigorous and practically useful. Combine machine learning with causal inference to create tools that explain why, not just what."
I believe in:
- π€ AI for Good: Building LLM systems that augment human decision-making
- π Open Science: Making research accessible and reproducible
- π Documentation: Clear guides that help others learn and contribute
- π€ Community: Collaborative development and knowledge sharing
- π― Impact: Solving real problems with elegant solutions
- π€ LLM agentic systems and RAG applications
- π¬ Causal inference research and applications
- π Marketing analytics and MMM projects
- π§ Deep learning for time series
- π¦ Open source data science tools
- π Technical writing and documentation
Feel free to reach out if you're working on something interesting in these areas!