Arvind Puthucode

Algorithms. Data. Math.

I enjoy building and understanding scalable data systems and algorithmic tooling — lately exploring PySpark pipelines, MongoDB aggregations, and conversational agents with LangChain/LangGraph.

Data ScientistAlgorithmsRust · Python

Data Scientist — Data Genie

Remote · Jan 2024 – Present

  • MongoDB aggregations and HLL metrics for large-scale analytics.
  • PySpark pipelines re-architecture for 100M+ time-series aggregations.
  • PostgreSQL → MongoDB migration for time-series optimization.
  • LangChain/LangGraph features for internal chatbot.

Software Developer Intern — KLA Tencor

Chennai · Jun 2022 – Dec 2022

  • Charting tools for large datasets using Angular, D3.js, Plotly.
  • Performance and UX optimizations.

Education

PSG College of Technology — M.S. Theoretical CS (May 2024), GPA 8.9/10.

Advanced Algorithms · Reinforcement Learning · Stochastic Processes

Skills

PythonRustPySparkAirflowLangGraphLangChainGame Theory

Projects

View all →
© 2025 Arvind Puthucode