PSTNPX

Data Scientist & Software Engineer

Turning Data into Impact

8+ years building ML pipelines, data platforms, and full-stack applications at scale.

Scroll to explore

Selected Work

Projects that moved the needle

ML Pipeline
ML Pipeline
Data Analytics
Data Analytics
DevOps & K8s
DevOps & K8s
Homelab
Homelab
Profile Picture

Tanupat Phasithjirakul

Data Scientist & Software Development Engineer

Experienced in C++, Python, DevOps and Data Analysis, adept at analytical problem-solving. Fast learner, team-oriented professional with expertise in developing data pipelines and web applications.

Email

Quick Chat

Hi! Send me a quick message and I'll get back to you soon.

CallLinkedInpstnpx.comGitHub

Skills & Expertise

Python
Docker
Kubernetes
NextJS
React
C/C++
SQL
Apache Airflow
Machine Learning
ETL Pipelines
Data Analysis
MLflow

Key Projects

Centralized MCP Gateway

AI Platform • Developer Tools • Security

Company-wide streamable MCP (Model Context Protocol) server that lets teams connect AI assistants like Claude Code to internal data sources — Presto, Redshift, Postgres, MSSQL, MLflow, Bitbucket, and Jira — using their own credentials passed via request headers. No service accounts, no stored secrets.

MCP
Python
Presto
Redshift
+5 more
More..

Enterprise LLM API Platform

AI Platform • LLMOps • Cost Governance

Self-service LLM gateway where employees generate their own API keys and call company-approved models. Built on EKS container images invoked through AWS Lambda against Amazon Bedrock. Tracks usage, activity logs, and cost breakdowns by team, API key, and user — with full RBAC and admin controls. Fully compatible with Anthropic-style tools (Claude Code, Roo Code, OpenCode).

AWS Bedrock
EKS
Lambda
Python
+3 more
More..

Test Cost Reduction ML Pipeline

Machine Learning • ETL • MLOps

Configured Airflow to process raw data from Redshift for training new models. Integrated MLflow to log and track model performance and metadata. Deployed trained models for production use.

Kubernetes
Docker
Python
Airflow
+3 more
More..

LAB PC Performance Monitoring

Full Stack • DevOps • Monitoring

Monitor the test application CPU and Memory usage during test with comprehensive dashboard and analytics capabilities.

NextJS
MySQL
Time Series DB
Grafana
+4 more
More..

Material Verification Web App

Full Stack • Process Automation

Web application to streamline the submission, review, and approval of material verification results, generate summarized material reports for improved efficiency.

NextJS
Python
Docker
MySQL
More..

ETL Pipeline for Test Time Reduction

Data Engineering • Analytics

Coded an ETL pipeline to use historical test time data to reduce the test time in production, improving efficiency and reducing costs.

Python
SQL
Apache Airflow
Kubernetes
More..

Experience

Scientist 3, Data Science

Western DigitalFeb 2024 - Present

Leading data science initiatives, developing machine learning models, and implementing ETL pipelines to optimize test processes and reduce costs.

More..

Senior Engineer, Software Development Engineering (Apps)

Western DigitalSep 2021 - Feb 2024

Coded ETL pipelines using historical test time data to reduce production test time. Developed monitoring solutions and web applications for material verification.

More..

Engineer, Software Development Engineering (Apps)

Western DigitalNov 2017 - Sep 2021

Supported test code for hard disk drive using C++. Developed applications to improve team efficiency (PyQt). Created websites to support statistical analysis requests (Python, SPSS, JS, HTML).

More..

Education

BE, Avionics Engineering

Civil Aviation Training CenterJan 2013 - Dec 2017

GPA: 2.75

Studied electronics design, airplane systems and structures

More..

Awards

IEEE Conference Presentation

Okinawa, JapanOct 2016

Design of a dual-band Verre de Champagne Fractal CPW antenna for LTE and aircraft altimeter application

Homelab & Technical Hobbies

🧠 Homelab Overview

My homelab is a self-hosted environment designed for AI/ML experimentation, automation, monitoring, privacy, and media streaming, all orchestrated in a well-integrated ecosystem.

🔧 Core Services
ServicePurpose
MLflowTracks and manages machine learning experiments and model versions
OllamaLocal LLM hosting and inference server for running models like Phi-4
OpenWebUIWeb interface for interacting with LLMs (Ollama integration)
n8nNo-code automation workflows (similar to Zapier)
Uptime KumaSelf-hosted monitoring tool for service uptime and health
InfluxDBTime-series database for metrics collection
GrafanaVisualizes metrics from InfluxDB and other sources
TraefikDynamic reverse proxy and load balancer for containerized services
PrivoxyFiltering web proxy for privacy and ad-blocking
Pi-holeNetwork-wide ad-blocking and DNS filtering
JellyfinMedia server for movies, TV shows, and music
JellyseerOverseer-like web UI for managing media requests to Jellyfin
📦 Environment & Deployment
  • Running via Docker and Kubernetes (k3s) cluster
  • Managed by FluxCD for GitOps-based deployment
  • Services exposed via Traefik with SSO authentication
  • Persistent storage via TrueNAS and RAID-Z
  • Automation and monitoring tightly coupled (n8n, Uptime Kuma, InfluxDB, Grafana)
More..

Chat with Tanupat