Data Engineer building large-scale data infrastructure and pipeline systems.
Currently working at SK Planet on the AI Data Platform team, where I manage enterprise-grade Hadoop ecosystems and develop internal data platform tools that serve the entire organization.
- Managing 1,900+ Hive tables across 48 databases on a large-scale Hadoop cluster
- Leading Hadoop 2 โ 3 migration for enterprise data infrastructure
- Designing and implementing data access control architectures with Apache Ranger
- Ensuring secure, governed access to data assets across the organization
- Exploring Text-to-SQL capabilities for Hive/HiveQL environments
- Analyzing complex BI-generated queries and building training datasets
- Bridging the gap between natural language and enterprise data queries
Data & Distributed Systems
Hadoop HDFS Hive HiveQL Trino Apache Ranger ClickHouse
Backend & DevOps
Python Node.js Express Nginx PM2 Linux Shell Script
Frontend & Full-Stack
React MongoDB
Monitoring & Testing
Langfuse Locust Grafana
Package & Process Management
uv pip npm Docker
- Open source contributions
- Infrastructure optimization & performance tuning
- Technical blogging & knowledge sharing
- Building developer tools that make data teams more productive

