Data · Advanced

Spark Batch Processing

Name: Spark Batch Processing
Price: 1890000 KRW

Optimize partitioned jobs, shuffle stages, and cost-aware cluster settings.

Hands-on PySpark labs from local mode to small EMR-style clusters. Focus on partitioning, caching decisions, and explaining physical plans in mentor reviews.

₩1,890,000 · 10 weeks · Blended

Request information Refund & Cancellation

Cluster status panel showing job stages and executor metrics

Included in this cohort

PySpark 3.5 lab images
Explain plan reading drills
Skew mitigation strategies
Delta Lake intro module
Cost estimation worksheet
Performance regression lab
Capstone on clickstream aggregation

Outcomes you can show

Cut a sample job runtime by measurable percent
Document partition strategy trade-offs
Present Spark UI screenshots in portfolio

Mentor

Min-jun Park

Spark practitioner for ad-tech batch reconciliation jobs.

Common questions

Scala track?

PySpark only. Scala snippets appear in reading lists.

Cluster costs?

Not included?

Learner notes

"Shuffle drill finally made stage timelines readable. Still tuning UDF habits."

Kenji · Platform engineer · AdPulse