AWS Batch
- Fully managed services
- Allows you to schedule & run batch jobs on AWS compute (EC2, EKS, Fargate, Spot or On-Demand instances too)
- Containers & ML workloads (what’s the difference)
AWS EMR (Elastic Map Reduce)
- Managed cluster platform that simplifies running big data frameworks.
- eg Apache Hadoop
- Apache Spark
- Has different types of file systems
- Hadoop Distributed File System (HDFS)
- EMR File System - uses HDFS, or S3 as the file system in your cluster
- Local disk