Difference between batch processing and real-time processing in Apache Spark
| Batch Processing | Real-time Processing |
|---|---|
| Processes data in fixed batches | Processes data in near real-time |
| Suitable for processing large | Suitable for processing streaming |
| volumes of data | data in real-time |
| Typically used for offline | Typically used for online |
| analytics and data transformations | analytics and continuous processing |
| Examples: ETL jobs, | Examples: Real-time monitoring, |
| batch aggregations | fraud detection |
