Data can be processed as unbounded or bounded streams.
-
Unbounded streams have a start but no defined end. They do not terminate and provide data as it is generated. Unbounded streams must be continuously processed, i.e., events must be promptly handled after they have been ingested. It is not possible to wait for all input data to arrive because the input is unbounded and will not be complete at any point in time. Processing unbounded data often requires that events are ingested in a specific order, such as the order in which events occurred, to be able to reason about result completeness.
-
Bounded streams have a defined start and end. Bounded streams can be processed by ingesting all data before performing any computations. Ordered ingestion is not required to process bounded streams because a bounded data set can always be sorted. Processing of bounded streams is also known as batch processing.
What is Apache Flink? — Architecture
from apache.org
Filed under:
Related Notes
- Often, people who don’t have access to the raw data expect one narr...from Josh Beckman
- The upshot for the industry at large, is: the **LLM-as-Moat model h...from Steve Yegge
- The first image ever transmitted to Earth from another planet was r...from Instagram
- My experience is companies do not anticipate that the cost of monit...from Mathew Duggan
- Ad-hoc validation leads to a phenomenon that the [language-theoreti...from Alexis King
- the difference between validation and parsing lies almost entirely ...from Alexis King
- few days ago I did something that I never thought I’d do again, and...from brandur.org
- Syntax coloring isn't useless, it is childish, like training wh...from Douglas Crockford