From Wikipedia, the free encyclopedia

Google Cloud Dataflow is a fully managed service for executing Apache Beam pipelines within the Google Cloud Platform ecosystem.

History

Google Cloud Dataflow was announced in June, 2014 [1] and released to the general public as an open beta in April, 2015. [2] In January, 2016 Google donated the underlying SDK, the implementation of a local runner, and a set of IOs ( data connectors) to access Google Cloud Platform data services to the Apache Software Foundation. [3] The donated code formed the original basis for Apache Beam.

References

  1. ^ "Sneak peek: Google Cloud Dataflow, a Cloud-native data processing service". Google Cloud Platform Blog. Retrieved 2018-09-08.
  2. ^ "Google Opens Cloud Dataflow To All Developers, Launches European Zone For BigQuery". TechCrunch. Retrieved 2018-09-08.
  3. ^ "Google wants to donate its Dataflow technology to Apache". Venture Beat. Retrieved 2019-02-21.

External links