Amazon Managed Workflows for Apache Airflow makes it easy to author, schedule, and monitor data processing and machine learning workflows in the cloud
Amazon Web Services, Inc. (AWS), announced the general availability of Amazon Managed Workflows for Apache Airflow (MWAA), a new managed service that makes it easy for data engineers to execute data processing workflows in the cloud. Apache Airflow is a popular open-source tool that helps customers author, schedule, and monitor workflows. With Amazon MWAA, customers can use the same familiar Airflow platform as they do today to manage their workflows, and enjoy improved scalability, availability, and security without the burden of having to build, scale, and manage the underlying infrastructure. Amazon MWAA scales workflow execution capacity based on customer needs, and integrates with AWS security services to provide secure access to customers’ data. There are no up-front investments required to use Amazon MWAA and customers only pay for what they use.
Read More:Â InMarket Wins Best Location Platform At The 2020 Digiday Technology Awards
“Customers have told us they really like Apache Airflow because it speeds the development of their data processing and machine learning workflows, but they want it without the burden of scaling, operating, and securing servers”
Today, customers are using analytics and machine learning to derive insights from massive amounts of data. To effectively use this data, customers often need to first build a workflow that defines a series of sequential tasks to prepare and process the data. Tens of thousands of customers use AWS Step Functions to visually build and run cost-effective and scalable event-driven workflows that execute tasks across multiple AWS services. There are also customers who want the Apache Airflow orchestration workflow, which has an active open source community, a large library of pre-built integrations to third-party data processing tools like Apache Spark and Hadoop, and the ability to use Python scripts to create workflows. However, using Apache Airflow requires data engineers to install, maintain, scale, and secure the Apache Airflow environments, which adds cost and operational complexity. Furthermore, to support role-based authentication for secure access, Apache Airflow often requires a manual, iterative, and error-prone combination of configuration changes, command-line interface (CLI) commands, and, in some cases, edits to the Apache Airflow code. Customers also must integrate and configure additional tools for alerting for issues like system downtime, workflow errors, and task execution delays. While customers really enjoy the pre-built integrations and familiar Python programming language of Apache Airflow, they want it without the added operational cost and complexity.
Read More:Â Atos Named By Gartner As An Edge AI Technology Innovator For 2020
Amazon MWAA makes it easy for customers to build and execute Apache Airflow workflows in AWS. Amazon MWAA manages the provisioning and ongoing maintenance of Apache Airflow so customers no longer need to worry about patching, scaling, or securing self-managed Apache Airflow implementations. With Amazon MWAA, compute resources that execute tasks are scaled on demand, providing consistent performance for users. Customer data is secure by default as workloads run in customers’ own isolated and secure cloud environments using Amazon’s Virtual Private Cloud (Amazon VPC), with stored data encrypted using AWS Key Management Service (AWS KMS). Amazon MWAA makes it easy for customers to combine data using any of Apache Airflow’s integrations, including AWS services and popular third-party tools like Apache Hadoop, Presto, Hive, and Spark, to automate data processing, machine learning pipelines, and software development and operations. Customers can provide role-based access to Apache Airflow’s user interface easily and securely via AWS Identity and Access Management (IAM), providing users Single Sign-On (SSO) access for scheduling and viewing their workflow executions. Amazon MWAA automatically sends Apache Airflow system metrics and logs to AWS’s monitoring service, Amazon CloudWatch, making it easy for customers to view task execution delays and workflow errors across one or more environments without third party tools. With Amazon MWAA, data engineers get the extensibility of Apache Airflow with the scalability, availability, and security of AWS.
Read More: SOPHOLA, Inc Expands AdScale Enterprise Digital Ad Optimisation Platform To Indonesia Through…