AWS re:Invent 2022 Executive Summary of Analytics and Data Management Announcements
Author: Tom Hoblitzell | 3 min read | January 5, 2023
AWS had a lot to say about its analytics and data management capabilities this year at re:Invent 2022. From the keynotes to the sessions throughout the week, Amazon Redshift, AWS Glue, Amazon QuickSight, and many other services ended up with significant announcements. We have the highlights from the event in this handy summary.
Amazon Redshift Announcements
- AWS is delivering a zero-ETL Amazon Aurora and Amazon Redshift integration to support near real-time access to this transactional data. This feature is in limited preview.
- Amazon Redshift is expanding its SQL capabilities, with support for MERGE, ROLLUP, CUBE, and GROUPING SETS, designed to facilitate data warehouse migrations. This feature is in preview.
- Real-time streaming ingestion for Amazon Kinesis Data Streams and Amazon Managed Streaming for Apache Kafka is now generally available for Redshift.
- Dynamic Data Masking support for Amazon Redshift is now in preview.
- Amazon Redshift integration for Apache Spark streamlines the process for using Redshift in your Apache Spark applications.
- AWS Backup now supports Amazon Redshift, allowing you to schedule and restore manual Redshift snapshots.
- Auto-copy Amazon S3 data into your Redshift data warehouse for continuous data loading. This feature is in preview.
- Run your Amazon Redshift RA3 clusters in Multi-AZ deployments for expanded disaster recovery. This functionality is in preview.
AWS Glue Announcements
- AWS Glue 4.0 launched, which adds more data formats, updated engines, Ray support, and many other features.
- AWS Glue for Apache Spark has added support for several open-source data lake storage frameworks. These include Linux Foundation Delta Lake, Apache Iceberg, and Apache Hudi.
- Define, reuse, and share your business-specific Extract, Transform, Load logic with your team through AWS Glue custom visual transforms.
- AWS Glue Data Quality offers automatic measurement and monitoring of data lake and data pipeline quality, which it uses to provide data quality recommendations. This feature is in preview.
Amazon QuickSight Announcements
- AWS has expanded Amazon QuickSight API capabilities to support DevOps automation and migration acceleration. This functionality is generally available.
- You can now create Paginated Reports in Amazon QuickSight, which allows you to use custom formats of detailed operational data to create multipage reports. This feature is generally available.
Other AWS Data Management and Analytics Announcements
- Amazon DataZone is a brand new data management service in preview that offers governed analytics to “share, search, and discover data at scale across organizational boundaries.”
- Amazon OpenSearch now offers a serverless option called Amazon OpenSearch Serverless. This service is in preview. As part of this release, Amazon Kinesis Data Firehose now supports data stream delivery for OpenSearch Serverless.
- Amazon Athena now has Apache Spark support.
Data management on AWS looks like it will be quite exciting as we enter into 2023. Want to discuss how you can best leverage these new capabilities to get more value from your data? Contact us to connect with our AWS and data management specialists.