Introduction To Analytics On AWS
Introduction To Analytics On AWS
Analytics on AWS
Lesly Reyes
Telco Solutions Architect
© 2022, Amazon Web Services, Inc. or its affiliates. © 2022, Amazon Web Services, Inc. or its affiliates.
Customers want more value from their data
Machine
Learning Databases
Catalog People,
Data
Apps, and
Sources
Governance Devices
Data
Analytics
Lakes
Amazon
S3
Catalog
Cost-effectively scale storage to exabytes
© 2022, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
AMAZON
AMAZON AMAZON AMAZON AMAZON
OPENSEARCH
REDSHIFT ATHENA EMR KINESIS & MSK
SERVICE
Data Query all your Big data Log and search Real-time
warehousing data using SQL processing analytics analytics
or Python
AWS
Amazon EMR
Analytics Amazon QuickSight
Big data processing Visualization
Amazon
Redshift Price-performance at any scale
EMR
RUN BIG DATA APPLICATIONS Automatically scale up and down
IN THE CLOUD
Best price-performance
Amazon
EMR Multiple deployment models
Kinesis
COLLECT, PROCESS, AND Kinesis Video Streams
ANALYZE VIDEO AND DATA
STREAMS IN REAL TIME
Secure
Amazon S3
Portfolio of integrated Lake Formation
analytics tools
Simplified
ingest and
cleaning
Amazon Athena Amazon QuickSight AWS Glue Blueprints ML Transform
Cost effective, durable
data lake storage with
global replication capabilities
Simplify security
management with
AWS Lake Formation Data Lake
Admin Lake Access Data
Formation Control Catalog
Data Lake
QuickSight
CLOUD-NATIVE BI SOLUTION
FOR ILLUMINATING
Deeply integrated with AWS services
ORGANIZATIONAL INSIGHTS
SIMPLE, SCALABLE,
AND SERVERLESS
No servers to manage
Scalable Data
Integration Engine
Execution engine
Monitor
Glue crawlers
Execution engine
Built-in data transforms Glue data catalog Glue connectors Persona specific tools
Built-in data transforms Glue data catalog Glue connectors Persona specific tools
Cost-effective
Serverless, no setup Federated queries ANSI SQL, Apache Spark Pay only for what you use
across 35+ data stores
Instant start, SQL: Save on
optimized runtimes Use PySpark ecosystem Multiple formats, per-query costs
for fast results compression types, and through compression
Simplified notebooks on complex joins
Point to S3 and console for PySpark and data types Python: minimize idle
start querying compute charges
Amazon Amazon
OpenSearch Aurora
Service Amazon S3
Amazon Amazon
Redshift SageMaker
ETL developer
Business Analyst
Data Scientist
Distributed processes
Data engineer
© 2022, Amazon Web Services, Inc. or its affiliates. © 2022, Amazon Web Services, Inc. or its affiliates. 34