How To Achieve Aws Azure or GCP Observability at Scale
How To Achieve Aws Azure or GCP Observability at Scale
Table of Contents
Section I: Section II: Section III:
Growth & Challenges of Guide to Achieving Multi-Cloud How to Excel at Multi-Cloud
Multi-Cloud Observability Observability Across AWS, Observability Across AWS,
Azure, and GCP Azure, and GCP
Why are Enterprises 4 What Should You Monitor 14 How to Enhance Amazon 33
Adopting Multi-Cloud? on AWS? Cloudwatch to get Full-Stack,
Multi-Cloud Observability
Challenges of Monitoring 6 Why Choose Wavefront for 15
Applications Across Multi-Cloud AWS Observability How to Monitor Serverless 39
Applications in AWS
Overview of Wavefront by 9 What Should You Monitor 17
VMware: Enterprise Observability on Azure? Guide to AWS App Mesh 43
and Analytics Platform Observability and Analytics
for Multi-Cloud Environments Why Choose Wavefront 21
for Azure Observability Optimize Both Service 46
Performance and Cloud Costs
What Should You Monitor 23 with CloudHealth and Wavefront
on GCP?
Top 5 Lessons Learned 50
Why Choose Wavefront 27 from Customers with
for GCP Observability Multi-Cloud Monitoring
2
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Section I:
Growth & Challenges of
Multi-Cloud Observability
3
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
4
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Before we go into the details of each Here are some of the reasons why enterprises
cloud, lets clarify the difference between are adopting a multi-cloud approach:
multi-cloud and hybrid cloud. Hybrid cloud
is a kind of multi-cloud which uses both Utilize best of the breed services Data sovereignty
private and public cloud with orchestration AWS, Azure, and GCP provide a breath Certain laws and corporate policies require
between the two clouds. For instance, of amazing services (IaaS, PaaS, SaaS), enterprises to store their data in certain
VMware Cloud on AWS is a hybrid cloud including many CI/CD, Kubernetes, locations, often within country/state
solution that enables you to extend your AI/ML services. Engineering teams want boundaries. Multi-cloud helps enterprises
on-premises Software-Defined Data Center to benefit from those services and not to scale to different geographies while
(SDDC) to AWS public cloud with VMware re-invent the wheel. abiding by data sovereignty laws.
vCenter acting as the orchestrator between
the two. Now that we know the difference Optimize cost Avoid vendor lock-in
between multi-cloud and hybrid cloud, Different cloud providers have different Enterprises are very aware that being
let’s concentrate on multi-cloud and why pricing models. Organizations often make all-in on one public cloud provider gives
enterprises are increasingly adopting a the decision to adopt different cloud that cloud provider an edge on pricing
multi-cloud strategy. providers or run dev and production negotiations. So, enterprises often
workloads on a different cloud, if the adop a multi-cloud strategy to avoid
price is favorable. vendor lock-in.
5
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
2. Challenges of
Monitoring Applications
Across Multi-cloud
There are many benefits of adopting a multi-cloud approach, as Siloed cloud vendor tools lead
shown above. In fact, in many ways, the adoption of multi-cloud has to slower troubleshooting and
made life much easier for engineering teams. They can now select significant revenue impact
the cloud of their choice, deploy in minutes, and in some cases
altogether, avoid having any datacenters. Furthermore, they can Organizations often start with the tools provided
take advantage of the best of breed IaaS/PaaS and SaaS services by the cloud vendor to monitor their applications.
provided by different cloud providers. But they soon realize that cloud vendor tools are
built for only one vendor and don’t effectively
But at the same time, multi-cloud has introduced many new monitor multi-cloud environments. To overcome
challenges for the engineering team as well. Now instead of one this problem, Ops/DevOps teams find themselves
cloud, engineering teams need to learn, deploy, and monitor across using multiple siloed monitoring solutions that fail
multiple cloud providers. Because cloud providers are so different, to provide a unified view across different cloud
monitoring across different cloud environments is not easy. Here are leading to increased troubleshooting time and
some of the challenges faced by engineering teams while monitoring significant revenue impact.
multi-cloud environments:
6
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
7
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
8
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
9
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
10
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
11
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
CloudWatch
retrieves AWS metric
and dimension data.
CloudTrail
retrieves EC2 event information and
creates Wavefront System events that
represent the AWS events.
AWS Metrics+
retrieves additional metrics using AWS APIs
other than CloudWatch. Data include EBS
volume data and EC2 instance metadata like tags.
You can investigate billing data and the number
of reserved instances. Be sure to enable AWS+
metrics because it allows Wavefront to optimize
its use of Cloudwatch and saves money
on Cloudwatch calls as a result.
12
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
13
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
14
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
15
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
16
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
17
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
18
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
19
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
20
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
21
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
22
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
23
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Achieve OOTB
GCP Observability
with Wavefront
In response to increasing demand from Wavefront
customers, we have introduced Wavefront GCP
Integration. The Wavefront GCP integration
collects, analyzes, and visualizes key metrics from
Google Compute Engine, Google Kubernetes
Engine, and other Google Cloud services.
It delivers metric-driven visibility into critical
cloud performance and usage metrics. In addition,
you can monitor all your applications that run
on Google Cloud with Wavefront.
24
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Google App Google Cloud Google Cloud Google Cloud Google Cloud
Engine Billing Datastore Functions Logging
Google App Google Cloud Google Cloud Google Cloud Google Cloud
Pub/Sub Router Spanner Storage VPN
See our documentation for more details and the most up to date list of Wavefront GCP integrations.
25
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Pre-built Wavefront GKE Dashboard Showing Nodes, Namespaces, Pods, Containers etc.
26
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
27
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
28
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
29
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
30
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
31
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
32
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
33
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
34
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Wavefront Interlinked
Multi-View of AWS
Infrastructure
35
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
3. Troubleshoot Microservices
Faster with Metrics, Histograms,
Traces and Span Logs
For troubleshooting applications that use AWS
ECS or AWS Lambda, CloudWatch/CloudTrail
offers only a few metrics and some complex
logs. Consider using Wavefront’s enterprise
observability instead. We offer the unique
monitoring combination of out-of-the-box metrics,
histograms, and traces at cloud scale, with
potentially over a million data points per second.
Metrics tell you system-wide when there is a
problem. Histograms can overcome the limitations
of averages. Histograms help you find problems
with high-speed metrics like sub-second response
times and latencies. Distributed tracing, with
visibility into the entire API call flow, lets you drill Wavefront Enterprise Observability – Metrics,
down and isolate issues across? In? microservices. Histograms, Traces – Go System Wide to Detailed Flow
In short, Wavefront is a single platform that unifies for Faster Troubleshooting
metrics, histograms, and traces and significantly
reduces MTTR.
36
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
37
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
38
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
39
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
40
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
How to Monitor
1 aws.lambda.wf.invocations.count Count of number of Lambda function invocations aggregated at the server.
Serverless Better 3
4
aws.lambda.wf.errors.count
aws.lambda.wf.error_event.count
Count of number of errors aggregated at the server.
with Wavefront 5 aws.lambda.wf.coldstarts.count Count of number of cold starts aggregated at the server.
Using the Wavefront Lambda SDK (Python, 7 aws.lambda.wf.duration.value Execution time of the Lambda handler function in milliseconds.
those functions in Wavefront. There is no need 2 Region AWS Region of the Lambda function.
metrics to Wavefront. Also, since the metrics are 6 Resource The name and version/alias of Lambda function. (Ex: DemoLambdaFunc:aliasProd).
41
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Lambda Integration
Sample Code
As an application developer if you’re developing
your Lambda function in say Python, all you have
to do is to decorate your AWS Lambda handler
function with @wavefront_lambda.wrapper, and
voila, all the standard Lambda metrics with the
point tags are sent directly to Wavefront.
import wavefront_lambda
@wavefront_lambda.wrapper
def handler(event, context):
# your code
42
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Service Mesh
In any service mesh, including AWS App Mesh, critical KPIs to • Uptime • Concurrency
• Allocated memory • Total connections
monitor are as follows:
Health of
• Heap size • Internal flush buffer size
• Live/Currently draining • File reopens
• Health of the service mesh
• Health of your services
Envoy provides the following service and cluster health metrics:
• Performance and activity of your services
Health of
services
• Active listeners
• Active clusters
Because App Mesh uses Envoy proxy as the data plane, • Total healthy/unhealthy clusters
you can determine the health of the mesh from Envoy proxy
health stats. Envoy proxy also provides many health and Envoy proxy provides many performance metrics
Some of the main service performance metrics are as follows:
activity metrics for different service clusters. Here are the
Performance
you can determine the health of the mesh from Envoy proxy
health stats. Envoy proxy also provides many health and
43
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
44
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
45
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
As enterprises are adopting more public cloud services, By using CloudHealth and
they are often surprised by growing, instead of shrinking Wavefront together, you gain:
expenses. As a result, insight into cloud cost ROI is essential
to make proper business decisions. CloudHealth by VMware 1) Rightsizing Analysis
helps you manage costs, usage, security, and governance
In many large cloud environments, it’s common to find
across your entire cloud environment. CloudHealth analyzes
severely underutilized infrastructure that drives up costs.
native metrics from cloud providers out-of-the-box, but in
Using CloudHealth In many large cloud environments, it’s
many cases, that is not enough for a 360 view. If you use
common to find severely underutilized infrastructure that
CloudHealth and Wavefront together, CloudHealth can take
drives up costs. Using CloudHealth and Wavefront together
advantage of Wavefront’s vast real-time, highly granular,
can help solve this problem. The Wavefront platform
multi-cloud (AWS, Azure, GCP) infrastructure usage metrics,
collects granular performance data from the application,
and provide effective recommendations for cost savings of
CPU, memory, network, and disk usage from instances
your service.
and virtual machines on AWS, Azure, and GCP. This tag
enriched data is then ingested and used by CloudHealth
46
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
to do a rightsizing analysis. CloudHealth provides specific you do rightsizing analysis by function, so you can quickly find
recommendations around downsizing, or even terminating which departments or teams need to adjust their behavior
instances, including the suggested instance type for your when provisioning new infrastructure.
services’ optimal performance. By following these rightsizing
recommendations, for example, downsizing an instance
that is underutilized, you can save time and money without 3 - Balance Cost with Performance
interruption to the service.
Not only can CloudHealth help identify assets that are
underutilized and that can be downsized for cost savings, but
“I can easily identify which Elastic it can also help with tracking and managing expenses in the
Load Balancers in a business unit cloud. You can allocate and amortize costs and can forecast
do not have an AWS EC2 instance spend across departments and teams. Data and analysis from
attached.This is just money on the CloudHealth can be extended further into the Wavefront
table that we can easily save using platform and correlated with application and infrastructure
Wavefront.” performance. Thus, DevOps teams can optimize cloud
service performance and set real-time, proactive alerts for
- Dieter Matzion, Staff Business
configuration changes and cost overruns.
Systems Analyst, Intuit
47
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Intuit integrates Wavefront and CloudHealth Intuit also forwards aggregated CloudHealth cost
through the CloudHealth API, feeding metrics to Wavefront for correlated monitoring
Wavefront’s tag-enriched resource usage and analytics-driven alerting. Specifically, Intuit
data into CloudHealth’s cloud cost-savings uses the Wavefront powerful alerting engine to
recommendations, saving millions of cloud cost alert on cloud budget overruns.
dollars annually. Using rightsizing analysis, Intuit
was able to move over half of its workloads to run
smaller EC2 instances, saving them over $100,000
a year.
48
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
49
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Over a period of time, we have seen our Common feedback from such customers was
customers modernize their services and adopt that it helped them to use an observability
emerging technologies. For instance, our solution (like Wavefront), which has support for
customer 8×8’s cloud service has evolved emerging technologies, including multi-cloud and
dramatically from being a monolithic piece of Kubernetes. With Wavefront, customers don’t
software. In line with industry trends and best have to worry about building observability support
practices, 8x8 moved to a services model, for emerging technologies. Traditional monitoring
and, ultimately, to microservices. We have and APM solutions, on the other hand, often lack
also seen many customers embark on the multi-cloud and emerging technologies support.
journey of containerization and adopt different
flavors of Kubernetes, including Amazon EKS
andOpenShift.
50
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Before using Wavefront, many of our customers had so many tools, not many people knew how
had one common problem. They were using to use every tool. (“You just cannot expect them
many siloed monitoring solutions and didn’t have to learn every tool when you have over half a
a unified/first pane-of-glass view across different dozen monitoring tools,” they said.) Furthermore,
clouds, full-stack, and tools. This lack of the first none of the teams had a unified view across all
pane-of-glass view led to long MTTR and missed the tools. So, whenever an incident happened,
SLAs. they would have to look through several tools
to troubleshoot a problem. This led to increased
Let me walk you through an example of one MTTR and missed SLAs. Additionally, when an
of our customers, one of the biggest Indian incident happened, it resulted in several alerts
e-ticketing platforms. They mentioned that their from different tools leading to alert storms. Very
SRE and DevOps team was using over half a often, an important alert was missed in this array
dozen siloed monitoring solutions. Because they of false positives leading to increased chances of
missing SLAs.
51
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Many enterprises adopt a multi-cloud strategy to integration. Intuit, one well-known Wavefront
reduce costs. But they are often surprised to see customer, uses CloudHealth as a caching engine
their cloud cost go up due to different cloud’s that retrieves and stores telemetry information
pricing models, varied types of instances, and from AWS. With the Wavefront CloudHealth
underutilized workloads. If you are using multi- integration, they feed this telemetry information
cloud, it is essential to select rightsized instances into Wavefront and use Wavefront to monitor and
and avoid budget overruns with proactive alert on budget overruns.
alerting.
52
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Many enterprises move to the cloud to take with more horsepower, or you need to spin
advantage of the the ability to scale up at a up another Prometheus instance. Both of the
moment’s notice. Reliability is one of the reasons tasks need engineering team effort and cycles,
for adopting multi-cloud. But while adopting leading to teams getting slowed down with scale.
multi-cloud for all these benefits, enterprises Soon engineering teams find they’re spending
often forget that their open source monitoring more and more time on scaling, patching,
solutions lack support for scale and reliability. and maintain their open-source monitoring
For instance, consider this scale-limiting use- solutions and not having that time to focus on
case of the open-source monitoring solution innovating and scaling their business. Reliability
Prometheus. As a customer’s footprint grows is another big concern with open-source solutions.
in the cloud, so do their metrics volumes as In fact, one of the biggest reasons for Boxever
engineers add more and more necessary to switch from Prometheus to Wavefront was
telemetry. But when one Prometheus node gets that Prometheus lacked support for a real high
too busy with the increased number of metrics, availability configuration.
it either needs to be hosted on a bigger instance
53
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
As enterprises adopt a distributed microservices- metrics and analytics information with different
based architecture, the overall customer teams. Wavefront offers teams a centralized
experience depends on performance and observability solution that different teams can
availability of many different microservices. These easily share. With Wavefront, engineering
microservices are often owned by different teams can easily monitor their SLOs, taking
development teams. As mentioned by Edgar complete responsibility for the performance and
Nidome, Principal Architect at 8x8, it is critical reliability of their services. In short, make sure
to empower all development teams to take to give development teams the ability to have
ownership of their metrics and performance. lightweight instrumentation, unify objectives and
Every development team should be able to empower them to get to addressing their SLAs.
measure and share the quality of service that they
provide with other teams. But often, engineering
teams find it hard to standardize how to share
54
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
55
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
56
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
2 - Unified, Full-Stack
Observability that can
Significantly Reduce MTTR
57
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
3 - Optimized Service
Performance and Cloud
Cost with Wavefront and
CloudHealth
With Wavefront, you can easily monitor
cloud consumption and cost metrics and
convert them into data-driven insights
and alerts for reducing your cloud cost.
Furthermore, by using CloudHealth and
Wavefront together, you get effective
recommendations for cost savings of
your service that use CloudHealth’s cost
governance platform and Wavefront’s
vast real-time, highly granular, multi-cloud
(AWS, Azure, GCP) infrastructure usage
metrics.
58
Section I: Growth & Section II: Guide to Achieving Section II: How to Excel
Challenges of Multi-Cloud Multi-Cloud Observability Across at Multi-Cloud Observability
Observability AWS, Azure, and GCP Across AWS,Azure, and GCP
Author
Chhavi Nijhawan
Chhavi is a Product Line Marketing Manager at Wavefront by
VMware. Before Wavefront, she worked at New Relic, SnapLogic
and Cisco, where she led product marketing and technical
marketing. She has over 10 years of IT industry experience.
She is also an AWS certified solutions architect.
59