以查询 Metrics
信息案例来分析 Skywalking
查询协议
基本概述
Skywalking
查询协议默认基于 GraphQL
,如果有需要也可以自定义扩展,提供一个实现了 org.apache.skywalking.oap.server.core.query.QueryModule
的查询模块即可。
截取 Skywalking UI
发送的请求
- 请求路径
POST http://127.0.0.1:8080/graphql
- 请求体
{
"query": "query queryData($condition: MetricsCondition!, $duration: Duration!) {\n readMetricsValues: readMetricsValues(condition: $condition, duration: $duration) {\n label\n values {\n values {value}\n }\n }}",
"variables": {
"duration": {
"start": "2021-07-03 1320",
"end": "2021-07-03 1321",
"step": "MINUTE"
},
"condition": {
"name": "instance_jvm_thread_runnable_thread_count",
"entity": {
"scope": "ServiceInstance",
"serviceName": "business-zone::projectA",
"serviceInstanceName": "[email protected]",
"normal": true
}
}
}
}
- 响应
{
"data": {
"readMetricsValues": {
"values": {
"values": [
{
"value": 22
},
{
"value": 22
}
]
}
}
}
}
在 Skywalking
源码中找到对应 GraphQL
定义
打开 oap-server/server-query-plugin/query-graphql-plugin/src/main/resources/query-protocol
目录,使用请求体中的模板关键字 readMetricsValues
搜索
在 oap-server/server-query-plugin/query-graphql-plugin/src/main/resources/query-protocol/metrics-v2.graphqls
中找到对应的定义
extend type Query {
# etc...
# Read time-series values in the duration of required metrics
readMetricsValues(condition: MetricsCondition!, duration: Duration!): MetricsValues!
# etc...
}
输入参数定义
input MetricsCondition {
# Metrics name, which should be defined in OAL script
# Such as:
# Endpoint_avg = from(Endpoint.latency).avg()
# Then, `Endpoint_avg`
name: String!
# Follow entity definition description.
entity: Entity!
}
input Entity {
# 1. scope=All, no name is required.
# 2. scope=Service, ServiceInstance and Endpoint, set neccessary serviceName/serviceInstanceName/endpointName
# 3. Scope=ServiceRelation, ServiceInstanceRelation and EndpointRelation
# serviceName/serviceInstanceName/endpointName is/are the source(s)
# destServiceName/destServiceInstanceName/destEndpointName is/are destination(s)
# set necessary names of sources and destinations.
scope: Scope!
serviceName: String
# Normal service is the service having installed agent or metrics reported directly.
# Unnormal service is conjectural service, usually detected by the agent.
normal: Boolean
serviceInstanceName: String
endpointName: String
destServiceName: String
# Normal service is the service having installed agent or metrics reported directly.
# Unnormal service is conjectural service, usually detected by the agent.
destNormal: Boolean
destServiceInstanceName: String
destEndpointName: String
}
# The Duration defines the start and end time for each query operation.
# Fields: `start` and `end`
# represents the time span. And each of them matches the step.
# ref https://www.ietf.org/rfc/rfc3339.txt
# The time formats are
# `SECOND` step: yyyy-MM-dd HHmmss
# `MINUTE` step: yyyy-MM-dd HHmm
# `HOUR` step: yyyy-MM-dd HH
# `DAY` step: yyyy-MM-dd
# `MONTH` step: yyyy-MM
# Field: `step`
# represents the accurate time point.
# e.g.
# if step==HOUR , start=2017-11-08 09, end=2017-11-08 19
# then
# metrics from the following time points expected
# 2017-11-08 9:00 -> 2017-11-08 19:00
# there are 11 time points (hours) in the time span.
input Duration {
start: String!
end: String!
step: Step!
}
enum Step {
DAY
HOUR
MINUTE
SECOND
}
返回结果定义
type MetricsValues {
# Could be null if no label assigned in the query condition
label: String
# Values of this label value.
values: IntValues
}
type IntValues {
values: [KVInt!]!
}
type KVInt {
id: ID!
# This is the value, the caller must understand the Unit.
# Such as:
# 1. If ask for cpm metric, the unit and result should be count.
# 2. If ask for response time (p99 or avg), the unit should be millisecond.
value: Long!
}
使用 GraphQL
IDEA
插件验证 Skywalking UI
的请求
使用“ GraphQL
在 Skywalking
中的应用”一节中的方式,模仿“截取 Skywalking UI 发送的请求”一节中前端发送的请求
- 请求模板
query queryData($condition: MetricsCondition!, $duration: Duration!) {
readMetricsValues: readMetricsValues(duration: $duration, condition: $condition) {
label values { values { id value }}
}
}
- 请求参数
{
"duration": {
"start": "2021-07-03 1400",
"end": "2021-07-03 1401",
"step": "MINUTE"
},
"condition": {
"name": "instance_jvm_thread_runnable_thread_count",
"entity": {
"scope": "ServiceInstance",
"serviceName": "business-zone::projectA",
"serviceInstanceName": "[email protected]",
"normal": true
}
}
}
- 响应结果
{
"data": {
"readMetricsValues": {
"values": {
"values": [
{
"id": "202107031400_YnVzaW5lc3Mtem9uZTo6cHJvamVjdEE=.1_ZThjZjM0YTFkNTRhNDA1OGE4Yzk4NTA1ODc3NzcwZTJAMTkyLjE2OC41MC4xMTM=",
"value": 22
},
{
"id": "202107031401_YnVzaW5lc3Mtem9uZTo6cHJvamVjdEE=.1_ZThjZjM0YTFkNTRhNDA1OGE4Yzk4NTA1ODc3NzcwZTJAMTkyLjE2OC41MC4xMTM=",
"value": 22
}
]
}
}
}
}
PS:如果不使用模板的方式,写查询语句是会有代码提示的
query queryData {
readMetricsValues(
duration: {start: "2021-07-03 1400",end: "2021-07-03 1401", step: MINUTE},
condition: {
name: "instance_jvm_thread_runnable_thread_count",
entity: {
scope: ServiceInstance,
serviceName: "business-zone::projectA",
serviceInstanceName: "[email protected]",
normal: true
}
}
) {
label values{ values{ id value }}
}
}
如何将 GraphQL Schema
文件加载到程序中
搜索 metrics-v2.graphqls
,在 oap-server/server-query-plugin/query-graphql-plugin/src/main/java/org/apache/skywalking/oap/query/graphql/GraphQLQueryProvider.java
找到加载代码
// 初始化GraphQL引擎
@Override
public void prepare() throws ServiceNotProvidedException, ModuleStartException {
GraphQLSchema schema = SchemaParser.newParser()
// etc...
.file("query-protocol/metrics-v2.graphqls")
.resolvers(new MetricsQuery(getManager())) // MetricsQuery 是 com.coxautodev.graphql.tools.GraphQLQueryResolver 接口实现类
// etc...
.build()
.makeExecutableSchema();
this.graphQL = GraphQL.newGraphQL(schema).build();
}
在 org.apache.skywalking.oap.query.graphql.resolver.MetricsQuery
类中,找到 readMetricsValues
方法
/**
* Read time-series values in the duration of required metrics
*/
public MetricsValues readMetricsValues(MetricsCondition condition, Duration duration) throws IOException {
if (MetricsType.UNKNOWN.equals(typeOfMetrics(condition.getName())) || !condition.getEntity().isValid()) {
final List pointOfTimes = duration.assembleDurationPoints();
MetricsValues values = new MetricsValues();
pointOfTimes.forEach(pointOfTime -> {
String id = pointOfTime.id(
condition.getEntity().isValid() ? condition.getEntity().buildId() : "ILLEGAL_ENTITY"
);
final KVInt kvInt = new KVInt();
kvInt.setId(id);
kvInt.setValue(0);
values.getValues().addKVInt(kvInt);
});
return values;
}
return getMetricsQueryService().readMetricsValues(condition, duration);
}
private MetricsQueryService getMetricsQueryService() {
if (metricsQueryService == null) {
this.metricsQueryService = moduleManager.find(CoreModule.NAME)
.provider()
.getService(MetricsQueryService.class);
}
return metricsQueryService;
}
org.apache.skywalking.oap.server.core.query.MetricsQueryService#readMetricsValues
/**
* Read time-series values in the duration of required metrics
*/
public MetricsValues readMetricsValues(MetricsCondition condition, Duration duration) throws IOException {
return getMetricQueryDAO().readMetricsValues(
condition, ValueColumnMetadata.INSTANCE.getValueCName(condition.getName()), duration);
}
private IMetricsQueryDAO getMetricQueryDAO() {
if (metricQueryDAO == null) {
metricQueryDAO = moduleManager.find(StorageModule.NAME).provider().getService(IMetricsQueryDAO.class);
}
return metricQueryDAO;
}
查看Extend storage文档, IMetricsQueryDAO
为指标查询数据访问对象
# Implement all DAOs
# Here is the list of all DAO interfaces in storage
IServiceInventoryCacheDAO
IServiceInstanceInventoryCacheDAO
IEndpointInventoryCacheDAO
INetworkAddressInventoryCacheDAO
IBatchDAO
StorageDAO
IRegisterLockDAO
ITopologyQueryDAO
IMetricsQueryDAO
ITraceQueryDAO
IMetadataQueryDAO
IAggregationQueryDAO
IAlarmQueryDAO
IHistoryDeleteDAO
IMetricsDAO
IRecordDAO
IRegisterDAO
ILogQueryDAO
ITopNRecordsQueryDAO
IBrowserLogQueryDAO
通过类图,可以看出 IMetricsQueryDAO
实现类有 ES
、 ES7
、 InfluxDB
、 SQL
四种
如何将 GraphQL
引擎注册到 Jetty
服务
// 注册GraphQL查询处理器至Jetty服务
@Override
public void start() throws ServiceNotProvidedException, ModuleStartException {
JettyHandlerRegister service = getManager().find(CoreModule.NAME)
.provider()
.getService(JettyHandlerRegister.class);
service.addHandler(new GraphQLQueryHandler(config.getPath(), graphQL));
}
通过分析 GraphQLQueryProvider
该类,发现就是 QueryModule
(查询模块)的 Provider
(提供)类
由此,也验证了在“基本概述”一节的说法:
Skywalking
查询协议默认基于GraphQL
,如果有需要也可以自定义扩展,提供一个实现了org.apache.skywalking.oap.server.core.query.QueryModule
的查询模块即可。
@Override
public String name() {
return "graphql";
}
@Override
public Class extends ModuleDefine> module() {
return QueryModule.class;
}
package org.apache.skywalking.oap.query.graphql;
import com.google.gson.Gson;
import com.google.gson.JsonArray;
import com.google.gson.JsonElement;
import com.google.gson.JsonObject;
import com.google.gson.reflect.TypeToken;
import graphql.ExecutionInput;
import graphql.ExecutionResult;
import graphql.GraphQL;
import graphql.GraphQLError;
import java.io.BufferedReader;
import java.io.IOException;
import java.io.InputStreamReader;
import java.lang.reflect.Type;
import java.util.List;
import java.util.Map;
import javax.servlet.http.HttpServletRequest;
import lombok.RequiredArgsConstructor;
import org.apache.skywalking.oap.server.library.server.jetty.JettyJsonHandler;
import org.apache.skywalking.oap.server.library.util.CollectionUtils;
import org.slf4j.Logger;
import org.slf4j.LoggerFactory;
@RequiredArgsConstructor
public class GraphQLQueryHandler extends JettyJsonHandler {
private static final Logger LOGGER = LoggerFactory.getLogger(GraphQLQueryHandler.class);
private static final String QUERY = "query";
private static final String VARIABLES = "variables";
private static final String DATA = "data";
private static final String ERRORS = "errors";
private static final String MESSAGE = "message";
private final Gson gson = new Gson();
private final Type mapOfStringObjectType = new TypeToken
Webapp
网关转发 GraphQL
请求至 OAP
v8.6.0
及之前,网关都是 zuul
, v8.7.0
及之后替换成了 Spring Cloud Gateway
。因为这块不是这篇文章的重点,这里不再赘述
总结
Skywalking
的查询协议默认使用通用性很强的 GraphQL
实现,客户端可以通过 GraphQL
协议很方便的选取自己需要的数据。
对应 Skywalking
这种模式相对固定、变更不频繁的查询需求来说,还是挺适合的。
参考文档
- Extend storage
分享并记录所学所见