PassZhang

Logstash生产环境实践手册(含grok规则示例和ELKF应用场景)

ELKF应用场景：

1） datasource->logstash->elasticsearch->kibana

2） datasource->filebeat->logstash-> elasticsearch->kibana

3） datasource->filebeat->logstash->redis/kafka->logstash-> elasticsearch->kibana

4） kafka->logstash-> elasticsearch->kibana

5） datasource->filebeat->kafka->logstash->elasticsearch->kibana(最常用)

6） filebeatSSL加密传输

7） datasource->logstash->redis/kafka->logstash->elasticsearch->kibana

8） mysql->logstash->elasticsearch->kibana

上述主要是对下面传输处理场景的一个概括，从数据源开始，如何采集，用什么工具采集，采集到哪里，经过怎样的处理过滤，传输到哪里，怎样进行展示

输入、输出、过滤主要通过插件实现（包含多类型插件），插件教程参考官网

https://www.elastic.co/guide/en/logstash/current/index.html

【安装部署这种官网或者社区已经很完善，此处不做赘述，可自行去官网查看】

【redis集群安装文档前面已经说明过，可自行查看】

前提条件

1） java环境：jdk8；

2） elk已搭建完毕；

3） elasticsearch、kibana、logstash版本最好保持一致，目前环境是5.6.10版本

4） logstash建议使用root用户（拥有足够权限去搜集所需日志文件）；

5） elasticsearch使用普通用户安装，新版本已限制不允许root安装；

6） filebeat安装完毕

启动命令：

7） logstash启动命令：

*nohup ./bin/logstash -f .conf –config.reload.automatic >/dev/null 2>/dev/null &

8） filebeat启动命令： nohup ./filebeat -e -c filebeat.yml>/dev/null 2>/dev/null &

9）elasticsearch启动命令：./elasticsearch -d

10）kibana启动命令：nohup ./bin/kibana &

Logstash启动命令：--config.reload.automatic自动重新加载配置文件，无需重启logstash

filebeat启动命令：-e 参数指定输出日志到stderr，-c参数指定配置文件路径

场景介绍

简单模式：以logstash作为日志搜索器

架构：logstash采集、处理、转发到elasticsearch存储，在kibana进行展示

特点：这种结构因为需要在各个服务器上部署 Logstash，而它比较消耗 CPU 和内存资源，所以比较适合计算资源丰富的服务器，否则容易造成服务器性能下降，甚至可能导致无法正常工作。

Demo1：

test1.conf：

控制台输入，不经过任何处理转换（仅传输），输出到控制台（或者elasticsearch、文件----自行选择）：

#控制台输入

input { stdin { } }

output {

     #codec输出到控制台

stdout { codec=> rubydebug }

#输出到elasticsearch

elasticsearch {

        hosts => "node18:9200"

        codec => json

        }

#输出到文件

file {

    path => "/usr/local/logstash-5.6.10/data/log/logstash/all.log" #指定写入文件路径

    flush_interval => 0                  # 指定刷新间隔，0代表实时写入

    codec => json

    }

}

安全模式：beats（Filebeat、Metricbeat、Packetbeat、Winlogbeat等）作为日志搜集器

Packetbeat（搜集网络流量数据）；

Topbeat（搜集系统、进程和文件系统级别的 CPU 和内存使用情况等数据）；

Filebeat（搜集文件数据）-------最常用

Winlogbeat（搜集 Windows 事件日志数据）。

架构：

工作模式：Beats 将搜集到的数据发送到 Logstash，经 Logstash 解析、过滤后，将其发送到 Elasticsearch 存储，并由 Kibana 呈现给用户；

模式特点：这种架构解决了 Logstash 在各服务器节点上占用系统资源高的问题。相比 Logstash，Beats 所占系统的 CPU 和内存几乎可以忽略不计。另外，Beats 和 Logstash 之间支持 SSL/TLS 加密传输，客户端和服务器双向认证，保证了通信安全。

因此这种架构适合对数据安全性要求较高，同时各服务器性能比较敏感的场景

Demo2：

filebeat.yml：

################# Filebeat Configuration Example ########################

 

# This file is an example configuration file highlighting only the most common

# options. The filebeat.full.yml file from the same directory contains all the

# supported options with more comments. You can use it as a reference.

#

# You can find the full configuration reference here:

# https://www.elastic.co/guide/en/beats/filebeat/index.html

 

#===================== Filebeat prospectors =====================

 

filebeat.prospectors:

 

# Each - is a prospector. Most options can be set at the prospector level, so

# you can use different prospectors for various configurations.

# Below are the prospector specific configurations.

 

- input_type: log

 

  # Paths that should be crawled and fetched. Glob based paths.

  paths:

    - /home/admin/helloworld/logs/*.log

    #- c:\programdata\elasticsearch\logs\*

 

  # Exclude lines. A list of regular expressions to match. It drops the lines that are

  # matching any regular expression from the list.

  #exclude_lines: ["^DBG"]

 

  # Include lines. A list of regular expressions to match. It exports the lines that are

  # matching any regular expression from the list.

  #include_lines: ["^ERR", "^WARN"]

 

  # Exclude files. A list of regular expressions to match. Filebeat drops the files that

  # are matching any regular expression from the list. By default, no files are dropped.

  #exclude_files: [".gz$"]

 

  # Optional additional fields. These field can be freely picked

  # to add additional information to the crawled log files for filtering

  #fields:

  #  level: debug

  #  review: 1

 

  ### Multiline options

 

  # Mutiline can be used for log messages spanning multiple lines. This is common

  # for Java Stack Traces or C-Line Continuation

 

  # The regexp Pattern that has to be matched. The example pattern matches all lines starting with [

  #multiline.pattern: ^\[

 

  # Defines if the pattern set under pattern should be negated or not. Default is false.

  #multiline.negate: false

 

  # Match can be set to "after" or "before". It is used to define if lines should be append to a pattern

  # that was (not) matched before or after or as long as a pattern is not matched based on negate.

  # Note: After is the equivalent to previous and before is the equivalent to to next in Logstash

  #multiline.match: after

 

 

#====================== General =============================

 

# The name of the shipper that publishes the network data. It can be used to group

# all the transactions sent by a single shipper in the web interface.

#name:

 

# The tags of the shipper are included in their own field with each

# transaction published.

#tags: ["service-X", "web-tier"]

 

# Optional fields that you can specify to add additional information to the

# output.

#fields:

#  env: staging

 

#======================= Outputs ===========================

 

# Configure what outputs to use when sending the data collected by the beat.

# Multiple outputs may be used.

 

#-------------------------- Elasticsearch output ------------------------------

#output.elasticsearch:

  # Array of hosts to connect to.

  # hosts: ["localhost:9200"]

 

  # Optional protocol and basic auth credentials.

  #protocol: "https"

  #username: "elastic"

  #password: "changeme"

 

#--------------------------- Logstash output --------------------------------

output.logstash:

  # The Logstash hosts

  hosts: ["192.168.80.34:5044"]

 

  # Optional SSL. By default is off.

  # List of root certificates for HTTPS server verifications

  #ssl.certificate_authorities: ["/etc/pki/root/ca.pem"]

 

  # Certificate for SSL client authentication

  #ssl.certificate: "/etc/pki/client/cert.pem"

 

  # Client Certificate Key

  #ssl.key: "/etc/pki/client/cert.key"

 

#=========================== Logging =======================

 

# Sets log level. The default log level is info.

# Available log levels are: critical, error, warning, info, debug

#logging.level: debug

 

# At debug level, you can selectively enable logging only for some components.

# To enable all selectors use ["*"]. Examples of other selectors are "beat",

# "publish", "service".

#logging.selectors: ["*"]

logstash配置文件

test2.conf：

input {

    beats {

    port => 5044

    codec => "json"

}

}

#filters{

#…………(后续进行说明)

#}

 

output {

    # 输出到控制台

    # stdout { }

 

    # 输出到redis

    redis {

        host => "192.168.80.32"   # redis主机地址

        port => 6379              # redis端口号

        password => "123456"          # redis 密码

        #db => 8                   # redis数据库编号

        data_type => "channel"    # 使用发布/订阅模式

        key => "logstash_list_0"  # 发布通道名称

}

#输出到kafka

    kafka {

        bootstrap_servers => "192.168.80.42:9092"

        topic_id         => "test" 

       }

#输出到es

elasticsearch {

        hosts => "node18:9200"

        codec => json

        }

}

消息模式filebeat->logstash->kafka->logstash->es

Beats 还不支持输出到消息队列（新版本除外：5.0版本及以上），所以在消息队列前后两端只能是 Logstash 实例。logstash从各个数据源搜集数据，不经过任何处理转换仅转发出到消息队列（kafka、redis、rabbitMQ等），后logstash从消息队列取数据进行转换分析过滤，输出到elasticsearch，并在kibana进行图形化展示

架构（Logstash进行日志解析所在服务器性能各方面必须要足够好）：

模式特点：这种架构适合于日志规模比较庞大的情况。但由于 Logstash 日志解析节点和 Elasticsearch 的负荷比较重，可将他们配置为集群模式，以分担负荷。引入消息队列，均衡了网络传输，从而降低了网络闭塞，尤其是丢失数据的可能性，但依然存在 Logstash 占用系统资源过多的问题

工作流程：Filebeat采集—> logstash转发到kafka—> logstash处理从kafka缓存的数据进行分析—> 输出到es—> 显示在kibana

Msg1.conf：

input {

    beats {

    port => 5044

    codec => "json"

       }

    syslog{

       }

}

 

#filter{

#

#}

 

output {

    # 输出到控制台

    # stdout { }

 

    # 输出到redis

    redis {

        host => "192.168.80.32"   # redis主机地址

        port => 6379              # redis端口号

        password => "123456"          # redis 密码

       #db => 8                   # redis数据库编号

        data_type => "channel"    # 使用发布/订阅模式

        key => "logstash_list_0"  # 发布通道名称

    }

    #输出到kafka

    kafka {

        bootstrap_servers => "192.168.80.42:9092"

        topic_id          => "test" 

       }     

}

Msg2.conf：

input{

    kafka {

        bootstrap_servers => "192.168.80.42:9092"

           topics          => ["test"]

           #decroate_events   => true

        group_id          => "consumer-test"（消费组）

           #decroate_events  => true

        auto_offset_reset => "earliest"（初始消费，相当于from beginning，不设置，相当于是监控启动后的kafka的消息生产）

   }

}

#filter{

#}

output {

       elasticsearch {

       hosts => "192.168.80.18:9200"   

       codec => json

       }

}

消息模式filebeat->kafka->logstash->es

filebeat新版本（5.0以上）支持直接支持输出到kafka，而无需经过logstash接收转发到kafka.

Filebeat采集完毕直接入到kafka消息队列，进而logstash取出数据，进行处理分析输出到es，并在kibana进行展示。

filebeat.yml：

################# Filebeat Configuration Example #########################

 

# This file is an example configuration file highlighting only the most common

# options. The filebeat.full.yml file from the same directory contains all the

# supported options with more comments. You can use it as a reference.

#

# You can find the full configuration reference here:

# https://www.elastic.co/guide/en/beats/filebeat/index.html

 

#================== Filebeat prospectors===========================

 

filebeat.prospectors:

 

# Each - is a prospector. Most options can be set at the prospector level, so

# you can use different prospectors for various configurations.

# Below are the prospector specific configurations.

 

- input_type: log

 

  # Paths that should be crawled and fetched. Glob based paths.

  paths:

    - /home/admin/helloworld/logs/*.log

    #- c:\programdata\elasticsearch\logs\*

 

  # Exclude lines. A list of regular expressions to match. It drops the lines that are

  # matching any regular expression from the list.

  #exclude_lines: ["^DBG"]

 

  # Include lines. A list of regular expressions to match. It exports the lines that are

  # matching any regular expression from the list.

  #include_lines: ["^ERR", "^WARN"]

 

  # Exclude files. A list of regular expressions to match. Filebeat drops the files that

  # are matching any regular expression from the list. By default, no files are dropped.

  #exclude_files: [".gz$"]

 

  # Optional additional fields. These field can be freely picked

  # to add additional information to the crawled log files for filtering

  #fields:

  #  level: debug

  #  review: 1

 

  ### Multiline options

 

  # Mutiline can be used for log messages spanning multiple lines. This is common

  # for Java Stack Traces or C-Line Continuation

 

  # The regexp Pattern that has to be matched. The example pattern matches all lines starting with [

  #multiline.pattern: ^\[

 

  # Defines if the pattern set under pattern should be negated or not. Default is false.

  #multiline.negate: false

 

  # Match can be set to "after" or "before". It is used to define if lines should be append to a pattern

  # that was (not) matched before or after or as long as a pattern is not matched based on negate.

  # Note: After is the equivalent to previous and before is the equivalent to to next in Logstash

  #multiline.match: after

 

#============================ General=========================

 

# The name of the shipper that publishes the network data. It can be used to group

# all the transactions sent by a single shipper in the web interface.

#name:

 

# The tags of the shipper are included in their own field with each

# transaction published.

#tags: ["service-X", "web-tier"]

 

# Optional fields that you can specify to add additional information to the

# output.

#fields:

#  env: staging

 

#======================== Outputs ============================

 

# Configure what outputs to use when sending the data collected by the beat.

# Multiple outputs may be used.

 

#-------------------------- Elasticsearch output ------------------------------

#output.elasticsearch:

  # Array of hosts to connect to.

  # hosts: ["localhost:9200"]

 

  # Optional protocol and basic auth credentials.

  #protocol: "https"

  #username: "elastic"

  #password: "changeme"

 

#----------------------------- Logstash output --------------------------------

#output.logstash:

  # The Logstash hosts

#  hosts: ["192.168.80.34:5044"]

 

#-----------------------------kafka  output-----------------------------------

#output.kafka:

#  enabled: true

#  hosts: ["192.168.80.42:9092,192.168.80.43:9092,192.168.80.44:9092"]

#  topics: 'test'

output.kafka:

  hosts: ["192.168.80.42:9092"]

  topic: test

  required_acks: 1

 

 

  # Optional SSL. By default is off.

  # List of root certificates for HTTPS server verifications

  #ssl.certificate_authorities: ["/etc/pki/root/ca.pem"]

 

  # Certificate for SSL client authentication

  #ssl.certificate: "/etc/pki/client/cert.pem"

 

  # Client Certificate Key

  #ssl.key: "/etc/pki/client/cert.key"

 

#======================== Logging ============================

 

# Sets log level. The default log level is info.

# Available log levels are: critical, error, warning, info, debug

#logging.level: debug

 

# At debug level, you can selectively enable logging only for some components.

# To enable all selectors use ["*"]. Examples of other selectors are "beat",

# "publish", "service".

#logging.selectors: ["*"]

logstash.conf：

input{

    kafka {

        bootstrap_servers => "192.168.80.42:9092"

            topics          => ["test"]

         group_id       => "consumer-test"

         #decroate_events  => true

       auto_offset_reset => "earliest"

   }

 

}

#flter{

#

#}

 

output {
       elasticsearch {

       hosts => "192.168.80.18:9200"

       codec => json

       }

      

}

FilebeatSSL加密传输

FilebeatSSL加密传输（增强安全性，仅配置了秘钥和证书的filebeat服务器和logstash服务器才能进行日志文件数据的传输）：

参考文档: https://blog.csdn.net/zsq12138/article/details/78753369

参考文档：https://blog.csdn.net/Gamer_gyt/article/details/69280693?locationNum=5&fps=1

Logstash的配置文件：

注释：

ssl_certificate_authorities ：filebeat端传来的证书所在位置
ssl_certificate => 本端生成的证书所在的位置
ssl_key => 本端生成的密钥所在的位置
ssl_verify_mode => "force_peer"

beat.conf：

input {

    beats {

    port => 5044

    codec => "json"

    ssl => true

   ssl_certificate_authorities => ["/usr/local/logstash-5.6.10/pki/tls/certs/filebeat.crt"]

   ssl_certificate => "/usr/local/logstash-5.6.10/pki/tls/certs/logstash.crt"

   ssl_key => "/usr/local/logstash-5.6.10/pki/tls/private/logstash.key"

ssl_verify_mode => "force_peer"#（需与ssl_certificate_authorities一起使用）

       }

    syslog{

       }

}

 

output {

    # 输出到控制台

    # stdout { }

 

    # 输出到redis

    redis {

        host => "192.168.80.32"   # redis主机地址

        port => 6379              # redis端口号

        password => "123456"          # redis 密码

       #db => 8                   # redis数据库编号

        data_type => "channel"    # 使用发布/订阅模式

        key => "logstash_list_0"  # 发布通道名称

    }

    #输出到kafka

    kafka {

        bootstrap_servers => "192.168.80.42:9092"

        topic_id          => "test" 

       }     

    #输出到es

    elasticsearch {

       hosts => "node18:9200"

       codec => json

       }

 

}

filebeat的配置文件：

filebeat.yml：

################ #Filebeat Configuration Example #####################

 

# This file is an example configuration file highlighting only the most common

# options. The filebeat.full.yml file from the same directory contains all the

# supported options with more comments. You can use it as a reference.

#

# You can find the full configuration reference here:

# https://www.elastic.co/guide/en/beats/filebeat/index.html

 

#=================== Filebeat prospectors ========================

 

filebeat.prospectors:

 

# Each - is a prospector. Most options can be set at the prospector level, so

# you can use different prospectors for various configurations.

# Below are the prospector specific configurations.

 

- input_type: log

 

  # Paths that should be crawled and fetched. Glob based paths.

  paths:

    - /home/admin/helloworld/logs/*.log

    #- c:\programdata\elasticsearch\logs\*

 

  # Exclude lines. A list of regular expressions to match. It drops the lines that are

  # matching any regular expression from the list.

  #exclude_lines: ["^DBG"]

 

  # Include lines. A list of regular expressions to match. It exports the lines that are

  # matching any regular expression from the list.

  #include_lines: ["^ERR", "^WARN"]

 

  # Exclude files. A list of regular expressions to match. Filebeat drops the files that

  # are matching any regular expression from the list. By default, no files are dropped.

  #exclude_files: [".gz$"]

 

  # Optional additional fields. These field can be freely picked

  # to add additional information to the crawled log files for filtering

  #fields:

  #  level: debug

  #  review: 1

 

  ### Multiline options

 

  # Mutiline can be used for log messages spanning multiple lines. This is common

  # for Java Stack Traces or C-Line Continuation

 

  # The regexp Pattern that has to be matched. The example pattern matches all lines starting with [

  #multiline.pattern: ^\[

 

  # Defines if the pattern set under pattern should be negated or not. Default is false.

  #multiline.negate: false

 

  # Match can be set to "after" or "before". It is used to define if lines should be append to a pattern

  # that was (not) matched before or after or as long as a pattern is not matched based on negate.

  # Note: After is the equivalent to previous and before is the equivalent to to next in Logstash

  #multiline.match: after

 

#======================== General ============================

 

# The name of the shipper that publishes the network data. It can be used to group

# all the transactions sent by a single shipper in the web interface.

#name:

 

# The tags of the shipper are included in their own field with each

# transaction published.

#tags: ["service-X", "web-tier"]

 

# Optional fields that you can specify to add additional information to the

# output.

#fields:

#  env: staging

 

#========================= Outputs ===========================

 

# Configure what outputs to use when sending the data collected by the beat.

# Multiple outputs may be used.

 

#----------------------------- Elasticsearch output ------------------------------

#output.elasticsearch:

  # Array of hosts to connect to.

  # hosts: ["localhost:9200"]

 

  # Optional protocol and basic auth credentials.

  #protocol: "https"

  #username: "elastic"

  #password: "changeme"

 

#----------------------------- Logstash output --------------------------------

output.logstash:

# The Logstash hosts

  hosts: ["192.168.80.18:5044"]

#加密传输

  ssl.certificate_authorities: ["/usr/local/filebeat-5.6.10/pki/tls/certs/logstash.crt"]

  ssl.certificate: "/usr/local/filebeat-5.6.10/pki/tls/certs/filebeat.crt"

  ssl.key: "/usr/local/filebeat-5.6.10/pki/tls/private/filebeat.key" 

 

#----------------------------- kafka  output-----------------------------------

#output.kafka:

#  hosts: ["192.168.80.42:9092"]

#  topic: test

#  required_acks: 1

 

  # Optional SSL. By default is off.

  # List of root certificates for HTTPS server verifications

  #ssl.certificate_authorities: ["/etc/pki/root/ca.pem"]

 

  # Certificate for SSL client authentication

  #ssl.certificate: "/etc/pki/client/cert.pem"

 

  # Client Certificate Key

  #ssl.key: "/etc/pki/client/cert.key"

 

#========================== Logging =========================

 

# Sets log level. The default log level is info.

# Available log levels are: critical, error, warning, info, debug

#logging.level: debug

 

# At debug level, you can selectively enable logging only for some components.

# To enable all selectors use ["*"]. Examples of other selectors are "beat",

# "publish", "service".

#logging.selectors: ["*"]

七、logstash（非filebeat）进行文件采集，输出到kafka缓存，读取kafka数据并处理输出到文件或es

读数据：

kafkaput.conf：

input {

    file {

        path => [

            # 这里填写需要监控的文件

            "/home/admin/helloworld/logs/catalina.out"

        ]

    }

}

 

output {

    kafka {

    # 输出到控制台

    # stdout { }

    # 输出到kafka

    bootstrap_servers => "192.168.80.42:9092"

    topic_id          => "test"

    }

}

取数据

indexer.conf

input{

#从redis读取

 redis {

        host => "192.168.80.32"   # redis主机地址

        port => 6379              # redis端口号

       password  => "123456"      # redis 密码

        #db => 8                   # redis数据库编号

        data_type => "channel"    # 使用发布/订阅模式

        key => "logstash_list_0"  # 发布通道名称

}

#从kafka读取

 kafka {

        bootstrap_servers => "192.168.80.42:9092"

           topics          => ["test"]

        auto_offset_reset => "earliest"

       }

}

 

output {

    #输出到文件

    file {

        path => "/usr/local/logstash-5.6.10/data/log/logstash/all1.log" # 指定写入文件路径

#       message_format => "%{host} %{message}"         # 指定写入格式

        flush_interval => 0                             # 指定刷新间隔，0代表实时写入

     codec => json

       }

   #输出到es

   elasticsearch {

       hosts => "node18:9200"

       codec => json

       }

}

logstash同步mysql数据库数据到es

mysql2es.conf：

input {

 stdin { }

    jdbc {

        jdbc_connection_string => "jdbc:mysql://192.168.80.18:3306/fyyq-mysql"

        jdbc_user => "fyyq"

        jdbc_password => "fyyq@2017"

   jdbc_driver_library => "/usr/local/logstash-5.6.10/mysql-connector-java-5.1.46.jar"

        jdbc_driver_class => "com.mysql.jdbc.Driver"

        jdbc_paging_enabled => "true"

        statement_filepath => "/usr/local/logstash-5.6.10/mysql2es.sql"

        #schedule => "* * * * *"

    }

 }

 

 output {

     stdout {

        codec => json_lines

    }

    elasticsearch {

        hosts => "node18:9200"

        #index => "mainIndex"

        #document_type => "user"

        #document_id => "%{id}"

    }

}

mysql2es.sql：

select * from sys_log;

logstash输出到hdfs文件**

input {

    beats {

      port => 5044

      #codec => "json"

      ssl => true

   ssl_certificate_authorities => ["/usr/local/logstash-5.6.10/pki/tls/certs/filebeat.crt"]

      ssl_certificate => "/usr/local/logstash-5.6.10/pki/tls/certs/logstash.crt"

      ssl_key => "/usr/local/logstash-5.6.10/pki/tls/private/logstash.key"

      ssl_verify_mode => "force_peer"

                           }

}

 

filter{

   grok {

       match => { "message" => "%{IP:client} %{WORD:method} %{URIPATHPARAM:request} %{NUMBER:bytes} %{NUMBER:duration}"}

}

}

 

 

output {

    # 输出到控制台

    # stdout { }

 

    # 输出到redis

    redis {

        host => "192.168.80.32"   # redis主机地址

        port => 6379              # redis端口号

        password => "123456"          # redis 密码

       #db => 8                   # redis数据库编号

        data_type => "channel"    # 使用发布/订阅模式

        key => "logstash_list_0"  # 发布通道名称

    }

    #输出到kafka

    kafka {

        bootstrap_servers => "192.168.80.42:9092"

        topic_id          => "test" 

                           }      

    #输出到es

    elasticsearch {

                           hosts => "node18:9200"

                           codec => json

                           }

    #输出到hdfs

     webhdfs {

     host => "192.168.80.42"

     port => 50070

     path => "/user/logstash/dt=%{+YYYY-MM-dd}/%{@source_host}-%{+HH}.log"

     user => "hadoop"

       }

}

Logstash-input插件及插件参数概览

仅以beat插件为例，后续插件将以连接形式提供（都是官网标准介绍）

所有输入插件都支持以下配置选项：

Setting	Input type	Required
`add_field`	hash	No（默认为{}）
`codec`	codec	No（输入数据的编解码器，默认“plain”）
`enable_metric`	boolean	No（默认true）
`id`	string	No（自动生成，但最好自行定义）
`tags`	array	No
`type`	string	No

codec：可选

json: (json格式编解码器)

**msgpack: ** (msgpack格式编解码器)

plain: (文本格式编解码器)

multiline: (将多行文本event合并成一个event，eg:将java中的异常跟踪日志合并成一条消息)]**

常用输入插件：

1、beat-input：Receives events from the Elastic Beats framework，从框架接收事件

Settings：

Setting	Input type	Required
`cipher_suites`	array	No
`client_inactivity_timeout`	number	No
`host`	string	No
`include_codec_tag`	boolean	No
`port`	number	Yes（必填项）
`ssl`	boolean	No
`ssl_certificate`	a valid filesystem path	No
`ssl_certificate_authorities`	array	No
`ssl_handshake_timeout`	number	No
`ssl_key`	a valid filesystem path	No
`ssl_key_passphrase`	password	No
`ssl_verify_mode`	string,one of `["none", "peer","force_peer"]`	No
`tls_max_version`	number	No
`tls_min_version`	number

2、file-input：来自文件的Streams事件（path字段必填项）

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-file.html

3、stdin-input：从标准输入读取事件

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-stdin.html

4、syslog-input：将syslog消息作为事件读取

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-syslog.html

5、tcp-input：从TCP读取事件（port字段必填项）

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-tcp.html

6、udp-input：通过UDP读取事件（port字段必填项）

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-udp.html

7、twitter-input：从Twitter Streaming API读取事件（相对常用场景）

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-twitter.html

（consumer_key、consumer_secret、oauth_token、oauth_token_secret必填项）

8、redis-input：从Redis实例读取事件

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-redis.html

（data_type["list", "channel", "pattern_channel"]、key必填项，）

9、kafka-input：从Kafka主题中读取事件

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-kafka.html

（参数过多，自行查看）

10、jdbc-input：从JDBC数据创建事件

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-jdbc.html

（jdbc_connection_string、jdbc_driver_class、jdbc_user必填项）

11、http-input：通过HTTP或HTTPS接收事件

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-http.html

12、elasticsearch-input：从Elasticsearch集群读取查询结果

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-elasticsearch.html

13、exec-input：将shell命令的输出捕获为事件（command字段必填项）

https://www.elastic.co/guide/en/logstash/current/plugins-inputs-exec.html

非常用输入插件：

自行进入logstash的插件中心进行查看，有需要自行配置

总：https://www.elastic.co/guide/en/logstash/current/input-plugins.html

Logstash-filter插件(grok)及插件参数概览

所有处理插件均支持的配置：

Setting	Input type	Required
`add_field`	hash	no
`add_tag`	array	no
`enable_metric`	boolean	no
`id`	string	no
`periodic_flush`	boolean	no
`remove_field`	array	no
`remove_tag`	array	no

常用处理插件：

1、 grok-filter：可以将非结构化日志数据解析为结构化和可查询的内容

https://www.elastic.co/guide/en/logstash/current/plugins-filters-grok.html#_grok_basics

grok模式的语法是 %{SYNTAX:SEMANTIC}

SYNTAX是与您的文本匹配的模式的名称

SEMANTIC是您为匹配的文本提供的标识符

grok是通过系统预定义的正则表达式或者通过自己定义正则表达式来匹配日志中的各个值

正则解析式比较容易出错，建议先调试（地址）：

grok debugger调试：http://grokdebug.herokuapp.com/

grok事先已经预定义好了许多正则表达式规则，该规则文件存放路径：

/usr/local/logstash-5.6.10/vendor/bundle/jruby/1.9/gems/logstash-patterns-core-4.1.2/patterns

等等，可自行进入查看

示例一：

filter {

  grok {match => { "message" => "%{IP:client} %{WORD:method} %{URIPATHPARAM:request} %{NUMBER:bytes} %{NUMBER:duration}" }

  }

}

初始输入的message是：

55.3.244.1 GET /index.html 15824 0.043

经过grok的正则分析后：

client: 55.3.244.1（IP）

method: GET（方法）

request: /index.html（请求文件路径）

bytes: 15824（字节数）

duration: 0.043（访问时长）

示例二：

filter {

    grok {

        match => { "message" => "%{COMBINEDAPACHELOG}"}

    }

}

COMBINEDAPACHELOG的具体内容见:

https://github.com/logstash-plugins/logstash-patterns-core/blob/master/patterns/httpd

初始输入message为：

192.168.80.183 - - [04/Jan/2018:05:13:42 +0000] "GET /presentations/logstash-monitorama-2013/images/kibana-search.png HTTP/1.1" 200 203023 "http://semicomplete.com/presentations/logstash-monitorama-2013/" "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/32.0.1700.77 Safari/537.36"

经过grok正则分析后：

"clientip" => "192.168.80.183",

"timestamp" => "04/Jan/2018:05:13:42 +0000",

"verb" => "GET",

"request" => "/presentations/logstash-monitorama-2013/images/kibana-search.png",

"referrer" => "\"http://semicomplete.com/presentations/logstash-monitorama-2013/\"",

"response" => "200",

"bytes" => "203023",

"agent" => "\"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_9_1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/32.0.1700.77 Safari/537.36\"",

示例三（自定义grok表达式mypattern[A-Z]）：

filter {

  grok{
　　match=>{
　　　　"message"=>"%{IP:clientip}\s+(?[A-Z]+)"}
    }

}

初始输入message：

12.12.12.12 ABC

经过grok正则分析后：

"clientip" => "12.12.12.12",
"mypattern" => "ABC"

示例四（移除重复字段）：

filter {

    grok {

        #match => { "message" => "%{COMBINEDAPACHELOG}"}

         match => { "message" => "%{IP:clientip}\s+%{IP:clientip1}"}

    }

    mutate {

    remove_field => ["message"]

    remove_field => ["host"]

   }

}

初始输入message：

1.1.1.1 2.2.2.2

经过grok正则解析后（json格式）：

{

  "_index": "logstash-2018.07.31",

  "_type": "log",

  "_id": "AWTuNdzp6Wkp4mVEj3Fh",

  "_version": 1,

  "_score": null,

  "_source": {

    "@timestamp": "2018-07-31T02:41:00.014Z",

    "offset": 1114,

    "clientip": "1.1.1.1",

    "@version": "1",

    "input_type": "log",

    "beat": {

      "name": "node183",

      "hostname": "node183",

      "version": "5.6.10"

    },

    "source": "/home/usieip/bdp-datashare/logs/a.log",

    "type": "log",

    "clientip1": "2.2.2.2",

    "tags": [

      "beats_input_codec_plain_applied"

    ]

  },

  "fields": {

    "@timestamp": [

      1533004860014

    ]

  },

  "sort": [

    1533004860014

  ]

}

示例五（过滤筛选catalina.out文件中的信息，message字段已移除）：

filter {

    grok {

         match => { "message" =>

 "%{DATA:ymd} %{DATA:sfm} %{DATA:http} %{DATA:info}  %{GREEDYDATA:index}"}

}

}

Data在pattern中的定义是：.*? GREEDYDATA在pattern中的定义是：.*】

初始输入message：

2018-07-30 17:04:31.317 [http-bio-8080-exec-19] INFO c.u.i.b.m.s.i.LogInterceptor - ViewName: modules/datashare/front/index

经过grok正则解析后（截图及json格式如下）：

{

  "_index": "logstash-2018.07.31",

  "_type": "log",

  "_id": "AWTvhiPD6Wkp4mVEj3GU",

  "_version": 1,

  "_score": null,

  "_source": {

    "offset": 125,

    "input_type": "log",

    "index": "c.u.i.b.m.s.i.LogInterceptor - ViewName: modules/datashare/front/index",

    "source": "/home/usieip/bdp-datashare/logs/b.log",

    "type": "log",

    "tags": [],

    "ymd": "2018-07-30",

    "@timestamp": "2018-07-31T08:48:17.948Z",

    "@version": "1",

    "beat": {

      "name": "node183",

      "hostname": "node183",

      "version": "5.6.10"

    },

    "http": "[http-bio-8080-exec-19]",

    "sfm": "17:04:31.317",

    "info": "INFO"

  },

  "fields": {

    "ymd": [

      1532908800000

    ],

    "@timestamp": [

      1533026897948

    ]

  },

  "sort": [

    1533026897948

  ]

}

*常用参数：*

1）match：match作用：用来对字段的模式进行匹配

2）patterns_dir：用来指定规则的匹配路径，如果使用logstash自定义的规则时，不需要写此参数。Patterns_dir可以同时制定多个存放过滤规则的目录；

patterns_dir => ["/opt/logstash/patterns","/opt/logstash/extra_patterns"]

3）remove_field：如果匹配到某个”日志字段，则将匹配的这个日志字段从这条日志中删除（多个以逗号隔开）

remove_field => ["foo _％{somefield}"]

clone-filter：克隆过滤器用于复制事件
drop-filter：丢弃所有活动
json-filter：解析JSON事件
kv-filter：解析键值对

非常用参数：

参考教程：https://www.elastic.co/guide/en/logstash/current/filter-plugins.html

Logstash-output插件及插件参数概览

所有输出插件均支持以下配置：

Setting	Input type	Required
`codec`	codec	No（默认plain）
`enable_metric`	boolean	No（默认true）
`id`	string	No

常用插件：

1、Elasticsearch-output：此插件是在Elasticsearch中存储日志的推荐方法。如果您打算使用Kibana Web界面，则需要使用此输出

2、file-output：此输出将事件写入磁盘上的文件（path字段必填项）

3、kafka-output：将事件写入Kafka主题（topic_id是必填项）

4、 redis-output：此输出将使用RPUSH将事件发送到Redis队列

5、stdout-output：一个简单的输出，打印到运行Logstash的shell的STDOUT

*非常用插件：*

参考官网教程链接：https://www.elastic.co/guide/en/logstash/current/output-plugins.html

Logstash与flume简单对比

1）结构：

Logstash： Shipper、Broker、Indexer (broker部署redis或者kafka进行缓存)

Flume： Source、Channel、Sink

Logstash已集成，broker可以不需要，直接读取处理输出，不进行缓存

Flume需单独配置，三组件缺一不可

2）配置：

Logstash：配置简洁清晰，三个部分的属性都定义好了，可自行选择，若没有，可自行开发插件，便捷易用；且logstash在Filter plugin部分具有比较完备的功能，比如grok，能通过正则解析和结构化任何文本，Grok 目前是Logstash最好的方式对非结构化日志数据解析成结构化和可查询化。此外，Logstash还可以重命名、删除、替换和修改事件字段，当然也包括完全丢弃事件，如debug事件。还有很多的复杂功能可供选择，

Flume：配置繁琐，分别手动配置source、channel、sink，采集环境如果复杂需要多个。Flume的插件比较多，channel常用的就内存和文件两种

3）初衷：

Flume侧重数据的传输，使用者需非常清楚整个数据的路由，相对来说其更可靠，channel是用于持久化目的的，数据必须确认传输到下一个目的地，才会删除；

Logstash侧重数据的预处理，日志字段经过预处理之后再进行解析

4）组件：

logstash可以与elk其他组件配合使用、开发，应用简单，使用场景广泛；

flume新版本轻量级，适合有一定计算编程基础的人使用，且场景针对性强，需要配合很多其他工具进行使用，不方便

5）举例：

Logstash：主板、电源、硬盘，机箱等都已经装好的台式机，可以直接用

Flume ：提供一套完整的主板，电源、硬盘、机箱等，自行组装，装好了才能用

你可能感兴趣的:(Logstash生产环境实践手册(含grok规则示例和ELKF应用场景))

NodeJS中的require和import huzhenv5 Node
ES6标准发布后，module成为标准，标准的使用是以export指令导出接口，以import引入模块，但是在我们一贯的node模块中，我们采用的是CommonJS规范，使用require引入模块，使用module.exports导出接口。不把require和import整清楚，会在未来的标准编程中死的很难看。require时代的模块node编程中最重要的思想之一就是模块，而正是这个思想，让Jav
DeepSeek提示词结构：新手指南与技巧2 调皮的芋头 AIGC AI写作人工智能神经网络
以下是为DeepSeek新手总结的提示词使用技巧，结合核心原则、结构化模板与实操案例，助你快速提升工作效率：一、核心原则：3个关键点具体明确❌模糊指令：“写一篇产品文案”✅明确需求：“写一篇面向Z世代的防晒霜小红书文案，强调‘抗光老’和‘水润感’，口语化，带emoji，500字以内”。背景先行❌缺乏上下文：“分析这份数据”✅补充信息：“分析近3个月电商用户购买数据，聚焦‘复购率低于10%’的品类，
100个DeepSeek AI prompt提示词：实用案例与示例知识小报童 DeepSeek前言内容整理人工智能 prompt 深度学习机器学习神经网络自然语言处理语言模型
目录介绍教育与学习的提示内容写作的提示工作场所和办公室任务的提示创意设计的提示营销的提示生活助手的提示技术开发的提示医疗保健的提示财务与投资的提示旅行规划的提示解决服务器问题以释放DeepSeek的潜力与MimicPC通过10个类别的100个prompt提示词，释放DeepSeek的全部潜力。学习教育、内容、工作、设计、营销等领域的实用示例。介绍Deepseek，这款AI助手，最近获得了巨大的关注
Orcale、MySQL中参数类型的详解和运用场景(带示例) 浪九天 SQL sql mysql oracle 数据库
Oracle中的参数类型及运用场景1.数值类型NUMBER(p,s)详解：p表示精度（即数字的总位数），s表示小数位数。例如，NUMBER(5,2)可以存储最大为999.99的数字。运用场景：适用于需要精确计算的财务数据，如货币金额、税率等。示例：CREATETABLEfinancial_data(amountNUMBER(10,2));INSERTINTOfinancial_data(amoun
【大模型】DeepSeek 高级提示词技巧使用详解大富大贵7 程序员知识储备1 经验分享
以下是关于**DeepSeek大模型高级提示词技巧**的详细解析，帮助您更高效地利用模型能力，解决复杂任务：---###一、**核心提示词设计原则**1.**明确目标**-**避免模糊性**：直接说明任务类型（如生成、分析、推理、创作）和期望的输出格式（如代码、列表、JSON、自然语言）。-**示例**：❌模糊提示：“帮我处理数据。”✅明确提示：“分析以下销售数据，按地区分类，总结Top3区域的增
英伟达（NVIDIA）芯片全解析：专业分类、应用场景与真实案例嵌入式Jerry AI 分类人工智能数据挖掘嵌入式硬件 linux 数据分析算法
引言你知道吗？你每天使用的智能手机、AI语音助手、自动驾驶汽车，甚至是电影特效背后，都有英伟达（NVIDIA）的芯片在默默工作。NVIDIA不仅仅是“游戏显卡”的代名词，它的GPU和AI计算平台已经广泛应用于人工智能（AI）、自动驾驶、医疗影像、工业自动化、智能家居等领域。那么，NVIDIA的芯片有哪些分类？它们分别用在哪里？普通人又能从哪些场景感受到它的存在？今天，我们就来用最通俗易懂的方式，带
《DAMA数据管理知识体系指南》第五章数据建模和设计读书笔记总结数据大包哥 #数据治理大数据
《DAMA数据管理知识体系指南》第五章数据建模和设计读书笔记总结在《DAMA数据管理知识体系指南》中，第五章围绕数据建模和设计展开深入探讨，数据建模和设计作为数据管理的关键环节，对组织有效理解、管理和利用数据起着基础性作用，为企业实现数据驱动的决策和运营提供了重要支撑。一、数据建模和设计的基础概念1.1定义与重要性数据建模是发现、分析和确定数据需求，并采用数据模型的精确形式表示和传递这些需求的过程
Hive排序函数源码解密：字节跳动面试官的底层三连问数据大包哥 #Hive #大厂SQL面试指南 hive hadoop 数据仓库
Hive排序函数源码解密：字节跳动面试官的底层三连问作为数据工程师，理解Hive排序函数的源码就像掌握汽车的发动机原理。本文通过字节跳动内部技术文档，为你揭示三大排序函数的源码级实现差异。一、分布式执行框架Hive中ROW_NUMBER、RANK和DENSE_RANK的底层实现差异主要体现在相同排序键值的处理逻辑上，其核心流程可分为两个阶段：数据分区（Shuffle阶段）根据PARTITIONBY
《DAMA数据管理知识体系指南》备考笔记-第一章数据管理 (4 分)_dama8大模块 2401_84411072 程序员笔记大数据
数据：构成信息的基本材料。信息：数据在特定上下文中的应用。P2数据驱动的定义：依赖事件触发和分析应用以获得有价值的见解。这要求业务领导与技术专家合作，并依据专业规则对数据进行有效管理。P3数据管理的核心原则：P4-51高效数据管理需领导层承担其责任。2数据价值：A作为一个具有独特属性的资产；B可以用经济学术语表达。3数据管理的需求源自业务需求：A涉及质量管理。B需要元数据。C需要规划。D应驱动IT
Linux系统中常见的词GNU是什么意思？昊虹AI笔记 Linux系统 linux gnu
GNU是“GNU’sNotUnix”的递归缩写，它是一个自由软件项目，旨在创建一个完全自由的操作系统。这个名字反映了GNU项目的核心理念：它试图创建一个类Unix的系统，但不是Unix本身。GNU项目由理查德·斯托曼（RichardStallman）在1983年发起，目标是开发一个完全自由的软件操作系统，用户可以自由使用、修改和分发这些软件。GNU项目的一个关键概念是自由软件运动，提倡软件应该允许
Python 的元组和列表的区别是什么？海姐软件测试职场和发展笔记经验分享面试其他
以下是Python中元组（tuple）和列表（list）的主要区别：1.语法表示：元组使用小括号()来定义，例如(1,2,3)；列表使用方括号[]来定义，例如[1,2,3]。2.可变性：列表是可变的，即可以对其元素进行添加、删除、修改操作；而元组是不可变的，一旦创建，其元素的值就不能被修改。3.内存占用：通常情况下，元组的内存占用比列表小，因为元组的不可变性使其在某些情况下更易于优化。4.速度：由
rust笔记8-Deref与隐式解引用强制转换 shanzhizi rust rust 笔记算法
Rust的智能指针和DerefTrait是Rust中非常重要的概念，它们使得Rust的引用和指针操作更加灵活和安全。下面我们将深入介绍DerefTrait、Deref与&、*运算符的关系，以及Rust的隐式解引用强制转换（DerefCoercion）。1.智能指针与DerefTrait智能指针（如Box、Rc、Arc等）是Rust中用于管理堆上数据的类型。它们实现了DerefTrait，使得智能指
Node.js中不支持require和import两种导入模块的混用熬夜不洗澡 node.js
最近在整理Node.js相关的知识点，发现通过Node.js支持的两个模块导入语句require和import在同时使用时会发生错误，而且错误非常诡异。例如，在先使用require导入模块，在使用import导入模块时，出现require无法识别，在先使用import导入模块，在使用require导入模块时，同样出现了require无法识别，建议使用import代替。ReferenceError:
DEMF模型赋能多模态图像融合，助力肺癌高效分类 cv君 cv君独家视角 AI内幕系列深度学习 PET-CT 集成分类肺部图像多模态图像融合
目录论文创新点实验设计1.可视化的研究设计2.样本选取和数据处理3.集成分类模型4.实验结果5.可视化结果图表总结可视化知识图谱在肺癌早期筛查中，计算机断层扫描（CT）和正电子发射断层扫描（PET）作为两种关键的影像学手段，分别提供了丰富的解剖结构信息和代谢活动信息。然而，单一模态的影像数据在诊断精准度上往往存在瓶颈，难以全面揭示病变特征。因此，如何将多模态影像数据有机融合，以提升诊断效能，已成为
使用pyinstaller对gradio和chromadb进行打包顾德拉科 python
解决gradio和chromadb的打包问题背景问题gradio和gradio_client模块chromadb模块解决背景python项目里包含了gradio和chromadb模块，使用pyinstaller后总有模块找不到，这里分享一个办法一招解决。问题gradio和gradio_client模块gradio在被打进exe后执行报错：Nosuchfileordirectory:gradio_c
ubuntu指定版本安装python 丐哥说 ubuntu python linux 运维服务器
Python,安装相关视频讲解：python的or运算赋值用法用python编程Excel有没有用处？011_编程到底好玩在哪？查看python文件_输出py文件_cat_运行python文件_shel安装指定版本的Python在Ubuntu上Python是一种广泛使用的高级编程语言，具有简单易读的语法和强大的功能，因此受到了众多开发者的喜爱。在Ubuntu系统上安装Python是一项常见的操作，
国产编辑器EverEdit - 如何在EverEdit中创建工程？编辑器爱好者妙用编辑器编辑器 EverEdit EmEditor Notepad
1创建工程1.1应用场景工程是一个文件及文件夹的集合，对于稍微有点规模的项目，一般都会包含多个文件，甚至还会以文件夹的形式进行分层管理多个文件，为了方便的管理这个项目，可以将这些文件和文件夹保存为一个工程。在EverEdit中，工程文件是以.eprj扩展名结尾的文件，其内容为目录或文件的条目，如下图所示：![在这里插入图片描述](https://i-blog.csdnimg.cn/direct
chatgpt赋能python：PythonUDS：让你的汽车掌握更多技能 qq_43479892 ChatGpt chatgpt 汽车计算机
PythonUDS：让你的汽车掌握更多技能UDS（UnifiedDiagnosticServices）是一种汽车电子控制单元（ECU）通信协议，用于车辆的诊断和测试。PythonUDS是用Python编程语言实现的UDS客户端和服务器实现，并且为汽车行业提供了许多有用的功能。什么是PythonUDS？PythonUDS是一种用于处理汽车诊断数据和通信的Python库。它可以帮助你轻松地解析和操作U
(学习总结25)Linux工具：vim 编辑器和 gcc/g++ 编译器瞌睡不来 linux 编辑器学习 vim gcc/g++编译器
Linux工具：vim编辑器和gcc/g++编译器vim编辑器在Linux命令行中执行vimvim命令模式光标操作相关命令文本或字符操作命令撤销操作命令查找操作vim插入模式vim底行模式查找与编写操作界面操作文件处理操作vim与shell交互其它操作退出vim一般操作vim可视模式vim替换模式vim简单配置配置文件位置：常用配置选项，用来测试(可以在vim底行模式使用)：使用插件gcc/g++
如何配置 PostgreSQL 允许远程连接 - 以 Odoo 数据库为例 m0_74823842 面试学习路线阿里巴巴数据库 postgresql
如何配置PostgreSQL允许远程连接-以Odoo数据库为例问题背景在使用Odoo时，我们经常需要通过远程工具（如DataGrip、pgAdmin等）连接数据库进行管理和查询。然而，PostgreSQL默认只允许本地连接，需要进行适当的配置才能实现远程访问。本文将详细介绍如何配置PostgreSQL以允许远程连接。环境说明操作系统：Linux（Ubuntu/Debian）PostgreSQL版本
CS架构和BS架构的区别(通俗易懂) 九块六 CS架构 BS架构服务器运维
目录一、CS架构1.1.优点：1.2.缺点二、BS架构2.1.优点2.2.缺点三、区别3.1.开发成本3.2.客户端负载3.3.安全性3.4.作用范围CS：Client/Server(客户端/服务器)结构，使用之前需要用户下载安装客户端的操作界面例如：腾讯视频、QQ、微信社交工具、WPS、向日葵、Navicat工具、idea、Xshell等BS：Browser/Server(浏览器/服务器)结构，
Eclipse Kiso-testing-Python-UDS 教程井隆榕Star
EclipseKiso-testing-Python-UDS教程kiso-testing-python-udskiso-testing-python-uds项目地址:https://gitcode.com/gh_mirrors/ki/kiso-testing-python-uds1.项目介绍EclipseKiso-testing-Python-UDs是一个集成测试框架，主要用于物联网（IoT）和边
机器学习和深度学习有什么区别？ facaixxx2024 AI大模型机器学习深度学习人工智能
深度学习和机器学习有什么区别？深度学习是机器学习一个分支，机器学习包含深度学习。下面阿小云从定义、技术、数据需求、应用领域、模型复杂度和计算资源多维度来对比深度学习和机器学习的区别：二者的定义区别机器学习：是一种数据分析技术，通过算法使计算机能够在无明确编程的情况下进行学习和决策。深度学习：是机器学习的一个子领域，使用神经网络模型，尤其是深层神经网络模型，来处理、解释和分类数据。依赖算法和技术不同
测试建模(二) 输入与输出模型 IO模型悠然的笔记本
输入与输出模型是最基本的测试模型。它将被测对象（功能、模块、系统）视为一个整理，分析并列举该对象的输入变量和输出变量。为了建立完整的IO模型，测试人员需要从多个角度考察被测对象和相关系统。对于构建IO模型，可以利用fiddler，charles等网络工具了解与服务器通信的输入输出关系。构建IO模型有助于测试人员更好的理解被测对象，更自如的操控，更全面的观察，更好的设计测试。
AI趋势下，软件测试工程师怎么拥抱AI 悠然的笔记本人工智能
在AI趋势下，软件测试工程师怎么拥抱AI呢？以下是我的一些思考：一、掌握AI基础知识软件测试工程师需要学习机器学习、深度学习、自然语言处理等领域的基本原理和算法。这些基础知识有助于理解AI在测试中的应用基础，从而能够更好地利用AI技术提升测试效率和质量。二、掌握AI相关工具和技术编程语言：学习使用Python等编程语言，这是实现AI应用的常用工具之一。框架：掌握TensorFlow、PyTorch
Windows 应急响应指南 Administrator_ABC Windows 应急溯源 windows
在实际的安全应急响应过程中，Windows系统往往成为攻击者重点入侵的目标。一旦服务器被入侵，攻击者可能会采用各种手段建立隐藏或克隆账户、植入恶意任务、启动恶意进程或服务，并在文件和日志中留下痕迹。本文将从账户、计划任务、进程、服务、文件痕迹及日志分析六个方面，详细介绍常用的排查方法和技巧，帮助安全人员快速定位异常行为，挖掘攻击路径与线索。0x1.Windows账户排查背景说明在服务器被入侵后，攻
Linux 应急响应指南 Administrator_ABC Linux 应急溯源 linux 运维服务器
在现代企业环境中，Linux系统同样是攻击者青睐的目标。一旦系统被入侵，攻击者可能会利用各种手段建立后门、修改计划任务、伪装进程、篡改服务配置以及在文件系统中留下恶意痕迹，从而达到远程控制、数据窃取或持久化存在的目的。本文将从Linux账户、计划任务、进程、服务、文件痕迹以及日志分析六个方面，详细介绍常用的排查方法和实战技巧，帮助大家快速定位异常、追踪攻击路径，为后续取证和系统修复提供依据。0x1
利用Nmap进行漏洞验证和检测 Administrator_ABC Web渗透网络安全安全
免责声明文章中敏感信息均已做多层打马处理。传播、利用本文章所提供的信息而造成的任何直接或者间接的后果及损失，均由使用者本人负责，作者不为此承担任何责任，一旦造成后果请自行负责。如有侵权烦请告知，我会立即删除并致歉。谢谢！前言：在网络安全领域，Nmap（NetworkMapper）是一款功能强大的开源网络扫描工具，被广泛应用于网络发现和安全审计等方面。Nmap提供了丰富的脚本库，用户可以通过调用这些
TaskBuilder主界面介绍 Nodejs_home java python
TaskBuilder主界面介绍TaskBuilder的主界面分为如下图所示的7个区域：这7个区域的作用简要介绍如下：2、服务器设置：在此查看和设置任擎服务器的信息。应用系统的代码都是保存在任擎服务器上的，TaskBuilder必须连接任擎服务器才能进行相关操作，且同一时间只能连接一个任擎服务器，默认连接服务器列表中的第一个服务器，可以打开服务器列表选择其他服务器进行切换，切换服务器后，区域4内的
TaskBuilder与VSCode、Eclipse有什么区别？ Nodejs_home
VisualStudioCode（简称“VSCode”）是Microsoft在2015年4月30日Build开发者大会上正式宣布一个运行于MacOSX、Windows和Linux之上的，针对于编写现代Web和云应用的跨平台源代码编辑器，可在桌面上运行，并且可用于Windows，macOS和Linux。它具有对JavaScript，TypeScript和Node.js的内置支持，并具有丰富的其他语言
Linux的Initrd机制被触发 linux
Linux 的 initrd 技术是一个非常普遍使用的机制，linux2.6 内核的 initrd 的文件格式由原来的文件系统镜像文件转变成了 cpio 格式，变化不仅反映在文件格式上， linux 内核对这两种格式的 initrd 的处理有着截然的不同。本文首先介绍了什么是 initrd 技术，然后分别介绍了 Linux2.4 内核和 2.6 内核的 initrd 的处理流程。最后通过对 Lin
maven本地仓库路径修改 bitcarter maven
默认maven本地仓库路径：C:\Users\Administrator\.m2 修改maven本地仓库路径方法： 1.打开E:\maven\apache-maven-2.2.1\conf\settings.xml 2.找到
XSD和XML中的命名空间 darrenzhu xml xsd schema namespace 命名空间
http://www.360doc.com/content/12/0418/10/9437165_204585479.shtml http://blog.csdn.net/wanghuan203/article/details/9203621 http://blog.csdn.net/wanghuan203/article/details/9204337 http://www.cn
Java 求素数运算周凡杨 java 算法素数
网络上对求素数之解数不胜数，我在此总结归纳一下，同时对一些编码，加以改进，效率有成倍热提高。第一种：原理: 6N(+-)1法任何一个自然数，总可以表示成为如下的形式之一： 6N，6N+1，6N+2，6N+3，6N+4，6N+5 (N=0，1，2，…)
java 单例模式 g21121 java
想必单例模式大家都不会陌生，有如下两种方式来实现单例模式： class Singleton { private static Singleton instance=new Singleton(); private Singleton(){} static Singleton getInstance() { return instance; }
Linux下Mysql源码安装 510888780 mysql
1.假设已经有mysql-5.6.23-linux-glibc2.5-x86_64.tar.gz (1)创建mysql的安装目录及数据库存放目录解压缩下载的源码包，目录结构，特殊指定的目录除外：
32位和64位操作系统墙头上一根草 32位和64位操作系统
32位和64位操作系统是指：CPU一次处理数据的能力是32位还是64位。现在市场上的CPU一般都是64位的，但是这些CPU并不是真正意义上的64 位CPU，里面依然保留了大部分32位的技术，只是进行了部分64位的改进。32位和64位的区别还涉及了内存的寻址方面，32位系统的最大寻址空间是2 的32次方= 4294967296（bit）= 4（GB）左右，而64位系统的最大寻址空间的寻址空间则达到了
我的spring学习笔记10-轻量级_Spring框架 aijuans Spring 3
一、问题提问： → 请简单介绍一下什么是轻量级？轻量级（Leightweight）是相对于一些重量级的容器来说的，比如Spring的核心是一个轻量级的容器，Spring的核心包在文件容量上只有不到1M大小，使用Spring核心包所需要的资源也是很少的，您甚至可以在小型设备中使用Spring。
mongodb 环境搭建及简单CURD antlove Web Install curd NoSQL mongo
一搭建mongodb环境 1. 在mongo官网下载mongodb 2. 在本地创建目录 "D:\Program Files\mongodb-win32-i386-2.6.4\data\db" 3. 运行mongodb服务 [mongod.exe --dbpath "D:\Program Files\mongodb-win32-i386-2.6.4\data\
数据字典和动态视图百合不是茶 oracle 数据字典动态视图系统和对象权限
数据字典（data dictionary）是 Oracle 数据库的一个重要组成部分，这是一组用于记录数据库信息的只读（read-only）表。随着数据库的启动而启动,数据库关闭时数据字典也关闭数据字典中包含数据库中所有方案对象（schema object）的定义(包括表，视图，索引，簇，同义词，序列，过程，函数，包，触发器等等) 数据库为一
多线程编程一般规则 bijian1013 java thread 多线程 java多线程
如果两个工两个以上的线程都修改一个对象，那么把执行修改的方法定义为被同步的，如果对象更新影响到只读方法，那么只读方法也要定义成同步的。不要滥用同步。如果在一个对象内的不同的方法访问的不是同一个数据，就不要将方法设置为synchronized的。
将文件或目录拷贝到另一个Linux系统的命令scp bijian1013 linux unix scp
一.功能说明 scp就是security copy，用于将文件或者目录从一个Linux系统拷贝到另一个Linux系统下。scp传输数据用的是SSH协议，保证了数据传输的安全，其格式如下： scp 远程用户名@IP地址：文件的绝对路径
【持久化框架MyBatis3五】MyBatis3一对多关联查询 bit1129 Mybatis3
以教员和课程为例介绍一对多关联关系，在这里认为一个教员可以叫多门课程，而一门课程只有1个教员教，这种关系在实际中不太常见，通过教员和课程是多对多的关系。示例数据：地址表： CREATE TABLE ADDRESSES ( ADDR_ID INT(11) NOT NULL AUTO_INCREMENT, STREET VAR
cookie状态判断引发的查找问题 bitcarter form cgi
先说一下我们的业务背景： 1.前台将图片和文本通过form表单提交到后台，图片我们都做了base64的编码，并且前台图片进行了压缩 2.form中action是一个cgi服务 3.后台cgi服务同时供PC，H5，APP 4.后台cgi中调用公共的cookie状态判断方法（公共的，大家都用，几年了没有问题）问题：（折腾两天。。。。） 1.PC端cgi服务正常调用，cookie判断没
通过Nginx,Tomcat访问日志(access log)记录请求耗时 ronin47
一、Nginx通过$upstream_response_time $request_time统计请求和后台服务响应时间 nginx.conf使用配置方式： log_format main '$remote_addr - $remote_user [$time_local] "$request" ''$status $body_bytes_sent "$http_r
java-67- n个骰子的点数。把n个骰子扔在地上，所有骰子朝上一面的点数之和为S。输入n，打印出S的所有可能的值出现的概率。 bylijinnan java
public class ProbabilityOfDice { /** * Q67 n个骰子的点数 * 把n个骰子扔在地上，所有骰子朝上一面的点数之和为S。输入n，打印出S的所有可能的值出现的概率。 * 在以下求解过程中，我们把骰子看作是有序的。 * 例如当n=2时，我们认为（1，2）和（2，1）是两种不同的情况 */ private stati
看别人的博客，觉得心情很好 Cb123456 博客心情
以为写博客，就是总结，就和日记一样吧，同时也在督促自己。今天看了好长时间博客: 职业规划: http://www.iteye.com/blogs/subjects/zhiyeguihua android学习: 1.http://byandby.i
[JWFD开源工作流]尝试用原生代码引擎实现循环反馈拓扑分析 comsci 工作流
我们已经不满足于仅仅跳跃一次，通过对引擎的升级，今天我测试了一下循环反馈模式，大概跑了200圈，引擎报一个溢出错误在一个流程图的结束节点中嵌入一段方程，每次引擎运行到这个节点的时候，通过实时编译器GM模块，计算这个方程，计算结果与预设值进行比较，符合条件则跳跃到开始节点，继续新一轮拓扑分析，直到遇到
JS常用的事件及方法 cwqcwqmax9 js
事件描述 onactivate 当对象设置为活动元素时触发。 onafterupdate 当成功更新数据源对象中的关联对象后在数据绑定对象上触发。 onbeforeactivate 对象要被设置为当前元素前立即触发。 onbeforecut 当选中区从文档中删除之前在源对象触发。 onbeforedeactivate 在 activeElement 从当前对象变为父文档其它对象之前立即
正则表达式验证日期格式 dashuaifu 正则表达式 IT其它 java其它
正则表达式验证日期格式 function isDate(d){ var v = d.match(/^(\d{4})-(\d{1,2})-(\d{1,2})$/i); if(!v) { this.focus(); return false; } } <input value="2000-8-8" onblu
Yii CModel.rules() 方法、validate预定义完整列表、以及说说验证 dcj3sjt126com yii
public array rules () {return} array 要调用 validate() 时应用的有效性规则。返回属性的有效性规则。声明验证规则，应重写此方法。每个规则是数组具有以下结构：array('attribute list', 'validator name', 'on'=>'scenario name', ...validation
UITextAttributeTextColor = deprecated in iOS 7.0 dcj3sjt126com ios
In this lesson we used the key "UITextAttributeTextColor" to change the color of the UINavigationBar appearance to white. This prompts a warning "first deprecated in iOS 7.0." Ins
判断一个数是质数的几种方法 EmmaZhao Math python
质数也叫素数，是只能被1和它本身整除的正整数，最小的质数是2，目前发现的最大的质数是p=2^57885161-1【注1】。判断一个数是质数的最简单的方法如下： def isPrime1(n): for i in range(2, n): if n % i == 0: return False return True 但是在上面的方法中有一些冗余的计算，所以
SpringSecurity工作原理小解读坏我一锅粥 SpringSecurity
SecurityContextPersistenceFilter ConcurrentSessionFilter WebAsyncManagerIntegrationFilter HeaderWriterFilter CsrfFilter LogoutFilter Use
JS实现自适应宽度的Tag切换 ini JavaScript html Web css html5
效果体验：http://hovertree.com/texiao/js/3.htm 该效果使用纯JavaScript代码，实现TAB页切换效果，TAB标签根据内容自适应宽度，点击TAB标签切换内容页。 HTML文件代码： <!DOCTYPE html> <html xmlns="http://www.w3.org/1999/xhtml"
Hbase Rest API : 数据查询 kane_xie REST hbase
hbase（hadoop）是用java编写的，有些语言（例如python）能够对它提供良好的支持，但也有很多语言使用起来并不是那么方便，比如c#只能通过thrift访问。Rest就能很好的解决这个问题。Hbase的org.apache.hadoop.hbase.rest包提供了rest接口，它内嵌了jetty作为servlet容器。启动命令：./bin/hbase rest s
JQuery实现鼠标拖动元素移动位置（源码+注释）明子健 jquery js 源码拖动鼠标
欢迎讨论指正！ print.html代码： <!DOCTYPE html> <html> <head> <meta http-equiv=Content-Type content="text/html;charset=utf-8"> <title>发票打印</title> &l
Postgresql 连表更新字段语法 update qifeifei PostgreSQL
下面这段sql本来目的是想更新条件下的数据，可是这段sql却更新了整个表的数据。sql如下： UPDATE tops_visa.visa_order SET op_audit_abort_pass_date = now() FROM tops_visa.visa_order as t1 INNER JOIN tops_visa.visa_visitor as t2 ON t1.
将redis,memcache结合使用的方案? tcrct redis cache
公司架构上使用了阿里云的服务，由于阿里的kvstore收费相当高，打算自建，自建后就需要自己维护，所以就有了一个想法，针对kvstore(redis)及ocs(memcache)的特点，想自己开发一个cache层，将需要用到list，set，map等redis方法的继续使用redis来完成，将整条记录放在memcache下，即findbyid，save等时就memcache，其它就对应使用redi
开发中遇到的诡异的bug wudixiaotie bug
今天我们服务器组遇到个问题：我们的服务是从Kafka里面取出数据，然后把offset存储到ssdb中，每个topic和partition都对应ssdb中不同的key，服务启动之后，每次kafka数据更新我们这边收到消息，然后存储之后就发现ssdb的值偶尔是-2,这就奇怪了，最开始我们是在代码中打印存储的日志，发现没什么问题，后来去查看ssdb的日志，才发现里面每次set的时候都会对同一个key