左美美￣　　

【Flink专题】基于Flink1.12的知识点总结

更新几期Flink系的文章，有时间就写写，没时间就放放。之前做flink开发时候的一些认知还有心得，没有研究多深，大家多提意见，有问题的地方麻烦更正，本系列基于flink1.12。

Flink介绍

发展历史

官方介绍

组件栈

应用场景

所有的流式计算

Flink安装部署

local本地模式-了解

原理

操作

1.下载安装包

https://archive.apache.org/dist/flink/

2.上传flink-1.12.0-bin-scala_2.12.tgz到node1的指定目录

3.解压

tar -zxvf flink-1.12.0-bin-scala_2.12.tgz

4.如果出现权限问题，需要修改权限

chown -R root:root /export/server/flink-1.12.0

5.改名或创建软链接

mv flink-1.12.0 flink

ln -s /export/server/flink-1.12.0 /export/server/flink

测试

1.准备文件/root/words.txt

vim /root/words.txt

hello me you herhello me youhello mehello

2.启动Flink本地“集群”

/export/server/flink/bin/start-cluster.sh

3.使用jps可以查看到下面两个进程

- TaskManagerRunner

- StandaloneSessionClusterEntrypoint

4.访问Flink的Web UI

http://node1:8081/#/overview

slot在Flink里面可以认为是资源组，Flink是通过将任务分成子任务并且将这些子任务分配到slot来并行执行程序。

5.执行官方示例

/export/server/flink/bin/flink run /export/server/flink/examples/batch/WordCount.jar --input /root/words.txt --output /root/out

6.停止Flink

/export/server/flink/bin/stop-cluster.sh

启动shell交互式窗口(目前所有Scala 2.12版本的安装包暂时都不支持 Scala Shell)

/export/server/flink/bin/start-scala-shell.sh local

执行如下命令

benv.readTextFile("/root/words.txt").flatMap(_.split(" ")).map((_,1)).groupBy(0).sum(1).print()

退出shell

:quit

Standalone独立集群模式-了解

原理

操作

1.集群规划:

- 服务器: node1(Master + Slave): JobManager + TaskManager

- 服务器: node2(Slave): TaskManager

- 服务器: node3(Slave): TaskManager

2.修改flink-conf.yaml

vim /export/server/flink/conf/flink-conf.yaml

jobmanager.rpc.address: node1taskmanager.numberOfTaskSlots: 2web.submit.enable: true#历史服务器jobmanager.archive.fs.dir: hdfs://node1:8020/flink/completed-jobs/historyserver.web.address: node1historyserver.web.port: 8082historyserver.archive.fs.dir: hdfs://node1:8020/flink/completed-jobs/

2.修改masters

vim /export/server/flink/conf/masters

node1:8081

3.修改slaves

vim /export/server/flink/conf/workers

node1node2node3

4.添加HADOOPCONFDIR环境变量

vim /etc/profile

export HADOOP_CONF_DIR=/export/server/hadoop/etc/hadoop

5.分发

scp -r /export/server/flink node2:/export/server/flink

scp -r /export/server/flink node3:/export/server/flink

scp /etc/profile node2:/etc/profile

scp /etc/profile node3:/etc/profile

或

 for i in {2..3}; do scp -r flink node$i:$PWD; done

6.source

source /etc/profile

测试

1.启动集群，在node1上执行如下命令

/export/server/flink/bin/start-cluster.sh

或者单独启动

/export/server/flink/bin/jobmanager.sh ((start|start-foreground) cluster)|stop|stop-all

/export/server/flink/bin/taskmanager.sh start|start-foreground|stop|stop-all

2.启动历史服务器

/export/server/flink/bin/historyserver.sh start

3.访问Flink UI界面或使用jps查看

http://node1:8081/#/overview

http://node1:8082/#/overview

4.执行官方测试案例

/export/server/flink/bin/flink run /export/server/flink/examples/batch/WordCount.jar

5.停止Flink集群

/export/server/flink/bin/stop-cluster.sh

Standalone-HA高可用集群模式-了解

原理

操作

1.集群规划

- 服务器: node1(Master + Slave): JobManager + TaskManager

- 服务器: node2(Master + Slave): JobManager + TaskManager

- 服务器: node3(Slave): TaskManager

2.启动ZooKeeper

zkServer.sh status

zkServer.sh stop

zkServer.sh start

3.启动HDFS

/export/serves/hadoop/sbin/start-dfs.sh

4.停止Flink集群

/export/server/flink/bin/stop-cluster.sh

5.修改flink-conf.yaml

vim /export/server/flink/conf/flink-conf.yaml

增加如下内容

state.backend: filesystemstate.backend.fs.checkpointdir: hdfs://node1:8020/flink-checkpointshigh-availability: zookeeperhigh-availability.storageDir: hdfs://node1:8020/flink/ha/high-availability.zookeeper.quorum: node1:2181,node2:2181,node3:2181

6.修改masters

vim /export/server/flink/conf/masters

7.同步

scp -r /export/server/flink/conf/flink-conf.yaml node2:/export/server/flink/conf/scp -r /export/server/flink/conf/flink-conf.yaml node3:/export/server/flink/conf/scp -r /export/server/flink/conf/masters node2:/export/server/flink/conf/scp -r /export/server/flink/conf/masters node3:/export/server/flink/conf/

8.修改node2上的flink-conf.yaml

vim /export/server/flink/conf/flink-conf.yaml

jobmanager.rpc.address: node2

9.重新启动Flink集群,node1上执行

/export/server/flink/bin/stop-cluster.sh

/export/server/flink/bin/start-cluster.sh

10.使用jps命令查看

发现没有Flink相关进程被启动

11.查看日志

cat /export/server/flink/log/flink-root-standalonesession-0-node1.log

发现如下错误

因为在Flink1.8版本后,Flink官方提供的安装包里没有整合HDFS的jar

12.下载jar包并在Flink的lib目录下放入该jar包并分发使Flink能够支持对Hadoop的操作

下载地址

https://flink.apache.org/downloads.html

13.放入lib目录

cd /export/server/flink/lib

14.分发

for i in {2..3}; do scp -r flink-shaded-hadoop-2-uber-2.7.5-10.0.jar node$i:$PWD; done

15.重新启动Flink集群,node1上执行

/export/server/flink/bin/stop-cluster.sh

/export/server/flink/bin/start-cluster.sh

16.使用jps命令查看,发现三台机器已经ok

测试

1.访问WebUI

http://node1:8081/#/job-manager/config

http://node2:8081/#/job-manager/config

2.执行wc

/export/server/flink/bin/flink run /export/server/flink/examples/batch/WordCount.jar

3.kill掉其中一个master

4.重新执行wc,还是可以正常执行

/export/server/flink/bin/flink run /export/server/flink/examples/batch/WordCount.jar

3.停止集群

/export/server/flink/bin/stop-cluster.sh

Flink-On-Yarn-开发使用

原理

两种模式

Session会话模式

Job分离模式

操作

1.关闭yarn的内存检查

vim /export/server/hadoop/etc/hadoop/yarn-site.xml

             yarn.nodemanager.pmem-check-enabled        false                yarn.nodemanager.vmem-check-enabled        false

2.分发

scp -r /export/server/hadoop/etc/hadoop/yarn-site.xml node2:/export/server/hadoop/etc/hadoop/yarn-site.xmlscp -r /export/server/hadoop/etc/hadoop/yarn-site.xml node3:/export/server/hadoop/etc/hadoop/yarn-site.xml

3.重启yarn

/export/server/hadoop/sbin/stop-yarn.sh

/export/server/hadoop/sbin/start-yarn.sh

测试

Session会话模式

在Yarn上启动一个Flink集群,并重复使用该集群,后续提交的任务都是给该集群,资源会被一直占用,除非手动关闭该集群----适用于大量的小任务

1.在yarn上启动一个Flink集群/会话，node1上执行以下命令

/export/server/flink/bin/yarn-session.sh -n 2 -tm 800 -s 1 -d

说明:

申请2个CPU、1600M内存

# -n 表示申请2个容器，这里指的就是多少个taskmanager

# -tm 表示每个TaskManager的内存大小

# -s 表示每个TaskManager的slots数量

# -d 表示以后台程序方式运行

注意:

该警告不用管

WARN org.apache.hadoop.hdfs.DFSClient - Caught exception

java.lang.InterruptedException

2.查看UI界面

http://node1:8088/cluster

3.使用flink run提交任务：

/export/server/flink/bin/flink run /export/server/flink/examples/batch/WordCount.jar

运行完之后可以继续运行其他的小任务

/export/server/flink/bin/flink run /export/server/flink/examples/batch/WordCount.jar

4.通过上方的ApplicationMaster可以进入Flink的管理界面

==5.关闭yarn-session：==

yarn application -kill application16095080879770005

Job分离模式--用的更多

针对每个Flink任务在Yarn上启动一个独立的Flink集群并运行,结束后自动关闭并释放资源,----适用于大任务

1.直接提交job

/export/server/flink/bin/flink run -m yarn-cluster -yjm 1024 -ytm 1024 /export/server/flink/examples/batch/WordCount.jar

# -m jobmanager的地址

# -yjm 1024 指定jobmanager的内存信息

# -ytm 1024 指定taskmanager的内存信息

2.查看UI界面

http://node1:8088/cluster

参数说明

/export/server/flink/bin/flink --helpSLF4J: Class path contains multiple SLF4J bindings.SLF4J: Found binding in [jar:file:/export/server/flink/lib/log4j-slf4j-impl-2.12.1.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J: Found binding in [jar:file:/export/server/hadoop-2.7.5/share/hadoop/common/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.SLF4J: Actual binding is of type [org.apache.logging.slf4j.Log4jLoggerFactory]./flink  [OPTIONS] [ARGUMENTS]The following actions are available:Action "run" compiles and runs a program.  Syntax: run [OPTIONS]    "run" action options:     -c,--class                Class with the program entry point                                          ("main()" method). Only needed if the                                          JAR file does not specify the class in                                          its manifest.     -C,--classpath                  Adds a URL to each user code                                          classloader  on all nodes in the                                          cluster. The paths must specify a                                          protocol (e.g. file://) and be                                          accessible on all nodes (e.g. by means                                          of a NFS share). You can use this                                          option multiple times for specifying                                          more than one URL. The protocol must                                          be supported by the {@link                                          java.net.URLClassLoader}.     -d,--detached                        If present, runs the job in detached                                          mode     -n,--allowNonRestoredState           Allow to skip savepoint state that                                          cannot be restored. You need to allow                                          this if you removed an operator from                                          your program that was part of the                                          program when the savepoint was                                          triggered.     -p,--parallelism        The parallelism with which to run the                                          program. Optional flag to override the                                          default value specified in the                                          configuration.     -py,--python             Python script with the program entry                                          point. The dependent resources can be                                          configured with the `--pyFiles`                                          option.     -pyarch,--pyArchives            Add python archive files for job. The                                          archive files will be extracted to the                                          working directory of python UDF                                          worker. Currently only zip-format is                                          supported. For each archive file, a                                          target directory be specified. If the                                          target directory name is specified,                                          the archive file will be extracted to                                          a name can directory with the                                          specified name. Otherwise, the archive                                          file will be extracted to a directory                                          with the same name of the archive                                          file. The files uploaded via this                                          option are accessible via relative                                          path. '#' could be used as the                                          separator of the archive file path and                                          the target directory name. Comma (',')                                          could be used as the separator to                                          specify multiple archive files. This                                          option can be used to upload the                                          virtual environment, the data files                                          used in Python UDF (e.g.: --pyArchives                                          file:///tmp/py37.zip,file:///tmp/data.                                          zip#data --pyExecutable                                          py37.zip/py37/bin/python). The data                                          files could be accessed in Python UDF,                                          e.g.: f = open('data/data.txt', 'r').     -pyexec,--pyExecutable          Specify the path of the python                                          interpreter used to execute the python                                          UDF worker (e.g.: --pyExecutable                                          /usr/local/bin/python3). The python                                          UDF worker depends on Python 3.5+,                                          Apache Beam (version == 2.23.0), Pip                                          (version >= 7.1.0) and SetupTools                                          (version >= 37.0.0). Please ensure                                          that the specified environment meets                                          the above requirements.     -pyfs,--pyFiles         Attach custom python files for job.                                          These files will be added to the                                          PYTHONPATH of both the local client                                          and the remote python UDF worker. The                                          standard python resource file suffixes                                          such as .py/.egg/.zip or directory are                                          all supported. Comma (',') could be                                          used as the separator to specify                                          multiple files (e.g.: --pyFiles                                          file:///tmp/myresource.zip,hdfs:///$na                                          menode_address/myresource2.zip).     -pym,--pyModule        Python module with the program entry                                          point. This option must be used in                                          conjunction with `--pyFiles`.     -pyreq,--pyRequirements         Specify a requirements.txt file which                                          defines the third-party dependencies.                                          These dependencies will be installed                                          and added to the PYTHONPATH of the                                          python UDF worker. A directory which                                          contains the installation packages of                                          these dependencies could be specified                                          optionally. Use '#' as the separator                                          if the optional parameter exists                                          (e.g.: --pyRequirements                                          file:///tmp/requirements.txt#file:///t                                          mp/cached_dir).     -s,--fromSavepoint    Path to a savepoint to restore the job                                          from (for example                                          hdfs:///flink/savepoint-1537).     -sae,--shutdownOnAttachedExit        If the job is submitted in attached                                          mode, perform a best-effort cluster                                          shutdown when the CLI is terminated                                          abruptly, e.g., in response to a user                                          interrupt, such as typing Ctrl + C.  Options for Generic CLI mode:     -D    Allows specifying multiple generic configuration                           options. The available options can be found at                           https://ci.apache.org/projects/flink/flink-docs-stabl                           e/ops/config.html     -e,--executor    DEPRECATED: Please use the -t option instead which is                           also available with the "Application Mode".                           The name of the executor to be used for executing the                           given job, which is equivalent to the                           "execution.target" config option. The currently                           available executors are: "remote", "local",                           "kubernetes-session", "yarn-per-job", "yarn-session".     -t,--target      The deployment target for the given application,                           which is equivalent to the "execution.target" config                           option. For the "run" action the currently available                           targets are: "remote", "local", "kubernetes-session",                           "yarn-per-job", "yarn-session". For the                           "run-application" action the currently available                           targets are: "kubernetes-application",                           "yarn-application".  Options for yarn-cluster mode:     -d,--detached                        If present, runs the job in detached                                          mode     -m,--jobmanager                 Set to yarn-cluster to use YARN                                          execution mode.     -yat,--yarnapplicationType      Set a custom application type for the                                          application on YARN     -yD                  use value for given property     -yd,--yarndetached                   If present, runs the job in detached                                          mode (deprecated; use non-YARN                                          specific option instead)     -yh,--yarnhelp                       Help for the Yarn session CLI.     -yid,--yarnapplicationId        Attach to running YARN session     -yj,--yarnjar                   Path to Flink jar file     -yjm,--yarnjobManagerMemory     Memory for JobManager Container with                                          optional unit (default: MB)     -ynl,--yarnnodeLabel            Specify YARN node label for the YARN                                          application     -ynm,--yarnname                 Set a custom name for the application                                          on YARN     -yq,--yarnquery                      Display available YARN resources                                          (memory, cores)     -yqu,--yarnqueue                Specify YARN queue.     -ys,--yarnslots                 Number of slots per TaskManager     -yt,--yarnship                  Ship files in the specified directory                                          (t for transfer)     -ytm,--yarntaskManagerMemory    Memory per TaskManager Container with                                          optional unit (default: MB)     -yz,--yarnzookeeperNamespace    Namespace to create the Zookeeper                                          sub-paths for high availability mode     -z,--zookeeperNamespace         Namespace to create the Zookeeper                                          sub-paths for high availability mode  Options for default mode:     -D              Allows specifying multiple generic                                     configuration options. The available                                     options can be found at                                     https://ci.apache.org/projects/flink/flink-                                     docs-stable/ops/config.html     -m,--jobmanager            Address of the JobManager to which to                                     connect. Use this flag to connect to a                                     different JobManager than the one specified                                     in the configuration. Attention: This                                     option is respected only if the                                     high-availability configuration is NONE.     -z,--zookeeperNamespace    Namespace to create the Zookeeper sub-paths                                     for high availability modeAction "run-application" runs an application in Application Mode.  Syntax: run-application [OPTIONS]    Options for Generic CLI mode:     -D    Allows specifying multiple generic configuration                           options. The available options can be found at                           https://ci.apache.org/projects/flink/flink-docs-stabl                           e/ops/config.html     -e,--executor    DEPRECATED: Please use the -t option instead which is                           also available with the "Application Mode".                           The name of the executor to be used for executing the                           given job, which is equivalent to the                           "execution.target" config option. The currently                           available executors are: "remote", "local",                           "kubernetes-session", "yarn-per-job", "yarn-session".     -t,--target      The deployment target for the given application,                           which is equivalent to the "execution.target" config                           option. For the "run" action the currently available                           targets are: "remote", "local", "kubernetes-session",                           "yarn-per-job", "yarn-session". For the                           "run-application" action the currently available                           targets are: "kubernetes-application",                           "yarn-application".Action "info" shows the optimized execution plan of the program (JSON).  Syntax: info [OPTIONS]    "info" action options:     -c,--class            Class with the program entry point                                      ("main()" method). Only needed if the JAR                                      file does not specify the class in its                                      manifest.     -p,--parallelism    The parallelism with which to run the                                      program. Optional flag to override the                                      default value specified in the                                      configuration.Action "list" lists running and scheduled programs.  Syntax: list [OPTIONS]  "list" action options:     -a,--all         Show all programs and their JobIDs     -r,--running     Show only running programs and their JobIDs     -s,--scheduled   Show only scheduled programs and their JobIDs  Options for Generic CLI mode:     -D    Allows specifying multiple generic configuration                           options. The available options can be found at                           https://ci.apache.org/projects/flink/flink-docs-stabl                           e/ops/config.html     -e,--executor    DEPRECATED: Please use the -t option instead which is                           also available with the "Application Mode".                           The name of the executor to be used for executing the                           given job, which is equivalent to the                           "execution.target" config option. The currently                           available executors are: "remote", "local",                           "kubernetes-session", "yarn-per-job", "yarn-session".     -t,--target      The deployment target for the given application,                           which is equivalent to the "execution.target" config                           option. For the "run" action the currently available                           targets are: "remote", "local", "kubernetes-session",                           "yarn-per-job", "yarn-session". For the                           "run-application" action the currently available                           targets are: "kubernetes-application",                           "yarn-application".  Options for yarn-cluster mode:     -m,--jobmanager             Set to yarn-cluster to use YARN execution                                      mode.     -yid,--yarnapplicationId    Attach to running YARN session     -z,--zookeeperNamespace     Namespace to create the Zookeeper                                      sub-paths for high availability mode  Options for default mode:     -D              Allows specifying multiple generic                                     configuration options. The available                                     options can be found at                                     https://ci.apache.org/projects/flink/flink-                                     docs-stable/ops/config.html     -m,--jobmanager            Address of the JobManager to which to                                     connect. Use this flag to connect to a                                     different JobManager than the one specified                                     in the configuration. Attention: This                                     option is respected only if the                                     high-availability configuration is NONE.     -z,--zookeeperNamespace    Namespace to create the Zookeeper sub-paths                                     for high availability modeAction "stop" stops a running program with a savepoint (streaming jobs only).  Syntax: stop [OPTIONS]   "stop" action options:     -d,--drain                           Send MAX_WATERMARK before taking the                                          savepoint and stopping the pipelne.     -p,--savepointPath    Path to the savepoint (for example                                          hdfs:///flink/savepoint-1537). If no                                          directory is specified, the configured                                          default will be used                                          ("state.savepoints.dir").  Options for Generic CLI mode:     -D    Allows specifying multiple generic configuration                           options. The available options can be found at                           https://ci.apache.org/projects/flink/flink-docs-stabl                           e/ops/config.html     -e,--executor    DEPRECATED: Please use the -t option instead which is                           also available with the "Application Mode".                           The name of the executor to be used for executing the                           given job, which is equivalent to the                           "execution.target" config option. The currently                           available executors are: "remote", "local",                           "kubernetes-session", "yarn-per-job", "yarn-session".     -t,--target      The deployment target for the given application,                           which is equivalent to the "execution.target" config                           option. For the "run" action the currently available                           targets are: "remote", "local", "kubernetes-session",                           "yarn-per-job", "yarn-session". For the                           "run-application" action the currently available                           targets are: "kubernetes-application",                           "yarn-application".  Options for yarn-cluster mode:     -m,--jobmanager             Set to yarn-cluster to use YARN execution                                      mode.     -yid,--yarnapplicationId    Attach to running YARN session     -z,--zookeeperNamespace     Namespace to create the Zookeeper                                      sub-paths for high availability mode  Options for default mode:     -D              Allows specifying multiple generic                                     configuration options. The available                                     options can be found at                                     https://ci.apache.org/projects/flink/flink-                                     docs-stable/ops/config.html     -m,--jobmanager            Address of the JobManager to which to                                     connect. Use this flag to connect to a                                     different JobManager than the one specified                                     in the configuration. Attention: This                                     option is respected only if the                                     high-availability configuration is NONE.     -z,--zookeeperNamespace    Namespace to create the Zookeeper sub-paths                                     for high availability modeAction "cancel" cancels a running program.  Syntax: cancel [OPTIONS]   "cancel" action options:     -s,--withSavepoint    **DEPRECATION WARNING**: Cancelling                                            a job with savepoint is deprecated.                                            Use "stop" instead.                                            Trigger savepoint and cancel job.                                            The target directory is optional. If                                            no directory is specified, the                                            configured default directory                                            (state.savepoints.dir) is used.  Options for Generic CLI mode:     -D    Allows specifying multiple generic configuration                           options. The available options can be found at                           https://ci.apache.org/projects/flink/flink-docs-stabl                           e/ops/config.html     -e,--executor    DEPRECATED: Please use the -t option instead which is                           also available with the "Application Mode".                           The name of the executor to be used for executing the                           given job, which is equivalent to the                           "execution.target" config option. The currently                           available executors are: "remote", "local",                           "kubernetes-session", "yarn-per-job", "yarn-session".     -t,--target      The deployment target for the given application,                           which is equivalent to the "execution.target" config                           option. For the "run" action the currently available                           targets are: "remote", "local", "kubernetes-session",                           "yarn-per-job", "yarn-session". For the                           "run-application" action the currently available                           targets are: "kubernetes-application",                           "yarn-application".  Options for yarn-cluster mode:     -m,--jobmanager             Set to yarn-cluster to use YARN execution                                      mode.     -yid,--yarnapplicationId    Attach to running YARN session     -z,--zookeeperNamespace     Namespace to create the Zookeeper                                      sub-paths for high availability mode  Options for default mode:     -D              Allows specifying multiple generic                                     configuration options. The available                                     options can be found at                                     https://ci.apache.org/projects/flink/flink-                                     docs-stable/ops/config.html     -m,--jobmanager            Address of the JobManager to which to                                     connect. Use this flag to connect to a                                     different JobManager than the one specified                                     in the configuration. Attention: This                                     option is respected only if the                                     high-availability configuration is NONE.     -z,--zookeeperNamespace    Namespace to create the Zookeeper sub-paths                                     for high availability modeAction "savepoint" triggers savepoints for a running job or disposes existing ones.  Syntax: savepoint [OPTIONS]  []  "savepoint" action options:     -d,--dispose        Path of savepoint to dispose.     -j,--jarfile    Flink program JAR file.  Options for Generic CLI mode:     -D    Allows specifying multiple generic configuration                           options. The available options can be found at                           https://ci.apache.org/projects/flink/flink-docs-stabl                           e/ops/config.html     -e,--executor    DEPRECATED: Please use the -t option instead which is                           also available with the "Application Mode".                           The name of the executor to be used for executing the                           given job, which is equivalent to the                           "execution.target" config option. The currently                           available executors are: "remote", "local",                           "kubernetes-session", "yarn-per-job", "yarn-session".     -t,--target      The deployment target for the given application,                           which is equivalent to the "execution.target" config                           option. For the "run" action the currently available                           targets are: "remote", "local", "kubernetes-session",                           "yarn-per-job", "yarn-session". For the                           "run-application" action the currently available                           targets are: "kubernetes-application",                           "yarn-application".  Options for yarn-cluster mode:     -m,--jobmanager             Set to yarn-cluster to use YARN execution                                      mode.     -yid,--yarnapplicationId    Attach to running YARN session     -z,--zookeeperNamespace     Namespace to create the Zookeeper                                      sub-paths for high availability mode  Options for default mode:     -D              Allows specifying multiple generic                                     configuration options. The available                                     options can be found at                                     https://ci.apache.org/projects/flink/flink-                                     docs-stable/ops/config.html     -m,--jobmanager            Address of the JobManager to which to                                     connect. Use this flag to connect to a                                     different JobManager than the one specified                                     in the configuration. Attention: This                                     option is respected only if the                                     high-availability configuration is NONE.     -z,--zookeeperNamespace    Namespace to create the Zookeeper sub-paths                                     for high availability mode

Flink入门案例

前置说明

注意:入门案例使用DataSet后续就不再使用了,而是使用流批一体的DataStream

https://ci.apache.org/projects/flink/flink-docs-release-1.12/dev/batch/

准备环境

POM

         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">    4.0.0    XX.XXXX    flink_XXXXX    1.0-SNAPSHOT                            aliyun            http://maven.aliyun.com/nexus/content/groups/public/                            apache            https://repository.apache.org/content/repositories/snapshots/                            cloudera            https://repository.cloudera.com/artifactory/cloudera-repos/                        UTF-8        UTF-8        1.8        1.8        1.8        2.12        1.12.0                            org.apache.flink            flink-clients_2.12            ${flink.version}                            org.apache.flink            flink-scala_2.12            ${flink.version}                            org.apache.flink            flink-java            ${flink.version}                            org.apache.flink            flink-streaming-scala_2.12            ${flink.version}                            org.apache.flink            flink-streaming-java_2.12            ${flink.version}                            org.apache.flink            flink-table-api-scala-bridge_2.12            ${flink.version}                            org.apache.flink            flink-table-api-java-bridge_2.12            ${flink.version}                                    org.apache.flink            flink-table-planner_2.12            ${flink.version}                                    org.apache.flink            flink-table-planner-blink_2.12            ${flink.version}                            org.apache.flink            flink-table-common            ${flink.version}                                            org.apache.flink            flink-connector-kafka_2.12            ${flink.version}                            org.apache.flink            flink-sql-connector-kafka_2.12            ${flink.version}                            org.apache.flink            flink-connector-jdbc_2.12            ${flink.version}                            org.apache.flink            flink-csv            ${flink.version}                            org.apache.flink            flink-json            ${flink.version}                                                            org.apache.bahir            flink-connector-redis_2.11            1.0                                                flink-streaming-java_2.11                    org.apache.flink                                                    flink-runtime_2.11                    org.apache.flink                                                    flink-core                    org.apache.flink                                                    flink-java                    org.apache.flink                                                        org.apache.flink            flink-connector-hive_2.12            ${flink.version}                            org.apache.hive            hive-metastore            2.1.0                            org.apache.hive            hive-exec            2.1.0                            org.apache.flink            flink-shaded-hadoop-2-uber            2.7.5-10.0                            org.apache.hbase            hbase-client            2.1.0                            mysql            mysql-connector-java            5.1.38                                                io.vertx            vertx-core            3.9.0                            io.vertx            vertx-jdbc-client            3.9.0                            io.vertx            vertx-redis-client            3.9.0                                    org.slf4j            slf4j-log4j12            1.7.7            runtime                            log4j            log4j            1.2.17            runtime                            com.alibaba            fastjson            1.2.44                            org.projectlombok            lombok            1.18.2            provided                                                        src/main/java                                                org.apache.maven.plugins                maven-compiler-plugin                3.5.1                                    1.8                    1.8                                                                            org.apache.maven.plugins                maven-surefire-plugin                2.18.1                                    false                    true                                            **/*Test.*                        **/*Suite.*                                                                                        org.apache.maven.plugins                maven-shade-plugin                2.3                                                            package                                                    shade                                                                                                                                                *:*                                                                                                                    META-INF/*.SF                                        META-INF/*.DSA                                        META-INF/*.RSA

 
   
   
   
   
  代码实现-DataSet-了解 
   
   
   
  import org.apache.flink.api.common.functions.FlatMapFunction;import org.apache.flink.api.common.functions.MapFunction;import org.apache.flink.api.java.DataSet;import org.apache.flink.api.java.ExecutionEnvironment;import org.apache.flink.api.java.operators.AggregateOperator;import org.apache.flink.api.java.operators.UnsortedGrouping;import org.apache.flink.api.java.tuple.Tuple2;import org.apache.flink.util.Collector;/** * Author ZuoYan * Desc 演示Flink-DataSet-API-实现WordCount */public class WordCount {    public static void main(String[] args) throws Exception {        //TODO 0.env        ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();        //TODO 1.source        DataSet lines = env.fromElements("itcast hadoop spark", "itcast hadoop spark", "itcast hadoop", "itcast");        //TODO 2.transformation        //切割        /*        @FunctionalInterface        public interface FlatMapFunction extends Function, Serializable {            void flatMap(T value, Collector out) throws Exception;        }         */        DataSet words = lines.flatMap(new FlatMapFunction() {            @Override            public void flatMap(String value, Collector out) throws Exception {                //value表示每一行数据                String[] arr = value.split(" ");                for (String word : arr) {                    out.collect(word);                }            }        });        //记为1        /*        @FunctionalInterface        public interface MapFunction extends Function, Serializable {            O map(T value) throws Exception;        }         */        DataSet> wordAndOne = words.map(new MapFunction>() {            @Override            public Tuple2 map(String value) throws Exception {                //value就是每一个单词                return Tuple2.of(value, 1);            }        });        //分组        UnsortedGrouping> grouped = wordAndOne.groupBy(0);        //聚合        AggregateOperator> result = grouped.sum(1);        //TODO 3.sink        result.print();    }} 
   
  代码实现-DataStream-匿名内部类-处理批 
   
  import org.apache.flink.api.common.functions.FlatMapFunction;import org.apache.flink.api.common.functions.MapFunction;import org.apache.flink.api.java.tuple.Tuple2;import org.apache.flink.streaming.api.datastream.DataStream;import org.apache.flink.streaming.api.datastream.KeyedStream;import org.apache.flink.streaming.api.datastream.SingleOutputStreamOperator;import org.apache.flink.streaming.api.environment.StreamExecutionEnvironment;import org.apache.flink.util.Collector;/** * Author ZuoYan * Desc 演示Flink-DataStream-API-实现WordCount * 注意:在Flink1.12中DataStream既支持流处理也支持批处理,如何区分? */public class WordCount2 {    public static void main(String[] args) throws Exception {        //TODO 0.env        //ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();        StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();        //env.setRuntimeMode(RuntimeExecutionMode.BATCH);//注意:使用DataStream实现批处理        //env.setRuntimeMode(RuntimeExecutionMode.STREAMING);//注意:使用DataStream实现流处理        //env.setRuntimeMode(RuntimeExecutionMode.AUTOMATIC);//注意:使用DataStream根据数据源自动选择使用流还是批        //TODO 1.source        //DataSet lines = env.fromElements("itcast hadoop spark", "itcast hadoop spark", "itcast hadoop", "itcast");        DataStream lines = env.fromElements("itcast hadoop spark", "itcast hadoop spark", "itcast hadoop", "itcast");        //TODO 2.transformation        //切割        /*        @FunctionalInterface        public interface FlatMapFunction extends Function, Serializable {            void flatMap(T value, Collector out) throws Exception;        }         */        DataStream words = lines.flatMap(new FlatMapFunction() {            @Override            public void flatMap(String value, Collector out) throws Exception {                //value就是每一行数据                String[] arr = value.split(" ");                for (String word : arr) {                    out.collect(word);                }            }        });        //记为1        /*        @FunctionalInterface        public interface MapFunction extends Function, Serializable {            O map(T value) throws Exception;        }         */        DataStream> wordAndOne = words.map(new MapFunction>() {            @Override            public Tuple2 map(String value) throws Exception {                //value就是一个个单词                return Tuple2.of(value, 1);            }        });        //分组:注意DataSet中分组是groupBy,DataStream分组是keyBy        //wordAndOne.keyBy(0);        /*        @FunctionalInterface        public interface KeySelector extends Function, Serializable {            KEY getKey(IN value) throws Exception;        }         */        KeyedStream, String> grouped = wordAndOne.keyBy(t -> t.f0);        //聚合        SingleOutputStreamOperator> result = grouped.sum(1);        //TODO 3.sink        result.print();        //TODO 4.execute/启动并等待程序结束        env.execute();    }} 
   
  代码实现-DataStream-匿名内部类-处理流 
  import org.apache.flink.api.common.RuntimeExecutionMode;import org.apache.flink.api.common.functions.FlatMapFunction;import org.apache.flink.api.common.functions.MapFunction;import org.apache.flink.api.java.tuple.Tuple2;import org.apache.flink.streaming.api.datastream.DataStream;import org.apache.flink.streaming.api.datastream.KeyedStream;import org.apache.flink.streaming.api.datastream.SingleOutputStreamOperator;import org.apache.flink.streaming.api.environment.StreamExecutionEnvironment;import org.apache.flink.util.Collector;/** * Author ZuoYan * Desc 演示Flink-DataStream-API-实现WordCount * 注意:在Flink1.12中DataStream既支持流处理也支持批处理,如何区分? */public class WordCount3 {    public static void main(String[] args) throws Exception {        //TODO 0.env        //ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();        StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();        //env.setRuntimeMode(RuntimeExecutionMode.BATCH);//注意:使用DataStream实现批处理        //env.setRuntimeMode(RuntimeExecutionMode.STREAMING);//注意:使用DataStream实现流处理        env.setRuntimeMode(RuntimeExecutionMode.AUTOMATIC);//注意:使用DataStream根据数据源自动选择使用流还是批        //TODO 1.source        //DataSet lines = env.fromElements("itcast hadoop spark", "itcast hadoop spark", "itcast hadoop", "itcast");        //DataStream lines = env.fromElements("itcast hadoop spark", "itcast hadoop spark", "itcast hadoop", "itcast");        DataStream lines = env.socketTextStream("node1", 9999);        //TODO 2.transformation        //切割        /*        @FunctionalInterface        public interface FlatMapFunction extends Function, Serializable {            void flatMap(T value, Collector out) throws Exception;        }         */        DataStream words = lines.flatMap(new FlatMapFunction() {            @Override            public void flatMap(String value, Collector out) throws Exception {                //value就是每一行数据                String[] arr = value.split(" ");                for (String word : arr) {                    out.collect(word);                }            }        });        //记为1        /*        @FunctionalInterface        public interface MapFunction extends Function, Serializable {            O map(T value) throws Exception;        }         */        DataStream> wordAndOne = words.map(new MapFunction>() {            @Override            public Tuple2 map(String value) throws Exception {                //value就是一个个单词                return Tuple2.of(value, 1);            }        });        //分组:注意DataSet中分组是groupBy,DataStream分组是keyBy        //wordAndOne.keyBy(0);        /*        @FunctionalInterface        public interface KeySelector extends Function, Serializable {            KEY getKey(IN value) throws Exception;        }         */        KeyedStream, String> grouped = wordAndOne.keyBy(t -> t.f0);        //聚合        SingleOutputStreamOperator> result = grouped.sum(1);        //TODO 3.sink        result.print();        //TODO 4.execute/启动并等待程序结束        env.execute();    }} 
   
  代码实现-DataStream-Lambda 
  import org.apache.flink.api.common.RuntimeExecutionMode;import org.apache.flink.api.common.typeinfo.Types;import org.apache.flink.api.java.tuple.Tuple2;import org.apache.flink.streaming.api.datastream.DataStream;import org.apache.flink.streaming.api.datastream.KeyedStream;import org.apache.flink.streaming.api.datastream.SingleOutputStreamOperator;import org.apache.flink.streaming.api.environment.StreamExecutionEnvironment;import org.apache.flink.util.Collector;import java.util.Arrays;/** * Author ZuoYan * Desc 演示Flink-DataStream-API-实现WordCount * 注意:在Flink1.12中DataStream既支持流处理也支持批处理,如何区分? */public class WordCount4 {    public static void main(String[] args) throws Exception {        //TODO 0.env        //ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();        StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();        //env.setRuntimeMode(RuntimeExecutionMode.BATCH);//注意:使用DataStream实现批处理        //env.setRuntimeMode(RuntimeExecutionMode.STREAMING);//注意:使用DataStream实现流处理        env.setRuntimeMode(RuntimeExecutionMode.AUTOMATIC);//注意:使用DataStream根据数据源自动选择使用流还是批        //TODO 1.source        //DataSet lines = env.fromElements("itcast hadoop spark", "itcast hadoop spark", "itcast hadoop", "itcast");        DataStream lines = env.fromElements("itcast hadoop spark", "itcast hadoop spark", "itcast hadoop", "itcast");        //TODO 2.transformation        //切割        /*        @FunctionalInterface        public interface FlatMapFunction extends Function, Serializable {            void flatMap(T value, Collector out) throws Exception;        }         */        /*DataStream words = lines.flatMap(new FlatMapFunction() {            @Override            public void flatMap(String value, Collector out) throws Exception {                //value就是每一行数据                String[] arr = value.split(" ");                for (String word : arr) {                    out.collect(word);                }            }        });*/        SingleOutputStreamOperator words = lines.flatMap(                (String value, Collector out) -> Arrays.stream(value.split(" ")).forEach(out::collect)        ).returns(Types.STRING);        //记为1        /*        @FunctionalInterface        public interface MapFunction extends Function, Serializable {            O map(T value) throws Exception;        }         */        /*DataStream> wordAndOne = words.map(new MapFunction>() {            @Override            public Tuple2 map(String value) throws Exception {                //value就是一个个单词                return Tuple2.of(value, 1);            }        });*/        DataStream> wordAndOne = words.map(                (String value) -> Tuple2.of(value, 1)        ).returns(Types.TUPLE(Types.STRING,Types.INT));        //分组:注意DataSet中分组是groupBy,DataStream分组是keyBy        //wordAndOne.keyBy(0);        /*        @FunctionalInterface        public interface KeySelector extends Function, Serializable {            KEY getKey(IN value) throws Exception;        }         */        KeyedStream, String> grouped = wordAndOne.keyBy(t -> t.f0);        //聚合        SingleOutputStreamOperator> result = grouped.sum(1);        //TODO 3.sink        result.print();        //TODO 4.execute/启动并等待程序结束        env.execute();    }} 
   
  代码实现-On-Yarn 
  import org.apache.flink.api.common.typeinfo.Types;import org.apache.flink.api.java.tuple.Tuple2;import org.apache.flink.api.java.utils.ParameterTool;import org.apache.flink.streaming.api.datastream.DataStream;import org.apache.flink.streaming.api.datastream.KeyedStream;import org.apache.flink.streaming.api.datastream.SingleOutputStreamOperator;import org.apache.flink.streaming.api.environment.StreamExecutionEnvironment;import org.apache.flink.util.Collector;import java.util.Arrays;/** * Author ZuoYan * Desc 演示Flink-DataStream-API-实现WordCount * 注意:在Flink1.12中DataStream既支持流处理也支持批处理,如何区分? */public class WordCount5_Yarn {    public static void main(String[] args) throws Exception {        ParameterTool parameterTool = ParameterTool.fromArgs(args);        String output = "";        if (parameterTool.has("output")) {            output = parameterTool.get("output");            System.out.println("指定了输出路径使用:" + output);        } else {            output = "hdfs://node1:8020/wordcount/output47_";            System.out.println("可以指定输出路径使用 --output ,没有指定使用默认的:" + output);        }        //TODO 0.env        //ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();        StreamExecutionEnvironment env = StreamExecutionEnvironment.getExecutionEnvironment();        //env.setRuntimeMode(RuntimeExecutionMode.BATCH);//注意:使用DataStream实现批处理        //env.setRuntimeMode(RuntimeExecutionMode.STREAMING);//注意:使用DataStream实现流处理        //env.setRuntimeMode(RuntimeExecutionMode.AUTOMATIC);//注意:使用DataStream根据数据源自动选择使用流还是批        //TODO 1.source        //DataSet lines = env.fromElements("itcast hadoop spark", "itcast hadoop spark", "itcast hadoop", "itcast");        DataStream lines = env.fromElements("itcast hadoop spark", "itcast hadoop spark", "itcast hadoop", "itcast");        //TODO 2.transformation        //切割        /*        @FunctionalInterface        public interface FlatMapFunction extends Function, Serializable {            void flatMap(T value, Collector out) throws Exception;        }         */        /*DataStream words = lines.flatMap(new FlatMapFunction() {            @Override            public void flatMap(String value, Collector out) throws Exception {                //value就是每一行数据                String[] arr = value.split(" ");                for (String word : arr) {                    out.collect(word);                }            }        });*/        SingleOutputStreamOperator words = lines.flatMap(                (String value, Collector out) -> Arrays.stream(value.split(" ")).forEach(out::collect)        ).returns(Types.STRING);        //记为1        /*        @FunctionalInterface        public interface MapFunction extends Function, Serializable {            O map(T value) throws Exception;        }         */        /*DataStream> wordAndOne = words.map(new MapFunction>() {            @Override            public Tuple2 map(String value) throws Exception {                //value就是一个个单词                return Tuple2.of(value, 1);            }        });*/        DataStream> wordAndOne = words.map(                (String value) -> Tuple2.of(value, 1)        ).returns(Types.TUPLE(Types.STRING, Types.INT));        //分组:注意DataSet中分组是groupBy,DataStream分组是keyBy        //wordAndOne.keyBy(0);        /*        @FunctionalInterface        public interface KeySelector extends Function, Serializable {            KEY getKey(IN value) throws Exception;        }         */        KeyedStream, String> grouped = wordAndOne.keyBy(t -> t.f0);        //聚合        SingleOutputStreamOperator> result = grouped.sum(1);        //TODO 3.sink        //如果执行报hdfs权限相关错误,可以执行 hadoop fs -chmod -R 777  /        System.setProperty("HADOOP_USER_NAME", "root");//设置用户名        //result.print();        //result.writeAsText("hdfs://node1:8020/wordcount/output47_"+System.currentTimeMillis()).setParallelism(1);        result.writeAsText(output + System.currentTimeMillis()).setParallelism(1);        //TODO 4.execute/启动并等待程序结束        env.execute();    }} 
   
  打包改名上传 
   
   
  提交 
   
    
   
  /export/server/flink/bin/flink run -Dexecution.runtime-mode=BATCH -m yarn-cluster -yjm 1024 -ytm 1024 -c cn.itcast.hello.WordCount5_Yarn /root/wc.jar --output hdfs://node1:8020/wordcount/output_xx 
   
  注意 
   
    
    
    
    
   
  RuntimeExecutionMode.BATCH//使用DataStream实现批处理RuntimeExecutionMode.STREAMING//使用DataStream实现流处理RuntimeExecutionMode.AUTOMATIC//使用DataStream根据数据源自动选择使用流还是批//如果不指定,默认是流 
  在后续的Flink开发中,把一切数据源看做流即可或者使用AUTOMATIC就行了 
   
  Flink原理初探-慢慢理解/消化 
  角色分工 
   
   
  执行流程 
   
   
  DataFlow 
  https://ci.apache.org/projects/flink/flink-docs-release-1.12/concepts/glossary.html 
   
  DataFlow、Operator、Partition、Parallelism、SubTask 
   
   
   
   
   
   
  OperatorChain和Task 
   
   
  TaskSlot和TaskSlotSharing 
   
   
   
   
  执行流程图生成 
   
   
   
   
  公众号：漫话架构之美 
  大数据领域原创技术号，专注于大数据研究，包括 Hadoop、Flink、Spark、Kafka、Hive、HBase 等，深入大数据技术原理，数据仓库，数据治理，前沿大数据技术

Java 大视界 -- 基于 Java 的大数据实时流处理中的窗口操作与时间语义详解（135）青云交大数据新视界 Java 大视界 java 大数据大数据实时流处理窗口操作时间语义滚动窗口滑动窗口
亲爱的朋友们，热烈欢迎来到青云交的博客！能与诸位在此相逢，我倍感荣幸。在这飞速更迭的时代，我们都渴望一方心灵净土，而我的博客正是这样温暖的所在。这里为你呈上趣味与实用兼具的知识，也期待你毫无保留地分享独特见解，愿我们于此携手成长，共赴新程！一、欢迎加入【福利社群】点击快速加入：青云交灵犀技韵交响盛汇福利社群点击快速加入2：2024CSDN博客之星创作交流营（NEW)二、本博客的精华专栏：大数据新视
Lisp语言的云存储俞嫦曦包罗万象 golang 开发语言后端
Lisp语言的云存储：构建智能化数据管理新时代引言随着信息技术的飞速发展，数据的生产和存储呈现出爆炸式增长。云存储作为一种新兴的数据管理方式，逐渐成为各行业必不可少的基础设施。尤其是在大数据、人工智能等领域，对数据的快速访问和高效存储要求尤为迫切。与此同时，Lisp语言作为一种历史悠久且具有强大表达能力的编程语言，通过其特有的特性，可以在云存储的架构设计与实现方面发挥独特的优势。本文将深入探讨Li
中电金信25/3/18面前笔试（需求分析岗+数据开发岗）苍曦需求分析前端 javascript
部分相同题目在第二次数据开发岗中不做解析，本次解析来源于豆包AI，正确与否有待商榷，本文只提供一个速查与知识点的补充。一、需求分析第1题，单选题,Hadoop的核心组件包括HDFS和以下哪个？MapReduceSparkStormFlink解析：Hadoop的核心组件是HDFS（分布式文件系统）和MapReduce（分布式计算框架）。Spark、Storm、Flink虽然也是大数据处理相关技术，但
Flink实践：通过Flink SQL进行SFTP文件的读写操作 kkk1622245 flink sql 大数据
在大数据处理领域，ApacheFlink出类拔萃，它是一个高性能、易扩展、用于处理有界和无界数据流的分布式处理引擎。FlinkSQL是ApacheFlink提供的一种声明式API，允许开发者以SQL的形式，轻松实现复杂的数据流和批处理分析。本文将重点探讨如何通过FlinkSQL来实现对SFTP文件的读写操作，这是在实际应用中经常遇到的一种场景。Flink与SFTP文件的读写在很多实际应用场景中，数
题解 | 牛客周赛 Round 49 DEF Java题解 han_xue_feng java
面试又黄了反正不是什么喜欢的工作[牛泪]面试又黄了反正不是什么喜欢的工作2024秋招数据开发第一波面试题露出#字节##滴滴##大数据##面经##秋招#引流字节阿里巴巴腾讯百度美团美团后端暑期实习体验——实习的一天早上：8点半出门坐地铁，9点下地铁到惠新西街南口地铁站，出地铁站坐班车（这一点还是不错的），9点30深圳阿里实习day1领工牌mac，认工位mentor，配环境看文档，七点就润了。看各个文
Search after解决ES深度分页问题 Elastic开源社区 elasticsearch 大数据 search after 深度分页 ES
文章目录1、search_after的作用和意义2、search_after的工作原理3、search_after的使用方法4、注意事项5、与传统分页的对比6、总结search_after是Elasticsearch中用于实现深度分页的一种机制。相比于传统的from和size分页方式，search_after更适合处理大数据集的分页查询，因为它避免了深度分页带来的性能问题。1、search_aft
第七章Solr：企业级搜索应用 AGI大模型与大数据研究院 DeepSeek R1 &大数据AI人工智能计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
第七章Solr：企业级搜索应用1.背景介绍1.1搜索引擎的重要性在当今信息时代,数据量呈指数级增长,海量数据中蕴含着极其宝贵的信息和知识。然而,如何快速、准确地从大数据中检索出所需的信息,一直是企业和组织面临的巨大挑战。传统的数据库查询方式已经无法满足现代搜索需求,因此高效的搜索引擎应运而生。1.2什么是SolrApacheSolr是一个高性能、可扩展、云就绪的企业级搜索平台,由Apache软件基
金融租赁系统的创新发展与市场竞争力提升探讨红点租赁系统开发其他
内容概要随着经济的快速发展，金融租赁系统逐渐成为金融市场中不可或缺的一环。它不仅提供了灵活的资金解决方案，还促进了企业的资本结构优化与资源配置效率。因此，了解该系统的市场背景与发展现状至关重要。在现今环境下，新兴技术如人工智能、大数据和区块链等正加速推动金融租赁的创新。通过这些技术，不仅可以优化业务流程，提升运营效率，还可以增强风险管理能力。例如，利用数据分析可以实时监测租赁资产的风险，从而采取相
分块查找算法 1haooo 算法 java 算法开发语言数据结构
分块的原则前一块的最大数据，小于后一窥啊中所有的数据（块内无序，块间有序）块数数量一般等于数字的个数开根号。比如：16个数字一般分为4块左右。publicclassblockSearch{publicstaticvoidmain(String[]args){int[]arr={16,5,9,12,21,18,32,23,37,26,45,34,50,48,61,52,73,66};//共18个元素
MongoDB数据库使用及常见问题微笑的曙光（StevenLi）数据库数据库 mongodb
MongoDB数据库之所以备受青睐，关键在于其独特的优势满足了现代应用的需求。它采用文档型存储，数据结构灵活，无需事先定义表结构，非常适合处理复杂且多变的数据。MongoDB具备高性能和可扩展性，能够轻松应对大数据量和高并发的访问，通过分片技术实现水平扩展，确保系统稳定运行。同时，它提供了强大的数据一致性和可靠性保障，支持多种复制和故障转移机制，确保数据的高可用性和持久性。此外，MongoDB拥有
智慧社区2.0 陈陈爱java java
项目亮点1.技术架构层面✅多数据源整合（MySQL+Redis+HDFS+OSS）核心亮点：不仅仅是单一数据库，而是根据数据特性使用MySQL（结构化数据）+Redis（缓存）+HDFS（大数据存储）+OSS（对象存储），提高了系统的数据存储效率和查询速度。面试时可以强调：Redis作为缓存，加速社区热点数据访问，减少MySQL压力。HDFS存储海量日志和AI任务数据，支持后续分析。OSS解决图片
Pandas与PySpark混合计算实战：突破单机极限的智能数据处理方案 Eqwaak00 Pandas pandas 学习 python 科技开发语言
引言：大数据时代的混合计算革命当数据规模突破十亿级时，传统单机Pandas面临内存溢出、计算缓慢等瓶颈。PySpark虽能处理PB级数据，但在开发效率和局部计算灵活性上存在不足。本文将揭示如何构建Pandas+PySpark混合计算管道，在保留Pandas便捷性的同时，借助Spark分布式引擎实现百倍性能提升，并通过真实电商用户画像案例演示全流程实现。一、混合架构设计原理1.1技术栈优势分析维度P
智能汽车：驶向未来的革命智能设备
一、引言汽车，作为现代文明的标志，正经历着一场前所未有的变革。人工智能、大数据、云计算等技术的飞速发展，正推动着汽车从单纯的交通工具向智能移动空间转变。智能汽车，作为这场变革的主角，正悄然改变着我们的出行方式和生活方式。二、智能汽车的定义与发展现状智能汽车，是指搭载先进传感器、控制器、执行器等装置，并融合现代通信与网络技术，实现车与X（人、车、路、云端等）智能信息交换、共享，具备复杂环境感知、智能
介绍 Apache Spark 的基本概念和在大数据分析中的应用佛渡红尘 apache
ApacheSpark是一个开源的集群计算框架，最初由加州大学伯克利分校的AMPLab开发，用于大规模数据处理和分析。相比于传统的MapReduce框架，Spark具有更快的数据处理速度和更强大的计算能力。ApacheSpark的基本概念包括：弹性分布式数据集（RDD）：是Spark中基本的数据抽象，是一个可并行操作的分区记录集合。RDD可以在集群中的节点间进行分布式计算。转换（Transform
从“笨重大象”到“敏捷火花”：Hadoop与Spark的大数据技术进化之路 Echo_Wish 大数据大数据 hadoop spark
从“笨重大象”到“敏捷火花”：Hadoop与Spark的大数据技术进化之路说起大数据技术，Hadoop和Spark可以说是这个领域的两座里程碑。Hadoop曾是大数据的开山之作，而Spark则带领我们迈入了一个高效、灵活的大数据处理新时代。那么，它们的演变过程到底有何深意？背后技术上的取舍和选择，又意味着什么？一、Hadoop：分布式存储与计算的奠基者Hadoop诞生于互联网流量爆发式增长的时代，
最新计算机专业毕设论文选题大全基于BeautifulSoup的毕业设计详细题目100套优质毕设项目分享(源码+论文)✅ 会写代码的羊毕设选题课程设计 beautifulsoup 毕业设计毕业设计题目毕设题目 python 网络爬虫
文章目录前言最新毕设选题（建议收藏起来）基于BeautifulSoup的毕业设计选题毕设作品推荐前言2025全新毕业设计项目博主介绍：✌全网粉丝10W+,CSDN全栈领域优质创作者，博客之星、掘金/华为云/阿里云等平台优质作者。技术范围：SpringBoot、Vue、SSM、HLMT、Jsp、PHP、Nodejs、Python、爬虫、数据可视化、小程序、大数据、机器学习等设计与开发。主要内容：免费
Flume详解——介绍、部署与使用克里斯蒂亚诺罗纳尔多阿维罗 flume 大数据分布式
1.Flume简介ApacheFlume是一个专门用于高效地收集、聚合、传输大量日志数据的分布式、可靠的系统。它特别擅长将数据从各种数据源（如日志文件、消息队列等）传输到HDFS、HBase、Kafka等大数据存储系统。特点：可扩展：支持大规模数据传输，灵活扩展容错性：支持数据恢复和失败重试，确保数据不丢失多种数据源：支持日志文件、网络数据、HTTP请求、消息队列等多种来源流式处理：数据边收集边传
智能租赁系统助力数字化转型提升管理效率与服务质量红点租赁系统开发其他
内容概要在当今快速发展的商业环境中，智能租赁系统正如一位得力助手，帮助企业以数字化的方式提升管理效率与服务质量。想象一下，传统的租赁管理就像是一场需要精确时间安排的舞蹈，而智能租赁系统则提供了高科技的音响设备，让整个表演流畅无比。通过先进的数字技术，比如云计算和大数据分析，这些系统能够优化资源配置，让企业的每一分钱都花得物有所值。更妙的是，智能租赁系统不仅高效处理日常事务，还能提供精确的数据分析，
金融租赁系统智慧风控实践探索红点租赁系统开发其他
内容概要当传统金融租赁还在和纸质合同较劲时，兴业金融租赁系统已经玩起了"变形金刚式"的智慧风控。这套系统就像给资产装上了GPS定位器+心电图监测仪，通过物联网传感器实时捕捉设备运行数据，配合卫星定位追踪车辆轨迹，再让大数据分析引擎消化海量场景信息——从工地的混凝土搅拌频率到物流车队的急刹车次数，全被转化成可量化的风险坐标。技术手段业务指标提升应用场景案例物联网传感器异常响应速度提升70%工程机械油
07-单链表-单链表基本操作哆啦A梦阳 2025算法机试算法数据结构
题目来源826.单链表-AcWing题库思路详见代码，主要思想就是用数组来模拟链表的创建。数组其实跟静态链表等价，由于动态链表动态new对于大数据太过于耗时，因此采用数组的方式。那数组如何起到链表的效果？用下标来索引。代码#includeusingnamespacestd;constintN=100010;inthead,e[N],ne[N],idx;//初始化voidinit(){head=-1
如果我想成为一名大数据和算法工程师，我需要学会哪些技能，获取大厂的offer 红豆和绿豆杂谈大数据算法
成为一名大数据和算法工程师并获取大厂Offer，需要掌握一系列核心技能，并具备丰富的项目经验与扎实的理论基础。以下是详细的技能要求和建议：---###**1.数学与理论基础**-**数学知识**：掌握线性代数、微积分、概率论和统计学，这些是设计和理解算法的基础。-**机器学习理论**：深入理解常见机器学习算法（如线性回归、逻辑回归、决策树、随机森林、SVM、K-means等），了解其原理、优缺点及
KVM 内核优化全攻略：全方位释放服务器性能 TechStack 创行者 KVM Linux 服务器运维 KVM
KVM内核优化全攻略：全方位释放服务器性能在云计算、大数据、人工智能等前沿技术蓬勃发展的当下，服务器性能面临着前所未有的挑战。KVM（Kernel-basedVirtualMachine）作为开源虚拟化解决方案，凭借高效稳定的特性，广泛应用于企业数据中心。要充分发挥KVM性能优势，对其内核进行全面优化势在必行。本文将为你详细介绍一套涵盖通用优化及其他关键优化点的完整KVM内核优化方案，并结合实际案
Hive 与 SparkSQL 的语法差异及性能对比自然术算 Hive hive hadoop 大数据 spark
在大数据处理领域，Hive和SparkSQL都是极为重要的工具，它们为大规模数据的存储、查询和分析提供了高效的解决方案。虽然二者都致力于处理结构化数据，并且都采用了类似SQL的语法来方便用户进行操作，但在实际使用中，它们在语法细节和性能表现上存在诸多差异。了解这些差异，对于开发者根据具体业务场景选择合适的工具至关重要。语法差异数据定义语言（DDL）表创建语法Hive：在Hive中创建表时，需要详细
【实操回顾】基于Apache SeaTunnel从MySQL同步到PostgreSQL——Demo方舟计划 SeaTunnel apache mysql postgresql
文章作者：马全才奥克斯集团大数据工程师编辑整理：国电南自赵鸿辉白鲸开源曾辉本文详细演示了如何通过ApacheSeaTunnel2.3.9实现**MySQL**到PostgreSQL的全量数据同步。非常感谢马全才老师花费业余时间为大家演示制作的Demo，也欢迎更多朋友贡献自己熟悉的同步场景，详细请参考社区Demo方舟活动：https://mp.weixin.qq.com/s/5gpiZZ0-8a4I
Flink流式计算系统 xyzkenan Flink 大数据大数据开发
本文将以这些概念为基础，逐一介绍Flink的发展背景、核心概念、时间推理与正确性工具、安装部署、客户端操作、编程API等内容，让开发人员对Flink有较为全面的认识并拥有一些基础操作与编程能力。一、发展背景1.1数据处理架构在流处理器出现之前，数据处理架构主要由批处理器组成，其是对无限数据的有限切分，具有吞吐量大、数据较为准确的特点。然而我们知道，批处理器在时间切分点附近仍然无法保证数据结果的真实
Flink 初体验：从 Hello World 到实时数据流处理小诸葛IT课堂 flink 大数据
在大数据处理领域，ApacheFlink以其卓越的流批一体化处理能力脱颖而出，成为众多企业构建实时数据应用的首选框架。本文将带领你迈出Flink学习的第一步，从基础概念入手，逐步引导你编写并运行第一个Flink程序——经典的WordCount，让你亲身感受Flink在实时数据流处理方面的强大魅力。一、Flink基础概念速览1.1什么是FlinkFlink是一个分布式流批一体化开源平台，旨在对无界和
时间语义与窗口操作：Flink 流式计算的核心逻辑小诸葛IT课堂 flink 大数据
在实时数据流处理中，时间是最为关键的维度之一。Flink通过灵活的时间语义和丰富的窗口类型，为开发者提供了强大的时间窗口分析能力。本文将深入解析Flink的时间语义机制，并通过实战案例演示如何利用窗口操作实现实时数据聚合。一、Flink时间语义详解1.1三种时间概念1.1.1EventTime（事件时间）定义：事件实际发生的时间，由事件本身携带的时间戳决定应用场景：需要准确反映事件真实顺序的场景（
大数据开发之Kubernetes篇----安装部署Kubernetes&dashboard 豆豆总 kubernetes
Kubernetes简介由于公司有需要，需要将外后的服务外加Tensorflow模型部署加训练全部集成到k8s上，所以特意记录下这次简单部署的过程。k8s安装部署首先，我们在部署任何大型的组件前都必须要做的事情就是关闭防火墙和设置hostname了vi/etc/hostsk8s001xxx.xxx.xxx.xxk8s002xxx.xxx.xxx.xx...systemctlstopfirewall
毕设分享大数据B站数据分析可视化系统 bee_dc 毕业设计毕设大数据
文章目录0前言1项目运行效果2设计原理数据处理方案可视化呈现方案综合得分计算指标综合得分漏斗图游客画像完成度三连排行榜点赞、投币、收藏与白嫖的比例分析3最后0前言这两年开始毕业设计和毕业答辩的要求和难度不断提升，传统的毕设题目缺少创新和亮点，往往达不到毕业答辩的要求，这两年不断有学弟学妹告诉学长自己做的项目系统达不到老师的要求。为了大家能够顺利以及最少的精力通过毕设，学长分享优质毕业设计项目，今天
毕业设计项目大数据B站数据分析可视化系统 bee_dc 毕业设计毕设大数据
文章目录0前言1项目运行效果2设计原理数据处理方案可视化呈现方案综合得分计算指标综合得分漏斗图游客画像完成度三连排行榜点赞、投币、收藏与白嫖的比例分析3最后0前言这两年开始毕业设计和毕业答辩的要求和难度不断提升，传统的毕设题目缺少创新和亮点，往往达不到毕业答辩的要求，这两年不断有学弟学妹告诉学长自己做的项目系统达不到老师的要求。为了大家能够顺利以及最少的精力通过毕设，学长分享优质毕业设计项目，今天
web报表工具FineReport常见的数据集报错错误代码和解释老A不折腾 web报表 finereport 代码可视化工具
在使用finereport制作报表，若预览发生错误，很多朋友便手忙脚乱不知所措了，其实没什么，只要看懂报错代码和含义，可以很快的排除错误，这里我就分享一下finereport的数据集报错错误代码和解释，如果有说的不准确的地方，也请各位小伙伴纠正一下。 NS-war-remote=错误代码\:1117 压缩部署不支持远程设计 NS_LayerReport_MultiDs=错误代码
Java的WeakReference与WeakHashMap bylijinnan java 弱引用
首先看看 WeakReference wiki 上 Weak reference 的一个例子： public class ReferenceTest { public static void main(String[] args) throws InterruptedException { WeakReference r = new Wea
Linux——（hostname）主机名与ip的映射 eksliang linux hostname
一、什么是主机名无论在局域网还是INTERNET上，每台主机都有一个IP地址，是为了区分此台主机和彼台主机，也就是说IP地址就是主机的门牌号。但IP地址不方便记忆，所以又有了域名。域名只是在公网（INtERNET)中存在，每个域名都对应一个IP地址，但一个IP地址可有对应多个域名。域名类型 linuxsir.org 这样的；主机名是用于什么的呢？答：在一个局域网中，每台机器都有一个主
oracle 常用技巧 18289753290
oracle常用技巧 ①复制表结构和数据 create table temp_clientloginUser as select distinct userid from tbusrtloginlog ②仅复制数据如果表结构一样 insert into mytable select * &nb
使用c3p0数据库连接池时出现com.mchange.v2.resourcepool.TimeoutException 酷的飞上天空 exception
有一个线上环境使用的是c3p0数据库，为外部提供接口服务。最近访问压力增大后台tomcat的日志里面频繁出现 com.mchange.v2.resourcepool.TimeoutException: A client timed out while waiting to acquire a resource from com.mchange.v2.resourcepool.BasicResou
IT系统分析师如何学习大数据蓝儿唯美大数据
我是一名从事大数据项目的IT系统分析师。在深入这个项目前需要了解些什么呢？学习大数据的最佳方法就是先从了解信息系统是如何工作着手，尤其是数据库和基础设施。同样在开始前还需要了解大数据工具，如Cloudera、Hadoop、Spark、Hive、Pig、Flume、Sqoop与Mesos。系统分析师需要明白如何组织、管理和保护数据。在市面上有几十款数据管理产品可以用于管理数据。你的大数据数据库可能
spring学习——简介 a-john spring
Spring是一个开源框架，是为了解决企业应用开发的复杂性而创建的。Spring使用基本的JavaBean来完成以前只能由EJB完成的事情。然而Spring的用途不仅限于服务器端的开发，从简单性，可测试性和松耦合的角度而言，任何Java应用都可以从Spring中受益。其主要特征是依赖注入、AOP、持久化、事务、SpringMVC以及Acegi Security 为了降低Java开发的复杂性，
自定义颜色的xml文件 aijuans xml
<?xml version="1.0" encoding="utf-8"?> <resources> <color name="white">#FFFFFF</color> <color name="black">#000000</color> &
运营到底是做什么的？ aoyouzi 运营到底是做什么的？
文章来源：夏叔叔（微信号：woshixiashushu），欢迎大家关注！很久没有动笔写点东西，近些日子，由于爱狗团产品上线，不断面试，经常会被问道一个问题。问：爱狗团的运营主要做什么？答：带着用户一起嗨。为什么是带着用户玩起来呢？究竟什么是运营？运营到底是做什么的？那么，我们先来回答一个更简单的问题——互联网公司对运营考核什么？以爱狗团为例，绝大部分的移动互联网公司，对运营部门的考核分为三块——用
js面向对象类和对象百合不是茶 js 面向对象函数创建类和对象
接触js已经有几个月了,但是对js的面向对象的一些概念根本就是模糊的,js是一种面向对象的语言但又不像java一样有class,js不是严格的面向对象语言 ,js在java web开发的地位和java不相上下 ,其中web的数据的反馈现在主流的使用json,json的语法和js的类和属性的创建相似下面介绍一些js的类和对象的创建的技术一:类和对
web.xml之资源管理对象配置 resource-env-ref bijian1013 java web.xml servlet
resource-env-ref元素来指定对管理对象的servlet引用的声明，该对象与servlet环境中的资源相关联 <resource-env-ref> <resource-env-ref-name>资源名</resource-env-ref-name> <resource-env-ref-type>查找资源时返回的资源类
Create a composite component with a custom namespace sunjing
https://weblogs.java.net/blog/mriem/archive/2013/11/22/jsf-tip-45-create-composite-component-custom-namespace When you developed a composite component the namespace you would be seeing would
【MongoDB学习笔记十二】Mongo副本集服务器角色之Arbiter bit1129 mongodb
一、复本集为什么要加入Arbiter这个角色回答这个问题，要从复本集的存活条件和Aribter服务器的特性两方面来说。什么是Artiber？ An arbiter does not have a copy of data set and cannot become a primary. Replica sets may have arbiters to add a
Javascript开发笔记白糖_ JavaScript
获取iframe内的元素通常我们使用window.frames["frameId"].document.getElementById("divId").innerHTML这样的形式来获取iframe内的元素，这种写法在IE、safari、chrome下都是通过的，唯独在fireforx下不通过。其实jquery的contents方法提供了对if
Web浏览器Chrome打开一段时间后，运行alert无效 bozch Web chorme alert 无效
今天在开发的时候，突然间发现alert在chrome浏览器就没法弹出了，很是怪异。试了试其他浏览器，发现都是没有问题的。开始想以为是chorme浏览器有啥机制导致的，就开始尝试各种代码让alert出来。尝试结果是仍然没有显示出来。这样开发的结果，如果客户在使用的时候没有提示，那会带来致命的体验。哎，没啥办法了就关闭浏览器重启。结果就好了，这也太怪异了。难道是cho
编程之美-高效地安排会议图着色问题贪心算法 bylijinnan 编程之美
import java.util.ArrayList; import java.util.Collections; import java.util.List; import java.util.Random; public class GraphColoringProblem { /**编程之美高效地安排会议图着色问题贪心算法 * 假设要用很多个教室对一组
机器学习相关概念和开发工具 chenbowen00 算法 matlab 机器学习
基本概念：机器学习(Machine Learning, ML)是一门多领域交叉学科，涉及概率论、统计学、逼近论、凸分析、算法复杂度理论等多门学科。专门研究计算机怎样模拟或实现人类的学习行为，以获取新的知识或技能，重新组织已有的知识结构使之不断改善自身的性能。它是人工智能的核心，是使计算机具有智能的根本途径，其应用遍及人工智能的各个领域，它主要使用归纳、综合而不是演绎。开发工具 M
[宇宙经济学]关于在太空建立永久定居点的可能性 comsci 经济
大家都知道,地球上的房地产都比较昂贵,而且土地证经常会因为新的政府的意志而变幻文本格式........ 所以,在地球议会尚不具有在太空行使法律和权力的力量之前,我们外太阳系统的友好联盟可以考虑在地月系的某些引力平衡点上面,修建规模较大的定居点
oracle 11g database control 证书错误 daizj oracle 证书错误 oracle 11G 安装
oracle 11g database control 证书错误 win7 安装完oracle11后打开 Database control 后，会打开em管理页面，提示证书错误，点“继续浏览此网站”，还是会继续停留在证书错误页面解决办法：是 KB2661254 这个更新补丁引起的，它限制了 RSA 密钥位长度少于 1024 位的证书的使用。具体可以看微软官方公告：
Java I/O之用FilenameFilter实现根据文件扩展名删除文件游其是你 FilenameFilter
在Java中，你可以通过实现FilenameFilter类并重写accept(File dir, String name) 方法实现文件过滤功能。在这个例子中，我们向你展示在“c:\\folder”路径下列出所有“.txt”格式的文件并删除。 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
C语言数组的简单以及一维数组的简单排序算法示例，二维数组简单示例 dcj3sjt126com c array
# include <stdio.h> int main(void) { int a[5] = {1, 2, 3, 4, 5}; //a 是数组的名字 5是表示数组元素的个数，并且这五个元素分别用a[0], a[1]...a[4] int i; for (i=0; i<5; ++i) printf("%d\n",
PRIMARY, INDEX, UNIQUE 这3种是一类 PRIMARY 主键。就是唯一且不能为空。 INDEX 索引，普通的 UNIQUE 唯一索引 dcj3sjt126com primary
PRIMARY, INDEX, UNIQUE 这3种是一类PRIMARY 主键。就是唯一且不能为空。INDEX 索引，普通的UNIQUE 唯一索引。不允许有重复。FULLTEXT 是全文索引，用于在一篇文章中，检索文本信息的。举个例子来说，比如你在为某商场做一个会员卡的系统。这个系统有一个会员表有下列字段：会员编号 INT会员姓名
java集合辅助类 Collections、Arrays shuizhaosi888 Collections Arrays HashCode
Arrays、Collections 1 ）数组集合之间转换 public static <T> List<T> asList(T... a) { return new ArrayList<>(a); } a）Arrays.asL
Spring Security（10）——退出登录logout 234390216 logout Spring Security 退出登录 logout-url LogoutFilter
要实现退出登录的功能我们需要在http元素下定义logout元素，这样Spring Security将自动为我们添加用于处理退出登录的过滤器LogoutFilter到FilterChain。当我们指定了http元素的auto-config属性为true时logout定义是会自动配置的，此时我们默认退出登录的URL为“/j_spring_secu
透过源码学前端之 Backbone 三 Model 逐行分析JS源代码 backbone 源码分析 js学习
Backbone 分析第三部分 Model 概述： Model 提供了数据存储，将数据以JSON的形式保存在 Model的 attributes里，但重点功能在于其提供了一套功能强大，使用简单的存、取、删、改数据方法，并在不同的操作里加了相应的监听事件，如每次修改添加里都会触发 change，这在据模型变动来修改视图时很常用，并且与collection建立了关联。
SpringMVC源码总结（七）mvc:annotation-driven中的HttpMessageConverter 乒乓狂魔 springMVC
这一篇文章主要介绍下HttpMessageConverter整个注册过程包含自定义的HttpMessageConverter，然后对一些HttpMessageConverter进行具体介绍。 HttpMessageConverter接口介绍： public interface HttpMessageConverter<T> { /** * Indicate
分布式基础知识和算法理论 bluky999 算法 zookeeper 分布式一致性哈希 paxos
分布式基础知识和算法理论 BY [email protected] 本文永久链接：http://nodex.iteye.com/blog/2103218 在大数据的背景下，不管是做存储，做搜索，做数据分析，或者做产品或服务本身，面向互联网和移动互联网用户，已经不可避免地要面对分布式环境。笔者在此收录一些分布式相关的基础知识和算法理论介绍，在完善自我知识体系的同
Android Studio的.gitignore以及gitignore无效的解决 bell0901 android gitignore
　　github上.gitignore模板合集，里面有各种.gitignore ： https://github.com/github/gitignore 　　自己用的Android Studio下项目的.gitignore文件，对github上的android.gitignore添加了　　　　　　# OSX files　　　　　　//mac os下　　　　　　.DS_Store
成为高级程序员的10个步骤 tomcat_oracle 编程
What 软件工程师的职业生涯要历经以下几个阶段：初级、中级，最后才是高级。这篇文章主要是讲如何通过 10 个步骤助你成为一名高级软件工程师。 Why 得到更多的报酬！因为你的薪水会随着你水平的提高而增加提升你的职业生涯。成为了高级软件工程师之后，就可以朝着架构师、团队负责人、CTO 等职位前进历经更大的挑战。随着你的成长，各种影响力也会提高。
mongdb在linux下的安装 xtuhcy mongodb linux
一、查询linux版本号： lsb_release -a LSB Version: :base-4.0-amd64:base-4.0-noarch:core-4.0-amd64:core-4.0-noarch:graphics-4.0-amd64:graphics-4.0-noarch:printing-4.0-amd64:printing-4.0-noa

【Flink专题】基于Flink1.12的知识点总结

你可能感兴趣的:(大数据,flink)