aebdm757009

从flink-example分析flink组件(1)WordCount batch实战及源码分析

上一章简单介绍了一下flink在windows下如何通过flink-webui运行已经打包完成的示例程序(jar)，那么我们为什么要使用flink呢？

flink的特征

官网给出的特征如下：

1、一切皆为流（All streaming use cases ）

事件驱动应用(Event-driven Applications)

流式 & 批量分析(Stream & Batch Analytics)

数据管道&ETL(Data Pipelines & ETL)

2、正确性保证(Guaranteed correctness)

唯一状态一致性(Exactly-once state consistency)
事件-事件处理(Event-time processing)
高超的最近数据处理(Sophisticated late data handling)

3、多层api(Layered APIs)

基于流式和批量数据处理的SQL(SQL on Stream & Batch Data)
流水数据API & 数据集API(DataStream API & DataSet API)
处理函数 (时间 & 状态)(ProcessFunction (Time & State))

4、易用性

部署灵活(Flexible deployment)
高可用安装(High-availability setup）
保存点(Savepoints)

5、可扩展性

可扩展架构(Scale-out architecture)
大量状态的支持(Support for very large state)
增量检查点(Incremental checkpointing)

6、高性能

低延迟(Low latency)
高吞吐量(High throughput)
内存计算(In-Memory computing)

flink架构

1、层级结构

2.工作架构图

flink实战

1、依赖文件pom.xml

xml version="1.0" encoding="UTF-8"?>
<project xmlns="http://maven.apache.org/POM/4.0.0"
         xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
         xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
    <modelVersion>4.0.0modelVersion>

    <groupId>flinkDemogroupId>
    <artifactId>flinkDemoartifactId>
    <version>1.0-SNAPSHOTversion>

    <dependencies>
        <dependency>
            <groupId>org.apache.flinkgroupId>
            <artifactId>flink-javaartifactId>
            <version>1.5.0version>
            
        dependency>
        <dependency>
            <groupId>org.apache.flinkgroupId>
            <artifactId>flink-streaming-java_2.11artifactId>
            <version>1.5.0version>
            
        dependency>
        
        <dependency>
            <groupId>org.apache.flinkgroupId>
            <artifactId>flink-connector-kafka-0.10_2.11artifactId>
            <version>1.5.0version>
        dependency>
        
        <dependency>
            <groupId>org.apache.flinkgroupId>
            <artifactId>flink-hbase_2.11artifactId>
            <version>1.5.0version>
        dependency>

        <dependency>
            <groupId>org.apache.kafkagroupId>
            <artifactId>kafka-clientsartifactId>
            <version>0.10.1.1version>
        dependency>

        <dependency>
            <groupId>org.apache.hbasegroupId>
            <artifactId>hbase-clientartifactId>
            <version>1.1.2version>
        dependency>

        <dependency>
            <groupId>org.projectlombokgroupId>
            <artifactId>lombokartifactId>
            <version>1.16.10version>
            <scope>compilescope>
        dependency>
        <dependency>
            <groupId>com.google.code.gsongroupId>
            <artifactId>gsonartifactId>
            <version>2.8.2version>
        dependency>
        <dependency>
            <groupId>com.github.rholdergroupId>
            <artifactId>guava-retryingartifactId>
            <version>2.0.0version>
        dependency>
    dependencies>

    <build>
        <plugins>
            <plugin>
                <groupId>org.apache.maven.pluginsgroupId>
                <artifactId>maven-compiler-pluginartifactId>
                <version>3.5.1version>
                <configuration>
                    <source>1.8source>
                    <target>1.8target>
                configuration>
            plugin>
        plugins>
    build>
project>

2、java程序

public class WordCountDemo {

        public static void main(String[] args) throws Exception {
            final ParameterTool params = ParameterTool.fromArgs(args);

            // create execution environment
            final ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment();
            env.getConfig().setGlobalJobParameters(params);

            // get input data
            DataSet text;
            if (params.has("input")) {
                // read the text file from given input path
                text = env.readTextFile(params.get("input"));
            } else {
                // get default test text data
                System.out.println("Executing WordCount example with default input data set.");
                System.out.println("Use --input to specify file input.");
                text = WordCountData.getDefaultTextLineDataSet(env);
            }

            DataSet> counts =
                    // split up the lines in pairs (2-tuples) containing: (word,1)
                    text.flatMap(new Tokenizer())
                            // group by the tuple field "0" and sum up tuple field "1"
                            .groupBy(0)
                            .sum(1);

            // emit result
            if (params.has("output")) {
                counts.writeAsCsv(params.get("output"), "\n", " ");
                // execute program
                env.execute("WordCount Example");
            } else {
                System.out.println("Printing result to stdout. Use --output to specify output path.");
                counts.print();
            }

        }

    // *************************************************************************
    //     USER FUNCTIONS
    // *************************************************************************

    /**
     * Implements the string tokenizer that splits sentences into words as a user-defined
     * FlatMapFunction. The function takes a line (String) and splits it into
     * multiple pairs in the form of "(word,1)" ({@code Tuple2}).
     */
    public static final class Tokenizer implements FlatMapFunction> {

        @Override
        public void flatMap(String value, Collector> out) {
            // normalize and split the line
            String[] tokens = value.toLowerCase().split("\\W+");

            // emit the pairs
            for (String token : tokens) {
                if (token.length() > 0) {
                    out.collect(new Tuple2<>(token, 1));
                }
            }
        }
    }
}

3、单步调试分析

　　第一步：获取环境信息ExecutionEnvironment.java

/**
 * The ExecutionEnvironment is the context in which a program is executed. A
 * {@link LocalEnvironment} will cause execution in the current JVM, a
 * {@link RemoteEnvironment} will cause execution on a remote setup.
 *
 * The environment provides methods to control the job execution (such as setting the parallelism)
 * and to interact with the outside world (data access).
 *
 * 
Please note that the execution environment needs strong type information for the input and return types
 * of all operations that are executed. This means that the environments needs to know that the return
 * value of an operation is for example a Tuple of String and Integer.
 * Because the Java compiler throws much of the generic type information away, most methods attempt to re-
 * obtain that information using reflection. In certain cases, it may be necessary to manually supply that
 * information to some of the methods.
 *
 * @see LocalEnvironment
 * @see RemoteEnvironment
 */

　　创建本地环境

    /**
     * Creates a {@link LocalEnvironment} which is used for executing Flink jobs.
     *
     * @param configuration to start the {@link LocalEnvironment} with
     * @param defaultParallelism to initialize the {@link LocalEnvironment} with
     * @return {@link LocalEnvironment}
     */
    private static LocalEnvironment createLocalEnvironment(Configuration configuration, int defaultParallelism) {
        final LocalEnvironment localEnvironment = new LocalEnvironment(configuration);

        if (defaultParallelism > 0) {
            localEnvironment.setParallelism(defaultParallelism);
        }

        return localEnvironment;
    }

　　第二步：获取外部数据，创建数据集 ExecutionEnvironment.java

    /**
     * Creates a DataSet from the given non-empty collection. Note that this operation will result
     * in a non-parallel data source, i.e. a data source with a parallelism of one.
     *
     * The returned DataSet is typed to the given TypeInformation.
     *
     * @param data The collection of elements to create the data set from.
     * @param type The TypeInformation for the produced data set.
     * @return A DataSet representing the given collection.
     *
     * @see #fromCollection(Collection)
     */
    public  DataSource fromCollection(Collection data, TypeInformation type) {
        return fromCollection(data, type, Utils.getCallLocationName());
    }

    private  DataSource fromCollection(Collection data, TypeInformation type, String callLocationName) {
        CollectionInputFormat.checkCollection(data, type.getTypeClass());
        return new DataSource<>(this, new CollectionInputFormat<>(data, type.createSerializer(config)), type, callLocationName);
    }

　　数据集的继承关系

其中，DataSet是一组相同类型数据的集合，抽象类，它提供了数据的转换功能，如map，reduce，join和coGroup

/**
 * A DataSet represents a collection of elements of the same type.
 *
 * A DataSet can be transformed into another DataSet by applying a transformation as for example
 * 

 *   {@link DataSet#map(org.apache.flink.api.common.functions.MapFunction)},
 *   {@link DataSet#reduce(org.apache.flink.api.common.functions.ReduceFunction)},
 *   {@link DataSet#join(DataSet)}, or
 *   {@link DataSet#coGroup(DataSet)}.
 * 
 *
 * @param  The type of the DataSet, i.e., the type of the elements of the DataSet.
 */

Operator是java api的操作基类，抽象类

/**
 * Base class of all operators in the Java API.
 *
 * @param  The type of the data set produced by this operator.
 * @param  The type of the operator, so that we can return it.
 */
@Public
public abstract class Operatorextends Operator> extends DataSet {

DataSource具体实现类。

/**
 * An operation that creates a new data set (data source). The operation acts as the
 * data set on which to apply further transformations. It encapsulates additional
 * configuration parameters, to customize the execution.
 *
 * @param  The type of the elements produced by this data source.
 */
@Public
public class DataSource extends Operator> {

　　第三步：对输入数据集进行转换

            DataSet> counts =
                    // split up the lines in pairs (2-tuples) containing: (word,1)
                    text.flatMap(new Tokenizer())
                            // group by the tuple field "0" and sum up tuple field "1"
                            .groupBy(0)
                            .sum(1);

>>调用map DataSet.java

    /**
     * Applies a FlatMap transformation on a {@link DataSet}.
     *
     * The transformation calls a {@link org.apache.flink.api.common.functions.RichFlatMapFunction} for each element of the DataSet.
     * Each FlatMapFunction call can return any number of elements including none.
     *
     * @param flatMapper The FlatMapFunction that is called for each element of the DataSet.
     * @return A FlatMapOperator that represents the transformed DataSet.
     *
     * @see org.apache.flink.api.common.functions.RichFlatMapFunction
     * @see FlatMapOperator
     * @see DataSet
     */
    public  FlatMapOperator flatMap(FlatMapFunction flatMapper) {
        if (flatMapper == null) {
            throw new NullPointerException("FlatMap function must not be null.");
        }

        String callLocation = Utils.getCallLocationName();
        TypeInformation resultType = TypeExtractor.getFlatMapReturnTypes(flatMapper, getType(), callLocation, true);
        return new FlatMapOperator<>(this, resultType, clean(flatMapper), callLocation);
    }

　　>>调用groupby DataSet.java

    /**
     * Groups a {@link Tuple} {@link DataSet} using field position keys.
     *
     * Note: Field position keys only be specified for Tuple DataSets.
     *
     * 
The field position keys specify the fields of Tuples on which the DataSet is grouped.
     * This method returns an {@link UnsortedGrouping} on which one of the following grouping transformation
     *   can be applied.
     * 

     *   {@link UnsortedGrouping#sortGroup(int, org.apache.flink.api.common.operators.Order)} to get a {@link SortedGrouping}.
     *   
{@link UnsortedGrouping#aggregate(Aggregations, int)} to apply an Aggregate transformation.
     *   
{@link UnsortedGrouping#reduce(org.apache.flink.api.common.functions.ReduceFunction)} to apply a Reduce transformation.
     *   
{@link UnsortedGrouping#reduceGroup(org.apache.flink.api.common.functions.GroupReduceFunction)} to apply a GroupReduce transformation.
     * 
     *
     * @param fields One or more field positions on which the DataSet will be grouped.
     * @return A Grouping on which a transformation needs to be applied to obtain a transformed DataSet.
     *
     * @see Tuple
     * @see UnsortedGrouping
     * @see AggregateOperator
     * @see ReduceOperator
     * @see org.apache.flink.api.java.operators.GroupReduceOperator
     * @see DataSet
     */
    public UnsortedGrouping groupBy(int... fields) {
        return new UnsortedGrouping<>(this, new Keys.ExpressionKeys<>(fields, getType()));
    }

　　>>调用sum UnsortedGrouping.java

    /**
     * Syntactic sugar for aggregate (SUM, field).
     * @param field The index of the Tuple field on which the aggregation function is applied.
     * @return An AggregateOperator that represents the summed DataSet.
     *
     * @see org.apache.flink.api.java.operators.AggregateOperator
     */
    public AggregateOperator sum (int field) {
        return this.aggregate (Aggregations.SUM, field, Utils.getCallLocationName());
    }
    // private helper that allows to set a different call location name
    private AggregateOperator aggregate(Aggregations agg, int field, String callLocationName) {
        return new AggregateOperator(this, agg, field, callLocationName);
    }

UnsortedGrouping和DataSet的关系

　　UnsortedGrouping使用AggregateOperator做聚合

　　第四步：对转换的输入值进行处理

            // emit result
            if (params.has("output")) {
                counts.writeAsCsv(params.get("output"), "\n", " ");
                // execute program
                env.execute("WordCount Example");
            } else {
                System.out.println("Printing result to stdout. Use --output to specify output path.");
                counts.print();
            }

　　如果不指定output参数，则打印到控制台

    /**
     * Prints the elements in a DataSet to the standard output stream {@link System#out} of the JVM that calls
     * the print() method. For programs that are executed in a cluster, this method needs
     * to gather the contents of the DataSet back to the client, to print it there.
     *
     * The string written for each element is defined by the {@link Object#toString()} method.
     *
     * 
This method immediately triggers the program execution, similar to the
     * {@link #collect()} and {@link #count()} methods.
     *
     * @see #printToErr()
     * @see #printOnTaskManager(String)
     */
    public void print() throws Exception {
        List elements = collect();
        for (T e: elements) {
            System.out.println(e);
        }
    }

　　若指定输出，则先进行输入转换为csv文件的DataSink，它是用来存储数据结果的

/**
 * An operation that allows storing data results.
 * @param 
 */

过程如下：

    /**
     * Writes a {@link Tuple} DataSet as CSV file(s) to the specified location with the specified field and line delimiters.
     *
     * Note: Only a Tuple DataSet can written as a CSV file.
      * For each Tuple field the result of {@link Object#toString()} is written.
     *
     * @param filePath The path pointing to the location the CSV file is written to.
     * @param rowDelimiter The row delimiter to separate Tuples.
     * @param fieldDelimiter The field delimiter to separate Tuple fields.
     * @param writeMode The behavior regarding existing files. Options are NO_OVERWRITE and OVERWRITE.
     *
     * @see Tuple
     * @see CsvOutputFormat
     * @see DataSet#writeAsText(String) Output files and directories
     */
    public DataSink writeAsCsv(String filePath, String rowDelimiter, String fieldDelimiter, WriteMode writeMode) {
        return internalWriteAsCsv(new Path(filePath), rowDelimiter, fieldDelimiter, writeMode);
    }

    @SuppressWarnings("unchecked")
    private extends Tuple> DataSink internalWriteAsCsv(Path filePath, String rowDelimiter, String fieldDelimiter, WriteMode wm) {
        Preconditions.checkArgument(getType().isTupleType(), "The writeAsCsv() method can only be used on data sets of tuples.");
        CsvOutputFormat of = new CsvOutputFormat<>(filePath, rowDelimiter, fieldDelimiter);
        if (wm != null) {
            of.setWriteMode(wm);
        }
        return output((OutputFormat) of);
    }
    /**
     * Emits a DataSet using an {@link OutputFormat}. This method adds a data sink to the program.
     * Programs may have multiple data sinks. A DataSet may also have multiple consumers (data sinks
     * or transformations) at the same time.
     *
     * @param outputFormat The OutputFormat to process the DataSet.
     * @return The DataSink that processes the DataSet.
     *
     * @see OutputFormat
     * @see DataSink
     */
    public DataSink output(OutputFormat outputFormat) {
        Preconditions.checkNotNull(outputFormat);

        // configure the type if needed
        if (outputFormat instanceof InputTypeConfigurable) {
            ((InputTypeConfigurable) outputFormat).setInputType(getType(), context.getConfig());
        }

        DataSink sink = new DataSink<>(this, outputFormat, getType());
        this.context.registerDataSink(sink);
        return sink;
    }

　　最后执行job

    @Override
    public JobExecutionResult execute(String jobName) throws Exception {
        if (executor == null) {
            startNewSession();
        }

        Plan p = createProgramPlan(jobName);

        // Session management is disabled, revert this commit to enable
        //p.setJobId(jobID);
        //p.setSessionTimeout(sessionTimeout);

        JobExecutionResult result = executor.executePlan(p);

        this.lastJobExecutionResult = result;
        return result;
    }

这一阶段是内容比较多，放到下一篇讲解吧

总结

　　Apache Flink 功能强大，支持开发和运行多种不同种类的应用程序。它的主要特性包括：批流一体化、精密的状态管理、事件时间支持以及精确一次的状态一致性保障等。Flink 不仅可以运行在包括 YARN、 Mesos、Kubernetes 在内的多种资源管理框架上，还支持在裸机集群上独立部署。在启用高可用选项的情况下，它不存在单点失效问题。事实证明，Flink 已经可以扩展到数千核心，其状态可以达到 TB 级别，且仍能保持高吞吐、低延迟的特性。世界各地有很多要求严苛的流处理应用都运行在 Flink 之上。

　　其应用场景如下：　　

1、事件驱动型应用
　　典型的事件驱动型应用实例：
　　反欺诈
　　异常检测
　　基于规则的报警
　　业务流程监控
　（社交网络）Web 应用
2、数据分析应用
　　典型的数据分析应用实例
　　电信网络质量监控
　　移动应用中的产品更新及实验评估分析
　　消费者技术中的实时数据即席分析
　　大规模图分析
3、数据管道应用
　　典型的数据管道应用实例
　　电子商务中的实时查询索引构建
　　电子商务中的持续 ETL

参考资料

【1】https://flink.apache.org/

【2】https://blog.csdn.net/yangyin007/article/details/82382734

【3】https://flink.apache.org/zh/usecases.html

转载于:https://www.cnblogs.com/davidwang456/p/10948698.html

React Native svygh123 问题解决过程编程 js react native react.js javascript
ReactNative是一个用于构建原生移动应用的框架，它使用JavaScript和React（一个用于构建用户界面的JavaScript库）来开发iOS和Android平台的应用程序。ReactNative由Facebook开发并维护，并且是开源的。特点跨平台开发：ReactNative允许开发者使用相同的代码库为多个平台（如iOS和Android）编写应用，极大地提高了开发效率。热重载：开发者
Java函数式接口四部曲之Consumer sundawei2016 java 前端开发语言
Consumer是一个函数式接口，位于java.util.function包中。它表示一个接受单个输入参数并且不返回任何结果的操作。Consumer通常用于需要对输入参数执行某些操作但不产生返回值的场景。Consumer接口定义了一个抽象方法：accept(Tt)：接受一个类型为T的参数，并对其执行操作。Consumerdisplay=System.out::println;display.acc
（十六）Java-File Kyrie_Li Java体系 java 开发语言
File类是Java中最基础的文件处理类，它用于表示文件和目录（文件路径）。File类不能直接进行读写操作，它仅用于描述文件或目录的元数据，比如文件名、路径、大小等。一、File类的构造方法1.通过提供文件的路径字符串来创建一个File对象。路径可以是绝对路径也可以是相对路径。Filefile=newFile("D:\\test\\555.txt");2.通过父目录路径和子文件/目录路径来创建Fi
（六）Java-BigDecimal Kyrie_Li Java体系 java 开发语言
一、概述BigDecimal类用于高精度计算，特别适用于需要进行精确浮点数运算的场合，例如货币计算、金融应用或科学计算。二、优势由于double和float类型是浮点数类型，它们在表示一些十进制数时会出现精度丢失问题，而BigDecimal则可以避免这些问题，提供任意精度的数值表示。三、特点1.任意精度：BigDecimal的精度仅受限于计算机的内存，而不像float和double有固定的精度限制
PIPCA个人信息保护合规审计师认证介绍！熙丫 13381482386 大数据
个人信息保护合规审计师"（PersonalInformationProtectionComplianceAuditor-CCRC）是中国网络安全审查认证中心与市场监管大数据中心为深入贯彻实施《个人信息保护法》，推动个人信息处理者切实履行合规审计职责，针对企事业单位及第三方机构中从事个人信息保护合规审计（简称“个保审计”）的专业人员，依据《个人信息保护法》、《网络安全从业人员能力基本要求》
MyBatis Plus 在 Java 项目中的高效使用随风九天匠心数据库 java spring java mybatis MyBatis Plus
1.前言1.1MyBatisPlus简介MyBatisPlus是一个MyBatis的增强工具，旨在简化开发人员在数据库操作上的工作量。它提供了丰富的功能，如自动化的CRUD操作、条件构造器、分页查询等，极大地提高了开发效率。1.2为什么选择MyBatisPlus简化代码：自动生成基础的CRUD方法，减少重复代码。提高效率：内置多种插件和工具，提升开发速度。易于维护：代码结构清晰，便于后续维护和扩展
java--数据校验Validator 郑*杰 java 开发语言 spring
一、基于注解进行数据校验1、配置依赖java--常用依赖配置_郑*杰的博客-CSDN博客2、创建一个配置类packagecom.ruqi.aditainoal;importorg.springframework.context.annotation.Bean;importorg.springframework.context.annotation.ComponentScan;importorg.s
Apache Doris 实现毫秒级查询响应随风九天匠心数据库服务 java apache Apache Doris
1.引言1.1数据分析的重要性随着大数据时代的到来，企业对实时数据分析的需求日益增长。快速、准确地获取数据洞察成为企业在竞争中脱颖而出的关键。传统的数据库系统在处理大规模数据时往往面临性能瓶颈，难以满足实时分析的需求。例如，一个电商公司需要实时监控销售数据以调整库存和营销策略，而传统的数据库可能需要数分钟甚至数小时才能生成报表，这显然无法满足业务需求。1.2ApacheDoris简介ApacheD
Apipost一站式API工具评测：整合Postman+Swagger+JMeter三大功能，打造全流程开发解决方案
作为一名Java开发者，始终追求开发过程的高效性。使用IntelliJIDEA编写代码只是开始。一般来说，代码完成后，我们会切换到Postman进行API调试。在确保API表现符合预期后，我们会使用Swagger为前端团队生成文档。最后，再使用JMeter进行性能和负载测试，以确保API工作流顺畅且自动化。Apipost=Postman+Swagger+JMeter然而，这种多工具的方法存在诸多挑
VScode使用小技巧前端CV攻城狮 vscode javascript 前端
代码片段快捷键设置设置位置：文件—首选项—用户代码片段----搜javaScript,进入JavaScript.json,自定义快捷键（setting→ConfigureUserSnippets→JavaScripts.json）例：log快速输入console.log()"Printtoconsole":{"prefix":"log","body":["console.log('$1')"],"
Java基于redis实现进度条冰糖码奇朵 java redis
一.问题背景为了提升用户体验，开发中有很多场景需要用到进度条，比如导入、导出、大规模更新操作等。进度条在许多大型系统中使用频率较高，反复编写既麻烦又不利于维护，因此基于Redis抽成公共方法供不同功能调用。二.实现方案1.引入依赖如果系统已集成Redis，直接跳到第5步，进度条实现。org.springframework.bootspring-boot-starter-data-redis2.配置
数据监控工具Mixpanel的简易使用教程 alankuo 大数据
Mixpanel的使用教程如下：注册与准备创建账号：访问Mixpanel官方网站，按照提示填写相关信息创建账号。登录后，在项目设置中可以获取项目密钥。了解基本概念：明确事件、用户属性等基本概念。事件是用户在应用中的操作，如点击按钮、完成注册等；用户属性是描述用户特征的信息，像年龄、城市、会员等级等。集成SDKWeb应用：在HTML文件中引入MixpanelJavaScriptSDK。在页面的标签内
如何实现集群中的session共享存储？思维导图代码示例（java 架构) 用心去追梦 java 架构开发语言
集群中Session共享存储的实现在分布式系统或集群环境中，确保用户会话（Session）能够在所有节点之间共享是一个关键问题。为了实现这一点，可以采用多种策略和技术。以下是关于如何在Java架构中实现集群中的Session共享存储的主要方面：1.使用集中式存储服务Memcached：轻量级、高性能的内存缓存系统，适用于存储短期的session数据。Redis：功能更强大的键值存储数据库，不仅支持
Java常用集合与映射的线程安全问题深度解析 yang789022 编程学习 java 安全 python
Java常用集合与映射的线程安全问题深度解析一、线程安全基础认知二、典型非线程安全集合问题分析1.ArrayList的并发陷阱2.HashMap的并发灾难3.HashSet的隐藏风险三、线程安全解决方案对比1.同步包装方案2.传统线程安全集合3.现代并发容器（java.util.concurrent包）3.1CopyOnWriteArrayList3.2ConcurrentHashMap3.3Co
java 连接oracle 字符集_Java连接Oracle数据库，编码格式转换东京客 java 连接oracle 字符集
学习东西不忘记下笔记：dbhelper类，各种数据库都合适。publicclassDBHelper{//mysql数据库//publicstaticfinalStringurl="jdbc:mysql://127.0.0.1:3306/test";//publicstaticfinalStringname="com.mysql.jdbc.Driver";//publicstaticfinalStr
java 读取resource文件夹文件_Java 获取Resource目录下的文件解决办法鬼斧神工119 java 读取resource文件夹文件
该楼层疑似违规已被系统折叠隐藏此楼查看此楼Java获取Resource目录下的文件有两种方式：Java代码中的类，要获取Resource资源文件目录下文件绝对路径寻址注意这个/址的是根目录，用绝对路径，可能会出现的问题是，你的程序在windows上可以用，但是在linux不能用，原因在于，你这根目录在windows环境址你的src目录放到linux环境，就可能执行你linux的根目录了，会导致fi
jvm堆外内存(直接内存) 不坠青云之志 Java Jvm direct memory
堆外内存(直接内存)堆外内存，又被称为直接内存。这部分内存不是由jvm管理和回收的。需要我们手动的回收。堆内内存是属于jvm的，由jvm进行分配和管理，属于"用户态"，而推外内存是由操作系统管理的，属于"内核态"在jdk1.4中新加入了NIO类，他可以调用native函数库直接分配堆外内存，然后通过java堆中的DirectByteBuffer对象来指向这块内存，进行内存分配等工作。可以这样申请堆
JVM内存深度解析：堆内与堆外内存的监控与诊断猿泰山 Java核心技术 jvm
JVM内存深度解析：堆内与堆外内存的监控与诊断一、引言在Java应用中，JVM（JavaVirtualMachine）的内存管理至关重要。其中，堆内内存和堆外内存是两个核心概念。堆内内存主要存储Java对象实例，而堆外内存则与Java的NIO（NewI/O）库密切相关，主要用于存储不受Java堆大小限制的直接缓冲区。本文将深入探讨如何监控和诊断这两种类型的内存使用。二、堆内内存监控与诊断JVM参数
【从零开始学java】第1章，基础知识入门，小白零基础可看，笔记整理莉莉鸟 java 学习
java基础11注释标志符关键字注释注释并不会被执行，是写给人类看的，书写注释是一个很好的习惯平时写代码一定要注意规范单行注释//多行注释/*注释*/文档注释/**注释*/2标识符关键字abstract：用于声明抽象类或抽象方法。assert：用于调试时进行断言。boolean：表示布尔类型（true或false）。break：跳出当前循环或switch语句。byte：表示字节数据类型。case：
查看 jvm 堆外内存大小 Horizon_Zy JVM相关 java 开发语言后端
java.nio.Bits#reservedMemor该值为堆外内存占用大小。可以通过arthasattach后用ognl进行输出。ognl@java.nio.Bits@reservedMemory.value
最新网络安全-跨站脚本攻击(XSS)的原理、攻击及防御_xsstrike原理 2401_84239830 程序员 web安全 xss 安全
XSS的类型反射型XSS/不持久型XSS存储型XSS/持久型XSS基于DOM的XSS常用Payload与工具XSS扫描工具Payloadsscript标签类结合js的html标签伪协议绕过危害防御简介跨站脚本攻击(全称CrossSiteScripting,为和CSS（层叠样式表）区分，简称为XSS)是指恶意攻击者在Web页面中插入恶意javascript代码（也可能包含html代码），当用户浏览网
ClickHouse Keeper 源码解析阿里云云栖号云栖号技术分享 java 开发语言后端
简介：ClickHouse社区在21.8版本中引入了ClickHouseKeeper。ClickHouseKeeper是完全兼容Zookeeper协议的分布式协调服务。本文对开源版本ClickHousev21.8.10.19-lts源码进行了解析。作者简介：范振（花名辰繁），阿里云开源大数据-OLAP方向负责人。内容框架背景架构图核心流程图梳理内部代码流程梳理Nuraft关键配置排坑结论关于我们R
如何使用Java和ElasticSearch实现全文搜索微赚淘客系统开发者@聚娃科技 java elasticsearch 开发语言
如何使用Java和ElasticSearch实现全文搜索大家好，我是微赚淘客系统3.0的小编，是个冬天不穿秋裤，天冷也要风度的程序猿！今天我们来探讨如何使用Java和ElasticSearch实现全文搜索。ElasticSearch是一个分布式搜索和分析引擎，能够处理大规模数据并提供实时搜索功能。在本文中，我们将介绍如何使用Java客户端与ElasticSearch进行交互，实现简单的全文搜索功能
基于大数据架构的就业岗位推荐系统的设计与实现【java或python】—计算机毕业设计源码+LW文档 qq_375279829 大数据架构 python 课程设计算法
摘要随着互联网技术的迅猛发展和大数据时代的到来，就业市场日益复杂多变，求职者与招聘方之间的信息不对称问题愈发突出。为解决这一难题，本文设计并实现了一个基于大数据架构的就业岗位推荐系统。该系统通过收集、整合并分析大量求职者简历信息、企业招聘信息以及市场动态数据，运用先进的机器学习算法，为求职者提供个性化的岗位推荐服务，同时帮助企业快速定位到合适的候选人。本文将从系统设计的背景与意义、技术基础、需求分
Java 基础核心总结仅此而已丶 Java基础教程系列开发语言 java
目录前言介绍1、基本语法2、面向对象编程3、异常处理4、集合框架5、IO流6、多线程专栏地址前言Java是一种广泛使用的程序设计语言，具有跨平台、面向对象、安全性高、灵活性强等特点，广泛应用于企业级应用程序和移动应用程序等领域。在学习Java语言时，需要掌握一些基础核心知识，本文将为您总结Java基础核心知识点，以便于您的学习和参考。介绍Java基础核心知识点包括基本语法、面向对象编程、异常处理、
java 金额转中文大写两眼墨黑 java python 开发语言
publicclassNumberChinese{publicstaticStringnumberChinese(Stringstr){BigDecimalnum=newBigDecimal(str);StringstrOutput;StringstrUnit="仟佰拾亿仟佰拾万仟佰拾元角分";StringstrNum="零壹贰叁肆伍陆柒捌玖";num=num.setScale(2,Roundin
java基础知识点详解一：Java概述及三种技术架构我是老实人辶 java 程序员架构
Java语言是一门随时代快速发展的计算机语言程序，其深刻展示了程序编写的精髓，加上其简明严谨的结构及简洁的语法编写为其将来的发展及维护提供了保障。由于提供了网络应用的支持和多媒体的存取，会推动Internet和企业网络的Web的应用java概述：1991年Sun公司的JamesGosling等人开始开发名称为Oak的语言，希望用于控制嵌入在有线电视交换盒、PDA等的微处理器；1994年将Oak语言
Java 的三种技术架构 hhappy0123456789 jvm java 开发语言
JAVAEE：JavaPlatformEnterpriseEdition，开发企业环境下的应用程序，主要针对web程序开发;JAVASE：JavaPlatformStandardEdition，完成桌面应用程序的开发，是其它两者的基础;JAVAME：JavaPlatformMicroEdition，开发电子消费产品和嵌入式设备，如手机中的程序;1，JDK：JavaDevelopmentKit，ja
Java多线程编程实战：synchronized与Lock锁对比微风灬浮尘 java java java入门 java多线程
一、锁机制全景图：从内核态到用户态1.Java锁分类与演进史锁机制悲观锁乐观锁synchronizedReentrantLockCAS版本号机制2.锁升级全流程（synchronized底层原理）无锁→偏向锁（单线程）→轻量级锁（CAS自旋）→重量级锁（OS互斥量）锁膨胀条件：偏向锁：-XX:BiasedLockingStartupDelay=0（默认延迟4秒）重量级锁：自旋超过阈值（-XX:Pr
Java的三种技术架构: Dagssb java 架构 jvm
JAVAEE：JavaPlatformEnterpriseEdition，开发企业环境下的应用程序，主要针对web程序开发；JAVASE：JavaPlatformStandardEdition，完成桌面应用程序的开发，是其它两者的基础；JAVAME：JavaPlatformMicroEdition，开发电子消费产品和嵌入式设备，如手机中的程序；1，JDK：JavaDevelopmentKit，ja
ASM系列四利用Method 组件动态注入方法逻辑 lijingyao8206 字节码技术 jvm AOP 动态代理 ASM
这篇继续结合例子来深入了解下Method组件动态变更方法字节码的实现。通过前面一篇，知道ClassVisitor 的visitMethod()方法可以返回一个MethodVisitor的实例。那么我们也基本可以知道，同ClassVisitor改变类成员一样，MethodVIsistor如果需要改变方法成员，注入逻辑，也可以
java编程思想 --内部类百合不是茶 java 内部类匿名内部类
内部类;了解外部类并能与之通信内部类写出来的代码更加整洁与优雅 1,内部类的创建内部类是创建在类中的 package com.wj.InsideClass; /* * 内部类的创建 */ public class CreateInsideClass { public CreateInsideClass(
web.xml报错 crabdave web.xml
web.xml报错 The content of element type "web-app" must match "(icon?,display- name?,description?,distributable?,context-param*,filter*,filter-mapping*,listener*,servlet*,s
泛型类的自定义麦田的设计者 java android 泛型
为什么要定义泛型类，当类中要操作的引用数据类型不确定的时候。采用泛型类，完成扩展。例如有一个学生类 Student{ Student(){ System.out.println("I'm a student....."); } } 有一个老师类
CSS清除浮动的4中方法 IT独行者 JavaScript UI css
清除浮动这个问题，做前端的应该再熟悉不过了，咱是个新人，所以还是记个笔记，做个积累，努力学习向大神靠近。CSS清除浮动的方法网上一搜，大概有N多种，用过几种，说下个人感受。 1、结尾处加空div标签 clear:both 1 2 3 4 .div 1 { background : #000080 ; border : 1px s
Cygwin使用windows的jdk 配置方法 _wy_ jdk windows cygwin
1.[vim /etc/profile] JAVA_HOME="/cgydrive/d/Java/jdk1.6.0_43" (windows下jdk路径为D:\Java\jdk1.6.0_43) PATH="$JAVA_HOME/bin:${PATH}" CLAS
linux下安装maven 无量 maven linux 安装
Linux下安装maven(转) 1.首先到Maven官网下载安装文件，目前最新版本为3.0.3，下载文件为 apache-maven-3.0.3-bin.tar.gz，下载可以使用wget命令； 2.进入下载文件夹，找到下载的文件，运行如下命令解压 tar -xvf apache-maven-2.2.1-bin.tar.gz 解压后的文件夹
tomcat的https 配置,syslog-ng配置 aichenglong tomcat http跳转到https syslong-ng配置 syslog配置
1) tomcat配置https,以及http自动跳转到https的配置 1)TOMCAT_HOME目录下生成密钥(keytool是jdk中的命令) keytool -genkey -alias tomcat -keyalg RSA -keypass changeit -storepass changeit
关于领号活动总结 alafqq 活动
关于某彩票活动的总结具体需求，每个用户进活动页面，领取一个号码，1000中的一个；活动要求 1，随机性，一定要有随机性； 2，最少中奖概率，如果注数为3200注，则最多中4注 3，效率问题，（不能每个人来都产生一个随机数，这样效率不高）； 4，支持断电（仍然从下一个开始），重启服务；（存数据库有点大材小用，因此不能存放在数据库）解决方案 1，事先产生随机数1000个，并打
java数据结构冒泡排序的遍历与排序百合不是茶 java
java的冒泡排序是一种简单的排序规则冒泡排序的原理：比较两个相邻的数，首先将最大的排在第一个，第二次比较第二个，此后一样；针对所有的元素重复以上的步骤，除了最后一个例题；将int array[]
JS检查输入框输入的是否是数字的一种校验方法 bijian1013 js
如下是JS检查输入框输入的是否是数字的一种校验方法： <form method=post target="_blank"> 数字：<input type="text" name=num onkeypress="checkNum(this.form)"><br> </form>
Test注解的两个属性：expected和timeout bijian1013 java JUnit expected timeout
JUnit4：Test文档中的解释：　　The Test annotation supports two optional parameters. 　　The first, expected, declares that a test method should throw an exception. 　　If it doesn't throw an exception or if it
[Gson二]继承关系的POJO的反序列化 bit1129 POJO
父类 package inheritance.test2; import java.util.Map; public class Model { private String field1; private String field2; private Map<String, String> infoMap
【Spark八十四】Spark零碎知识点记录 bit1129 spark
1. ShuffleMapTask的shuffle数据在什么地方记录到MapOutputTracker中的 ShuffleMapTask的runTask方法负责写数据到shuffle map文件中。当任务执行完成成功，DAGScheduler会收到通知，在DAGScheduler的handleTaskCompletion方法中完成记录到MapOutputTracker中
WAS各种脚本作用大全 ronin47 WAS 脚本
　　　http://www.ibm.com/developerworks/cn/websphere/library/samples/SampleScripts.html 　　　无意中，在WAS官网上发现的各种脚本作用，感觉很有作用，先与各位分享一下　　　获取下载这些示例 jacl 和 Jython 脚本可用于在 WebSphere Application Server 的不同版本中自
java-12.求 1+2+3+..n不能使用乘除法、 for 、 while 、 if 、 else 、 switch 、 case 等关键字以及条件判断语句 bylijinnan switch
借鉴网上的思路，用java实现： public class NoIfWhile { /** * @param args * * find x=1+2+3+....n */ public static void main(String[] args) { int n=10; int re=find(n); System.o
Netty源码学习-ObjectEncoder和ObjectDecoder bylijinnan java netty
Netty中传递对象的思路很直观： Netty中数据的传递是基于ChannelBuffer（也就是byte[]）；那把对象序列化为字节流，就可以在Netty中传递对象了相应的从ChannelBuffer恢复对象，就是反序列化的过程 Netty已经封装好ObjectEncoder和ObjectDecoder 先看ObjectEncoder ObjectEncoder是往外发送
spring 定时任务中cronExpression表达式含义 chicony cronExpression
一个cron表达式有6个必选的元素和一个可选的元素，各个元素之间是以空格分隔的，从左至右，这些元素的含义如下表所示：代表含义是否必须允许的取值范围 &nb
Nutz配置Jndi ctrain JNDI
1、使用JNDI获取指定资源： var ioc = { dao : { type :"org.nutz.dao.impl.NutDao", args : [ {jndi :"jdbc/dataSource"} ] } } 以上方法,仅需要在容器中配置好数据源,注入到NutDao即可.
解决 /bin/sh^M: bad interpreter: No such file or directory daizj shell
在Linux中执行.sh脚本，异常/bin/sh^M: bad interpreter: No such file or directory。分析：这是不同系统编码格式引起的：在windows系统中编辑的.sh文件可能有不可见字符，所以在Linux系统下执行会报以上异常信息。解决： 1）在windows下转换：利用一些编辑器如UltraEdit或EditPlus等工具
[转]for 循环为何可恨？ dcj3sjt126com 程序员读书
Java的闭包(Closure)特征最近成为了一个热门话题。一些精英正在起草一份议案，要在Java将来的版本中加入闭包特征。然而，提议中的闭包语法以及语言上的这种扩充受到了众多Java程序员的猛烈抨击。不久前，出版过数十本编程书籍的大作家Elliotte Rusty Harold发表了对Java中闭包的价值的质疑。尤其是他问道“for 循环为何可恨？”[http://ju
Android实用小技巧 dcj3sjt126com android
1、去掉所有Activity界面的标题栏　　修改AndroidManifest.xml 　　在application 标签中添加android:theme="@android:style/Theme.NoTitleBar" 2、去掉所有Activity界面的TitleBar 和StatusBar 　　修改AndroidManifes
Oracle 复习笔记之序列 eksliang Oracle 序列 sequence Oracle sequence
转载请出自出处：http://eksliang.iteye.com/blog/2098859 1.序列的作用序列是用于生成唯一、连续序号的对象一般用序列来充当数据库表的主键值 2.创建序列语法如下： create sequence s_emp start with 1 --开始值 increment by 1 --増长值 maxval
有“品”的程序员 gongmeitao 工作
完美程序员的10种品质　　完美程序员的每种品质都有一个范围，这个范围取决于具体的问题和背景。没有能解决所有问题的完美程序员（至少在我们这个星球上），并且对于特定问题，完美程序员应该具有以下品质：　　1. 才智非凡- 能够理解问题、能够用清晰可读的代码翻译并表达想法、善于分析并且逻辑思维能力强（范围：用简单方式解决复杂问题）　　
使用KeleyiSQLHelper类进行分页查询 hvt sql .net C#asp.net hovertree
本文适用于sql server单主键表或者视图进行分页查询，支持多字段排序。KeleyiSQLHelper类的最新代码请到http://hovertree.codeplex.com/SourceControl/latest下载整个解决方案源代码查看。或者直接在线查看类的代码：http://hovertree.codeplex.com/SourceControl/latest#HoverTree.D
SVG 教程（三）圆形，椭圆，直线天梯梦 svg
SVG <circle> SVG 圆形 - <circle> <circle> 标签可用来创建一个圆：下面是SVG代码： <svg xmlns="http://www.w3.org/2000/svg" version="1.1"> <circle cx="100" c
链表栈 luyulong java 数据结构
public class Node { private Object object; private Node next; public Node() { this.next = null; this.object = null; } public Object getObject() { return object; } public
基础数据结构和算法十：2-3 search tree sunwinner Algorithm 2-3 search tree
Binary search tree works well for a wide variety of applications, but they have poor worst-case performance. Now we introduce a type of binary search tree where costs are guaranteed to be loga
spring配置定时任务 stunizhengjia spring timer
最近因工作的需要，用到了spring的定时任务的功能,觉得spring还是很智能化的,只需要配置一下配置文件就可以了,在此记录一下，以便以后用到： //------------------------定时任务调用的方法------------------------------ /** * 存储过程定时器 */ publi
ITeye 8月技术图书有奖试读获奖名单公布 ITeye管理员活动
ITeye携手博文视点举办的8月技术图书有奖试读活动已圆满结束，非常感谢广大用户对本次活动的关注与参与。 8月试读活动回顾： http://webmaster.iteye.com/blog/2102830 本次技术图书试读活动的优秀奖获奖名单及相应作品如下（优秀文章有很多，但名额有限，没获奖并不代表不优秀）：《跨终端Web》 gleams：http

从flink-example分析flink组件(1)WordCount batch实战及源码分析

你可能感兴趣的:(大数据,java)