JxWang05

Jbd2：Hadoop

1. 发展历史
2. 主要特性
3. 部分组件
- 3.1 HDFS
- 3.2 HBase
- 3.3 Sqoop
- 3.4 Zookeeper
4. 实践操作
- 4.1 创建Hadoop用户
- 4.2 安装Java
- - 4.2.1 安装jdk
  - 4.2.2 修改环境变量
  - 4.2.3 SSH登录权限设置
- 4.3 单机版Hadoop
- - 4.3.1 安装Hadoop
  - 4.3.2 修改系统环境变量
  - 4.3.3 修改hadoop-env.sh文件配置
- 4.4 伪分布式Hadoop
- - 4.4.1 修改core-site.xml文件配置
  - 4.4.2 修改hdfs-site.xml文件配置
  - 4.4.3 修改mapred-site.xml文件配置
  - 4.4.4 修改yarn-site.xml文件配置
  - 4.4.5 格式化分布式文件系统
  - 4.4.6 启动Hadoop
  - 4.4.7 查看Hadoop进程
  - 4.4.8 Hadoop WebUI管理界面
  - 4.4.9 测试HDFS集群以及MapReduce任务程序
  - 4.4.5 关闭Hadoop

1. 发展历史

2. 主要特性

高可靠性：
采用冗余数据存储方式，即使一个副本发生故障，其他副本也可以保证正常对外提供服务。Hadoop按位存储和处理数据的能力，值得人们信赖。

高效性：
作为并行分布式计算平台，Hadoop采用分布式存储和分布式处理两大核心技术，能够高效地处理PB级数据。Hadoop能够在节点之间动态地移动数据，并保证各个节点的动态平衡，因此处理速度非常快。

高可扩展性：
Hadoop的设计目标是可以高效稳定地运行在廉价的计算机集群上，可以扩展到数以千计的计算机节点。

高容错性：
采用冗余数据存储方式，自动保存数据的多个副本，并且能够自动将失败的任务进行重新分配。

成本低：
Hadoop采用廉价的计算机集群，成本较低，普通用户也很容易用自己的PC上搭建Hadoop运行环境。与一体机、商用数据仓库以及QlikView、Yonghong Z-Suite等数据集市相比，Hadoop是开源的，项目的软件成本因此会大大降低。

运行在Linux平台上：
Hadoop是基于Java语言开发的，可以较好地运行在Linux平台上。

支持多种编程语言：
Hadoop上的应用程序也可以使用其他语言编写，如C++。

3. 部分组件

3.1 HDFS

Hadoop分布式文件系统（Hadoop Distributed File System，HDFS）是针对谷歌文件系统（Google File System，GFS）的开源实现

优点是处理超大数据、流式处理、可以运行在廉价商用服务器上等。
可以通过提供高吞吐率来访问应用程序的数据，适合那些具有超大数据集的应用程序
放宽了可移植操作系统接口的要求，这样可以通过流的形式访问文件系统中的数据

3.2 HBase

HBase是一个提供高可靠性、高性能、可伸缩、实时读写和分布式的列式数据库，一般采用HDFS作为其底层数据存储。

HBase适合用于去存储非结构化数据
HBase的存储模式是基于列而不是基于行的
HBase表是疏松的，用户可以给行定义各种不同类型的列
HBase主要用于需要随机访问、实时读写的大数据

3.3 Sqoop

Sqoop可以改进数据的互操作性，主要用来在Hadoop和关系数据库之间交换数据。

Sqoop主要通过JDBC（Java DataBase Connectivity）与关系数据库进行交互
理论上，支持JDBC的关系数据库都可以用Sqoop与Hadoop进行数据交互。
Sqoop是专门为大数据集设计的，支持增量更新

3.4 Zookeeper

Zookeeper是一个为分布式应用所涉及的开源协调服务

主要为用户提供同步、配置管理、分组和命名等服务
Zookeeper的文件系统使用了我们所熟悉的目录树结构
Zookeeper是主要使用Java语言编写，同时支持C语言。

4. 实践操作

4.1 创建Hadoop用户

分离账号使不同用户之间有明确的权限区别。同时，也可以防止Hadoop的配置操作影响到其他用户的使用。

ubuntu@VM-0-12-ubuntu:~$ sudo adduser master
Adding user `master' ...
Adding new group `master' (1001) ...
Adding new user `master' (1001) with group `master' ...
Creating home directory `/home/master' ...
Copying files from `/etc/skel' ...
New password: 
Retype new password: 
passwd: password updated successfully
Changing the user information for master
Enter the new value, or press ENTER for the default
	Full Name []: 
	Room Number []: 
	Work Phone []: 
	Home Phone []: 
	Other []: 
Is the information correct? [Y/n] y
ubuntu@VM-0-12-ubuntu:~$ su master
Password: 
master@VM-0-12-ubuntu:/home/ubuntu$

4.2 安装Java

4.2.1 安装jdk

master@VM-0-12-ubuntu:/opt/JuciyBigData$ ls
apache-hive-2.3.9-bin.tar.gz  hbase-2.4.8-bin.tar.gz      mysql-connector-java_8.0.27-1ubuntu20.04_all.deb
hadoop-3.3.1.tar.gz           jdk-8u311-linux-x64.tar.gz  spark-3.2.0-bin-without-hadoop.tgz
master@VM-0-12-ubuntu:/opt/JuciyBigData$ sudo tar -xzvf jdk-8u311-linux-x64.tar.gz -C /opt
[sudo] password for master: 
master is not in the sudoers file.  This incident will be reported.
master@VM-0-12-ubuntu:/opt/JuciyBigData$ su root
Password: 
root@VM-0-12-ubuntu:/opt/JuciyBigData# sudo tar -xzvf jdk-8u311-linux-x64.tar.gz -C /opt
jdk1.8.0_311/jre/lib/security/trusted.libraries
jdk1.8.0_311/jre/lib/security/java.security
jdk1.8.0_311/jre/lib/security/policy/
jdk1.8.0_311/jre/lib/security/policy/limited/
···
root@VM-0-12-ubuntu:/opt/JuciyBigData# su master
master@VM-0-12-ubuntu:/opt/JuciyBigData$

sudo确实没办法用，这里的解决方案是先切回root，教程当中有说明：

注意：如果sudo命令无法使用，请直接切换到root用户，su root或sudo -i。但要注意，以下步骤需要切换到datawhale用户下进行操作：

ssh登录权限设置

Hadoop伪分布安装中的5、6、7、8、9步骤。

那放在我的操作上，就是要切换到hadoop用户，保证这些操作在该用户上有效

接下来是将jdk目录重命名为java，同样需要切换用户：

master@VM-0-12-ubuntu:/opt/JuciyBigData$ ls
apache-hive-2.3.9-bin.tar.gz  hbase-2.4.8-bin.tar.gz      mysql-connector-java_8.0.27-1ubuntu20.04_all.deb
hadoop-3.3.1.tar.gz           jdk-8u311-linux-x64.tar.gz  spark-3.2.0-bin-without-hadoop.tgz
master@VM-0-12-ubuntu:/opt/JuciyBigData$ cd ..
master@VM-0-12-ubuntu:/opt$ ls
jdk1.8.0_311  JuciyBigData  JuciyBigData.zip
master@VM-0-12-ubuntu:/opt$ sudo mv /opt/jdk1.8.0_311/ /opt/java
[sudo] password for master: 
master is not in the sudoers file.  This incident will be reported.
master@VM-0-12-ubuntu:/opt$ su root
Password: 
root@VM-0-12-ubuntu:/opt# sudo mv /opt/jdk1.8.0_311/ /opt/java
root@VM-0-12-ubuntu:/opt# ls
java  JuciyBigData  JuciyBigData.zip

接下来是，修改java目录的所属用户，因为解压等操作是root做的，不是master：

root@VM-0-12-ubuntu:/opt# ll
total 1496456
drwxr-xr-x  4 root  root        4096 Mar 15 21:24 ./
drwxr-xr-x 20 root  root        4096 Mar 15 21:27 ../
drwxr-xr-x  8 10143 10143       4096 Sep 27 20:29 java/
drwxr-xr-x  2 root  root        4096 Feb 12 17:51 JuciyBigData/
-rw-r--r--  1 root  root  1532346446 Mar 15 18:28 JuciyBigData.zip
root@VM-0-12-ubuntu:/opt# sudo chown -R master:master /opt/java
root@VM-0-12-ubuntu:/opt# ll
total 1496456
drwxr-xr-x  4 root   root         4096 Mar 15 21:24 ./
drwxr-xr-x 20 root   root         4096 Mar 15 21:28 ../
drwxr-xr-x  8 master master       4096 Sep 27 20:29 java/
drwxr-xr-x  2 root   root         4096 Feb 12 17:51 JuciyBigData/
-rw-r--r--  1 root   root   1532346446 Mar 15 18:28 JuciyBigData.zip
root@VM-0-12-ubuntu:/opt#

4.2.2 修改环境变量

打开/etc/profile文件，还是需要sudo，还是需要root用户：

root@VM-0-12-ubuntu:/opt# sudo vim /etc/profile
root@VM-0-12-ubuntu:/opt#

添加内容如下：

#java
export JAVA_HOME=/opt/java
export PATH=$JAVA_HOME/bin:$PATH

Linux编辑文件的基础很简单，vim打开后i是插入，然后光标移到尾部，shift+insert粘贴

然后按esc退出编辑模式，再用shift+:进入命令模式，输入w和q表示写入和退出

编辑完毕后用source /etc/profile更新环境变量，再用java -version命令去检查

root@VM-0-12-ubuntu:/opt# source /etc/profile
root@VM-0-12-ubuntu:/opt# java -version
java version "1.8.0_311"
Java(TM) SE Runtime Environment (build 1.8.0_311-b11)
Java HotSpot(TM) 64-Bit Server VM (build 25.311-b11, mixed mode)
root@VM-0-12-ubuntu:/opt#

想了想我们还是给master一个sudo的权限吧，主要是担心后面忘记切换用户

根据博客，我们需要先授予root用户关于sudoers文件的管理权限，默认是只读

root@VM-0-12-ubuntu:/opt# chmod u+w /etc/sudoers

然后追加以下内容：

master ALL=(ALL:ALL) NOPASSWD:ALL

意即允许用户master执行sudo命令，并且在执行的时候不输入密码

最后记得对于sudoers文件的写入权限：

root@VM-0-12-ubuntu:/opt# chmod u-w /etc/sudoers

然后切回master，尝试一下sudo，设置成功

root@VM-0-12-ubuntu:/opt# su master
master@VM-0-12-ubuntu:/opt$ sudo vim /etc/profile
master@VM-0-12-ubuntu:/opt$

4.2.3 SSH登录权限设置

教程这里就指明了，要切换回我们操作Hadoop的用户

对于Hadoop的伪分布和全分布而言，Hadoop名称节点（NameNode）需要启动集群中所有机器的Hadoop守护进程，这个过程可以通过SSH登录来实现。Hadoop并没有提供SSH输入密码登录的形式，因此，为了能够顺利登录每台机器，需要将所有机器配置为名称节点，可以通过SSH无密码的方式登录它们。

设置SSH，我感觉主要就是为了集群内部的通信吧

首先我们需要生成密钥：

为了实现SSH无密码登录方式，首先需要让NameNode生成自己的SSH密钥，命令如下：

master@VM-0-12-ubuntu:/opt$ ssh-keygen -t rsa # 执行该命令后，遇到提示信息，一直按回车就可以
Generating public/private rsa key pair.
Enter file in which to save the key (/home/master/.ssh/id_rsa): 
Created directory '/home/master/.ssh'.
Enter passphrase (empty for no passphrase): 
Enter same passphrase again: 
Your identification has been saved in /home/master/.ssh/id_rsa
Your public key has been saved in /home/master/.ssh/id_rsa.pub
The key fingerprint is:
SHA256:HfMDj9TBrGiVvkaLkxwXGZPI3H4EMpKL+E2nmWO/Z0A master@VM-0-12-ubuntu
The key's randomart image is:
+---[RSA 3072]----+
|      .+oo+O.    |
|      ..+oBo+.   |
|   . . . ==+.    |
|  . . o Eo*B.    |
|   . o OS*o++    |
|    . B * +  .   |
|     . o +       |
|        . o      |
|        .+       |
+----[SHA256]-----+
master@VM-0-12-ubuntu:/opt$

然后就是把这个登录的凭证发送给其他机器

NameNode生成密钥之后，需要将它的公共密钥发送给集群中的其他机器。我们可以将id_dsa.pub中的内容添加到需要SSH无密码登录的机器的~/ssh/authorized_keys目录下，然后就可以无密码登录这台机器了。对于无密码登录本机而言，可以执行以下命令：

教程给了两个方式，我依次尝试了一下，第二个好像不行：

master@VM-0-12-ubuntu:/opt$ cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
master@VM-0-12-ubuntu:/opt$ cat /home/datawhale/.ssh/id_rsa.pub >> /home/datawhale/.ssh/authorized_keys
bash: /home/datawhale/.ssh/authorized_keys: No such file or directory
master@VM-0-12-ubuntu:/opt$

然后就是测试连接，ssh localhost，我好像是成功了？但是没见sucessful login啊

master@VM-0-12-ubuntu:/opt$ ssh localhost
The authenticity of host 'localhost (127.0.0.1)' can't be established.
ECDSA key fingerprint is SHA256:5vRo0/nGDBuyknC2msG0n3P4a7H1LD2weTDyiIdNUhU.
Are you sure you want to continue connecting (yes/no/[fingerprint])? yes
Warning: Permanently added 'localhost' (ECDSA) to the list of known hosts.
Welcome to Ubuntu 20.04 LTS (GNU/Linux 5.4.0-96-generic x86_64)

 * Documentation:  https://help.ubuntu.com
 * Management:     https://landscape.canonical.com
 * Support:        https://ubuntu.com/advantage

  System information as of Tue 15 Mar 2022 10:13:21 PM CST

  System load:  0.0                Processes:             127
  Usage of /:   15.0% of 49.16GB   Users logged in:       1
  Memory usage: 14%                IPv4 address for eth0: 172.16.0.12
  Swap usage:   0%

 * Super-optimized for small spaces - read how we shrank the memory
   footprint of MicroK8s to make it the smallest full K8s around.

   https://ubuntu.com/blog/microk8s-memory-optimisation


The programs included with the Ubuntu system are free software;
the exact distribution terms for each program are described in the
individual files in /usr/share/doc/*/copyright.

Ubuntu comes with ABSOLUTELY NO WARRANTY, to the extent permitted by
applicable law.

教程上也说了，可能是没有安装openssh-server。于是我去看了给的参考博客

但是我这个情况，跟博客上面成功解决问题之后的样子，好像是一样的，博客例子如下：

$ ssh localhost
The authenticity of host 'localhost (127.0.0.1)' can't be established.
ECDSA key fingerprint is SHA256:7FTkHoAyQ9yLqfLXI+GOOz/Ej7uBe1vJldjpsej+OuM.
Are you sure you want to continue connecting (yes/no)? no

我感觉应该是成功了吧，毕竟我这里都刷新出来了系统的状态，跟我每次登录服务器的样子一样

4.3 单机版Hadoop

4.3.1 安装Hadoop

先解压文件：

master@VM-0-12-ubuntu:/opt/JuciyBigData$ ls
apache-hive-2.3.9-bin.tar.gz  hbase-2.4.8-bin.tar.gz      mysql-connector-java_8.0.27-1ubuntu20.04_all.deb
hadoop-3.3.1.tar.gz           jdk-8u311-linux-x64.tar.gz  spark-3.2.0-bin-without-hadoop.tgz
master@VM-0-12-ubuntu:/opt/JuciyBigData$ sudo tar -xzvf hadoop-3.3.1.tar.gz -C /opt
···
hadoop-3.3.1/include/hdfs.h
hadoop-3.3.1/include/Pipes.hh
master@VM-0-12-ubuntu:/opt/JuciyBigData$

然后也重命名一下，并且更改所属用户和所属组：

master@VM-0-12-ubuntu:/opt$ ls
hadoop-3.3.1  java  JuciyBigData  JuciyBigData.zip
master@VM-0-12-ubuntu:/opt$ sudo mv /opt/hadoop-3.3.1/ /opt/hadoop
master@VM-0-12-ubuntu:/opt$ ll
total 1496460
drwxr-xr-x  5 root   root         4096 Mar 15 22:23 ./
drwxr-xr-x 20 root   root         4096 Mar 15 22:23 ../
drwxr-xr-x 10 ubuntu ubuntu       4096 Jun 15  2021 hadoop/
drwxr-xr-x  8 master master       4096 Sep 27 20:29 java/
drwxr-xr-x  2 root   root         4096 Feb 12 17:51 JuciyBigData/
-rw-r--r--  1 root   root   1532346446 Mar 15 18:28 JuciyBigData.zip
master@VM-0-12-ubuntu:/opt$ sudo chown -R master:master /opt/hadoop
master@VM-0-12-ubuntu:/opt$ ll
total 1496460
drwxr-xr-x  5 root   root         4096 Mar 15 22:23 ./
drwxr-xr-x 20 root   root         4096 Mar 15 22:24 ../
drwxr-xr-x 10 master master       4096 Jun 15  2021 hadoop/
drwxr-xr-x  8 master master       4096 Sep 27 20:29 java/
drwxr-xr-x  2 root   root         4096 Feb 12 17:51 JuciyBigData/
-rw-r--r--  1 root   root   1532346446 Mar 15 18:28 JuciyBigData.zip
master@VM-0-12-ubuntu:/opt$

因为这个文件是我在ubuntu用户下载的，所以说一开始属于ubuntu

4.3.2 修改系统环境变量

跟上面一样，打开/etc/profile进行修改：

master@VM-0-12-ubuntu:/opt$ sudo vim /etc/profile
master@VM-0-12-ubuntu:/opt$

在末尾添加如下内容：

#hadoop
export HADOOP_HOME=/opt/hadoop
export PATH=$HADOOP_HOME/bin:$PATH

更新环境变量并测试：

master@VM-0-12-ubuntu:/opt$ source /etc/profile
master@VM-0-12-ubuntu:/opt$ 
master@VM-0-12-ubuntu:/opt$ hadoop version
Hadoop 3.3.1
Source code repository https://github.com/apache/hadoop.git -r a3b9c37a397ad4188041dd80621bdeefc46885f2
Compiled by ubuntu on 2021-06-15T05:13Z
Compiled with protoc 3.7.1
From source with checksum 88a4ddb2299aca054416d6b7f81ca55
This command was run using /opt/hadoop/share/hadoop/common/hadoop-common-3.3.1.jar
master@VM-0-12-ubuntu:/opt$

4.3.3 修改hadoop-env.sh文件配置

这好像是配置hadoop的环境变量，感觉也就是指定解释器什么的那种感觉

对于单机安装，首先需要更改hadoop-env.sh文件，用于配置Hadoop运行的环境变量，命令如下：

master@VM-0-12-ubuntu:/opt$ cd hadoop
master@VM-0-12-ubuntu:/opt/hadoop$ vim etc/hadoop/hadoop-env.sh
master@VM-0-12-ubuntu:/opt/hadoop$

然后同样的文件末尾添加新内容：

export JAVA_HOME=/opt/java/

同样地，我们也要测试一下

Hadoop文档中还附带了一些例子来供我们测试，可以运行WordCount的示例，检测一下Hadoop安装是否成功。运行示例的步骤如下：

在/opt/hadoop/目录下新建input文件夹，用来存放输入数据；

将etc/hadoop/文件夹下的配置文件拷贝至input文件夹中；

在hadoop目录下新建output文件夹，用于存放输出数据；

运行wordCount示例

查看输出数据的内容。

执行命令如下：

master@VM-0-12-ubuntu:/opt/hadoop$ ls
bin  etc  include  lib  libexec  LICENSE-binary  licenses-binary  LICENSE.txt  NOTICE-binary  NOTICE.txt  README.txt  sbin  share
master@VM-0-12-ubuntu:/opt/hadoop$ mkdir input
master@VM-0-12-ubuntu:/opt/hadoop$ ls
bin  include  lib      LICENSE-binary   LICENSE.txt    NOTICE.txt  sbin
etc  input    libexec  licenses-binary  NOTICE-binary  README.txt  share
master@VM-0-12-ubuntu:/opt/hadoop$ cp etc/hadoop/*.xml input
master@VM-0-12-ubuntu:/opt/hadoop$ bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.1.jar grep input output 'dfs[a-z.]+'
2022-03-15 22:59:45,601 INFO impl.MetricsConfig: Loaded properties from hadoop-metrics2.properties
···
	File Output Format Counters 
		Bytes Written=23
master@VM-0-12-ubuntu:/opt/hadoop$ cat output/*
1	dfsadmin
master@VM-0-12-ubuntu:/opt/hadoop$

测试成功

这意味着，在所有的配置文件中，只有一个符合正则表达式dfs[a-z.]+的单词，输出结果正确。

4.4 伪分布式Hadoop

4.4.1 修改core-site.xml文件配置

打开core-site.xml文件，添加下面配置到与标签之间：

<configuration>
    <property>
        <name>fs.defaultFS</name>
        <value>hdfs://localhost:9000</value>
    </property>
</configuration>

master@VM-0-12-ubuntu:/opt/hadoop$ cat /opt/hadoop/etc/hadoop/core-site.xml
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<!--
  Licensed under the Apache License, Version 2.0 (the "License");
  you may not use this file except in compliance with the License.
  You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software
  distributed under the License is distributed on an "AS IS" BASIS,
  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  See the License for the specific language governing permissions and
  limitations under the License. See accompanying LICENSE file.
-->

<!-- Put site-specific property overrides in this file. -->

<configuration>
</configuration>
master@VM-0-12-ubuntu:/opt/hadoop$ vim /opt/hadoop/etc/hadoop/core-site.xml
master@VM-0-12-ubuntu:/opt/hadoop$ cat /opt/hadoop/etc/hadoop/core-site.xml
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<!--
  Licensed under the Apache License, Version 2.0 (the "License");
  you may not use this file except in compliance with the License.
  You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software
  distributed under the License is distributed on an "AS IS" BASIS,
  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  See the License for the specific language governing permissions and
  limitations under the License. See accompanying LICENSE file.
-->

<!-- Put site-specific property overrides in this file. -->

<configuration>
	<property>
		<name>fs.defaultFS</name>
		<value>hdfs://localhost:9000</value>
	</property>
</configuration>
master@VM-0-12-ubuntu:/opt/hadoop$

可以看出，core-site.xml配置文件的格式十分简单，标签代表了配置项的名字，项设置的是配置的值。对于该文件，我们只需要在其中指定HDFS的地址和端口号，端口号按照官方文档设置为9000即可。

4.4.2 修改hdfs-site.xml文件配置

打开hdfs-site.xml文件，添加下面配置到与标签之间：

<configuration>
    <property>
        <name>dfs.replication</name>
        <value>1</value>
    </property>
</configuration>

master@VM-0-12-ubuntu:/opt/hadoop$ cat /opt/hadoop/etc/hadoop/hdfs-site.xml
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<!--
  Licensed under the Apache License, Version 2.0 (the "License");
  you may not use this file except in compliance with the License.
  You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software
  distributed under the License is distributed on an "AS IS" BASIS,
  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  See the License for the specific language governing permissions and
  limitations under the License. See accompanying LICENSE file.
-->

<!-- Put site-specific property overrides in this file. -->

<configuration>

</configuration>
master@VM-0-12-ubuntu:/opt/hadoop$ vim /opt/hadoop/etc/hadoop/hdfs-site.xml
master@VM-0-12-ubuntu:/opt/hadoop$ cat /opt/hadoop/etc/hadoop/hdfs-site.xml
<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<!--
  Licensed under the Apache License, Version 2.0 (the "License");
  you may not use this file except in compliance with the License.
  You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software
  distributed under the License is distributed on an "AS IS" BASIS,
  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  See the License for the specific language governing permissions and
  limitations under the License. See accompanying LICENSE file.
-->

<!-- Put site-specific property overrides in this file. -->

<configuration>
	<property>
		<name>dfs.replication</name>
		<value>1</value>
	</property>
</configuration>
master@VM-0-12-ubuntu:/opt/hadoop$

对于hdfs-site.xml文件，我们设置replication值为1，这也是Hadoop运行的默认最小值，用于设置HDFS文件系统中同一份数据的副本数量。

4.4.3 修改mapred-site.xml文件配置

打开mapred-site.xml文件，添加下面配置到与标签之间

<configuration>
    <property>
        <name>mapreduce.framework.name</name>
        <value>yarn</value>
    </property>
    <property>
        <name>mapreduce.application.classpath</name>
        <value>$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/*:$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/lib/*</value>
    </property>
</configuration>

master@VM-0-12-ubuntu:/opt/hadoop$ cat /opt/hadoop/etc/hadoop/mapred-site.xml
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<!--
  Licensed under the Apache License, Version 2.0 (the "License");
  you may not use this file except in compliance with the License.
  You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software
  distributed under the License is distributed on an "AS IS" BASIS,
  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  See the License for the specific language governing permissions and
  limitations under the License. See accompanying LICENSE file.
-->

<!-- Put site-specific property overrides in this file. -->

<configuration>

</configuration>
master@VM-0-12-ubuntu:/opt/hadoop$ vim /opt/hadoop/etc/hadoop/mapred-site.xml
master@VM-0-12-ubuntu:/opt/hadoop$ cat /opt/hadoop/etc/hadoop/mapred-site.xml
<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<!--
  Licensed under the Apache License, Version 2.0 (the "License");
  you may not use this file except in compliance with the License.
  You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software
  distributed under the License is distributed on an "AS IS" BASIS,
  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  See the License for the specific language governing permissions and
  limitations under the License. See accompanying LICENSE file.
-->

<!-- Put site-specific property overrides in this file. -->

<configuration>
	<property>
		<name>mapreduce.framework.name</name>
		<value>yarn</value>
	</property>
	<property>
		<name>mapreduce.application.classpath</name>
		<value>$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/*:$HADOOP_MAPRED_HOME/share/hadoop/mapreduce/lib/*</value>
	</property>
</configuration>

master@VM-0-12-ubuntu:/opt/hadoop$

4.4.4 修改yarn-site.xml文件配置

添加下面配置到与标签之间

<configuration>
    <property>
        <name>yarn.nodemanager.aux-services</name>
        <value>mapreduce_shuffle</value>
    </property>
    <property>
        <name>yarn.nodemanager.env-whitelist</name>
        <value>JAVA_HOME,HADOOP_COMMON_HOME,HADOOP_HDFS_HOME,HADOOP_CONF_DIR,CLASSPATH_PREPEND_DISTCACHE,HADOOP_YARN_HOME,HADOOP_MAPRED_HOME</value>
    </property>
</configuration>

master@VM-0-12-ubuntu:/opt/hadoop$ cat /opt/hadoop/etc/hadoop/yarn-site.xml
<?xml version="1.0"?>
<!--
  Licensed under the Apache License, Version 2.0 (the "License");
  you may not use this file except in compliance with the License.
  You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software
  distributed under the License is distributed on an "AS IS" BASIS,
  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  See the License for the specific language governing permissions and
  limitations under the License. See accompanying LICENSE file.
-->
<configuration>

<!-- Site specific YARN configuration properties -->

</configuration>
master@VM-0-12-ubuntu:/opt/hadoop$ vim /opt/hadoop/etc/hadoop/yarn-site.xml
master@VM-0-12-ubuntu:/opt/hadoop$ cat /opt/hadoop/etc/hadoop/yarn-site.xml
<?xml version="1.0"?>
<!--
  Licensed under the Apache License, Version 2.0 (the "License");
  you may not use this file except in compliance with the License.
  You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

  Unless required by applicable law or agreed to in writing, software
  distributed under the License is distributed on an "AS IS" BASIS,
  WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
  See the License for the specific language governing permissions and
  limitations under the License. See accompanying LICENSE file.
-->

<configuration>
	<property>
		<name>yarn.nodemanager.aux-services</name>
		<value>mapreduce_shuffle</value>
	</property>
	<property>
		<name>yarn.nodemanager.env-whitelist</name>
	<value>JAVA_HOME,HADOOP_COMMON_HOME,HADOOP_HDFS_HOME,HADOOP_CONF_DIR,CLASSPATH_PREPEND_DISTCACHE,HADOOP_YARN_HOME,HADOOP_MAPRED_HOME</value>
	</property>
</configuration>

master@VM-0-12-ubuntu:/opt/hadoop$

4.4.5 格式化分布式文件系统

首先要检查用户为Hadoop用户，然后执行初始化

在配置完成后，首先需要初始化文件系统，由于Hadoop的很多工作是在自带的 HDFS文件系统上完成的，因此，需要将文件系统初始化之后才能进一步执行计算任务。执行初始化的命令如下：

hdfs namenode -format

master@VM-0-12-ubuntu:/opt/hadoop$ hdfs namenode -format
WARNING: /opt/hadoop/logs does not exist. Creating.
2022-03-15 23:20:48,206 INFO namenode.NameNode: STARTUP_MSG: 
/************************************************************
STARTUP_MSG: Starting NameNode
STARTUP_MSG:   host = localhost.localdomain/127.0.1.1
STARTUP_MSG:   args = [-format]
STARTUP_MSG:   version = 3.3.1
···
2022-03-15 23:20:50,155 INFO common.Storage: Storage directory /tmp/hadoop-master/dfs/name has been `successfully formatted.`
···
2022-03-15 23:20:50,490 INFO namenode.NameNode: SHUTDOWN_MSG: 
/************************************************************
SHUTDOWN_MSG: Shutting down NameNode at localhost.localdomain/127.0.1.1
************************************************************/
master@VM-0-12-ubuntu:/opt/hadoop$

这个初始化成功的successfully formatted还挺不好找的

4.4.6 启动Hadoop

使用如下命令启动Hadoop的所有进程，可以通过提示信息得知，所有的启动信息都写入到对应的日志文件。如果出现启动错误，则可以查看相应的错误日志。

master@VM-0-12-ubuntu:/opt/hadoop$ /opt/hadoop/sbin/start-all.sh
WARNING: Attempting to start all Apache Hadoop daemons as master in 10 seconds.
WARNING: This is not a recommended production deployment configuration.
WARNING: Use CTRL-C to abort.
Starting namenodes on [localhost]
Starting datanodes
Starting secondary namenodes [VM-0-12-ubuntu]
VM-0-12-ubuntu: Warning: Permanently added 'vm-0-12-ubuntu' (ECDSA) to the list of known hosts.
Starting resourcemanager
Starting nodemanagers
master@VM-0-12-ubuntu:/opt/hadoop$

4.4.7 查看Hadoop进程

master@VM-0-12-ubuntu:/opt/hadoop$ jps
85952 NodeManager
85335 DataNode
86393 Jps
85800 ResourceManager
85565 SecondaryNameNode
85183 NameNode
master@VM-0-12-ubuntu:/opt/hadoop$

4.4.8 Hadoop WebUI管理界面

此时，可以通过http://localhost:8088访问Web界面，查看Hadoop的信息。

哇，神奇，我在浏览器访问服务器的这个端口也可以，我还以为不对外开放呢

4.4.9 测试HDFS集群以及MapReduce任务程序

先创建文件夹：

master@VM-0-12-ubuntu:/$ hadoop fs -mkdir /user
master@VM-0-12-ubuntu:/$ hadoop fs -mkdir /user/master
master@VM-0-12-ubuntu:/opt$ hadoop fs -mkdir /input

我查到的是说这是在HDFS的文件系统当中，Linux直接查是查不到的

然后新建测试文件，写入Hello world!

master@VM-0-12-ubuntu:/$ vim /home/master/test

将测试文件上传到Hadoop HDFS集群目录，命令如下：

master@VM-0-12-ubuntu:/$ hadoop fs -put /home/master/test /input

执行wordcount程序，命令如下：

master@VM-0-12-ubuntu:/$ hadoop jar /opt/hadoop/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.3.1.jar wordcount /input /out
2022-03-15 23:39:05,938 INFO client.DefaultNoHARMFailoverProxyProvider: Connecting to ResourceManager at /0.0.0.0:8032
2022-03-15 23:39:07,070 INFO mapreduce.JobResourceUploader: Disabling Erasure Coding for path: /tmp/hadoop-yarn/staging/master/.staging/job_1647357988111_0001
···

通过以下命令，查看执行结果：

master@VM-0-12-ubuntu:/$ hadoop fs -ls /out
Found 2 items
-rw-r--r--   1 master supergroup          0 2022-03-15 23:39 /out/_SUCCESS
-rw-r--r--   1 master supergroup         17 2022-03-15 23:39 /out/part-r-00000

可以看到，结果中包含_SUCCESS文件，表示Hadoop集群运行成功。

查看具体的输出结果，命令如下：

master@VM-0-12-ubuntu:/$ hadoop fs -text /out/part-r-00000
Hello	1
world!	1
master@VM-0-12-ubuntu:/$

还有集群安装模式可以做，但是我现在只有一台服务器，笔记本又暂时开不了虚拟机，以后再说吧

4.4.5 关闭Hadoop

噢，对了，之前启动了，那对应的得关闭啊

ubuntu@VM-0-12-ubuntu:~$ /opt/hadoop/sbin/stop-all.sh
WARNING: Stopping all Apache Hadoop daemons as ubuntu in 10 seconds.
WARNING: Use CTRL-C to abort.
^C
ubuntu@VM-0-12-ubuntu:~$ 
ubuntu@VM-0-12-ubuntu:~$ su master
Password: 
master@VM-0-12-ubuntu:/home/ubuntu$ /opt/hadoop/sbin/stop-all.sh
WARNING: Stopping all Apache Hadoop daemons as master in 10 seconds.
WARNING: Use CTRL-C to abort.
Stopping namenodes on [localhost]
Stopping datanodes
Stopping secondary namenodes [VM-0-12-ubuntu]
Stopping nodemanagers
localhost: WARNING: nodemanager did not stop gracefully after 5 seconds: Trying to kill with kill -9
Stopping resourcemanager
master@VM-0-12-ubuntu:/home/ubuntu$ /opt/hadoop/sbin/stop-all.sh
WARNING: Stopping all Apache Hadoop daemons as master in 10 seconds.
WARNING: Use CTRL-C to abort.
Stopping namenodes on [localhost]
Stopping datanodes
Stopping secondary namenodes [VM-0-12-ubuntu]
Stopping nodemanagers
Stopping resourcemanager
master@VM-0-12-ubuntu:/home/ubuntu$

这波还挺神奇的，由于我刚刚是重新连接上的服务器，所以没切换用户

然后等我反应过来的时候，立即用ctrl+c去进行打断，然后切换用户

切换完后，第一次执行还没完全成功，我一边查一边关第二次，然后就行了

我看这关闭的这几个组件，跟启动的时候是一一对应的，顺序都一致

说起来，如果我没有切换用户去关，也不知道行不行，下次试试

还有，我之前查到说尽量不要用*-all.sh这种东西，因为对应的是全部

如果脚本执行的时候出错了，不方便定位原因，所以建议一个一个来

你可能感兴趣的:(Juicy_Big_Data,hadoop,big,data,hdfs)

WPF中的ComboBox控件几种数据绑定的方式互联网打工人no1 wpf c#
一、用字典给ItemsSource赋值（此绑定用的地方很多，建议熟练掌握）在XMAL中：在CS文件中privatevoidBindData(){DictionarydicItem=newDictionary();dicItem.add(1,"北京");dicItem.add(2,"上海");dicItem.add(3,"广州");cmb_list.ItemsSource=dicItem;cmb_l
将cmd中命令输出保存为txt文本文件落难Coder Windows cmd window
最近深度学习本地的训练中我们常常要在命令行中运行自己的代码，无可厚非，我们有必要保存我们的炼丹结果，但是复制命令行输出到txt是非常麻烦的，其实Windows下的命令行为我们提供了相应的操作。其基本的调用格式就是：运行指令>输出到的文件名称或者具体保存路径测试下，我打开cmd并且ping一下百度：pingwww.baidu.com>./data.txt看下相同目录下data.txt的输出：如果你再
nosql数据库技术与应用知识点皆过客，揽星河 NoSQL nosql 数据库大数据数据分析数据结构非关系型数据库
Nosql知识回顾大数据处理流程数据采集(flume、爬虫、传感器)数据存储(本门课程NoSQL所处的阶段)Hdfs、MongoDB、HBase等数据清洗(入仓)Hive等数据处理、分析(Spark、Flink等)数据可视化数据挖掘、机器学习应用(Python、SparkMLlib等)大数据时代存储的挑战(三高)高并发(同一时间很多人访问)高扩展(要求随时根据需求扩展存储)高效率(要求读写速度快)
Linux MariaDB使用OpenSSL安装SSL证书 Meta39 MySQL Oracle MariaDB Linux Windows ssl linux mariadb
进入到证书存放目录，批量删除.pem证书警告：确保已经进入到证书存放目录find.-typef-iname\*.pem-delete查看是否安装OpenSSLopensslversion没有则安装yuminstallopensslopenssl-devel开启SSL编辑/etc/my.cnf文件（没有的话就创建，但是要注意，在/etc/my.cnf.d/server.cnf配置了datadir的，
网络编程基础记得开心一点啊网络
目录♫什么是网络编程♫Socket套接字♪什么是Socket套接字♪数据报套接字♪流套接字♫数据报套接字通信模型♪数据报套接字通讯模型♪DatagramSocket♪DatagramPacket♪实现UDP的服务端代码♪实现UDP的客户端代码♫流套接字通信模型♪流套接字通讯模型♪ServerSocket♪Socket♪实现TCP的服务端代码♪实现TCP的客户端代码♫什么是网络编程网络编程，指网络上
K近邻算法_分类鸢尾花数据集 _feivirus_ 算法机器学习和数学分类机器学习 K近邻
importnumpyasnpimportpandasaspdfromsklearn.datasetsimportload_irisfromsklearn.model_selectionimporttrain_test_splitfromsklearn.metricsimportaccuracy_score1.数据预处理iris=load_iris()df=pd.DataFrame(data=ir
4.C_数据结构_队列荣世蓥数据结构数据结构
概述什么是队列：队列是限定在两端进行插入操作和删除操作的线性表。具有先入先出(FIFO)的特点相关名词：队尾：写入数据的一段队头：读取数据的一段空队：队列中没有数据，队头指针=队尾指针满队：队列中存满了数据，队尾指针+1=队头指针循环队列1、基本内容循环队列是以数组形式构成的队列数据结构。循环队列的结构体如下：typedefintdata_t;//队列数据类型#defineN64//队列容量typ
vue项目element-ui的table表格单元格合并酋长哈哈 vue.js elementui javascript 前端
一、合并效果二全部代码exportdefault{name:'CellMerge',data(){return{tableData:[{id:'1',name:'王小虎',amount1:'165',amount2:'3.2',amount3:10},{id:'1',name:'王小虎',amount1:'162',amount2:'4.43',amount3:12},{id:'1',name:'
python tif转png Python与遥感 python 开发语言
importosfromosgeoimportgdalimportnumpyasnpfromPILimportImage#提取432三波段fromspectralimport*#输入文件夹路径defget_img(dataset_img):width=dataset_img.RasterXSize#获取行列数height=dataset_img.RasterYSizebands=dataset_i
MongoDB知识概括 GeorgeLin98 持久层 mongodb
MongoDB知识概括MongoDB相关概念单机部署基本常用命令索引-IndexSpirngDataMongoDB集成副本集分片集群安全认证MongoDB相关概念业务应用场景：传统的关系型数据库（如MySQL），在数据操作的“三高”需求以及应对Web2.0的网站需求面前，显得力不从心。解释：“三高”需求：①Highperformance-对数据库高并发读写的需求。②HugeStorage-对海量数
Vue中table合并单元格用法 weixin_30613343 javascript ViewUI
地名结果人名性别{{item.name}}已完成未完成{{item.groups[0].name}}{{item.groups[0].sex}}{{item.groups[son].name}}{{item.groups[son].sex}}exportdefault{data(){return{list:[{name:'地名1',result:'1',groups:[{name:'张三',sex
uniapp map组件自定义markers标记点以对_ uni-app学习记录 uni-app javascript 前端
需求是根据后端返回数据在地图上显示标记点，并且根据数据状态控制标记点颜色，标记点背景通过两张图片实现控制{{item.options.labelName}}exportdefault{data(){return{storeIndex:0,locaInfo:{longitude:120.445172,latitude:36.111387},markers:[//标点列表{id:1,//标记点idin
放松的一天 4da9b7687fa0
20190325总结起床07:20图片发自App睡觉:23:00天气:晴今日任务清单学习·信息·阅读•水滴阅读Day40Alice’sAdventuresinWonderlandChapter6.2图片发自App•BBC跟读训练营Day24图片发自App图片发自App图片发自App•潘多拉口语训练营Day6Wow.Whatabigboy!•文化知识学习今日无•阅读时间地狱健康·饮食·锻炼•饮食目标
博客网站制作教程 2401_85194651 java maven
首先就是技术框架：后端：Java+SpringBoot数据库：MySQL前端：Vue.js数据库连接：JPA(JavaPersistenceAPI)1.项目结构blog-app/├──backend/│├──src/main/java/com/example/blogapp/││├──BlogApplication.java││├──config/│││└──DatabaseConfig.java
vue + Element UI table动态合并单元格我家媳妇儿萌哒哒 element UI vue.js 前端 javascript
一、功能需求1、根据名称相同的合并工作阶段和主要任务合并这两列，但主要任务内容一样，但要考虑主要任务一样，但工作阶段不一样的情况。（枞向合并）2、落实情况里的定量内容和定性内容值一样则合并。（横向合并）二、功能实现exportdefault{data(){return{tableData:[{name:'a',address:'1',age:'1',six:'2'},{name:'a',addre
Python实现TIFF 文件转换为 PNG 和 JPG 格式 sand&wich python 开发语言
在日常的图像处理工作中，可能会遇到需要将TIFF格式的图像转换为其他格式的情况，例如PNG和JPG。下面，本文将介绍如何使用Python和GDAL库实现这一功能。准备工作在开始之前，请确保已经安装了必要的库：GDAL（GeospatialDataAbstractionLibrary）可以使用以下命令安装GDAL：pipinstallgdal代码实现以下是一个将TIFF文件转换为PNG文件的示例代码
浅谈MapReduce Android路上的人 Hadoop 分布式计算 mapreduce 分布式框架 hadoop
从今天开始，本人将会开始对另一项技术的学习，就是当下炙手可热的Hadoop分布式就算技术。目前国内外的诸多公司因为业务发展的需要，都纷纷用了此平台。国内的比如BAT啦，国外的在这方面走的更加的前面，就不一一列举了。但是Hadoop作为Apache的一个开源项目，在下面有非常多的子项目，比如HDFS，HBase,Hive，Pig,等等，要先彻底学习整个Hadoop，仅仅凭借一个的力量，是远远不够的。
使用datepicker和uploadify的冲突解决（IE双击才能打开附件上传对话框） zhanglb12
在开发的过程当中，IE的兼容无疑是我们的一块绊脚石，在我们使用的如期的datepicker插件和使用上传附件的uploadify插件的时候，两者就产生冲突，只要点击过时间的插件，uploadify上传框要双才能打开ie浏览器提示错误Missinginstancedataforthisdatepicker解决方案//if(.browser.msie&&'9.0'===.browser.version
golang获取用户输入的几种方式余生逆风飞翔 golang 开发语言后端
一、定义结构体typeUserInfostruct{Namestring`json:"name"`Ageint`json:"age"`Addstring`json:"add"`}typeReturnDatastruct{Messagestring`json:"message"`Statusstring`json:"status"`DataUserInfo`json:"data"`}二、get请求的
【Java】已解决：org.springframework.jdbc.datasource.lookup.DataSourceLookupFailureException 屿小夏 java 开发语言
文章目录一、分析问题背景问题背景描述出现问题的场景二、可能出错的原因三、错误代码示例四、正确代码示例五、注意事项已解决：org.springframework.jdbc.datasource.lookup.DataSourceLookupFailureException在使用Spring框架进行开发时，数据源的配置和使用是非常关键的一环。然而，有时候我们可能会遇到org.springframewo
el-table实现全选整表，单元一页复选框功能周bro vue.js elementui javascript 前端
全选整表单选一页0":popper-append-to-body="false":total="tableData.length":page-size="pageObj.pagesize":page-sizes="[10,50,100]"layout="total,sizes,prev,pager,next"@size-change="handleSizeChange"@current-chang
Vue + Express实现一个表单提交九旬大爷的梦
最近在折腾一个cms系统，用的vue+express，但是就一个表单提交就弄了好久，记录一下。环境：Node10+前端：Vue服务端：Express依赖包：vueexpressaxiosexpress-formidableelement-ui（可选）前言：axiosget请求参数是：paramsaxiospost请求参数是：dataexpressget接受参数是req.queryexpresspo
Kubernetes部署MySQL数据持久化沫殇-MS Kubernetes MySQL数据库 kubernetes mysql 容器
一、安装配置NFS服务端1、安装nfs-kernel-server：sudoapt-yinstallnfs-kernel-server2、服务端创建共享目录#列出所有可用块设备的信息lsblk#格式化磁盘sudomkfs-text4/dev/sdb#创建一个目录：sudomkdir-p/data/nfs/mysql#更改目录权限：sudochown-Rnobody:nogroup/data/nfs
Hadoop 傲雪凌霜，松柏长青后端大数据 hadoop 大数据分布式
ApacheHadoop是一个开源的分布式计算框架，主要用于处理海量数据集。它具有高度的可扩展性、容错性和高效的分布式存储与计算能力。Hadoop核心由四个主要模块组成，分别是HDFS（分布式文件系统）、MapReduce（分布式计算框架）、YARN（资源管理）和HadoopCommon（公共工具和库）。1.HDFS（HadoopDistributedFileSystem）HDFS是Hadoop生
Hadoop架构 henan程序媛 hadoop 大数据分布式
一、案列分析1.1案例概述现在已经进入了大数据(BigData)时代，数以万计用户的互联网服务时时刻刻都在产生大量的交互，要处理的数据量实在是太大了，以传统的数据库技术等其他手段根本无法应对数据处理的实时性、有效性的需求。HDFS顺应时代出现，在解决大数据存储和计算方面有很多的优势。1.2案列前置知识点1.什么是大数据大数据是指无法在一定时间范围内用常规软件工具进行捕捉、管理和处理的大量数据集合，
使用input[type=file]遇上的一些问题刘圣凯
项目遇到一个需要，如下image.png功能大致就是添加图片，展示出来，然后在用户点击提交的时候把图片传给后台，在和后台交涉之后，决定在用户选择图片之后转成formdata传给后台，后台返回一个url，提交的时候将url返回给后台/**转formdata*/varformdata=newFormData();formdata.append("file1",$("#pic")[0].files[0]
详解mybatis的一二级缓存以及缓存失效原因仰望天花板缓存数据库 mybatis java mysql
数据库的大部分场景下是从磁盘读取，如果数据从内存进行读取，速度较比磁盘要快得多。但因为内存的容量有限，所以一般只会把使用和查询较多的数据缓存起来，以便快速反应，其他使用率不太多的继续存放在磁盘。mybatis分为一级缓存和二级缓存1.一级缓存一级缓存存放在SqlSqeeion上，默认开启1.1pojo@DatapublicclassRole{privateLongid;privateStringr
小程序通过js控制页面字体颜色属性祈澈菇凉
需求：当电量少于百分之20的时候，显示电量的字体显示为红色。1：在wxml里面设置属性batStyle：style="{{item.batStyle}}"电量:{{item.battery}}%2：当复合逻辑条件的时候，在js里面carList[i].batStyle="color:red";success:function(res){constcarList=res.data.list;for(
Golang Channel PandaSkr golang
Channel解析1.Channel源码分析1.1Channel数据结构typehchanstruct{qcountuint//channel的元素数量dataqsizuint//channel循环队列长度bufunsafe.Pointer//指向循环队列的指针elemsizeuint16//元素大小closeduint32//channel是否关闭0-未关闭elemtype*_type//元素类
matlab游标标注移动,matlab实现图形窗口的数据游标莫白想 matlab游标标注移动
DatacursorsforfigurewindowSeveralrelatedfunctions:CreateCursorsetsupaverticalcursoronallaxesinafigure.Thecursorscanbemovedaroundusingthemouse.MultiplecursorsaresupportedineachfigureGetCursorLocationre
关于旗正规则引擎下载页面需要弹窗保存到本地目录的问题何必如此 jsp 超链接文件下载窗口
生成下载页面是需要选择“录入提交页面”，生成之后默认的下载页面<a>标签超链接为：<a href="<%=root_stimage%>stimage/image.jsp?filename=<%=strfile234%>&attachname=<%=java.net.URLEncoder.encode(file234filesourc
【Spark九十八】Standalone Cluster Mode下的资源调度源代码分析 bit1129 cluster
在分析源代码之前，首先对Standalone Cluster Mode的资源调度有一个基本的认识：首先，运行一个Application需要Driver进程和一组Executor进程。在Standalone Cluster Mode下，Driver和Executor都是在Master的监护下给Worker发消息创建(Driver进程和Executor进程都需要分配内存和CPU，这就需要Maste
linux上独立安装部署spark daizj linux 安装 spark 1.4 部署
下面讲一下linux上安装spark，以 Standalone Mode 安装 1）首先安装JDK 下载JDK：jdk-7u79-linux-x64.tar.gz ，版本是1.7以上都行，解压 tar -zxvf jdk-7u79-linux-x64.tar.gz 然后配置 ~/.bashrc&nb
Java 字节码之解析一周凡杨 java 字节码 javap
一： Java 字节代码的组织形式类文件 { OxCAFEBABE ，小版本号，大版本号，常量池大小，常量池数组，访问控制标记，当前类信息，父类信息，实现的接口个数，实现的接口信息数组，域个数，域信息数组，方法个数，方法信息数组，属性个数，属性信息数组 } &nbs
java各种小工具代码 g21121 java
1.数组转换成List import java.util.Arrays; Arrays.asList(Object[] obj); 2.判断一个String型是否有值 import org.springframework.util.StringUtils; if (StringUtils.hasText(str)) 3.判断一个List是否有值 import org.spring
加快FineReport报表设计的几个心得体会老A不折腾 finereport
一、从远程服务器大批量取数进行表样设计时，最好按“列顺序”取一个“空的SQL语句”，这样可提高设计速度。否则每次设计时模板均要从远程读取数据，速度相当慢！！二、找一个富文本编辑软件（如NOTEPAD+）编辑SQL语句，这样会很好地检查语法。有时候带参数较多检查语法复杂时，结合FineReport中生成的日志，再找一个第三方数据库访问软件（如PL/SQL）进行数据检索，可以很快定位语法错误。
mysql linux启动与停止墙头上一根草
如何启动/停止/重启MySQL一、启动方式1、使用 service 启动：service mysqld start2、使用 mysqld 脚本启动：/etc/inint.d/mysqld start3、使用 safe_mysqld 启动：safe_mysqld&二、停止1、使用 service 启动：service mysqld stop2、使用 mysqld 脚本启动：/etc/inin
Spring中事务管理浅谈 aijuans spring 事务管理
Spring中事务管理浅谈 By Tony Jiang@2012-1-20 Spring中对事务的声明式管理拿一个XML举例 [html] view plain copy print ? <?xml version="1.0" encoding="UTF-8"?>&nb
php中隐形字符65279（utf-8的BOM头）问题 alxw4616
php中隐形字符65279（utf-8的BOM头）问题今天遇到一个问题. php输出JSON 前端在解析时发生问题:parsererror. 调试: 1.仔细对比字符串发现字符串拼写正确.怀疑是非打印字符的问题. 2.逐一将字符串还原为unicode编码. 发现在字符串头的位置出现了一个 65279的非打印字符.
调用对象是否需要传递对象(初学者一定要注意这个问题) 百合不是茶对象的传递与调用技巧
类和对象的简单的复习,在做项目的过程中有时候不知道怎样来调用类创建的对象,简单的几个类可以看清楚,一般在项目中创建十几个类往往就不知道怎么来看为了以后能够看清楚,现在来回顾一下类和对象的创建,对象的调用和传递(前面写过一篇) 类和对象的基础概念: JAVA中万事万物都是类类有字段(属性),方法,嵌套类和嵌套接
JDK1.5 AtomicLong实例 bijian1013 java thread java多线程 AtomicLong
JDK1.5 AtomicLong实例类 AtomicLong 可以用原子方式更新的 long 值。有关原子变量属性的描述，请参阅 java.util.concurrent.atomic 包规范。AtomicLong 可用在应用程序中（如以原子方式增加的序列号），并且不能用于替换 Long。但是，此类确实扩展了 Number，允许那些处理基于数字类的工具和实用工具进行统一访问。
自定义的RPC的Java实现 bijian1013 java rpc
网上看到纯java实现的RPC，很不错。 RPC的全名Remote Process Call，即远程过程调用。使用RPC，可以像使用本地的程序一样使用远程服务器上的程序。下面是一个简单的RPC 调用实例，从中可以看到RPC如何
【RPC框架Hessian一】Hessian RPC Hello World bit1129 Hello world
什么是Hessian The Hessian binary web service protocol makes web services usable without requiring a large framework, and without learning yet another alphabet soup of protocols. Because it is a binary p
【Spark九十五】Spark Shell操作Spark SQL bit1129 shell
在Spark Shell上，通过创建HiveContext可以直接进行Hive操作 1. 操作Hive中已存在的表 [hadoop@hadoop bin]$ ./spark-shell Spark assembly has been built with Hive, including Datanucleus jars on classpath Welcom
F5　往header加入客户端的ip ronin47
when HTTP_RESPONSE {if {[HTTP::is_redirect]}{ HTTP::header replace Location [string map {:port/ /} [HTTP::header value Location]]HTTP::header replace Lo
java-61-在数组中，数字减去它右边(注意是右边)的数字得到一个数对之差. 求所有数对之差的最大值。例如在数组{2, 4, 1, 16, 7, 5, bylijinnan java
思路来自： http://zhedahht.blog.163.com/blog/static/2541117420116135376632/ 写了个java版的 public class GreatestLeftRightDiff { /** * Q61.在数组中，数字减去它右边(注意是右边)的数字得到一个数对之差。 * 求所有数对之差的最大值。例如在数组
mongoDB 索引开窍的石头 mongoDB索引
在这一节中我们讲讲在mongo中如何创建索引得到当前查询的索引信息 db.user.find(_id:12).explain(); cursor: basicCoursor 指的是没有索引 &
[硬件和系统]迎峰度夏 comsci 系统
从这几天的气温来看，今年夏天的高温天气可能会维持在一个比较长的时间内所以，从现在开始准备渡过炎热的夏天。。。。每间房屋要有一个落地电风扇，一个空调(空调的功率和房间的面积有密切的关系) 坐的，躺的地方要有凉垫，床上要有凉席电脑的机箱
基于ThinkPHP开发的公司官网 cuiyadll 行业系统
后端基于ThinkPHP，前端基于jQuery和BootstrapCo.MZ 企业系统轻量级企业网站管理系统运行环境:PHP5.3+, MySQL5.0 系统预览系统下载：http://www.tecmz.com 预览地址：http://co.tecmz.com 各种设备自适应响应式的网站设计能够对用户产生友好度，并且对于
Transaction and redelivery in JMS (JMS的事务和失败消息重发机制) darrenzhu jms 事务承认 MQ acknowledge
JMS Message Delivery Reliability and Acknowledgement Patterns http://wso2.com/library/articles/2013/01/jms-message-delivery-reliability-acknowledgement-patterns/ Transaction and redelivery in
Centos添加硬盘完全教程 dcj3sjt126com linux centos hardware
Linux的硬盘识别: sda 表示第1块SCSI硬盘 hda 表示第1块IDE硬盘 scd0 表示第1个USB光驱一般使用“fdisk -l”命
yii2 restful web服务路由 dcj3sjt126com PHP yii2
路由随着资源和控制器类准备，您可以使用URL如 http://localhost/index.php?r=user/create访问资源，类似于你可以用正常的Web应用程序做法。在实践中，你通常要用美观的URL并采取有优势的HTTP动词。例如，请求POST /users意味着访问user/create动作。这可以很容易地通过配置urlManager应用程序组件来完成如下所示
MongoDB查询(4)——游标和分页[八] eksliang mongodb MongoDB游标 MongoDB深分页
转载请出自出处：http://eksliang.iteye.com/blog/2177567 一、游标数据库使用游标返回find的执行结果。客户端对游标的实现通常能够对最终结果进行有效控制，从shell中定义一个游标非常简单，就是将查询结果分配给一个变量（用var声明的变量就是局部变量），便创建了一个游标，如下所示： > var
Activity的四种启动模式和onNewIntent() gundumw100 android
Android中Activity启动模式详解　　在Android中每个界面都是一个Activity，切换界面操作其实是多个不同Activity之间的实例化操作。在Android中Activity的启动模式决定了Activity的启动运行方式。　　Android总Activity的启动模式分为四种： Activity启动模式设置： <acti
攻城狮送女友的CSS3生日蛋糕 ini html Web html5 css css3
在线预览：http://keleyi.com/keleyi/phtml/html5/29.htm 代码如下： <!DOCTYPE html> <html> <head> <meta charset="UTF-8"> <title>攻城狮送女友的CSS3生日蛋糕-柯乐义<
读源码学Servlet（1）GenericServlet 源码分析 jzinfo tomcat Web servlet 网络应用网络协议
Servlet API的核心就是javax.servlet.Servlet接口，所有的Servlet 类（抽象的或者自己写的）都必须实现这个接口。在Servlet接口中定义了5个方法，其中有3个方法是由Servlet 容器在Servlet的生命周期的不同阶段来调用的特定方法。先看javax.servlet.servlet接口源码： package
JAVA进阶：VO(DTO)与PO(DAO)之间的转换 snoopy7713 java VO Hibernate po
PO即 Persistence Object　　VO即 Value Object 　VO和PO的主要区别在于：　　VO是独立的Java Object。　　PO是由Hibernate纳入其实体容器（Entity Map）的对象，它代表了与数据库中某条记录对应的Hibernate实体，PO的变化在事务提交时将反应到实际数据库中。　实际上，这个VO被用作Data Transfer
mongodb group by date 聚合查询日期统计每天数据（信息量） qiaolevip 每天进步一点点学习永无止境 mongodb 纵观千象
/* 1 */ { "_id" : ObjectId("557ac1e2153c43c320393d9d"), "msgType" : "text", "sendTime" : ISODate("2015-06-12T11:26:26.000Z")
java之18天常用的类(一) Luob. Math Date System Runtime Rundom
System类 import java.util.Properties; /** * System: * out:标准输出,默认是控制台 * in:标准输入,默认是键盘 * * 描述系统的一些信息 * 获取系统的属性信息:Properties getProperties(); * * * */ public class Sy
maven wuai maven
1、安装maven：解压缩、添加M2_HOME、添加环境变量path 2、创建maven_home文件夹，创建项目mvn_ch01,在其下面建立src、pom.xml，在src下面简历main、test、main下面建立java文件夹 3、编写类，在java文件夹下面依照类的包逐层创建文件夹，将此类放入最后一级文件夹 4、进入mvn_ch01 4.1、mvn compile ,执行后会在