lixora

【oracle 11G Grid 】Crsctl start cluster 和 crsctl start crs 有区别么？

q：Crsctl start cluster 是 11.2新特性和 crsctl start crs 有啥区别呢？

Crsctl start/stop crs管理本地节点的clusterware stack的启停，包含启动ohasd进程，这个命令只能用来管理本地节点

。

[root@vmrac2 ~]# crsctl start crs -h

Usage:

crsctl start crs[-excl [-nocrs]|-nowait]

Start OHAS onthis server

where

-excl Start Oracle Clusterware in exclusivemode

-nocrs Start Oracle Clusterware in exclusivemode without starting CRS

-nowait Do not wait for OHAS to start

crsctl start/stop cluster - Manage start/stop the Oracle Clusterware stack onlocal node if you do not specify either -all or -n and nodes remote if option-n or -all be specified ,NOT includingthe OHASD process. You can't start/stop clusterware stack without OHASD processrunning.

crsctl strat/stop cluster既可以管理本地 clusterware stack，也可以管理整个集群

指定–all 启动集群中所有节点的集群件，即启动整个集群。

-n 启动指定节点的集群件

但是不包含OHASD进程。You can't start/stop clusterware stack without OHASDprocess running.

[root@vmrac2 ~]# crsctl start cluster -h

Usage:

crsctl startcluster [[-all]|[-n [...]]]

Start CRS stack

where

Default Start local server

-all Start all servers

-n Start named servers

server [...] One or more blank-separated server names

Despite crsctl start/stop crs manage entire Oracle Clusterware stack on localnode crsctl start/stop crs not allow you to manage remote nodes, unlike crsctlstart/stop cluster that allows you to manage all the nodes, but if the processOASH is runing.
crsctl start/stop crs 只能管理本地节点的clusterware stack，并不允许我们管理远程节点。

但是当远程或者本地节点OHASD process运行时（Oracle High AvailabilityServices服务必须可用），才能使用crsctl start/stop crs管理所有节点

我们来做一个实验验证下

我们先把节点2的crs停掉，确保本地已经没有OHASD进程。

[root@vmrac2 ~]# crsctl stop crs

CRS-2791: Starting shutdown of Oracle High AvailabilityServices-managed resources on 'vmrac2'

CRS-2673: Attempting to stop 'ora.crsd' on 'vmrac2'

。。。。。

CRS-2673: Attempting to stop 'ora.DATANEW.dg' on 'vmrac2'

。。。。。

CRS-2677: Stop of 'ora.gipcd' on 'vmrac2' succeeded

。。。。。

CRS-2793: Shutdown of Oracle High AvailabilityServices-managed resources on 'vmrac2' has completed

CRS-4133: Oracle High Availability Services has beenstopped.

这里可以看到使用 crsctl stop crs已经本地的集群件全部停了下来。

但是为了去确保万无一失，建议在os层面查看下 cluster的进程是否存在

[root@vmrac2 ~]# ps -ef|grep ohasd

root 3747 1 0Jun19 ? 00:00:00 /bin/sh/etc/init.d/init.ohasd run

[root@vmrac2 ~]# ps -ef|grep d.bin

root 3064427369 0 13:08 pts/2 00:00:00 grep d.bin

------到这里可以确认集群已经全宕下来了

[root@vmrac2 ~]# ps -ef|grep ohasd

root 3747 1 0Jun19 ? 00:00:00 /bin/sh/etc/init.d/init.ohasd run

------当然这个脚本存在没有什么关系，如果没有这个sh进程，则 ohasd.bin就无法启动，

这时需要去调查下Snncommd –S96ohasd脚本为什么不能执行

这个后台脚本直接用kill去杀是无法杀掉的，会自动再生一个进程。

[root@vmrac2 ~]# ps -ef|grep ohasd

root 3747 1 0Jun19 ? 00:00:00 /bin/sh/etc/init.d/init.ohasd run

root 4888 4812 013:39 pts/1 00:00:00 grep ohasd

[root@vmrac2 ~]# kill -9 3747

[root@vmrac2 ~]# ps -ef|grep ohasd

root 4895 1 013:39 ? 00:00:00 /bin/sh/etc/init.d/init.ohasd run

root 4920 4812 013:39 pts/1 00:00:00 grep ohasd

[root@vmrac2 ~]# kill -9 4895

[root@vmrac2 ~]# ps -ef|grep ohasd

root 4933 1 013:40 ? 00:00:00 /bin/sh/etc/init.d/init.ohasd run

root 4958 4812 013:40 pts/1 00:00:00 grep ohasd

具体测试如下：

节点二的集群已经关闭，节点一的还在

节点一操作：

使用crsctl start cluster启动节点2的集群

[root@vmrac1 ~]# crsctl start cluster -n vmrac2

CRS-4405: The following nodes are unknown to Oracle HighAvailability Services:vmrac2

------报错很明显啊，vmrac2节点上ohasd进程不存在，所以节点1无法启动节点2上的集群

[root@vmrac1 ~]# crsctl start cluster -all

CRS-4690: Oracle Clusterware is already running on 'vmrac1'