张忠琳

【kubernetes/k8s源码分析】 rook operator cluster controller源码分析之一

Mon作用是监控、管理和协调整个分布式系统环境中其它各个OSD/PG、Client、MDS角色的工作，保证整个分布环境中的数据一致性

MON采用主备模式（leader/follower），即使系统中有多个MON角色，实际工作的也只有一个MON，其它MON都处于standby状态

当Ceph失去了Leader MON后，其它MON会基于PaxOS算法，投票选出新的Leader

mon启动命令

ceph-mon --fsid=dcef92d7-1f6a-4b9d-8ed0-0037d537d00b --keyring=/etc/ceph/keyring-store/keyring --log-to-stderr=true --err-to-stderr=true --mon-cluster-log-to-stderr=true --log-stderr-prefix=debug --mon-host=10.200.63.69:6789 --mon-initial-members=a --id=a --foreground --public-addr=10.200.63.69 --public-bind-addr=192.170.56.83

比如，kubectl apply -f cluster.yaml，如下资源为CephCluster

apiVersion: ceph.rook.io/v1
kind: CephCluster
metadata:
  name: rook-ceph
  namespace: rook-ceph
spec:
  cephVersion:
    # The container image used to launch the Ceph daemon pods (mon, mgr, osd, mds, rgw).
    # v12 is luminous, v13 is mimic, and v14 is nautilus.
    # RECOMMENDATION: In production, use a specific version tag instead of the general v14 flag, which pulls the latest release and could result in different
    # versions running within the cluster. See tags available at https://hub.docker.com/r/ceph/ceph/tags/.
    image: ceph/ceph:v14.2.0-20190410
    # Whether to allow unsupported versions of Ceph. Currently luminous, mimic and nautilus are supported, with the recommendation to upgrade to nautilus.
    # Do not set to true in production.
    allowUnsupported: false
  # The path on the host where configuration files will be persisted. Must be specified.
  # Important: if you reinstall the cluster, make sure you delete this directory from each host or else the mons will fail to start on the new cluster.
  # In Minikube, the '/data' directory is configured to persist across reboots. Use "/data/rook" in Minikube environment.
  dataDirHostPath: /var/lib/rook
  # set the amount of mons to be started
  mon:
    count: 1
    allowMultiplePerNode: false
  # enable the ceph dashboard for viewing cluster status
  dashboard:
    enabled: true
    # serve the dashboard under a subpath (useful when you are accessing the dashboard via a reverse proxy)
    # urlPrefix: /ceph-dashboard
    # serve the dashboard at the given port.
    # port: 8443
    # serve the dashboard using SSL
    # ssl: true
  network:
    # toggle to use hostNetwork
    hostNetwork: false
  rbdMirroring:
    # The number of daemons that will perform the rbd mirroring.
    # rbd mirroring must be configured with "rbd mirror" from the rook toolbox.
    workers: 0
  # To control where various services will be scheduled by kubernetes, use the placement configuration sections below.
  # The example under 'all' would have all services scheduled on kubernetes nodes labeled with 'role=storage-node' and
  # tolerate taints with a key of 'storage-node'.
#  placement:
#    all:
#      nodeAffinity:
#        requiredDuringSchedulingIgnoredDuringExecution:
#          nodeSelectorTerms:
#          - matchExpressions:
#            - key: role
#              operator: In
#              values:
#              - storage-node
#      podAffinity:
#      podAntiAffinity:
#      tolerations:
#      - key: storage-node
#        operator: Exists
# The above placement information can also be specified for mon, osd, and mgr components
#    mon:
#    osd:
#    mgr:
  annotations:
#    all:
#    mon:
#    osd:
# If no mgr annotations are set, prometheus scrape annotations will be set by default.
#   mgr:
  resources:
# The requests and limits set here, allow the mgr pod to use half of one CPU core and 1 gigabyte of memory
#    mgr:
#      limits:
#        cpu: "500m"
#        memory: "1024Mi"
#      requests:
#        cpu: "500m"
#        memory: "1024Mi"
# The above example requests/limits can also be added to the mon and osd components
#    mon:
#    osd:
  storage: # cluster level storage configuration and selection
    useAllNodes: true
    useAllDevices: false
    deviceFilter:
    location:
    config:
      # The default and recommended storeType is dynamically set to bluestore for devices and filestore for directories.
      # Set the storeType explicitly only if it is required not to use the default.
      # storeType: bluestore
      # metadataDevice: "md0" # specify a non-rotational storage so ceph-volume will use it as block db device of bluestore.
      # databaseSizeMB: "1024" # uncomment if the disks are smaller than 100 GB
      # journalSizeMB: "1024"  # uncomment if the disks are 20 GB or smaller
      # osdsPerDevice: "1" # this value can be overridden at the node or device level
      # encryptedDevice: "true" # the default value for this option is "false"
# Cluster level list of directories to use for filestore-based OSD storage. If uncommented, this example would create an OSD under the dataDirHostPath.
    directories:
    - path: /var/lib/rook
# Individual nodes and their config can be specified as well, but 'useAllNodes' above must be set to false. Then, only the named
# nodes below will be used as storage resources.  Each node's 'name' field should match their 'kubernetes.io/hostname' label.
#    nodes:
#    - name: "172.17.4.101"
#      directories: # specific directories to use for storage can be specified for each node
#      - path: "/rook/storage-dir"
#      resources:
#        limits:
#          cpu: "500m"
#          memory: "1024Mi"
#        requests:
#          cpu: "500m"
#          memory: "1024Mi"
    - name: "master-node"
      devices: # specific devices to use for storage can be specified for each node
      - name: "sdb"
#      - name: "nvme01" # multiple osds can be created on high performance devices
        config:
          osdsPerDevice: "5"
      config: # configuration can be specified at the node level which overrides the cluster level config
        storeType: filestore
#    - name: "172.17.4.301"
#      deviceFilter: "^sd."

1. NewClusterController函数

实例化ClusterController，用于watch资源CephCluster，node

// NewClusterController create controller for watching cluster custom resources created
func NewClusterController(context *clusterd.Context, rookImage string, volumeAttachment attachment.Attachment) *ClusterController {
	return &ClusterController{
		context:          context,
		volumeAttachment: volumeAttachment,
		rookImage:        rookImage,
		clusterMap:       make(map[string]*cluster),
	}
}

2. StartWatch函数

2.1 定义资源的回调函数，add update delete

异步watch资源CephCluster资源

resourceHandlerFuncs := cache.ResourceEventHandlerFuncs{
	AddFunc:    c.onAdd,
	UpdateFunc: c.onUpdate,
	DeleteFunc: c.onDelete,
}

logger.Infof("start watching clusters in all namespaces")
watcher := opkit.NewWatcher(ClusterResource, namespace, resourceHandlerFuncs, c.context.RookClientset.CephV1().RESTClient())
go watcher.Watch(&cephv1.CephCluster{}, stopCh)

2.2 watch node资源，设置informer机制

onK8sNodeAdd，第3章节讲解

// watch for events on new/updated K8s nodes, too

lwNodes := &cache.ListWatch{
	ListFunc: func(options metav1.ListOptions) (runtime.Object, error) {
		return c.context.Clientset.CoreV1().Nodes().List(options)
	},
	WatchFunc: func(options metav1.ListOptions) (watch.Interface, error) {
		return c.context.Clientset.CoreV1().Nodes().Watch(options)
	},
}

_, nodeController := cache.NewInformer(
	lwNodes,
	&v1.Node{},
	0,
	cache.ResourceEventHandlerFuncs{
		AddFunc:    c.onK8sNodeAdd,
		UpdateFunc: c.onK8sNodeUpdate,
		DeleteFunc: nil,
	},
)
go nodeController.Run(stopCh)

2.3 开启hotplug机制，设置configmap informer机制，就是更新device configmap功能

if disableVal := os.Getenv(disableHotplugEnv); disableVal != "true" {
	// watch for updates to the device discovery configmap
	logger.Infof("Enabling hotplug orchestration: %s=%s", disableHotplugEnv, disableVal)
	operatorNamespace := os.Getenv(k8sutil.PodNamespaceEnvVar)
	_, deviceCMController := cache.NewInformer(
		cache.NewFilteredListWatchFromClient(c.context.Clientset.CoreV1().RESTClient(),
			"configmaps", operatorNamespace, func(options *metav1.ListOptions) {
				options.LabelSelector = fmt.Sprintf("%s=%s", k8sutil.AppAttr, discoverDaemon.AppName)
			},
		),
		&v1.ConfigMap{},
		0,
		cache.ResourceEventHandlerFuncs{
			AddFunc:    nil,
			UpdateFunc: c.onDeviceCMUpdate,
			DeleteFunc: nil,
		},
	)

	go deviceCMController.Run(stopCh)
} else {
	logger.Infof("Disabling hotplug orchestration via %s", disableHotplugEnv)
}

2.4 watchLegacyClusters功能watch Cluster资源

func (c *ClusterController) watchLegacyClusters(namespace string, stopCh chan struct{}, resourceHandlerFuncs cache.ResourceEventHandlerFuncs) {
	// watch for cluster.rook.io/v1beta1 events if the CRD exists
	if _, err := c.context.RookClientset.CephV1beta1().Clusters(namespace).List(metav1.ListOptions{}); err != nil {
		logger.Infof("skipping watching for legacy rook cluster events (legacy cluster CRD probably doesn't exist): %+v", err)
	} else {
		logger.Infof("start watching legacy rook clusters in all namespaces")
		watcherLegacy := opkit.NewWatcher(ClusterResourceRookLegacy, namespace, resourceHandlerFuncs, c.context.RookClientset.CephV1beta1().RESTClient())
		go watcherLegacy.Watch(&cephv1beta1.Cluster{}, stopCh)
	}
}

3. onK8sNodeAdd函数

一旦watch node资源为add操作时执行，验证通过，调用createInstance创建

func (c *ClusterController) onK8sNodeAdd(obj interface{}) {
	newNode, ok := obj.(*v1.Node)
	if !ok {
		logger.Warningf("Expected NodeList but handler received %#v", obj)
	}

	if k8sutil.GetNodeSchedulable(*newNode) == false {
		logger.Debugf("Skipping cluster update. Added node %s is unschedulable", newNode.Labels[v1.LabelHostname])
		return
	}

	for _, cluster := range c.clusterMap {
		if cluster.Spec.Storage.UseAllNodes == false {
			logger.Debugf("Skipping -> Do not use all Nodes")
			continue
		}
		if cluster.Info == nil {
			logger.Info("Cluster %s is not ready. Skipping orchestration.", cluster.Namespace)
			continue
		}

		if valid, _ := k8sutil.ValidNode(*newNode, cluster.Spec.Placement.All()); valid == true {
			logger.Debugf("Adding %s to cluster %s", newNode.Labels[v1.LabelHostname], cluster.Namespace)
			err := cluster.createInstance(c.rookImage, cluster.Info.CephVersion)
			if err != nil {
				logger.Errorf("Failed to update cluster in namespace %s. Was not able to add %s. %+v", cluster.Namespace, newNode.Labels[v1.LabelHostname], err)
			}
		} else {
			logger.Infof("Could not add host %s . It is not valid", newNode.Labels[v1.LabelHostname])
			continue
		}
		logger.Infof("Added %s to cluster %s", newNode.Labels[v1.LabelHostname], cluster.Namespace)
	}
}

4. onAdd函数

watch CephCluster资源创建的情况

4.1 验证mon服务配置，默认副本为3，基数而且1<=count<=9

if cluster.Spec.Mon.Count <= 0 {
	logger.Warningf("mon count is 0 or less, should be at least 1, will use default value of %d", mon.DefaultMonCount)
	cluster.Spec.Mon.Count = mon.DefaultMonCount
	cluster.Spec.Mon.AllowMultiplePerNode = true
}
if cluster.Spec.Mon.Count > mon.MaxMonCount {
	logger.Warningf("mon count is bigger than %d (given: %d), not supported, changing to %d", mon.MaxMonCount, cluster.Spec.Mon.Count, mon.MaxMonCount)
	cluster.Spec.Mon.Count = mon.MaxMonCount
}
if cluster.Spec.Mon.Count%2 == 0 {
	logger.Warningf("mon count is even (given: %d), should be uneven, continuing", cluster.Spec.Mon.Count)
}

4.2 detectCephVersion函数

运行了个rook-ceph-detect-version的job，启动主要执行命令ceph version

# ceph version
ceph version 13.2.3 (9bf3c8b1a04b0aa4a3cc78456a508f1c48e70279) mimic (stable

WaitForJobCompletion函数直到status为succeed成功
DeleteBatchJob删除job

job := &batch.Job{
	ObjectMeta: metav1.ObjectMeta{
		Name:      detectVersionName,
		Namespace: c.Namespace,
	},
	Spec: batch.JobSpec{
		Template: v1.PodTemplateSpec{
			ObjectMeta: metav1.ObjectMeta{
				Labels: map[string]string{
					"job": detectVersionName,
				},
			},
			Spec: podSpec,
		},
	},
}
k8sutil.AddRookVersionLabelToJob(job)
k8sutil.SetOwnerRef(c.context.Clientset, c.Namespace, &job.ObjectMeta, &c.ownerRef)

// run the job to detect the version
if err := k8sutil.RunReplaceableJob(c.context.Clientset, job, true); err != nil {
	return nil, fmt.Errorf("failed to start version job. %+v", err)
}

4.3 ceph 支持与不支持版本

// Luminous Ceph version
Luminous = CephVersion{12, 0, 0}
// Mimic Ceph version
Mimic = CephVersion{13, 0, 0}
// Nautilus Ceph version
Nautilus = CephVersion{14, 0, 0}
// Octopus Ceph version
Octopus = CephVersion{15, 0, 0}

// supportedVersions are production-ready versions that rook supports
supportedVersions   = []CephVersion{Luminous, Mimic, Nautilus}
unsupportedVersions = []CephVersion{Octopus}

if !cluster.Spec.CephVersion.AllowUnsupported {
	if !cephVersion.Supported() {
		logger.Errorf("unsupported ceph version detected: %s. allowUnsupported must be set to true to run with this version.", cephVersion)
		return
	}
}

4.4 updateClusterStatus函数

获得CephCluster资源信息，更新status为Creating

apiVersion: ceph.rook.io/v1
kind: CephCluster
metadata:
annotations:
kubectl.kubernetes.io/last-applied-configuration: |
{"apiVersion":"ceph.rook.io/v1","kind":"CephCluster","metadata":{"annotations":{},"name":"rook-ceph","namespace":"rook-ceph"},"spec":{"annotations":null,"cephVersion":{"allowUnsupported":false,"image":"ceph/ceph:v13.2.3-20190410"},"dashboard":{"enabled":false},"dataDirHostPath":"/var/lib/rook","mon":{"allowMultiplePerNode":false,"count":1},"network":{"hostNetwork":false},"rbdMirroring":{"workers":0},"resources":null,"storage":{"config":{"osdsPerDevice":"1","storeType":"bluestore"},"deviceFilter":"/dev/sda","location":null,"nodes":[{"directories":[{"path":"/var/lib/rook"}],"name":"master-node"},{"directories":[{"path":"/var/lib/rook"}],"name":"node1"}],"useAllDevices":false,"useAllNodes":false}}}
creationTimestamp: "2019-05-23T06:46:46Z"
finalizers:
- cephcluster.ceph.rook.io
generation: 1144
name: rook-ceph
namespace: rook-ceph
resourceVersion: "299112"
selfLink: /apis/ceph.rook.io/v1/namespaces/rook-ceph/cephclusters/rook-ceph
uid: 885ae2f9-7d26-11e9-82d7-0800271c9f15
spec:
cephVersion:
image: ceph/ceph:v13.2.3-20190410
dashboard: {}
dataDirHostPath: /var/lib/rook
mon:
allowMultiplePerNode: false
count: 1
preferredCount: 0
network:
hostNetwork: false
rbdMirroring:
workers: 0
storage:
config:
osdsPerDevice: "1"
storeType: bluestore
deviceFilter: /dev/sda
nodes:
- config: null
directories:
- config: null
path: /var/lib/rook
name: master-node
resources: {}
- config: null
directories:
- config: null
path: /var/lib/rook
name: node1
resources: {}
useAllDevices: false
status:
ceph:
health: HEALTH_OK
lastChanged: "2019-05-23T06:56:14Z"
lastChecked: "2019-05-24T02:24:24Z"
previousHealth: HEALTH_WARN
state: Created

4.5 创建rook集群组件

updateClusterStatus更新集群状态创建中

createInstance创建组建

updateClusterStatus更新集群状态已经创建

// Start the Rook cluster components. Retry several times in case of failure.
err = wait.Poll(clusterCreateInterval, clusterCreateTimeout, func() (bool, error) {
	if err := c.updateClusterStatus(clusterObj.Namespace, clusterObj.Name, cephv1.ClusterStateCreating, ""); err != nil {
		logger.Errorf("failed to update cluster status in namespace %s: %+v", cluster.Namespace, err)
		return false, nil
	}

	err := cluster.createInstance(c.rookImage, *cephVersion)
	if err != nil {
		logger.Errorf("failed to create cluster in namespace %s. %+v", cluster.Namespace, err)
		return false, nil
	}

	// cluster is created, update the cluster CRD status now
	if err := c.updateClusterStatus(clusterObj.Namespace, clusterObj.Name, cephv1.ClusterStateCreated, ""); err != nil {
		logger.Errorf("failed to update cluster status in namespace %s: %+v", cluster.Namespace, err)
		return false, nil
	}

	return true, nil
})

5. createInstance函数

主要函数在doOrchestration，第6章节分析

func (c *cluster) createInstance(rookImage string, cephVersion cephver.CephVersion) error {
	var err error
	c.setOrchestrationNeeded()

	// execute an orchestration until
	// there are no more unapplied changes to the cluster definition and
	// while no other goroutine is already running a cluster update
	for c.checkSetOrchestrationStatus() == true {
		if err != nil {
			logger.Errorf("There was an orchestration error, but there is another orchestration pending; proceeding with next orchestration run (which may succeed). %+v", err)
		}
		// Use a DeepCopy of the spec to avoid using an inconsistent data-set
		spec := c.Spec.DeepCopy()

		err = c.doOrchestration(rookImage, cephVersion, spec)

		c.unsetOrchestrationStatus()
	}

	return err
}

6. doOrchestration函数

6.1 创建namespace为rook-ceph，name为rook-config-override的configmap

apiVersion: v1
data:
config: ""
kind: ConfigMap
metadata:
creationTimestamp: "2019-05-23T06:46:57Z"
name: rook-config-override
namespace: rook-ceph
ownerReferences:
- apiVersion: v1
blockOwnerDeletion: true
kind: CephCluster
name: rook-ceph
uid: 885ae2f9-7d26-11e9-82d7-0800271c9f15
resourceVersion: "134416"
selfLink: /api/v1/namespaces/rook-ceph/configmaps/rook-config-override
uid: 8ef4890a-7d26-11e9-82d7-0800271c9f15

// Create a configmap for overriding ceph config settings
// These settings should only be modified by a user after they are initialized
placeholderConfig := map[string]string{
	k8sutil.ConfigOverrideVal: "",
}
cm := &v1.ConfigMap{
	ObjectMeta: metav1.ObjectMeta{
		Name: k8sutil.ConfigOverrideName,
	},
	Data: placeholderConfig,
}
k8sutil.SetOwnerRef(c.context.Clientset, c.Namespace, &cm.ObjectMeta, &c.ownerRef)
_, err := c.context.Clientset.CoreV1().ConfigMaps(c.Namespace).Create(cm)
if err != nil && !errors.IsAlreadyExists(err) {
	return fmt.Errorf("failed to create override configmap %s. %+v", c.Namespace, err)
}

6.2 Start 创建mon pod，创建配置

6.2.1 CreateOrLoadClusterInfo函数

调用client-go API得到secret，如果没有则调用createNamedClusterInfo函数，执行命令ceph-authtool --create-keyring /var/lib/rook/rook-ceph/mon.keyring --gen-key -n mon. --cap mon 'allow * 生成mon secret

执行命令ceph-authtool --create-keyring /var/lib/rook/rook-ceph/client.admin.keyring --gen-key -n client.admin --cap mon 'allow *' --cap osd 'allow *' --cap mgr 'allow *' --cap mds 'allow' 生成admin secret

GenerateConfigFile函数将配置文件写入/var/lib/rook/rook-ceph/rook-ceph.config，并拷贝之/etc/ceph/ceph.conf如下所士：

[global]
fsid = 4b8dfb1b-2a8b-46df-b5d3-a9021b501337
run dir = /var/lib/rook/rook-ceph
mon initial members = a
mon host = v1:10.200.33.128:6789
log file = /dev/stderr
mon cluster log file = /dev/stderr
mon keyvaluedb = rocksdb
mon_allow_pool_delete = true
mon_max_pg_per_osd = 1000
debug default = 0
debug rados = 0
debug mon = 0
debug osd = 0
debug bluestore = 0
debug filestore = 0
debug journal = 0
debug leveldb = 0
filestore_omap_backend = rocksdb
osd pg bits = 11
osd pgp bits = 11
osd pool default size = 1
osd pool default min size = 1
osd pool default pg num = 100
osd pool default pgp num = 100
rbd_default_features = 3
fatal signal handlers = false

[client.admin]
keyring = /var/lib/rook/rook-ceph/client.admin.keyring

secrets, err := context.Clientset.CoreV1().Secrets(namespace).Get(appName, metav1.GetOptions{})
if err != nil {
	if !errors.IsNotFound(err) {
		return nil, maxMonID, monMapping, fmt.Errorf("failed to get mon secrets. %+v", err)
	}
	if ownerRef == nil {
		return nil, maxMonID, monMapping, fmt.Errorf("not expected to create new cluster info and did not find existing secret")
	}

	clusterInfo, err = createNamedClusterInfo(context, namespace)
	if err != nil {
		return nil, maxMonID, monMapping, fmt.Errorf("failed to create mon secrets. %+v", err)
	}

	err = createClusterAccessSecret(context.Clientset, namespace, clusterInfo, ownerRef)
	if err != nil {
		return nil, maxMonID, monMapping, err
	}
} else {
	clusterInfo = &cephconfig.ClusterInfo{
		Name:          string(secrets.Data[clusterSecretName]),
		FSID:          string(secrets.Data[fsidSecretName]),
		MonitorSecret: string(secrets.Data[monSecretName]),
		AdminSecret:   string(secrets.Data[adminSecretName]),
	}
	logger.Debugf("found existing monitor secrets for cluster %s", clusterInfo.Name)
}

6.2.2 initClusterInfo函数

saveMonConfig函数创建rook-ceph-mon-endpoints configmap，如下内容

data:
data: a=10.200.33.128:6789
mapping: '{"node":{"a":{"Name":"master-node","Hostname":"master-node","Address":"192.168.72.106"}},"port":{}}'
maxMonId: "0"

CreateOrUpdate创建rook-ceph-mons-keyring serret，目的所有mon共享

CreateOrUpdate创建rook-ceph-admin-keyring的secret

// initClusterInfo retrieves the ceph cluster info if it already exists.
// If a new cluster, create new keys.
func (c *Cluster) initClusterInfo(cephVersion cephver.CephVersion) error {
	var err error
	// get the cluster info from secret
	c.clusterInfo, c.maxMonID, c.mapping, err = CreateOrLoadClusterInfo(c.context, c.Namespace, &c.ownerRef)
	c.clusterInfo.CephVersion = cephVersion

	if err != nil {
		return fmt.Errorf("failed to get cluster info. %+v", err)
	}

	// save cluster monitor config
	if err = c.saveMonConfig(); err != nil {
		return fmt.Errorf("failed to save mons. %+v", err)
	}

	k := keyring.GetSecretStore(c.context, c.Namespace, &c.ownerRef)
	// store the keyring which all mons share
	if err := k.CreateOrUpdate(keyringStoreName, c.genMonSharedKeyring()); err != nil {
		return fmt.Errorf("failed to save mon keyring secret. %+v", err)
	}
	// also store the admin keyring for other daemons that might need it during init
	if err := k.Admin().CreateOrUpdate(c.clusterInfo); err != nil {
		return fmt.Errorf("failed to save admin keyring secret. %+v", err)
	}

	return nil
}

7. startMons函数

7.1 initMonConfig函数

初始化mon配置，

func (c *Cluster) initMonConfig(size int) (int, []*monConfig) {
	mons := []*monConfig{}

	// initialize the mon pod info for mons that have been previously created
	for _, monitor := range c.clusterInfo.Monitors {
		mons = append(mons, &monConfig{
			ResourceName: resourceName(monitor.Name),
			DaemonName:   monitor.Name,
			Port:         cephutil.GetPortFromEndpoint(monitor.Endpoint),
			DataPathMap: config.NewStatefulDaemonDataPathMap(
				c.dataDirHostPath, dataDirRelativeHostPath(monitor.Name), config.MonType, monitor.Name, c.Namespace),
		})
	}

	// initialize mon info if we don't have enough mons (at first startup)
	existingCount := len(c.clusterInfo.Monitors)
	for i := len(c.clusterInfo.Monitors); i < size; i++ {
		c.maxMonID++
		mons = append(mons, c.newMonConfig(c.maxMonID))
	}

	return existingCount, mons
}

7.2 ensureMonsRunning函数

7.2.1 initMonIPs 如果未使用host模式，则创建service，使用cluster ip

func (c *Cluster) initMonIPs(mons []*monConfig) error {
	for _, m := range mons {
		if c.HostNetwork {
			logger.Infof("setting mon endpoints for hostnetwork mode")
			node, ok := c.mapping.Node[m.DaemonName]
			if !ok {
				return fmt.Errorf("mon doesn't exist in assignment map")
			}
			m.PublicIP = node.Address
		} else {
			serviceIP, err := c.createService(m)
			if err != nil {
				return fmt.Errorf("failed to create mon service. %+v", err)
			}
			m.PublicIP = serviceIP
		}
		c.clusterInfo.Monitors[m.DaemonName] = cephconfig.NewMonInfo(m.DaemonName, m.PublicIP, m.Port)
	}

	return nil
}

7.2.2 saveMonConfig函数保存configmap rook-ceph-mon-endpoints

7.2.3 writeConnectionConfig函数写入配置文件/var/lib/rook/rook-ceph目录中，并拷贝至/etc/ceph/目录

7.2.4 调用client-go API创建mon的deployment

详情mon deployment后面查看

waitForQuorumWithMons调用ceph mon_status --connect-timeout=15 --cluster=rook-ceph --conf=/var/lib/rook/rook-ceph/rook-ceph.config --keyring=/var/lib/rook/rook-ceph/client.admin.keyring --format json --out-file /tmp/565758730得到状态

{
    "name":"a",
    "rank":0,
    "state":"leader",
    "election_epoch":9,
    "quorum":[
        0
    ],
    "features":{
        "required_con":"144115738102218752",
        "required_mon":[
            "kraken",
            "luminous",
            "mimic",
            "osdmap-prune"
        ],
        "quorum_con":"4611087854031142907",
        "quorum_mon":[
            "kraken",
            "luminous",
            "mimic",
            "osdmap-prune"
        ]
    },
    "outside_quorum":[

    ],
    "extra_probe_peers":[

    ],
    "sync_provider":[

    ],
    "monmap":{
        "epoch":1,
        "fsid":"dcef92d7-1f6a-4b9d-8ed0-0037d537d00b",
        "modified":"2019-05-23 06:47:12.870838",
        "created":"2019-05-23 06:47:12.870838",
        "features":{
            "persistent":[
                "kraken",
                "luminous",
                "mimic",
                "osdmap-prune"
            ],
            "optional":[

            ]
        },
        "mons":[
            {
                "rank":0,
                "name":"a",
                "addr":"10.200.63.69:6789/0",
                "public_addr":"10.200.63.69:6789/0"
            }
        ]
    },
    "feature_map":{
        "mon":[
            {
                "features":"0x3ffddff8ffa4fffb",
                "release":"luminous",
                "num":1
            }
        ],
        "osd":[
            {
                "features":"0x3ffddff8ffa4fffb",
                "release":"luminous",
                "num":2
            }
        ],
        "client":[
            {
                "features":"0x27018fb86aa42ada",
                "release":"jewel",
                "num":1
            },
            {
                "features":"0x3ffddff8ffa4fffb",
                "release":"luminous",
                "num":1
            }
        ],
        "mgr":[
            {
                "features":"0x3ffddff8ffa4fffb",
                "release":"luminous",
                "num":1
            }
        ]
    }
}

func (c *Cluster) startMon(m *monConfig, hostname string) error {
	d := c.makeDeployment(m, hostname)
	logger.Debugf("Starting mon: %+v", d.Name)
	_, err := c.context.Clientset.AppsV1().Deployments(c.Namespace).Create(d)
	if err != nil {
		if !errors.IsAlreadyExists(err) {
			return fmt.Errorf("failed to create mon deployment %s. %+v", m.ResourceName, err)
		}
		logger.Infof("deployment for mon %s already exists. updating if needed", m.ResourceName)
		if _, err := updateDeploymentAndWait(c.context, d, c.Namespace); err != nil {
			return fmt.Errorf("failed to update mon deployment %s. %+v", m.ResourceName, err)
		}
	}

	return nil
}

8. 开启协议

// Enable Ceph messenger 2 protocol on Nautilus
if c.clusterInfo.CephVersion.IsAtLeastNautilus() {
	v, err := client.GetCephMonVersion(c.context)
	if err != nil {
		return fmt.Errorf("failed to get ceph mon version. %+v", err)
	}
	if v.IsAtLeastNautilus() {
		versions, err := client.GetCephVersions(c.context)
		if err != nil {
			return fmt.Errorf("failed to get ceph daemons versions. %+v", err)
		}
		if len(versions.Mon) == 1 {
			// If length is one, this clearly indicates that all the mons are running the same version
			// We are doing this because 'ceph version' might return the Ceph version that a majority of mons has but not all of them
			// so instead of trying to active msgr2 when mons are not ready, we activate it when we believe that's the right time
			client.EnableMessenger2(c.context)
		}
	}
}

8.1 EnableMessager2

執行命令ceph mon enable-msgr2，msgr2支持加密，支持kerberos授权认证等提高ceph集群的安全性非常有益处

// EnableMessenger2 enable the messenger 2 protocol on Nautilus clusters
func EnableMessenger2(context *clusterd.Context) error {
	_, err := context.Executor.ExecuteCommandWithOutput(false, "", "ceph", "mon", "enable-msgr2")
	if err != nil {
		return fmt.Errorf("failed to enable msgr2 protocol: %+v", err)
	}
	logger.Infof("successfully enabled msgr2 protocol")

	return nil
}

9. createInitialCrushMap函數

9.1 CreateDefaultCrushMap函數

调整现有集群上的可调选项 ceph osd crush tunables firefly --connect-timeout=15 --cluster=rook-ceph --conf=/var/lib/rook/rook-ceph/rook-ceph.config --keyring=/var/lib/rook/rook-ceph/client.admin.keyring --format plain --out-file /tmp/113703137

把纯文本编译为二进制图文件 crushtool -c /tmp/533654220 -o /tmp/337292219

SetCrushMap把編譯的二進制文件設置到cluster， ceph osd setcrushmap -i /tmp/337292219 --connect-timeout=15 --cluster=rook-ceph --conf=/var/lib/rook/rook-ceph/rook-ceph.config --keyring=/var/lib/rook/rook-ceph/client.admin.keyring --format json --out-file /tmp/809736670

Mon 知识点

1. ceph quorum_status --format json 查看MON角色状态

quorum_names：投票参与者
quorum：投票参与者的编号
quorum_leader_name：当前选举出来的Leader MON
election_epoch：目前一共经历的投票轮次数量
epoch代：发起投票的次数
rank：每个MON角色的权重，权重越小的MON角色，在投票选举中越容易获得多数派支持

{"election_epoch":3,"quorum":[0],"quorum_names":["a"],"quorum_leader_name":"a","quorum_age":189,"monmap":{"epoch":1,"fsid":"9d6b7ae7-362c-45df-bb28-cfc8393a2edf","modified":"2019-05-06 19:40:19.266161","created":"2019-05-06 19:40:19.266161","min_mon_release":14,"min_mon_release_name":"nautilus","features":{"persistent":["kraken","luminous","mimic","osdmap-prune","nautilus"],"optional":[]},"mons":[{"rank":0,"name":"a","public_addrs":{"addrvec":[{"type":"v2","addr":"10.200.88.83:3300","nonce":0},{"type":"v1","addr":"10.200.88.83:6789","nonce":0}]},"addr":"10.200.88.83:6789/0","public_addr":"10.200.88.83:6789/0"}]}}

mon deployment

apiVersion: v1
kind: Pod
metadata:
creationTimestamp: "2019-05-23T06:47:11Z"
generateName: rook-ceph-mon-a-54d989f65-
labels:
app: rook-ceph-mon
ceph_daemon_id: a
mon: a
mon_cluster: rook-ceph
pod-template-hash: 54d989f65
rook_cluster: rook-ceph
name: rook-ceph-mon-a-54d989f65-hs958
namespace: rook-ceph
ownerReferences:
- apiVersion: apps/v1
blockOwnerDeletion: true
controller: true
kind: ReplicaSet
name: rook-ceph-mon-a-54d989f65
uid: 975c250f-7d26-11e9-82d7-0800271c9f15
resourceVersion: "140783"
selfLink: /api/v1/namespaces/rook-ceph/pods/rook-ceph-mon-a-54d989f65-hs958
uid: 975e035c-7d26-11e9-82d7-0800271c9f15
spec:
affinity: {}
containers:
- args:
- --fsid=dcef92d7-1f6a-4b9d-8ed0-0037d537d00b
- --keyring=/etc/ceph/keyring-store/keyring
- --log-to-stderr=true
- --err-to-stderr=true
- --mon-cluster-log-to-stderr=true
- '--log-stderr-prefix=debug '
- --mon-host=$(ROOK_CEPH_MON_HOST)
- --mon-initial-members=$(ROOK_CEPH_MON_INITIAL_MEMBERS)
- --id=a
- --foreground
- --public-addr=10.200.63.69
- --public-bind-addr=$(ROOK_POD_IP)
command:
- ceph-mon
env:
- name: CONTAINER_IMAGE
value: ceph/ceph:v13.2.3-20190410
- name: POD_NAME
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: metadata.name
- name: POD_NAMESPACE
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: metadata.namespace
- name: NODE_NAME
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: spec.nodeName
- name: POD_MEMORY_LIMIT
valueFrom:
resourceFieldRef:
divisor: "0"
resource: limits.memory
- name: POD_MEMORY_REQUEST
valueFrom:
resourceFieldRef:
divisor: "0"
resource: requests.memory
- name: POD_CPU_LIMIT
valueFrom:
resourceFieldRef:
divisor: "1"
resource: limits.cpu
- name: POD_CPU_REQUEST
valueFrom:
resourceFieldRef:
divisor: "0"
resource: requests.cpu
- name: ROOK_CEPH_MON_HOST
valueFrom:
secretKeyRef:
key: mon_host
name: rook-ceph-config
- name: ROOK_CEPH_MON_INITIAL_MEMBERS
valueFrom:
secretKeyRef:
key: mon_initial_members
name: rook-ceph-config
- name: ROOK_POD_IP
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: status.podIP
image: ceph/ceph:v13.2.3-20190410
imagePullPolicy: IfNotPresent
name: mon
ports:
- containerPort: 6789
name: client
protocol: TCP
resources: {}
securityContext:
privileged: false
procMount: Default
terminationMessagePath: /dev/termination-log
terminationMessagePolicy: File
volumeMounts:
- mountPath: /etc/ceph
name: rook-ceph-config
readOnly: true
- mountPath: /etc/ceph/keyring-store/
name: rook-ceph-mons-keyring
readOnly: true
- mountPath: /var/log/ceph
name: rook-ceph-log
- mountPath: /var/lib/ceph/mon/ceph-a
name: ceph-daemon-data
- mountPath: /var/run/secrets/kubernetes.io/serviceaccount
name: default-token-5jp2h
readOnly: true
dnsPolicy: ClusterFirst
enableServiceLinks: true
initContainers:
- args:
- --fsid=dcef92d7-1f6a-4b9d-8ed0-0037d537d00b
- --keyring=/etc/ceph/keyring-store/keyring
- --log-to-stderr=true
- --err-to-stderr=true
- --mon-cluster-log-to-stderr=true
- '--log-stderr-prefix=debug '
- --mon-host=$(ROOK_CEPH_MON_HOST)
- --mon-initial-members=$(ROOK_CEPH_MON_INITIAL_MEMBERS)
- --id=a
- --public-addr=10.200.63.69
- --mkfs
command:
- ceph-mon
env:
- name: CONTAINER_IMAGE
value: ceph/ceph:v13.2.3-20190410
- name: POD_NAME
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: metadata.name
- name: POD_NAMESPACE
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: metadata.namespace
- name: NODE_NAME
valueFrom:
fieldRef:
apiVersion: v1
fieldPath: spec.nodeName
- name: POD_MEMORY_LIMIT
valueFrom:
resourceFieldRef:
divisor: "0"
resource: limits.memory
- name: POD_MEMORY_REQUEST
valueFrom:
resourceFieldRef:
divisor: "0"
resource: requests.memory
- name: POD_CPU_LIMIT
valueFrom:
resourceFieldRef:
divisor: "1"
resource: limits.cpu
- name: POD_CPU_REQUEST
valueFrom:
resourceFieldRef:
divisor: "0"
resource: requests.cpu
- name: ROOK_CEPH_MON_HOST
valueFrom:
secretKeyRef:
key: mon_host
name: rook-ceph-config
- name: ROOK_CEPH_MON_INITIAL_MEMBERS
valueFrom:
secretKeyRef:
key: mon_initial_members
name: rook-ceph-config
image: ceph/ceph:v13.2.3-20190410
imagePullPolicy: IfNotPresent
name: init-mon-fs
resources: {}
securityContext:
privileged: false
procMount: Default
terminationMessagePath: /dev/termination-log
terminationMessagePolicy: File
volumeMounts:
- mountPath: /etc/ceph
name: rook-ceph-config
readOnly: true
- mountPath: /etc/ceph/keyring-store/
name: rook-ceph-mons-keyring
readOnly: true
- mountPath: /var/log/ceph
name: rook-ceph-log
- mountPath: /var/lib/ceph/mon/ceph-a
name: ceph-daemon-data
- mountPath: /var/run/secrets/kubernetes.io/serviceaccount
name: default-token-5jp2h
readOnly: true
nodeName: master-node
nodeSelector:
kubernetes.io/hostname: master-node
restartPolicy: Always
schedulerName: default-scheduler
securityContext: {}
serviceAccount: default
serviceAccountName: default
terminationGracePeriodSeconds: 30
volumes:
- configMap:
defaultMode: 420
items:
- key: ceph.conf
mode: 256
path: ceph.conf
name: rook-ceph-config
name: rook-ceph-config
- name: rook-ceph-mons-keyring
secret:
defaultMode: 420
secretName: rook-ceph-mons-keyring
- hostPath:
path: /var/lib/rook/rook-ceph/log
type: ""
name: rook-ceph-log
- hostPath:
path: /var/lib/rook/mon-a/data
type: ""
name: ceph-daemon-data
- name: default-token-5jp2h
secret:
defaultMode: 420
secretName: default-token-5jp2h
status:
conditions:
- lastProbeTime: null
lastTransitionTime: "2019-05-23T07:41:47Z"
status: "True"
type: Initialized
- lastProbeTime: null
lastTransitionTime: "2019-05-23T07:41:48Z"
status: "True"
type: Ready
- lastProbeTime: null
lastTransitionTime: "2019-05-23T07:41:48Z"
status: "True"
type: ContainersReady
- lastProbeTime: null
lastTransitionTime: "2019-05-23T06:47:11Z"
status: "True"
type: PodScheduled
containerStatuses:
- containerID: docker://a08fe6840c7fc9dc6755a88cd7dd06038e57c6f412a09c63414dff627fc4905b
image: ceph/ceph:v13.2.3-20190410
imageID: docker://sha256:fdb3585c96619a300dc2f153a3269c7b6e222adce9eed6ec199dc54302b9195a
lastState:
terminated:
containerID: docker://8477dd340d5d1dd8f8c5c9d7815da31b4cc9aa9ecb95ea8bf844976dee917988
exitCode: 0
finishedAt: "2019-05-23T07:39:14Z"
reason: Completed
startedAt: "2019-05-23T07:28:48Z"
name: mon
ready: true
restartCount: 3
state:
running:
startedAt: "2019-05-23T07:41:48Z"
hostIP: 192.168.74.57
initContainerStatuses:
- containerID: docker://7316d3105025179241e3fe56eb7b7024325c700cd5bfb9044d72305593c2b753
image: ceph/ceph:v13.2.3-20190410
imageID: docker://sha256:fdb3585c96619a300dc2f153a3269c7b6e222adce9eed6ec199dc54302b9195a
lastState: {}
name: init-mon-fs
ready: true
restartCount: 3
state:
terminated:
containerID: docker://7316d3105025179241e3fe56eb7b7024325c700cd5bfb9044d72305593c2b753
exitCode: 0
finishedAt: "2019-05-23T07:41:46Z"
reason: Completed
startedAt: "2019-05-23T07:41:40Z"
phase: Running
podIP: 192.170.56.83
qosClass: BestEffort
startTime: "2019-05-23T06:47:11Z"

总结：

本文关于CephClusters资源watch情况，如果新增则调用onAdd函数

包括创建configmap，初始集群信息，配置文件rook-ceph.config，secret configmap存入k8s中

创建mon deployment，调用ceph mon_status证明启动，设置crash map等

创建mgr，osd等

你可能感兴趣的:(kubernetes,CSI,存储)

K8S中Pod控制器之CronJob(CJ)控制器元气满满的热码式 kubernetes 容器云原生
CronJob控制器是Kubernetes中用于周期性执行任务的一种控制器，它基于Job控制器来创建和管理作业。以下是CronJob的一些关键特点：周期性调度：CronJob允许您定义一个基于时间的调度，类似于Linux的cron工具，来周期性地执行任务。时间点触发：CronJob根据指定的时间表（cron表达式）触发，可以精确到分钟。一次性或重复执行：尽管CronJob主要用于重复性任务，但它也
k8s部署Kafka集群潞哥的博客 kubernetes kafka 容器
1.1、Kafka(消息队列)是一个分布式消息中间件,支持分区的、多副本的、多订阅者的、基于zookeeper协调的分布式消息系统。通俗来说：kafka就是一个存储系统，存储的数据形式为“消息"；1.2、常用的消息系统有哪些以及各自的特点有activemq，rabbitmq，rocketmq，kafka1.3、为什么使用消息队列1)、提高扩展性：因为消息队列解耦了处理过程，有新增需求时只要另外增加
在 Kubernetes 上快速安装 KubeSphere v4.1.2 喝醉酒的小白 K8s kubernetes 容器云原生
目录标题安装文档配置repo安装使用插件安装文档在Kubernetes上快速安装KubeSphere配置repoexporthttps_proxy=10.10.x.x:7890helmrepoaddstablehttps://charts.helm.sh/stablehelmrepoupdate安装helmupgrade--install-nkubesphere-system--create-na
C语言的数据结构 2501_90183910 包罗万象 golang 开发语言后端
C语言的数据结构概述C语言是一种强大的通用编程语言，自1970年代初问世以来，一直被广泛应用于操作系统、嵌入式系统和各种应用程序的开发。数据结构是计算机科学的一个核心概念，它涉及到如何有效地组织、存储和管理数据。在C语言中，数据结构的实现为程序的性能和效率提供了重要保障。本文将详细探讨C语言中的数据结构，包括基本数据结构、复合数据结构以及如何在实际编程中应用这些数据结构。一、基本数据结构1.1数组
Kali Linux最新版本下无法直接pip安装？教你四招完美解决‘externally-managed-environment’报错！ vortex5 教程 Kali笔记 pip Kali 渗透经验分享
内容预览≧∀≦ゞKaliLinux中解决externally-managed-environment错误的四种方法引言解决方案1：从系统存储库安装Python包解决方案2：使用虚拟环境解决方案3：使用pipx安装（推荐）解决方案4：强制安装（不推荐）总结KaliLinux中解决externally-managed-environment错误的四种方法引言在KaliLinux的最新版本中，很多用户尝
Bash语言的数据库交互 2501_90183910 包罗万象 golang 开发语言后端
Bash语言的数据库交互引言在现代软件开发中，数据库是存储和管理数据的核心组件。无论是小型应用还是大型企业系统，数据库都扮演着至关重要的角色。而在这些系统中，与数据库的交互方式至关重要。尽管多种编程语言和框架可以与数据库进行交互，Bash脚本作为一种简单而强大的工具，常常被开发者用来进行数据库操作。本文将详细探讨使用Bash与数据库进行交互的方式，包括常用的数据库如MySQL、PostgreSQL
云原生周刊：K8s 生产环境架构设计及成本分析 KubeSphere 云原生 k8s 容器平台 kubesphere 云计算
开源项目推荐KubeZoneNetKubeZoneNet旨在帮助监控和优化Kubernetes集群中的跨可用区（Cross-Zone）网络流量。这个项目提供了一种简便的方式来跟踪和分析Kubernetes集群中跨不同可用区的通信，帮助用户优化集群的网络架构、提高资源利用效率并减少网络延迟。通过实时监控和数据分析，KubeZoneNet能有效地识别跨可用区的网络瓶颈，并提供改进建议，以支持Kuber
搭建直播网站技术层面准备全流程 sanx18 java
搭建直播网站涉及多个环节，包括前期的规划、技术选型、开发、部署。以下是搭建直播网站的完整流程：1.技术选型服务器端语言与框架：后端-选择如Java(SpringBoot)、或Go。数据库：用户和直播信息-MySQL/PostgreSQL。快速读写-Redis（用于弹幕、热度计数等）。文件存储-阿里云OSS、腾讯云COS或本地存储。2.前端框架：PC端-React、Vue.js。移动端-ReactN
VYOS容器运行DaloRadius实现AAA认证登录 GTaylor Vyos DaloRadius VYOS容器 AAA认证
整体架构freeradius提供AAA认证服务mysql提供用户认证授权信息存储daloradius提供Web界面管理用户认证授权信息mysql添加镜像addcontainerimagemysql:5.6配置setcontainernamemysql56description'mysql56'setcontainernamemysql56image'docker.io/library/mysql:
ELK Stack学习笔记在线打码学习笔记 redis linux centos es elk
一、ELKStack简介1、Elasticsearch一个实时的分布式搜索和分析引擎，它可以用于全文搜索，结构化搜索以及分析。它是一个建立在全文搜索引擎ApacheLucene(信息检索的工具jar包)基础上的搜索引擎，使用Java语言编写2、Logstash一个完全开源的工具，可以对日志进行收集、过滤，并将其存储供以后使用。是开源的服务器端数据处理管道，能够从多个来源收集数据、转换数据。并保存到
如何使用 Python 实现简单的算法与数据结构全栈探索者chen python python 算法数据结构开发语言 javascript 数据分析性能优化
如何使用Python实现简单的算法与数据结构算法和数据结构是计算机科学的基础，理解它们不仅有助于解决复杂问题，还能提高编程效率和代码质量。在Python中，由于其简洁和高效的语法，学习和实现算法与数据结构更加轻松。本文将从以下几个方面探讨如何用Python实现常见的数据结构和基本算法，帮助你从基础开始掌握核心概念。一、数据结构1.数组（Array）数组是一种线性数据结构，存储一组相同类型的元素。P
【Java面试】RabbitMQ 白衫~ java-rabbitmq java 面试
RabbitMQ是什么？RabbitMQ是一款开源的、基于Erlang语言编写的消息中间件，遵循AMQP协议（AdvancedMessageQueuingProtocol）。RabbitMQ核心概念生产者（Producer）：发送消息的一方。消费者（Consumer）：接收消息的一方。消息队列（Queue）：存储消息的容器，消息最终被发送到这里。交换器（Exchange）：负责将消息路由到队列，根
SQLSugar进阶使用：高级查询与性能优化 m0_74823611 性能优化 windows
文章目录前言一、高级查询1.查所有2.查询总数3.按条件查询4.动态OR查询5.查前几条6.设置新表名7.分页查询8.排序OrderBy9.联表查询10.动态表达式11.原生Sql操作，Sql和存储过程二、性能优化1.二级缓存2.批量操作3.异步操作4.分表组件，自动分表5.查询6.插入7.删除数据8.引入库9.读写分离/主从总结前言SqlSugar作为一款专为.NET平台设计的轻量级ORM（对象
Nginx 缓存清理 m0_74823452 面试学习路线阿里巴巴 nginx 缓存运维
Nginx缓存清理详解Nginx作为一个高效的Web服务器和反向代理服务器，在提供快速的页面响应和优化Web性能方面起着至关重要的作用。Nginx的缓存机制通过存储来自后端服务器或客户端的请求和响应数据，减少了数据的重复处理，从而大幅提高了系统的响应速度和吞吐量。然而，随着缓存数据的不断积累，如何有效地管理和清理缓存变得非常重要。合适的缓存清理策略不仅可以释放磁盘空间，还能确保缓存数据的时效性，防
数据存储设计面试：了解数据库分区、分片、索引小蜗牛慢慢爬行数据库 mysql 面试
快速掌握：分片将您的数据分布到多个服务器，以实现可扩展性和更好的性能。分区将单个数据库内的表划分为更小的部分（分区），从而提高查询性能和可管理性。索引创建数据结构以加速某些列的数据检索，从而提高查询性能，但代价是额外的存储和写入开销。数据库分片分片是一种在多个服务器或数据库之间水平划分数据的方法，这样每个服务器（或“分片”）都包含整个数据集的一个子集。此技术用于提高数据库的可扩展性和性能，尤其是在
我的软件架构师——Java 职位面试经历。小蜗牛慢慢爬行 java 面试开发语言职场和发展后端 spring boot spring
最近，我参加了一家领先的服务型公司的软件架构师（Java）职位的面试。我在这里分享了一些面试官问我的问题。我只列出了与Java相关的问题，因为本文主要关注Java。面试官问我有关AWS、Docker、Kubernetes、Kafka、ElasticSearch、SQL/NoSQL和设计模式的问题。ClassNotFoundException和NoClassDefFoundError有什么区别？当您
深入理解 Redis：高性能缓存与分布式存储架构全栈探索者chen redis 缓存 redis 分布式数据库开发语言服务器运维
深入理解Redis：高性能缓存与分布式存储架构Redis，作为现代互联网架构中广泛使用的高性能内存数据存储系统，其高效性、丰富的数据结构和分布式能力，使得它成为了分布式缓存和存储解决方案的首选。在本篇文章中，我们将深入探讨Redis的核心特性，工作原理，使用场景，并通过实际案例来帮助你掌握如何在项目中高效地使用Redis。目录Redis基础概念与核心特性Redis的工作原理Redis的数据持久化机
MySQL备份策略（五）：LVM快照备份一万个大苹果自动化运维 mysql 数据库 lvm 运维
方法一：1.添加新的磁盘2.创建LVM并格式化3.将当前的mysql数据库迁移到逻辑卷上4.快照备份数据库5.测试快照备份方法二：（整理为脚本）1.将上面备份整理为脚本+Crontab计划任务定时完成备份数据文件要在逻辑卷上；此逻辑卷所在卷组必须有足够空间使用快照卷；数据文件和事务日志要在同一个逻辑卷上；MySQL数据lv和将要创建的快照要在同一vg，vg要有足够的空间存储。方法一：1.添加新的磁
hadoop常用命令汇总 m0_67402026 java java 后端
1、查看目录下的文件列表：hadoopfs–ls[文件目录]hadoopfs-ls-h/lance2、将本机文件夹存储至hadoop上：hadoopfs–put[本机目录][hadoop目录]hadoopfs-putlance/3、在hadoop指定目录内创建新目录：hadoopfs–mkdir[目录]hadoopfs-mkdir/lance4、在hadoop指定目录下新建一个文件，使用touch
湖北移动魔百盒ZN90_Hi3798MV300／MV310-当贝桌面精简卡刷固件包 fatiaozhang9527 机顶盒刷机固件魔百盒刷机魔百盒固件移动魔百盒机顶盒ROM 盒子ROM
湖北移动魔百盒ZN90_Hi3798MV300／MV310-当贝桌面精简卡刷固件包特点：1、适用于对应型号的电视盒子刷机；2、开放原厂固件屏蔽的市场安装和u盘安装apk；3、修改dns，三网通用；4、大量精简内置的没用的软件，运行速度提升，多出大量的存储空间；5、去除应用安装限制；6、支持开机自启动、开机密码锁、儿童应用锁、应用隐藏、开机自动进入HDMI等各种花式功能；魔百和ZN90代工机顶盒刷机
M302H-ZN-Hi3798MV300／MV300H-当贝纯净桌面-卡刷固件包 fatiaozhang9527 机顶盒刷机固件魔百盒刷机魔百盒固件移动魔百盒机顶盒ROM 盒子ROM
M302H-ZN-Hi3798MV300／MV300H-当贝纯净桌面-卡刷固件包-内有教程特点：1、适用于对应型号的电视盒子刷机；2、开放原厂固件屏蔽的市场安装和u盘安装apk；3、修改dns，三网通用；4、大量精简内置的没用的软件，运行速度提升，多出大量的存储空间；5、去除应用安装限制；6、支持开机自启动、开机密码锁、儿童应用锁、应用隐藏、开机自动进入HDMI等各种花式功能；魔百和M302H-Z
C语言位域小宝哥Code C语言 c语言算法开发语言
在C语言中，位域（BitFields）是一种特殊的结构体成员，可以用来以位为单位定义数据成员的宽度。位域的主要作用是节省存储空间（特别是在嵌入式开发中）和对硬件寄存器进行位级操作。1.位域的定义与语法位域是在结构体中定义的一种特殊成员，通过冒号:指定其占用的位数。1.1语法struct结构体名{数据类型成员名:位宽;};数据类型：必须是整型或枚举类型（如int、unsignedint或signed
Kubernetes(k8s) 架构设计 boonya #k8s kubernetes 容器云原生
目录节点管理节点自注册手动节点管理节点状态地址状况容量与可分配信息节点控制器节点容量节点拓扑节点体面关闭接下来控制面到节点通信节点到控制面控制面到节点API服务器到kubeletapiserver到节点、Pod和服务SSH隧道Konnectivity服务控制器控制器模式通过API服务器来控制直接控制期望状态与当前状态设计运行控制器的方式接下来云控制器管理器的基础概念设计云控制器管理器的功能节点控制
Day_1 数据结构与算法&LeetCode入门及攻略 Finger-Von-Frings c++leetcode
数据结构与算法学习目的：我们学习算法和数据结构，是为了学会在编程中从时间复杂度、空间复杂度方面考虑解决方案，训练自己的逻辑思维，从而写出高质量的代码，以此提升自己的编程技能，获取更高的工作回报。数据结构定义：数据结构(DataStructure)指的是带有结构特性的数据元素的集合。学习的目的：为了帮助我们了解和掌握计算机中的数据是以何种方式进行组织、存储的。Q1：何为结构特性？所谓结构特性，指的是
k8s部署rabbitmq集群（使用rabbitmq-cluster-operator部署）仇誉 rabbitmq rabbitmq kubernetes
1.下载并安装cluster-operatorkubectlapply-frabbitmq-cluster-operator.yml百度网盘请输入提取码：qy992.部署rabbitmq实例kubectlapply-frabbitmq.yaml存储类改为自己的（如：managed-nfs-storage）#rabbitmq.yaml---apiVersion:rabbitmq.com/v1beta
【深度学习】Pytorch：导入导出模型参数 T0uken 深度学习 pytorch 人工智能
PyTorch是深度学习领域中广泛使用的框架，熟练掌握其模型参数的管理对于模型训练、推理以及部署非常重要。本文将全面讲解PyTorch中关于模型参数的操作，包括如何导出、导入以及如何下载模型参数。什么是模型参数模型参数是指深度学习模型中需要通过训练来优化的变量，如神经网络中的权重和偏置。这些参数存储在PyTorch的torch.nn.Module对象中，通过以下方式访问：importtorchim
lvm快照备份小吃饱了 adb
前提数据文件要在逻辑卷上；此逻辑卷所在卷组必须有足够空间使用快照卷；数据文件和事务日志要在同一个逻辑卷上；前提：MySQL数据lv和将要创建的快照要在同一vg，vg要有足够的空间存储优点几乎是热备（创建快照前把表上锁，创建完毕后立即释放）支持所有的存储引擎备份速度快无需使用昂贵的商业软件（操作系统级别的）缺点可能需要部门协调（使用操作系统级别的命令，DBA一般没有权限）无法预计服务停止时间数据如果
Kubernetes架构原则和对象设计（二） grahamzhu 云原生学习专栏 kubernetes 架构容器集群搭建 API设计云计算 kubelet
云原生学习路线导航页（持续更新中）kubernetes学习系列快捷链接Kubernetes架构原则和对象设计（一）Kubernetes常见问题解答本文从云计算架构发展入手，详细分析了kubernetes的生态系统、设计理念、分层架构、API设计原则、架构设计原则等，并介绍了使用kubelet+staticPod拉起集群的过程1.云计算的传统分类云计算出现之前，对于任何企业，想要搭建自己的服务，需要
aws s3仅允许cloudfront访问_配置跨账户S3存储桶的访问 weixin_39839478 aws aws s3查看accesskey secretkey 我们无法刷新此账户的凭据
【Domain1的组织复杂性设计（DesignforOrganizationalComplexity）】——-配置跨账户S3存储桶的访问（CrossAccountS3BucketConfiguration）Hello大家好，欢迎回来，我们今天的课程内容是跨账户S3存储桶的访问。当前，在很多组织中，应用跨账户S3存储桶访问的架构组成是非常普遍的，同样，对于AWSSAP认证考试，掌握跨账户存储桶的访问
python读二进制文件字节长度_使用Python进行二进制文件读写的简单方法(推荐) weixin_39574388
总的感觉，python本身并没有对二进制进行支持，不过提供了一个模块来弥补，就是struct模块。python没有二进制类型，但可以存储二进制类型的数据，就是用string字符串类型来存储二进制数据，这也没关系，因为string是以1个字节为单位的。importstructa=12.34#将a变为二进制bytes=struct.pack('i',a)此时bytes就是一个string字符串，字符串
Js函数返回值 _wy_ js return
一、返回控制与函数结果，语法为：return 表达式;作用: 结束函数执行，返回调用函数，而且把表达式的值作为函数的结果二、返回控制语法为：return;作用: 结束函数执行，返回调用函数，而且把undefined作为函数的结果在大多数情况下,为事件处理函数返回false,可以防止默认的事件行为.例如,默认情况下点击一个<a>元素,页面会跳转到该元素href属性
MySQL 的 char 与 varchar bylijinnan mysql
今天发现，create table 时，MySQL 4.1有时会把 char 自动转换成 varchar 测试举例： CREATE TABLE `varcharLessThan4` ( `lastName` varchar(3) ) ; mysql> desc varcharLessThan4; +----------+---------+------+-
Quartz——TriggerListener和JobListener eksliang TriggerListener JobListener quartz
转载请出自出处：http://eksliang.iteye.com/blog/2208624 一.概述 listener是一个监听器对象，用于监听scheduler中发生的事件，然后执行相应的操作；你可能已经猜到了，TriggerListeners接受与trigger相关的事件，JobListeners接受与jobs相关的事件。二.JobListener监听器 j
oracle层次查询 18289753290 oracle；层次查询；树查询
.oracle层次查询(connect by) oracle的emp表中包含了一列mgr指出谁是雇员的经理，由于经理也是雇员，所以经理的信息也存储在emp表中。这样emp表就是一个自引用表，表中的mgr列是一个自引用列，它指向emp表中的empno列，mgr表示一个员工的管理者， select empno,mgr,ename,sal from e
通过反射把map中的属性赋值到实体类bean对象中酷的飞上天空 javaee 泛型类型转换
使用过struts2后感觉最方便的就是这个框架能自动把表单的参数赋值到action里面的对象中但现在主要使用Spring框架的MVC，虽然也有@ModelAttribute可以使用但是明显感觉不方便。好吧，那就自己再造一个轮子吧。原理都知道，就是利用反射进行字段的赋值，下面贴代码主要类如下： import java.lang.reflect.Field; imp
SAP HANA数据存储：传统硬盘的瓶颈问题蓝儿唯美 HANA
SAPHANA平台有各种各样的应用场景，这也意味着客户的实施方法有许多种选择，关键是如何挑选最适合他们需求的实施方案。在《Implementing SAP HANA》这本书中，介绍了SAP平台在现实场景中的运作原理，并给出了实施建议和成功案例供参考。本系列文章节选自《Implementing SAP HANA》，介绍了行存储和列存储的各自特点，以及SAP HANA的数据存储方式如何提升空间压
Java Socket 多线程实现文件传输随便小屋 java socket
高级操作系统作业，让用Socket实现文件传输，有些代码也是在网上找的，写的不好，如果大家能用就用上。客户端类： package edu.logic.client; import java.io.BufferedInputStream; import java.io.Buffered
java初学者路径 aijuans java
学习Java有没有什么捷径?要想学好Java，首先要知道Java的大致分类。自从Sun推出Java以来，就力图使之无所不包，所以Java发展到现在，按应用来分主要分为三大块：J2SE,J2ME和J2EE,这也就是Sun ONE(Open Net Environment)体系。J2SE就是Java2的标准版，主要用于桌面应用软件的编程；J2ME主要应用于嵌入是系统开发，如手机和PDA的编程；J2EE
APP推广 aoyouzi APP 推广
一，免费篇 1，APP推荐类网站自主推荐最美应用、酷安网、DEMO8、木蚂蚁发现频道等,如果产品独特新颖，还能获取最美应用的评测推荐。PS：推荐简单。只要产品有趣好玩，用户会自主分享传播。例如足迹APP在最美应用推荐一次，几天用户暴增将服务器击垮。 2，各大应用商店首发合作老实盯着排期，多给应用市场官方负责人献殷勤。 3，论坛贴吧推广百度知道，百度贴吧，猫扑论坛，天涯社区，豆瓣（
JSP转发与重定向百合不是茶 jsp servlet Java Web jsp转发
在servlet和jsp中我们经常需要请求,这时就需要用到转发和重定向; 转发包括;forward和include 例子;forwrad转发; 将请求装法给reg.html页面关键代码; req.getRequestDispatcher("reg.html
web.xml之jsp-config bijian1013 java web.xml servlet jsp-config
1.作用：主要用于设定JSP页面的相关配置。 2.常见定义： <jsp-config> <taglib> <taglib-uri>URI(定义TLD文件的URI,JSP页面的tablib命令可以经由此URI获取到TLD文件)</tablib-uri> <taglib-location> TLD文件所在的位置
JSF2.2 ViewScoped Using CDI sunjing CDI JSF 2.2 ViewScoped
JSF 2.0 introduced annotation @ViewScoped; A bean annotated with this scope maintained its state as long as the user stays on the same view(reloads or navigation - no intervening views). One problem w
【分布式数据一致性二】Zookeeper数据读写一致性 bit1129 zookeeper
很多文档说Zookeeper是强一致性保证，事实不然。关于一致性模型请参考http://bit1129.iteye.com/blog/2155336 Zookeeper的数据同步协议 Zookeeper采用称为Quorum Based Protocol的数据同步协议。假如Zookeeper集群有N台Zookeeper服务器(N通常取奇数，3台能够满足数据可靠性同时
Java开发笔记白糖_ java开发
1、Map<key,value>的remove方法只能识别相同类型的key值 Map<Integer,String> map = new HashMap<Integer,String>(); map.put(1,"a"); map.put(2,"b"); map.put(3,"c"
图片黑色阴影 bozch 图片
.event{ padding:0; width:460px; min-width: 460px; border:0px solid #e4e4e4; height: 350px; min-heig
编程之美-饮料供货-动态规划 bylijinnan 动态规划
import java.util.Arrays; import java.util.Random; public class BeverageSupply { /** * 编程之美饮料供货 * 设Opt（V’，i）表示从i到n-1种饮料中，总容量为V’的方案中，满意度之和的最大值。 * 那么递归式就应该是：Opt（V’，i）=max{ k * Hi+Op
ajax大参数（大数据）提交性能分析 chenbowen00 Web Ajax 框架浏览器 prototype
近期在项目中发现如下一个问题项目中有个提交现场事件的功能，该功能主要是在web客户端保存现场数据（主要有截屏，终端日志等信息）然后提交到服务器上方便我们分析定位问题。客户在使用该功能的过程中反应点击提交后反应很慢，大概要等10到20秒的时间浏览器才能操作，期间页面不响应事件。根据客户描述分析了下的代码流程，很简单，主要通过OCX控件截屏，在将前端的日志等文件使用OCX控件打包，在将之转换为
[宇宙与天文]在太空采矿,在太空建造 comsci
我们在太空进行工业活动...但是不太可能把太空工业产品又运回到地面上进行加工,而一般是在哪里开采,就在哪里加工,太空的微重力环境,可能会使我们的工业产品的制造尺度非常巨大.... 地球上制造的最大工业机器是超级油轮和航空母舰,再大些就会遇到困难了,但是在空间船坞中,制造的最大工业机器,可能就没
ORACLE中CONSTRAINT的四对属性 daizj oracle CONSTRAINT
ORACLE中CONSTRAINT的四对属性 summary:在data migrate时,某些表的约束总是困扰着我们,让我们的migratet举步维艰,如何利用约束本身的属性来处理这些问题呢?本文详细介绍了约束的四对属性: Deferrable/not deferrable, Deferred/immediate, enalbe/disable, validate/novalidate,以及如
Gradle入门教程 dengkane gradle
一、寻找gradle的历程一开始的时候，我们只有一个工程，所有要用到的jar包都放到工程目录下面，时间长了，工程越来越大，使用到的jar包也越来越多，难以理解jar之间的依赖关系。再后来我们把旧的工程拆分到不同的工程里，靠ide来管理工程之间的依赖关系，各工程下的jar包依赖是杂乱的。一段时间后，我们发现用ide来管理项程很不方便，比如不方便脱离ide自动构建，于是我们写自己的ant脚本。再后
C语言简单循环示例 dcj3sjt126com c
# include <stdio.h> int main(void) { int i; int count = 0; int sum = 0; float avg; for (i=1; i<=100; i++) { if (i%2==0) { count++; sum += i; } } avg
presentModalViewController 的动画效果 dcj3sjt126com controller
系统自带(四种效果)： presentModalViewController模态的动画效果设置： [cpp] view plain copy UIViewController *detailViewController = [[UIViewController al
java 二分查找 shuizhaosi888 二分查找 java二分查找
需求：在排好顺序的一串数字中，找到数字T 一般解法：从左到右扫描数据，其运行花费线性时间O(N)。然而这个算法并没有用到该表已经排序的事实。 /** * * @param array * 顺序数组 * @param t * 要查找对象 * @return */ public stati
Spring Security（07）——缓存UserDetails 234390216 ehcache 缓存 Spring Security
Spring Security提供了一个实现了可以缓存UserDetails的UserDetailsService实现类，CachingUserDetailsService。该类的构造接收一个用于真正加载UserDetails的UserDetailsService实现类。当需要加载UserDetails时，其首先会从缓存中获取，如果缓存中没
Dozer 深层次复制 jayluns VO maven po
最近在做项目上遇到了一些小问题，因为架构在做设计的时候web前段展示用到了vo层，而在后台进行与数据库层操作的时候用到的是Po层。这样在业务层返回vo到控制层，每一次都需要从po-->转化到vo层，用到BeanUtils.copyProperties(source, target)只能复制简单的属性，因为实体类都配置了hibernate那些关联关系，所以它满足不了现在的需求，但后发现还有个很
CSS规范整理（摘自懒人图库） a409435341 html UI css 浏览器
刚没事闲着在网上瞎逛，找了一篇CSS规范整理，粗略看了一下后还蛮有一定的道理，并自问是否有这样的规范，这也是初入前端开发的人一个很好的规范吧。一、文件规范 1、文件均归档至约定的目录中。具体要求通过豆瓣的CSS规范进行讲解：所有的CSS分为两大类：通用类和业务类。通用的CSS文件，放在如下目录中：基本样式库 /css/core
C++动态链接库创建与使用你不认识的休道人 C++dll
一、创建动态链接库 1.新建工程test中选择”MFC [dll]”dll类型选择第二项"Regular DLL With MFC shared linked"，完成 2.在test.h中添加 extern “C” 返回类型 _declspec(dllexport)函数名(参数列表); 3.在test.cpp中最后写 extern “C” 返回类型 _decls
Android代码混淆之ProGuard rensanning ProGuard
Android应用的Java代码，通过反编译apk文件（dex2jar、apktool）很容易得到源代码，所以在release版本的apk中一定要混淆一下一些关键的Java源码。 ProGuard是一个开源的Java代码混淆器（obfuscation）。ADT r8开始它被默认集成到了Android SDK中。官网： http://proguard.sourceforge.net/
程序员在编程中遇到的奇葩弱智问题 tomcat_oracle jquery 编程 ide
　　现在收集一下：　　排名不分先后，按照发言顺序来的。 1、Jquery插件一个通用函数一直报错，尤其是很明显是存在的函数，很有可能就是你没有引入jquery。。。或者版本不对 2、调试半天没变化：不在同一个文件中调试。这个很可怕，我们很多时候会备份好几个项目，改完发现改错了。有个群友说的好：在汤匙
解决maven-dependency-plugin (goals "copy-dependencies","unpack") is not supported xp9802 dependency
解决办法：在plugins之前添加如下pluginManagement，二者前后顺序如下： [html] view plain copy <build> <pluginManagement