悟空明镜

[scheduler]十. 不考虑能效是如何为task选择合适的cpu？

一概述

之前在讲解新创建进程和idle进程被wake_up_process之后如何被调度器调度的原理，有两个点没有分析的很清楚，就是在这个调度的过程中，如何选择一个cpu来执行调度实体。现在单独拎出来详细分析：

如果EAS feature启用的话，执行函数流：select_energy_cpu_brute
如果EAS feature没有启用的话，执行传统的函数流：find_idlest_cpu

简单的流程图如下：

现在仅仅分析第二个部分，EAS没有启用的情况下,调度器如何实现进程的调度.即下面将要分析的source code如下:

static int  
select_task_rq_fair(struct task_struct *p, int prev_cpu, int sd_flag, int wake_flags, int sibling_count_hint)  
{  
    struct sched_domain *tmp, *affine_sd = NULL, *sd = NULL;  
    int cpu = smp_processor_id();  
    int new_cpu = prev_cpu;  
    int want_affine = 0;  
    int sync = wake_flags & WF_SYNC;  
  
#ifdef CONFIG_64BIT_ONLY_CPU  
    struct cpumask tmpmask;  
  
    if (find_packing_cpu(p, &new_cpu))  
        return new_cpu;  
  
    cpumask_andnot(&tmpmask, cpu_present_mask, &b64_only_cpu_mask);  
    if (cpumask_test_cpu(cpu, &tmpmask)) {  
        if (weighted_cpuload_32bit(cpu) >  
            sysctl_sched_32bit_load_threshold &&  
            !test_tsk_thread_flag(p, TIF_32BIT))  
            return min_load_64bit_only_cpu();  
    }  
#endif  
  
    if (sd_flag & SD_BALANCE_WAKE) {  
        record_wakee(p);  
        want_affine = !wake_wide(p, sibling_count_hint) &&  
                  !wake_cap(p, cpu, prev_cpu) &&  
                  cpumask_test_cpu(cpu, &p->cpus_allowed);  
    }  
  
    if (energy_aware())  
        return select_energy_cpu_brute(p, prev_cpu, sync);  
     /* 下面是这个章节需要分析的内容. */
    rcu_read_lock();  
    for_each_domain(cpu, tmp) {  
        if (!(tmp->flags & SD_LOAD_BALANCE))  
            break;  
  
        /* 
         * If both cpu and prev_cpu are part of this domain, 
         * cpu is a valid SD_WAKE_AFFINE target. 
         */  
        if (want_affine && (tmp->flags & SD_WAKE_AFFINE) &&  
            cpumask_test_cpu(prev_cpu, sched_domain_span(tmp))) {  
            affine_sd = tmp;  
            break;  
        }  
  
        if (tmp->flags & sd_flag)  
            sd = tmp;  
        else if (!want_affine)  
            break;  
    }  
  
    if (affine_sd) {  
        sd = NULL; /* Prefer wake_affine over balance flags */  
        if (cpu != prev_cpu && wake_affine(affine_sd, p, prev_cpu, sync))  
            new_cpu = cpu;  
    }  
  
    if (sd && !(sd_flag & SD_BALANCE_FORK)) {  
        /* 
         * We're going to need the task's util for capacity_spare_wake 
         * in find_idlest_group. Sync it up to prev_cpu's 
         * last_update_time. 
         */  
        sync_entity_load_avg(&p->se);  
    }  
  
    if (!sd) {  
        if (sd_flag & SD_BALANCE_WAKE) /* XXX always ? */  
            new_cpu = select_idle_sibling(p, prev_cpu, new_cpu);  
  
    } else {  
        new_cpu = find_idlest_cpu(sd, p, cpu, prev_cpu, sd_flag);  
    }  
    rcu_read_unlock();  
  
    return new_cpu;  
}

下面根据代码结果分下面三个章节来分析上面剩余的代码:

根据want_affine变量选择调度域并确定new_cpu
根据调度域及其调度域参数选择兄弟idle cpu根据调度域及其调度域参数选择兄弟idle cpu
根据调度域选择最深idle的cpu根据调度域选择最深idle的cpu

二根据want_affine变量选择调度域并确定new_cpu

我们知道如下的事实:

进程p的调度域参数设置了SD_BALANCE_WAKE
当前cpu的唤醒次数没有超标
当前task p消耗的capacity * 1138小于min_cap * 1024
当前cpu在task p的cpu亲和数里面的一个

只有上面三个条件全部成立,则want_affine=1
下面分析这部分代码:

for_each_domain(cpu, tmp) {
         /* 这个if永远不会成立,原因是在Sched domain初始化的时候已经设置了这个flag 
         在函数sd_init里面已经设置好了,也包括进程的几种状态下需要做balance的flag*/ 
        if (!(tmp->flags & SD_LOAD_BALANCE))  
            break;  
  
        /* 
         * If both cpu and prev_cpu are part of this domain, 
         * cpu is a valid SD_WAKE_AFFINE target. 
         */  
        if (want_affine && (tmp->flags & SD_WAKE_AFFINE) &&  
            cpumask_test_cpu(prev_cpu, sched_domain_span(tmp))) {  
            affine_sd = tmp;  
            break;  
        }  
  
        if (tmp->flags & sd_flag)  
            sd = tmp;  
        else if (!want_affine)  
            break;  
    }

在sd_init初始化的时候,设置的flags如下:

.flags          = 1*SD_LOAD_BALANCE  
            | 1*SD_BALANCE_NEWIDLE  
            | 1*SD_BALANCE_EXEC  
            | 1*SD_BALANCE_FORK  
            | 0*SD_BALANCE_WAKE  
            | 1*SD_WAKE_AFFINE  
            | 0*SD_SHARE_CPUCAPACITY  
            | 0*SD_SHARE_PKG_RESOURCES  
            | 0*SD_SERIALIZE  
            | 0*SD_PREFER_SIBLING  
            | 0*SD_NUMA  
fdef CONFIG_INTEL_DWS  
            | 0*SD_INTEL_DWS  
ndif  
            | sd_flags  
            ,

2.1 根据affine_sd update new_cpu

对于for_each_domain(cpu, tmp)的遍历解释如下:

/* 从domain最底层的MC遍历到最高层的DIE层 */
for_each_domain(cpu, tmp) {
         /* 这个if永远不会成立,原因是在Sched domain初始化的时候已经设置了这个flag 
         在函数sd_init里面已经设置好了,也包括进程的几种状态下需要做balance的flag*/ 
        if (!(tmp->flags & SD_LOAD_BALANCE))  
            break;  
  
        /* 
         * If both cpu and prev_cpu are part of this domain, 
         * cpu is a valid SD_WAKE_AFFINE target. 
         */  
        /* sched_domain_span(tmp)即这个遍历的domain下面有多少个cpu以及id,
        在SDTL 最底层(比如ARM的MC)一个domain是一个cluster,最高层DIE则是所有的cpu 
        SD_WAKE_AFFINE flag被设置了.如果if条件成立,则将当前的domain赋值给affine_sd
        变量
         */
        
        if (want_affine && (tmp->flags & SD_WAKE_AFFINE) &&  
            cpumask_test_cpu(prev_cpu, sched_domain_span(tmp))) {  
            affine_sd = tmp;  
            break;  
        }  
        /* balance flag符合进程在调用select_task_rq时候的flag */
        if (tmp->flags & sd_flag)  
            sd = tmp;  
        else if (!want_affine)  
            break;  
    }

我们知道当内核进程是从idle被wakeup起来的时候,sd_flag是被设置为SD_BALANCE_WAKE,但是在sd_init初始化的时候,没有设置这个flag,所以存在sd=NULL, affine_sd=NULL的情况出现.下面接着分析下面的函数:

/* affine_sd变量不为空,则根据相应的条件update new_cpu */
if (affine_sd) {  
    sd = NULL; /* Prefer wake_affine over balance flags */  
    if (cpu != prev_cpu && wake_affine(affine_sd, p, prev_cpu, sync))  
        new_cpu = cpu;  
}

wake_affine函数源码分析之前,需要先知道三个load的计算方式如下:

source_load(int cpu, int type)
target_load(int cpu, int type)target_load(int cpu, int type)
effective_load(struct task_group *tg, int cpu, long wl, long wg)effective_load(struct task_group *tg, int cpu, long wl, long wg)

分别分析如下.

2.1.1 source_load

其源码如下:

/* 
 * Return a low guess at the load of a migration-source cpu weighted 
 * according to the scheduling class and "nice" value. 
 * 
 * We want to under-estimate the load of migration sources, to 
 * balance conservatively. 
 */  
static unsigned long source_load(int cpu, int type)  
{  
    struct rq *rq = cpu_rq(cpu);
    /* 获取此cpu所在的cfs_rq的load */  
    unsigned long total = weighted_cpuload(cpu);  
    /* type=sd->wake_idx=0并且SCHED_FEAT(LB_BIAS, true),可以此if条件成立 */
    if (type == 0 || !sched_feat(LB_BIAS))  
        return total;  
  
    return min(rq->cpu_load[type-1], total);  
}  
  
/* Used instead of source_load when we know the type == 0 */  
static unsigned long weighted_cpuload(const int cpu)  
{  /* 获取此cpu上的cfs_rq的load,cpu上的load包括cfs_rq和rt_rq上的负载之和 */
    return cfs_rq_runnable_load_avg(&cpu_rq(cpu)->cfs);  
}  
  
static inline unsigned long cfs_rq_runnable_load_avg(struct cfs_rq *cfs_rq)  
{  /* 获取cfs_rq平均运行load,在每次调度实体负载update的时候会更新 */
    return cfs_rq->runnable_load_avg;  
}

2.1.2 target_load

/* 
 * Return a high guess at the load of a migration-target cpu weighted 
 * according to the scheduling class and "nice" value. 
 */  
static unsigned long target_load(int cpu, int type)  
{  
    struct rq *rq = cpu_rq(cpu);  
    unsigned long total = weighted_cpuload(cpu);  
  
    if (type == 0 || !sched_feat(LB_BIAS))  
        return total;  
  
    return max(rq->cpu_load[type-1], total);  
}

函数与上面类似,如果type不为0,则source_load是选择最小数值,而target_load选择最大数值.

2.1.3 effective_load

源码如下:

#ifdef CONFIG_FAIR_GROUP_SCHED  
/* 
 * effective_load() calculates the load change as seen from the root_task_group 
 * 
 * Adding load to a group doesn't make a group heavier, but can cause movement 
 * of group shares between cpus. Assuming the shares were perfectly aligned one 
 * can calculate the shift in shares. 
 * 
 * Calculate the effective load difference if @wl is added (subtracted) to @tg 
 * on this @cpu and results in a total addition (subtraction) of @wg to the 
 * total group weight. 
 * 
 * Given a runqueue weight distribution (rw_i) we can compute a shares 
 * distribution (s_i) using: 
 * 
 *   s_i = rw_i / \Sum rw_j                     (1) 
 * 
 * Suppose we have 4 CPUs and our @tg is a direct child of the root group and 
 * has 7 equal weight tasks, distributed as below (rw_i), with the resulting 
 * shares distribution (s_i): 
 * 
 *   rw_i = {   2,   4,   1,   0 } 
 *   s_i  = { 2/7, 4/7, 1/7,   0 } 
 * 
 * As per wake_affine() we're interested in the load of two CPUs (the CPU the 
 * task used to run on and the CPU the waker is running on), we need to 
 * compute the effect of waking a task on either CPU and, in case of a sync 
 * wakeup, compute the effect of the current task going to sleep. 
 * 
 * So for a change of @wl to the local @cpu with an overall group weight change 
 * of @wl we can compute the new shares distribution (s'_i) using: 
 * 
 *   s'_i = (rw_i + @wl) / (@wg + \Sum rw_j)                (2) 
 * 
 * Suppose we're interested in CPUs 0 and 1, and want to compute the load 
 * differences in waking a task to CPU 0. The additional task changes the 
 * weight and shares distributions like: 
 * 
 *   rw'_i = {   3,   4,   1,   0 } 
 *   s'_i  = { 3/8, 4/8, 1/8,   0 } 
 * 
 * We can then compute the difference in effective weight by using: 
 * 
 *   dw_i = S * (s'_i - s_i)                        (3) 
 * 
 * Where 'S' is the group weight as seen by its parent. 
 * 
 * Therefore the effective change in loads on CPU 0 would be 5/56 (3/8 - 2/7) 
 * times the weight of the group. The effect on CPU 1 would be -4/56 (4/8 - 
 * 4/7) times the weight of the group. 
 */  
static long effective_load(struct task_group *tg, int cpu, long wl, long wg)  
{  
    struct sched_entity *se = tg->se[cpu];  
  
    if (!tg->parent) /* the trivial, non-cgroup case */  
        return wl;  
  
    for_each_sched_entity(se) {  
        struct cfs_rq *cfs_rq = se->my_q;  
        long W, w = cfs_rq_load_avg(cfs_rq);  
  
        tg = cfs_rq->tg;  
  
        /* 
         * W = @wg + \Sum rw_j 
         */  
        W = wg + atomic_long_read(&tg->load_avg);  
  
        /* Ensure \Sum rw_j >= rw_i */  
        W -= cfs_rq->tg_load_avg_contrib;  
        W += w;  
  
        /* 
         * w = rw_i + @wl 
         */  
        w += wl;  
  
        /* 
         * wl = S * s'_i; see (2) 
         */
         /* tg->shares可能会被函数sched_group_set_shares修改,但是
          root_task_group.shares = ROOT_TASK_GROUP_LOAD在sched_init调度器
         初始化的时候设置了为nice=0的权重为1024 */   
        if (W > 0 && w < W)  
            wl = (w * (long)tg->shares) / W;  
        else  
            wl = tg->shares;  
  
        /* 
         * Per the above, wl is the new se->load.weight value; since 
         * those are clipped to [MIN_SHARES, ...) do so now. See 
         * calc_cfs_shares(). 
         */  
        if (wl < MIN_SHARES)  
            wl = MIN_SHARES;  
  
        /* 
         * wl = dw_i = S * (s'_i - s_i); see (3) 
         */  
        wl -= se->avg.load_avg;  
  
        /* 
         * Recursively apply this logic to all parent groups to compute 
         * the final effective load change on the root group. Since 
         * only the @tg group gets extra weight, all parent groups can 
         * only redistribute existing shares. @wl is the shift in shares 
         * resulting from this level per the above. 
         */  
        wg = 0;  
    }  
  
    return wl;  
}  
#else  
  
static long effective_load(struct task_group *tg, int cpu, long wl, long wg)  
{  
    return wl;  
}  
  
#endif

这个函数在定义之前已经讲解的很清楚,effective_load怎么计算的.具体如下:

获得有调度实体的平均负载,这是第一步.数值为se->avg.load_avg
根据当前进程负载迁移情况,重新获取调度实体的平均负载:根据当前进程负载迁移情况,重新获取调度实体的平均负载:
即大W = 进程组的负载+cfs_rq的负载 + 新进程的负载-之前cfs_rq的进程组贡献的负载=@wg + \Sum rw_j
小w=cfs_rq负载+新进程的负载=rw_i + @wl
返回diff数值,第二步数值减去第一步数值

for_each_sched_entity从最底层的se开始遍历一直到这个se所在的根节点:root_task_group

2.1.4 核心函数wake_affine分析

源码分析如下:

static int wake_affine(struct sched_domain *sd, struct task_struct *p,  
               int prev_cpu, int sync)  
{  
    s64 this_load, load;  
    s64 this_eff_load, prev_eff_load;  
    int idx, this_cpu;  
    struct task_group *tg;  
    unsigned long weight;  
    int balanced;  
    /* sched_domain的wake_idx是sd_alloc_ctl_domain_table函数分配的一个table的
     元素.默认数值为0,在sd_init初始化了.  默认数值为0,可以通过sys接口修改,属于
             schedctl interface */
    idx   = sd->wake_idx;     
    this_cpu  = smp_processor_id();/*获取当前运行此函数的cpu id*/  
    load      = source_load(prev_cpu, idx);  
    this_load = target_load(this_cpu, idx);  
  
    /* 
     * If sync wakeup then subtract the (maximum possible) 
     * effect of the currently running task from the load 
     * of the current CPU: 
     */  /* sync=wake_flag & WF_SYNC = 0 & 0x01 = 0,目前看到所有的wake_flag都
     被设置为0了  */
    if (sync) {  
        tg = task_group(current);  
        weight = current->se.avg.load_avg;  
  
        this_load += effective_load(tg, this_cpu, -weight, -weight);  
        load += effective_load(tg, prev_cpu, 0, -weight);  
    }  
  
    tg = task_group(p);  
    weight = p->se.avg.load_avg;  
  
    /* 
     * In low-load situations, where prev_cpu is idle and this_cpu is idle 
     * due to the sync cause above having dropped this_load to 0, we'll 
     * always have an imbalance, but there's really nothing you can do 
     * about that, so that's good too. 
     * 
     * Otherwise check if either cpus are near enough in load to allow this 
     * task to be woken on this_cpu. 
     *//*  下面所在的事情就是决定进程P放入到当前 this cpu上或者放入prev_cpu上负载的影
      响 */  
    this_eff_load = 100;  
    this_eff_load *= capacity_of(prev_cpu);  
  
    prev_eff_load = 100 + (sd->imbalance_pct - 100) / 2;  
    prev_eff_load *= capacity_of(this_cpu);  
  
    if (this_load > 0) {  
        this_eff_load *= this_load +  
            effective_load(tg, this_cpu, weight, weight);  
  
        prev_eff_load *= load + effective_load(tg, prev_cpu, 0, weight);  
    }  
    /* 确定是否进行进程的迁移,需要明白,在计算prev_eff_load的时候,为何effective_load
    的第三个参数为0呢?是因为prev_cpu是进程P当前运行的进程.只有进程需要迁移到某个cpu上,第
    三个参数才会设置数值. */
    balanced = this_eff_load <= prev_eff_load;  
  
    schedstat_inc(p, se.statistics.nr_wakeups_affine_attempts);  
  
    if (!balanced)  
        return 0;  
  
    schedstat_inc(sd, ttwu_move_affine);  
    schedstat_inc(p, se.statistics.nr_wakeups_affine);  
  
    return 1;  
}

只有当当前cpu与prev_cpu不是同一个cpu并且wake_affine函数返回为1,则new_cpu = 当前cpu. 即可以将进程P迁移到new_cpu上去了.下面继续分析:

/*这个是用来判断sd是否为空,以及调度域flag是否为SD_BALANCE_FORK*/
if (sd && !(sd_flag & SD_BALANCE_FORK)) {  
    /* 
     * We're going to need the task's util for capacity_spare_wake 
     * in find_idlest_group. Sync it up to prev_cpu's 
     * last_update_time. 
     * 对进程P作为调度实体进程负载的更新(PELT算法)
    */  
    sync_entity_load_avg(&p->se);  
}

2.2 核心函数select_idle_sibling分析

我们知道只要affine_sd !=NULL,那么sd变量会设置为空,那么选择new_cpu的代码路径就会走select_idle_sibling函数:

 if (!sd) {
        /*如果affine_sd不为空,则sd就被设置为空 SD_BALACNE_WAKE flag被设置 */
        if (sd_flag & SD_BALANCE_WAKE) /* XXX always ? */  
            new_cpu = select_idle_sibling(p, prev_cpu, new_cpu);  
/* 
 * Try and locate an idle CPU in the sched_domain. 
 */  
static int select_idle_sibling(struct task_struct *p, int prev, int target)  
{  
    struct sched_domain *sd;  
    struct sched_group *sg;  
    int best_idle_cpu = -1;  
    int best_idle_cstate = INT_MAX;  
    unsigned long best_idle_capacity = ULONG_MAX;  
  
    schedstat_inc(p, se.statistics.nr_wakeups_sis_attempts);  
    schedstat_inc(this_rq(), eas_stats.sis_attempts);  
    /*  这里是否设置了cstate感知调度器控制属性,即根据idle的深浅来做出不同的决策 */
    if (!sysctl_sched_cstate_aware) {
    /* 如果没有设置,则直接判断new_cpu是否idle而不管系统存在更钱的idle状态.只要是idle
    就直接返回. */  
        if (idle_cpu(target)) {  
            schedstat_inc(p, se.statistics.nr_wakeups_sis_idle);  
            schedstat_inc(this_rq(), eas_stats.sis_idle);  
            return target;  
        }  
        /* intel 平台 */
#ifdef CONFIG_INTEL_DWS  
        if (sched_feat(INTEL_DWS)) {  
            /* 
             * For either waker or wakee CPU, if it is idle, then select it, but 
             * if not, we lower down the bar to use a threshold of runnable avg 
             * to determine whether it is capable of handling the wakee task 
             */  
            if (sysctl_sched_wakeup_threshold && cpu_more_runnable(target))  
                return target;  
  
            if (prev != target) {  
                /* 
                 * If the prevous cpu is cache affine and idle, don't be stupid. 
                 */  
                if (cpus_share_cache(prev, target) && idle_cpu(prev))  
                    return prev;  
                if (sysctl_sched_wakeup_threshold && cpu_more_runnable(prev))  
                    return prev;  
            }  
        } else  
#endif  
        /* 
         * If the prevous cpu is cache affine and idle, don't be stupid. 
         */   /*如果进程p原先运行的cpu(prev)不等于目标cpu && prev与target共享 
        cache && prev是idle状态,则目标cpu为prev cpu*/  
        if (prev != target && cpus_share_cache(prev, target) && idle_cpu(prev)) {  
            schedstat_inc(p, se.statistics.nr_wakeups_sis_cache_affine);  
            schedstat_inc(this_rq(), eas_stats.sis_cache_affine);  
            return prev;  
        }  
    }  
  
    /* 
     * Otherwise, iterate the domains and find an elegible idle cpu. 
     */  
    sd = rcu_dereference(per_cpu(sd_llc, target)); 
     /* 下面寻找处于最浅idle状态的最小capacity的cpu来放置迁移进程P */ 
    for_each_lower_domain(sd) {  
        sg = sd->groups;  
        do {  
            int i;
             /* 进程P cpu亲和数与调度组span cpu数没有交集,则直接遍历下一个sg */  
            if (!cpumask_intersects(sched_group_cpus(sg),  
                        tsk_cpus_allowed(p)))  
                goto next;  
  
            if (sysctl_sched_cstate_aware) {  
                for_each_cpu_and(i, tsk_cpus_allowed(p), sched_group_cpus(sg)) {   /*获取cpu的idle状态索引*/
                    int idle_idx = idle_get_state_idx(cpu_rq(i));  
                    unsigned long new_usage = boosted_task_util(p);  
                    unsigned long capacity_orig = capacity_orig_of(i);  
  
                    if (new_usage > capacity_orig || !idle_cpu(i))  
                        goto next;  
                     /* 遍历到的cpu为new_cpu id并且进程P的util小于cpu的capacity
                     ,则直接返回目标cpu(new_cpu) */
                    if (i == target && new_usage <= capacity_curr_of(target)) {  
                        schedstat_inc(p, se.statistics.nr_wakeups_sis_suff_cap);  
                        schedstat_inc(this_rq(), eas_stats.sis_suff_cap);  
                        schedstat_inc(sd, eas_stats.sis_suff_cap);  
                        return target;  
                    }  
  			 /* 找出最浅idle状态并且最小capacity的cpu id */
                    if (idle_idx < best_idle_cstate &&  
                        capacity_orig <= best_idle_capacity) {  
                        best_idle_cpu = i;   /* 保存这个选择的id */
                        best_idle_cstate = idle_idx;  
                        best_idle_capacity = capacity_orig;  
                    }  
                }  
            } else {   /*如果没有定义感知idle深浅状态的flag,则只要目标cpu是idle就
                  OK*/
                for_each_cpu(i, sched_group_cpus(sg)) {  
                    if (i == target || !idle_cpu(i))  
                        goto next;  
                }  
                 /* 如果上面的循环失败, 则挑选调度组内第一个cpu*/
                target = cpumask_first_and(sched_group_cpus(sg),  
                    tsk_cpus_allowed(p));  
                schedstat_inc(p, se.statistics.nr_wakeups_sis_idle_cpu);  
                schedstat_inc(this_rq(), eas_stats.sis_idle_cpu);  
                schedstat_inc(sd, eas_stats.sis_idle_cpu);  
                goto done;  
            }  
next:  
            sg = sg->next;  
        } while (sg != sd->groups);  
    }  
  
    if (best_idle_cpu >= 0)  
        target = best_idle_cpu;  
  
done:  
    schedstat_inc(p, se.statistics.nr_wakeups_sis_count);  
    schedstat_inc(this_rq(), eas_stats.sis_count);  
  
    return target;  
}

原理比较简单.

三核心函数find_idlest_cpu分析

对于这个核心函数的分析,需要理解下面几个核心子函数

find_idlest_group(sd, p, cpu, sd_flag)
find_idlest_group_cpu(group, p, cpu)find_idlest_group_cpu(group, p, cpu)

两个是递进的关系,查找到了idlest的group之后,在此group中查找idlest的cpu,最后来确定目标cpu的id.分别来分析

3.1 子核心函数find_idlest_group

由于看的是ARM平台,选择ARM平台调用的函数:

/* 
 * find_idlest_group finds and returns the least busy CPU group within the 
 * domain. 
 * 
 * Assumes p is allowed on at least one CPU in sd. 
 */  
  
 
static struct sched_group *  
find_idlest_group(struct sched_domain *sd, struct task_struct *p,  
          int this_cpu, int sd_flag)  
{  
    struct sched_group *idlest = NULL, *group = sd->groups;  
    struct sched_group *most_spare_sg = NULL;  
    unsigned long min_load = ULONG_MAX, this_load = ULONG_MAX;  
    unsigned long most_spare = 0, this_spare = 0;  
    int load_idx = sd->forkexec_idx;  
    int imbalance = 100 + (sd->imbalance_pct-100)/2;  
    /*根据sd_flag数值,设定load_idx数值(只有被wakeup的进程才会设置)*/
    if (sd_flag & SD_BALANCE_WAKE)  
        load_idx = sd->wake_idx;  
    /*开始对sd内的所有sg遍历*/
    do {  
        unsigned long load, avg_load, spare_cap, max_spare_cap;  
        int local_group;  
        int i;  
         /*sg内的cpu与进程的cpu亲和数没有交集,直接进行下次遍历*/
        /* Skip over this group if it has no CPUs allowed */  
        if (!cpumask_intersects(sched_group_cpus(group),  
                    tsk_cpus_allowed(p)))  
            continue;  
/*由于 sd是最底层MC SDTL,所以cpumask_weight(sched_group_cpus(group))=1
        local_group目的是确定this_cpu是否在本次遍历的group中*/
        local_group = cpumask_test_cpu(this_cpu,  
                           sched_group_cpus(group));  
  
        /* 
         * Tally up the load of all CPUs in the group and find 
         * the group containing the CPU with most spare capacity. 
         */  
        avg_load = 0;  
        max_spare_cap = 0;  
        /*在这个group计算累加负载和最大没有使用的capacity数值*/
        for_each_cpu(i, sched_group_cpus(group)) {  
            /* Bias balancing toward cpus of our domain */  
            if (local_group)  
                load = source_load(i, load_idx);  
            else  
                load = target_load(i, load_idx);  
  	     /*累加负载,后面会归一化为相对负载*/
            avg_load += load;  
            /*cpu i的剩余capacity余量,即capacity_of-cpu_util*/
            spare_cap = capacity_spare_wake(i, p);  
  	     /*计算整个循环周期内的最大capacity余量*/
            if (spare_cap > max_spare_cap)  
                max_spare_cap = spare_cap;  
        }  
        /*归一化为相对负载*/
        /* Adjust by relative CPU capacity of the group */  
        avg_load = (avg_load * SCHED_CAPACITY_SCALE) / group->sgc->capacity;  
  	 /*根据每次遍历的local_group的数值,来update相应的一些决策变量.*/
        if (local_group) {  
            this_load = avg_load;  
            this_spare = max_spare_cap;  
        } else { 
            /*获取最小load的group*/ 
            if (avg_load < min_load) {  
                min_load = avg_load;  
                idlest = group;  
            }  
  	     /*获取最大capacity余量的group*/
            if (most_spare < max_spare_cap) {  
                most_spare = max_spare_cap;  
                most_spare_sg = group;  
            }  
        }  
    } while (group = group->next, group != sd->groups);  
  
    /* 
     * The cross-over point between using spare capacity or least load 
     * is too conservative for high utilization tasks on partially 
     * utilized systems if we require spare_capacity > task_util(p), 
     * so we allow for some task stuffing by using 
     * spare_capacity > task_util(p)/2. 
     * 
     * Spare capacity can't be used for fork because the utilization has 
     * not been set yet, we must first select a rq to compute the initial 
     * utilization. 
     */  /*下面开始根据设定的策略来决策使用哪种方式作为idlest_group*/
     /*1.如果进程p是新创建的进程,直接选择负载最轻的group
       2.如果capacity余量大于进程p自身的util的一半,并且imbalance参数大于一定数值
     则返回为空
       3.如果最大capacity余量大于进程p的util的一半,则返回最大余量的group
       4.最后根据实际情况选择idlest的group*/ 
    if (sd_flag & SD_BALANCE_FORK)  
        goto skip_spare;  
  
    if (this_spare > task_util(p) / 2 &&  
        imbalance*this_spare > 100*most_spare)  
        return NULL;  
    else if (most_spare > task_util(p) / 2)  
        return most_spare_sg;  
  
skip_spare:  
    if (!idlest || 100*this_load < imbalance*min_load)  
        return NULL;  
    return idlest;  
}

比较好理解

3.2 子核心函数find_idlest_group_cpu

既然已经选择了idlest group,那么开始选择idlest group内的idlest cpu了.

/* 
 * find_idlest_group_cpu - find the idlest cpu among the cpus in group. 
 */  
static int  
find_idlest_group_cpu(struct sched_group *group, struct task_struct *p, int this_cpu)  
{  
    unsigned long load, min_load = ULONG_MAX;  
    unsigned int min_exit_latency = UINT_MAX;  
    u64 latest_idle_timestamp = 0;  
    int least_loaded_cpu = this_cpu;  
    int shallowest_idle_cpu = -1;  
    int i;  
  
    /* Check if we have any choice: */  
    if (group->group_weight == 1)  
        return cpumask_first(sched_group_cpus(group));  
  
    /* Traverse only the allowed CPUs */  
    for_each_cpu_and(i, sched_group_cpus(group), tsk_cpus_allowed(p)) {  
        if (idle_cpu(i)) {  
            struct rq *rq = cpu_rq(i);  
            struct cpuidle_state *idle = idle_get_state(rq);
            /*cpu处于idle并且退出时延最小,那么cpu肯定是最浅idle状态的cpu了,idle状态
            越深,退出时延越大.*/  
            if (idle && idle->exit_latency < min_exit_latency) {  
                /* 
                 * We give priority to a CPU whose idle state 
                 * has the smallest exit latency irrespective 
                 * of any idle timestamp. 
                 */  
                min_exit_latency = idle->exit_latency;  
                latest_idle_timestamp = rq->idle_stamp;  
                shallowest_idle_cpu = i;
             /*如果退出时延是最小退出时延并且 此cpu之前进入过idle状态.那么挑选刚刚进入
             idle的cpu最为idle状态最浅的cpu.注释很清楚*/  
            } else if ((!idle || idle->exit_latency == min_exit_latency) &&  
                   rq->idle_stamp > latest_idle_timestamp) {  
                /* 
                 * If equal or no active idle state, then 
                 * the most recently idled CPU might have 
                 * a warmer cache. 
                 */  
                latest_idle_timestamp = rq->idle_stamp;  
                shallowest_idle_cpu = i;  
            }  
         /*如果没有cpu处于idle,那么选择load最轻的cpu作为返回值*/
        } else if (shallowest_idle_cpu == -1) {  
            load = weighted_cpuload(i);  
            if (load < min_load || (load == min_load && i == this_cpu)) {  
                min_load = load;  
                least_loaded_cpu = i;  
            }  
        }  
    }  
    /*根据系统是否有idle cpu来决策是选择最浅idle状态的cpu还是选择最轻负载的cpu*/
    return shallowest_idle_cpu != -1 ? shallowest_idle_cpu : least_loaded_cpu;  
}

原理也比较简单.
两个子核心函数分析完毕,下面分析这个源码:

3.3 核心函数find_idlest_cpu分析

static inline int find_idlest_cpu(struct sched_domain *sd, struct task_struct *p, int cpu, int prev_cpu, int sd_flag)  
{  
    int new_cpu = cpu;  
    int wu = sd_flag & SD_BALANCE_WAKE;  
    int cas_cpu = -1;  
  
    if (wu) {  
        schedstat_inc(p, se.statistics.nr_wakeups_cas_attempts);  
        schedstat_inc(this_rq(), eas_stats.cas_attempts);  
    }  
    /*进程p的cpu亲和数与调度域的span cpu是否存在交集*/
    if (!cpumask_intersects(sched_domain_span(sd), &p->cpus_allowed))  
        return prev_cpu;  
    /*在此调度域内查找出idlest group,进而查找出idlest cpu*/
    while (sd) {  
        struct sched_group *group = NULL;  
        struct sched_domain *tmp;  
        int weight;  
  
        if (wu)  
            schedstat_inc(sd, eas_stats.cas_attempts);  
  
        if (!(sd->flags & sd_flag)) {  
            sd = sd->child;  
            continue;  
        }  
  
#ifdef CONFIG_INTEL_DWS  
            if (sched_feat(INTEL_DWS)) {  
                if (sd->flags & SD_INTEL_DWS)  
                    group = dws_find_group(sd, p, cpu);  
                if (!group)  
                    group = find_idlest_group(sd, p, cpu, sd_flag);  
            } else  
#endif  /*查找出最小load/最大capacity余量的group*/
        group = find_idlest_group(sd, p, cpu, sd_flag);  
        if (!group) {  
            sd = sd->child;  
            continue;  
        }  
        /*查找出最浅idle状态/最小负载的cpu id*/
        new_cpu = find_idlest_group_cpu(group, p, cpu);  
        if (new_cpu == cpu) {  
            /* Now try balancing at a lower domain level of cpu */  
            sd = sd->child;  
            continue;  
        }  
  
        /* Now try balancing at a lower domain level of new_cpu */  
        cpu = cas_cpu = new_cpu;  
        weight = sd->span_weight;  
        sd = NULL;  /*循环中断条件设置*/
        for_each_domain(cpu, tmp) {
            /*如果tmp与sd不处于同一个level的话,数据会有差异,比如两个  
         SDTL(MC(child)/DIE(parent))
             架构是8核心两个cluster,那么DIE的weight大于MC的weight.这个for循环的目的
是需要遍历顶层sd下面的sg.因为传参过来的sd可能是MC/DIE两者之一*/  
            if (weight <= tmp->span_weight)  
                break;  
            if (tmp->flags & sd_flag)  
                sd = tmp;  
        }  
        /* while loop will break here if sd == NULL */  
    }  
  
    if (wu && (cas_cpu >= 0)) {  
        schedstat_inc(p, se.statistics.nr_wakeups_cas_count);  
        schedstat_inc(this_rq(), eas_stats.cas_count);  
    }  
    
    return new_cpu;  
}

原理也比较简单.
重点还是理解调度域和调度组的关系.

你可能感兴趣的:(EAS-调度器学习,linux,kernel,cfs,scheduler)

基于kylin-v10安装docker 神奇侠2024 redis kylin 大数据 docker
1、下载地址Indexoflinux/static/stable/x86_64/2、下载docker-24.0.5.tgz.tar版本3、上传服务器解压tarxvfdocker-24.0.5.tgz.tar4、解压的docker拷贝或移动到/usr/bin/目录下cpdocker/*/usr/bin/5、编写docker.service文件加入Linux服务当中并开启守护进程vi/etc/syst
基于python+django的家教预约网站-家教信息管理系统源码+运行步骤冷琴1996 Python系统设计 python django 开发语言
该系统是基于python+django开发的家教预约网站。是给师妹做的课程作业。大家在学习过程中，遇到问题可以在github给作者留言。共同学习进步哦效果演示前台地址：http://jiajiao.gitapp.cn后台地址：http://jiajiao.gitapp.cn/admin后台管理帐号：用户名：admin123密码：admin123源码地址https://github.com/geee
剑指 Offer II 113. 课程顺序（中等图 bfs 拓扑排序数组哈希表）风雨中de宁静图搜索算法
剑指OfferII113.课程顺序现在总共有numCourses门课需要选，记为0到numCourses-1。给定一个数组prerequisites，它的每一个元素prerequisites[i]表示两门课程之间的先修顺序。例如prerequisites[i]=[ai,bi]表示想要学习课程ai，需要先完成课程bi。请根据给出的总课程数numCourses和表示先修顺序的prerequisites
MySQL 进阶学习文档你曾经是少年数据库
一、存储引擎1.1核心架构四层架构：连接层→服务层→引擎层→存储层插件式存储引擎：不同引擎独立管理数据存储，可动态选择1.2主流引擎对比特性InnoDB（默认）MyISAMMemory事务支持✅支持❌不支持❌不支持锁粒度行锁表锁表锁外键支持✅支持❌不支持❌不支持存储位置磁盘磁盘内存适用场景高并发事务读多写少临时数据缓存选择建议：优先选InnoDB（支持事务和外键）读多写少且无需事务选MyISAM临
LoadRunner 11 性能测试全面教程金融先生-Frank
本文还有配套的精品资源，点击获取简介：LoadRunner11（LR11）是HP开发的一款企业级性能测试工具，支持多应用程序类型的负载测试，用于性能评估、瓶颈识别和系统优化。教程详细介绍LR11的组件功能、脚本开发、场景设置、测试执行、结果分析、性能指标监测、故障诊断以及自动化测试等，提供从初级到高级的完整学习路径。1.LoadRunner11(LR11)功能概述LoadRunner11(LR11
python爬虫系列实例-python爬虫实例，一小时上手爬取淘宝评论(附代码) weixin_37988176
前言本文的文字及图片来源于网络,仅供学习、交流使用,不具有任何商业用途,版权归原作者所有,如有问题请及时联系我们以作处理。1明确目的通过访问天猫的网站，先搜索对应的商品，然后爬取它的评论数据。可以作为设计前期的市场调研的数据，帮助很大。2爬取评论并储存（首先要进行登录，获取cookie）搜索你想收集的信息的评价，然后点开对应的产品图片。找到对应的评价的位置。找到对应的位置之后就可以进行数据的爬取了
MySQL学习路线蜡笔小新星 MySQL 数据库 mysql 学习经验分享
本专栏纯干货订阅专栏不迷路以下是一个详细的MySQL学习路线，适合从初学者到中高级用户的逐步学习。整个路线分为几个阶段，每个阶段包含了必要的知识点和学习材料。第一阶段：基础知识（1-2周）目标：了解数据库的基本概念，熟悉MySQL的基本用法。学习内容：数据库基础什么是数据库、数据库管理系统（DBMS）数据库的类型（关系型数据库与非关系型数据库）SQL（结构化查询语言）概述MySQL入门MySQL的
国外7个最佳大语言模型 (LLM) API推荐幂简集成 API新理念语言模型人工智能自然语言处理
大型语言模型(LLM)API将彻底改变我们处理语言的方式。在深度学习和机器学习算法的支持下，LLMAPI提供了前所未有的自然语言理解能力。通过利用这些新的API，开发人员现在可以创建能够以前所未有的方式理解和响应书面文本的应用程序。下面，我们将比较从Bard到ChatGPT、PaLM等市场上顶级LLMAPI。我们还将探讨整合这些LLM的潜在用例，并考虑其对语言处理的影响。什么是大语言模型(LLM)
【深度学习】DeepSeek模型介绍与部署 Nerous_ 深度学习深度学习人工智能
原文链接：DeepSeek-V31.介绍DeepSeek-V3，一个强大的混合专家(MoE)语言模型，拥有671B总参数，其中每个token激活37B参数。为了实现高效推理和成本效益的训练，DeepSeek-V3采用了多头潜在注意力(MLA)和DeepSeekMoE架构，这些架构在DeepSeek-V2中得到了充分验证。此外，DeepSeek-V3首次提出了无辅助损失的负载平衡策略，并设置了多to
【深度学习】 PyTorch一文详解 Nerous_ 深度学习深度学习 pytorch 人工智能机器学习 python
“PyTorchisadeeplearningframeworkthatprioritizessimplicityandflexibility,makingitthego-tochoiceforbothresearchersanddevelopers.”—Anonymous1.PyTorch简介1.1PyTorch的背景与发展PyTorch是由Facebook人工智能研究院（FAIR）开发的一个开
Flutter开发：运行报错Error detected in pubspec.yaml：…的解决方法三掌柜666 大前端开发常识 flutter android
前言在Flutter开发中，经常会遇到一些稀奇古怪的的报错，比如版本更新之后会报错、文件没有导入会报错、第三方插件版本不一致的报错等等，而且最近几年Flutter不断完善和更新的速度越来越快，这就需要Flutter相关的开发者时刻保持不断学习的心态，来应对在Flutter开发中遇到的各种突发情况。本篇博文就来分享一下关于Flutter开发中运行Flutter项目之后报错Errordetectedi
stm32完全学习——NRF24L01模块小A159 STM32完全学习 stm32 学习嵌入式硬件
对于这个模块的移植，无论是标准库还是HAL库，无论是软件模拟SPI还是，硬件SPI通信，网上都有很多的例子，这里关于移植的事情就不再赘述了。一、调试中遇到的一些问题我是用的别人的代码进行移植的，使用的是软件模拟SPI时序，在进行通信的时候，可以正确检测到NRF24L01的存在，但是发送数据和接收数据都不能成功的运行，本来以为是发送的时候数据包设置的不正确，后来发现他的代码里面使用软件SPI里面的延
JAVA毕业设计BS架构考研交流学习平台设计与实现计算机源码+lw文档+系统+调试部署+数据库瑞致网络 java 开发语言 jvm
JAVA毕业设计BS架构考研交流学习平台设计与实现计算机源码+lw文档+系统+调试部署+数据库JAVA毕业设计BS架构考研交流学习平台设计与实现计算机源码+lw文档+系统+调试部署+数据库本源码技术栈：项目架构：B/S架构开发语言：Java语言开发软件：ideaeclipse前端技术：Layui、HTML、CSS、JS、JQuery等技术后端技术：JAVA运行环境：Win10、JDK1.8数据库：
【DNN量化工具】QKeras 工具简介 kanhao100 笔记 dnn 人工智能神经网络
QKeras工具简介QKeras是一个用于量化深度学习模型的Keras扩展库，旨在使深度学习模型的量化（即将模型的浮点权重转换为低精度格式）变得简单而高效。QKeras主要目标是优化模型的存储和推理速度，特别适用于需要在资源受限的设备（如移动设备和嵌入式系统）上运行深度学习模型的场景。QKeras的主要特点量化支持：QKeras提供了对不同类型量化的支持，包括权重量化和激活量化。用户可以根据需求选
CSP-J备考冲刺必刷题（C++） | AcWing 11 背包问题求方案数热爱编程的通信人 c++算法开发语言
本文分享的必刷题目是从蓝桥云课、洛谷、AcWing等知名刷题平台精心挑选而来，并结合各平台提供的算法标签和难度等级进行了系统分类。题目涵盖了从基础到进阶的多种算法和数据结构，旨在为不同阶段的编程学习者提供一条清晰、平稳的学习提升路径。欢迎大家订阅我的专栏：算法题解：C++与Python实现！附上汇总贴：算法竞赛备考冲刺必刷题（C++）|汇总【题目来源】AcWing：11.背包问题求方案数-AcWi
[软件工程] 数据字典枪枪枪 Software Engineering
======================================================================= 学习过程中很容易忘记绘图的符号、图的定义，为避免重新翻书查定义，还是整理整理放博客上，方便查看吧。基本上都是书上的内容，在这里集合一下。参考资料：软件工程（张海藩、吕云翔）=========================================
网页编辑器能否满足Word公式与图片的直接复制粘贴？ 2501_90699800 编辑器 word umeditor粘贴word ueditor粘贴word ueditor复制word ueditor上传word图片 ueditor导入word
要求：开源，免费，技术支持编辑器：百度ueditor前端：vue2,vue3,vue-cli,react,html5用户体验：Ctrl+V快捷键操作功能：导入Word,导入Excel,导入PPT(PowerPoint),导入PDF,复制粘贴word,导入微信公众号内容,web截屏平台：Windows,macOS,Linux,RedHat,CentOS,Ubuntu,中标麒麟,银河麒麟,统信UOS,
Softmax温度调节与注意力缩放：深度神经网络中的平滑艺术 Mark White dnn 人工智能神经网络
Softmax温度调节与注意力缩放：深度神经网络中的平滑艺术在深度学习的精密机械中，有些细微的调整机制往往被视为理所当然，却实际上蕴含着深刻的数学洞察和巧妙的工程智慧。今天，我们将探讨两个看似独立却本质相通的机制：生成模型中的温度参数与Transformer注意力机制中的缩放因子。这两个设计都围绕着同一个核心概念——softmax分布的平滑控制。Softmax函数：概率分布的催化剂在深入讨论之前，
Python Textract库：文本提取程序员喵哥 python 开发语言
更多Python学习内容：ipengtao.comTextract是一个强大的Python库，用于从各种文件格式中提取文本。无论是PDF、Word文档、Excel电子表格、HTML页面还是图像，Textract都能有效地提取其中的文本内容。Textract通过集成多种开源工具和库，实现了对多种文件格式的支持，使得文本提取变得简单而高效。本文将详细介绍Textract库的安装、主要功能、基本操作、高
python学智能算法（八）|决策树西猫雷婶人工智能 python学习笔记机器学习 python 决策树开发语言
【1】引言前序学习进程中，已经对KNN邻近算法有了探索，相关文章链接为：python学智能算法（七）|KNN邻近算法-CSDN博客但KNN邻近算法有一个特点是：它在分类的时候，不能知晓每个类别内事物的具体面貌，只能获得类别，停留在事物的表面。为了进一步探索事物的内在特征，就需要学习新的算法。本篇文章就是在KNN的基础上学习新算法：决策树。【2】原理分析在学习决策树执之前，需要先了解香农熵。本科学控
Centos7_安装爱喝兽奶 Linux基础 linux ubuntu centos
一.Linux哲学思想一切都是一个文件（包括硬件）小型，单一用途的程序链接程序，共同完成复杂的任务避免令人困惑的用户界面配置数据存储在文本中二.Linux生产主流版本Linux各种版本CentOS各版本介绍https://zh.wikipedia.org/wiki/CentOSRHEL各版本介绍https://zh.wikipedia.org/wiki/Red_Hat_Enterprise_Lin
Linux学习1_Linux命令及英文全称 Wang_Zhenwei —Linux 转载 linux
LinuxCommandreferences(命令全称，方便记忆)aliasCreateyourownnameforacommandarchprintmachinearchitectureashashcommandinterpreter(shell)awk(gawk)patternscanningandprocessinglanguagebasenameRemovedirectoryandsuff
docker compose部署dragonfly java初学者分享 docker 容器运维
整个工具的代码都在Gitee或者Github地址内gitee：solomon-parent:这个项目主要是总结了工作上遇到的问题以及学习一些框架用于整合例如:rabbitMq、reids、Mqtt、S3协议的文件服务器、mongodbgithub：GitHub-ZeroNing/solomon-parent:这个项目主要是总结了工作上遇到的问题以及学习一些框架用于整合例如:rabbitMq、rei
SpringBoot整合通用xxl-job,自动注册任务 java初学者分享 spring boot 后端 java
整个工具的代码都在Gitee或者Github地址内gitee：solomon-parent:这个项目主要是总结了工作上遇到的问题以及学习一些框架用于整合例如:rabbitMq、reids、Mqtt、S3协议的文件服务器、mongodbgithub：GitHub-ZeroNing/solomon-parent:这个项目主要是总结了工作上遇到的问题以及学习一些框架用于整合例如:rabbitMq、rei
SpringBoot整合阿里云、腾讯云、minio、百度云、华为云、天翼云、金山云、七牛云、移动云、网易数帆等等有关于S3协议下文分布式对象存储接口 java初学者分享阿里云腾讯云华为云
前提：在可运行的SpringBoot的项目内引用以下JAR包整个工具的代码都在Gitee或者Github地址内gitee：solomon-parent:这个项目主要是总结了工作上遇到的问题以及学习一些框架用于整合例如:rabbitMq、reids、Mqtt、S3协议的文件服务器、mongodbgithub：GitHub-ZeroNing/solomon-parent:这个项目主要是总结了工作上遇到
SpringBoot整合多租户MongoBD java初学者分享 SaaS多租户专栏 spring boot 后端 java mongodb
前提：在可运行的SpringBoot的项目内引用以下JAR包整个工具的代码都在Gitee或者Github地址内gitee：solomon-parent:这个项目主要是总结了工作上遇到的问题以及学习一些框架用于整合例如:rabbitMq、reids、Mqtt、S3协议的文件服务器、mongodbgithub：GitHub-ZeroNing/solomon-parent:这个项目主要是总结了工作上遇到
QKeras、Brevitas和QONNX量化工具对比 kanhao100 笔记深度学习边缘计算
QKeras、Brevitas和QONNX量化工具对比一、引言在深度学习模型部署领域，量化技术已成为提升模型执行效率的关键手段。通过将浮点权重转换为低精度表示，量化能显著减小模型体积、降低内存占用并加速推理过程。对于资源受限的设备（如移动设备、嵌入式系统和边缘计算设备），量化技术尤为重要。本文深入对比三款主流量化工具：QKeras、Brevitas和QONNX，从用户实际应用角度剖析它们的技术特点
Umi-OCR：解锁高效文字识别的新时代水熠芝Dark-Haired
Umi-OCR：解锁高效文字识别的新时代Umi-OCR一款强大而高效的文字识别工具项目地址:https://gitcode.com/Resource-Bundle-Collection/6adda项目介绍在数字化浪潮席卷全球的今天，文字识别技术已成为提升工作效率和生活质量的关键工具。Umi-OCR，作为一款基于深度学习技术的开源文字识别工具，凭借其强大的功能和高效的性能，迅速成为众多用户的首选。无
Umi-OCR：一款强大而高效的文字识别工具裘心国Trent
Umi-OCR：一款强大而高效的文字识别工具Umi-OCR一款强大而高效的文字识别工具项目地址:https://gitcode.com/Resource-Bundle-Collection/6adda介绍Umi-OCR是一款基于深度学习技术的开源文字识别工具，特别适合日常办公、学术研究及数据分析等场景。它能有效解决将图像中的文字快速转化为可编辑文本的需求，极大提升工作效率。此工具依托于先进的计算机
自动语音识别（ASR）：技术、应用与未来 ajie1117 语音识别人工智能
自动语音识别（ASR）：技术、应用与未来1.ASR简介自动语音识别（ASR，AutomaticSpeechRecognition）是一种将语音转换为文本的技术。它利用人工智能（AI）、深度学习和自然语言处理（NLP）技术来识别和理解人类的语言，使计算机能够与人类进行更自然的交互。2.ASR的工作原理ASR的核心流程通常包括以下几个步骤：语音信号采集：通过麦克风或其他设备获取音频数据。预处理：去除噪
SQL的各种连接查询 xieke90 UNION ALL UNION 外连接内连接 JOIN
一、内连接概念：内连接就是使用比较运算符根据每个表共有的列的值匹配两个表中的行。内连接（join 或者inner join ） SQL语法： select * fron
java编程思想--复用类百合不是茶 java 继承代理组合 final类
复用类看着标题都不知道是什么,再加上java编程思想翻译的比价难懂,所以知道现在才看这本软件界的奇书一:组合语法:就是将对象的引用放到新类中即可代码: package com.wj.reuse; /** * * @author Administrator 组
[开源与生态系统]国产CPU的生态系统 comsci cpu
计算机要从娃娃抓起...而孩子最喜欢玩游戏.... 要让国产CPU在国内市场形成自己的生态系统和产业链,国家和企业就不能够忘记游戏这个非常关键的环节.... 投入一些资金和资源,人力和政策,让游
JVM内存区域划分Eden Space、Survivor Space、Tenured Gen，Perm Gen解释商人shang jvm内存
jvm区域总体分两类，heap区和非heap区。heap区又分：Eden Space（伊甸园）、Survivor Space(幸存者区)、Tenured Gen（老年代-养老区）。非heap区又分：Code Cache(代码缓存区)、Perm Gen（永久代）、Jvm Stack(java虚拟机栈)、Local Method Statck(本地方法栈)。 HotSpot虚拟机GC算法采用分代收
页面上调用 QQ oloz qq
<A href="tencent://message/?uin=707321921&Site=有事Q我&Menu=yes"> <img style="border:0px;" src=http://wpa.qq.com/pa?p=1:707321921:1></a>
一些问题文强chu 问题
1.eclipse 导出 doc 出现“The Javadoc command does not exist.” javadoc command 选择 jdk/bin/javadoc.exe 2.tomcate 配置 web 项目 ..... SQL:3.mysql * 必须得放前面否则 select&nbs
生活没有安全感小桔子生活孤独安全感
圈子好小，身边朋友没几个，交心的更是少之又少。在深圳，除了男朋友，没几个亲密的人。不知不觉男朋友成了唯一的依靠，毫不夸张的说，业余生活的全部。现在感情好，也很幸福的。但是说不准难免人心会变嘛，不发生什么大家都乐融融，发生什么很难处理。我想说如果不幸被分手(无论原因如何)，生活难免变化很大，在深圳，我没交心的朋友。明
php 基础语法 aichenglong php 基本语法
1 .1 php变量必须以$开头 <?php $a=” b”; echo ?> 1 .2 php基本数据库类型 Integer float/double Boolean string 1 .3 复合数据类型数组array和对象 object 1 .4 特殊数据类型 null 资源类型(resource) $co
mybatis tools 配置详解 AILIKES mybatis
MyBatis Generator中文文档 MyBatis Generator中文文档地址： http://generator.sturgeon.mopaas.com/ 该中文文档由于尽可能和原文内容一致，所以有些地方如果不熟悉，看中文版的文档的也会有一定的障碍，所以本章根据该中文文档以及实际应用，使用通俗的语言来讲解详细的配置。本文使用Markdown进行编辑，但是博客显示效
继承与多态的探讨百合不是茶 JAVA面向对象继承对象
继承 extends 多态继承是面向对象最经常使用的特征之一：继承语法是通过继承发、基类的域和方法 //继承就是从现有的类中生成一个新的类，这个新类拥有现有类的所有extends是使用继承的关键字：在A类中定义属性和方法； class A{ //定义属性 int age； //定义方法 public void go
JS的undefined与null的实例 bijian1013 JavaScript JavaScript
<form name="theform" id="theform"> </form> <script language="javascript"> var a alert(typeof(b)); //这里提示undefined if(theform.datas
TDD实践（一） bijian1013 java 敏捷 TDD
一.TDD概述 TDD：测试驱动开发，它的基本思想就是在开发功能代码之前，先编写测试代码。也就是说在明确要开发某个功能后，首先思考如何对这个功能进行测试，并完成测试代码的编写，然后编写相关的代码满足这些测试用例。然后循环进行添加其他功能，直到完全部功能的开发。
[Maven学习笔记十]Maven Profile与资源文件过滤器 bit1129 maven
什么是Maven Profile Maven Profile的含义是针对编译打包环境和编译打包目的配置定制，可以在不同的环境上选择相应的配置，例如DB信息，可以根据是为开发环境编译打包，还是为生产环境编译打包，动态的选择正确的DB配置信息 Profile的激活机制 1.Profile可以手工激活，比如在Intellij Idea的Maven Project视图中可以选择一个P
【Hive八】Hive用户自定义生成表函数(UDTF) bit1129 hive
1. 什么是UDTF UDTF，是User Defined Table-Generating Functions，一眼看上去，貌似是用户自定义生成表函数，这个生成表不应该理解为生成了一个HQL Table，貌似更应该理解为生成了类似关系表的二维行数据集 2. 如何实现UDTF 继承org.apache.hadoop.hive.ql.udf.generic
tfs restful api 加auth 2.0认计 ronin47
　　目前思考如何给tfs的ngx-tfs api增加安全性。有如下两点：　　一是基于客户端的ip设置。这个比较容易实现。　　二是基于OAuth2.0认证，这个需要lua，实现起来相对于一来说，有些难度。　　现在重点介绍第二种方法实现思路。　　前言：我们使用Nginx的Lua中间件建立了OAuth2认证和授权层。如果你也有此打算，阅读下面的文档，实现自动化并获得收益。SeatGe
jdk环境变量配置 byalias java jdk
进行java开发，首先要安装jdk，安装了jdk后还要进行环境变量配置： 1、下载jdk（http://java.sun.com/javase/downloads/index.jsp），我下载的版本是：jdk-7u79-windows-x64.exe 2、安装jdk-7u79-windows-x64.exe 3、配置环境变量：右击"计算机"-->&quo
《代码大全》表驱动法-Table Driven Approach-2 bylijinnan java
package com.ljn.base; import java.io.BufferedReader; import java.io.FileInputStream; import java.io.InputStreamReader; import java.util.ArrayList; import java.util.Collections; import java.uti
SQL 数值四舍五入小数点后保留2位 chicony 四舍五入
1.round() 函数是四舍五入用，第一个参数是我们要被操作的数据，第二个参数是设置我们四舍五入之后小数点后显示几位。 2.numeric 函数的2个参数，第一个表示数据长度，第二个参数表示小数点后位数。例如：　　select cast(round(12.5,2) as numeric(5,2))
c++运算符重载 CrazyMizzz C++
一、加+，减-，乘*，除/ 的运算符重载 Rational operator*(const Rational &x) const{ return Rational(x.a * this->a); } 在这里只写乘法的，加减除的写法类似二、<<输出,>>输入的运算符重载 &nb
hive DDL语法汇总 daizj hive 修改列 DDL 修改表
hive DDL语法汇总１、对表重命名 hive> ALTER TABLE table_name RENAME TO new_table_name; 2、修改表备注 hive> ALTER TABLE table_name SET TBLPROPERTIES ('comment' = new_comm
jbox使用说明 dcj3sjt126com Web
参考网址：http://www.kudystudio.com/jbox/jbox-demo.html jBox v2.3 beta [ 点击下载] 技术交流QQGroup：172543951 100521167 [2011-11-11] jBox v2.3 正式版 - [调整&修复] IE6下有iframe或页面有active、applet控件
UISegmentedControl 开发笔记 dcj3sjt126com
// typedef NS_ENUM(NSInteger, UISegmentedControlStyle) { // UISegmentedControlStylePlain, // large plain &
Slick生成表映射文件 ekian scala
Scala添加SLICK进行数据库操作，需在sbt文件上添加slick-codegen包 "com.typesafe.slick" %% "slick-codegen" % slickVersion 因为我是连接SQL Server数据库，还需添加slick-extensions，jtds包 "com.typesa
ES-TEST gengzg test
package com.MarkNum; import java.io.IOException; import java.util.Date; import java.util.HashMap; import java.util.Map; import javax.servlet.ServletException; import javax.servlet.annotation
为何外键不再推荐使用 hugh.wang mysql DB
表的关联，是一种逻辑关系，并不需要进行物理上的“硬关联”，而且你所期望的关联，其实只是其数据上存在一定的联系而已，而这种联系实际上是在设计之初就定义好的固有逻辑。在业务代码中实现的时候，只要按照设计之初的这种固有关联逻辑来处理数据即可，并不需要在数据库层面进行“硬关联”，因为在数据库层面通过使用外键的方式进行“硬关联”，会带来很多额外的资源消耗来进行一致性和完整性校验，即使很多时候我们并不
领域驱动设计 julyflame VO DAO 设计模式 DTO po
概念： VO（View Object）：视图对象，用于展示层，它的作用是把某个指定页面（或组件）的所有数据封装起来。 DTO（Data Transfer Object）：数据传输对象，这个概念来源于J2EE的设计模式，原来的目的是为了EJB的分布式应用提供粗粒度的数据实体，以减少分布式调用的次数，从而提高分布式调用的性能和降低网络负载，但在这里，我泛指用于展示层与服务层之间的数据传输对
单例设计模式 hm4123660 java Singleton 单例设计模式懒汉式饿汉式
单例模式是一种常用的软件设计模式。在它的核心结构中只包含一个被称为单例类的特殊类。通过单例模式可以保证系统中一个类只有一个实例而且该实例易于外界访问，从而方便对实例个数的控制并节约系统源。如果希望在系统中某个类的对象只能存在一个，单例模式是最好的解决方案。 &nb
logback zhb8015 log logback
一、logback的介绍 Logback是由log4j创始人设计的又一个开源日志组件。logback当前分成三个模块：logback-core,logback- classic和logback-access。logback-core是其它两个模块的基础模块。logback-classic是log4j的一个改良版本。此外logback-class
整合Kafka到Spark Streaming——代码示例和挑战 Stark_Summer spark storm zookeeper PARALLELISM processing
作者Michael G. Noll是瑞士的一位工程师和研究员，效力于Verisign，是Verisign实验室的大规模数据分析基础设施（基础Hadoop）的技术主管。本文，Michael详细的演示了如何将Kafka整合到Spark Streaming中。期间， Michael还提到了将Kafka整合到 Spark Streaming中的一些现状，非常值得阅读，虽然有一些信息在Spark 1.2版
spring-master-slave-commondao 王新春 DAO spring dataSource slave master
互联网的web项目，都有个特点：请求的并发量高，其中请求最耗时的db操作，又是系统优化的重中之重。为此，往往搭建 db的一主多从库的数据库架构。作为web的DAO层，要保证针对主库进行写操作，对多个从库进行读操作。当然在一些请求中，为了避免主从复制的延迟导致的数据不一致性，部分的读操作也要到主库上。（这种需求一般通过业务垂直分开，比如下单业务的代码所部署的机器，读去应该也要从主库读取数