悟空明镜

[scheduler]七. 被wake_up_process之后的进程是如何被调度的

说明：

对于函数select_task_rq的分析，本章分析的不够清晰，后面会专门有两篇文章分析传统的负载均衡迁移进程和EAS如何根据能效为进程选择目标CPU

两篇文章请参考：
一: 九. EAS如何根据能效为进程选择目标CPU
二: 十. 传统的负载均衡是如何为task选择合适的cpu

我们经常看到很多driver会或者cpufreq governor等等会创建一些进程,并在创建之后使用wake_up_process(struct task_struct *p)函数直接wakeup这个task,下面看看这个wake_up_process函数是怎么实现进程调度,怎么实现放在哪个CPU上执行调度的?
先列出起源码如下:

wake_up_process(p)---->try_to_wake_up(p,TASK_NORMAL,0,1)---->
 
/** 
 * wake_up_process - Wake up a specific process 
 * @p: The process to be woken up. 
 * 
 * Attempt to wake up the nominated process and move it to the set of runnable 
 * processes. 
 * 
 * Return: 1 if the process was woken up, 0 if it was already running. 
 * 
 * It may be assumed that this function implies a write memory barrier before 
 * changing the task state if and only if any tasks are woken up. 
 */  
int wake_up_process(struct task_struct *p)  
{   /*wake_up_process直接调用try_to_wake_up函数,并添加三个限定参数*/
    return try_to_wake_up(p, TASK_NORMAL, 0, 1);  
}  
/* Convenience macros for the sake of wake_up */  
#define TASK_NORMAL     (TASK_INTERRUPTIBLE | TASK_UNINTERRUPTIBLE)  
  
/** 
 * try_to_wake_up - wake up a thread 
 * @p: the thread to be awakened 
 * @state: the mask of task states that can be woken 
 * @wake_flags: wake modifier flags (WF_*) 
 * @sibling_count_hint: A hint at the number of threads that are being woken up 
 *                      in this event. 
 * 
 * Put it on the run-queue if it's not already there. The "current" 
 * thread is always on the run-queue (except when the actual 
 * re-schedule is in progress), and as such you're allowed to do 
 * the simpler "current->state = TASK_RUNNING" to mark yourself 
 * runnable without the overhead of this. 
 * 
 * Return: %true if @p was woken up, %false if it was already running. 
 * or @state didn't match @p's state. 
 */  
static int  
try_to_wake_up(struct task_struct *p, unsigned int state, int wake_flags,  
           int sibling_count_hint)  
{  
    unsigned long flags;  
    int cpu, success = 0;  
#ifdef CONFIG_SMP  
    struct rq *rq;  
    u64 wallclock;  
#endif  
  
    /* 
     * If we are going to wake up a thread waiting for CONDITION we 
     * need to ensure that CONDITION=1 done by the caller can not be 
     * reordered with p->state check below. This pairs with mb() in 
     * set_current_state() the waiting thread does. 
     */  /*很有可能需要唤醒一个thread的函数,某个条件必须成立,为了取到最新的没有没优化
   的条件数值,使用内存屏障来实现.*/
    smp_mb__before_spinlock();  
    raw_spin_lock_irqsave(&p->pi_lock, flags); 
    /*如果进程不是在:TASK_INTERRUPTIBLE | TASK_UNINTERRUPTIBLE下,则就不是normal
  task,直接退出wakeup流程.所以在内核里面看到的wake_up_process,可以看到起主函数都会
   将进程设置为TASK_INTERRUPTIBLE or TASK_UNINTERRUPTIBLE这两种状态之一*/ 
    if (!(p->state & state))  
        goto out;  
  
    trace_sched_waking(p);  
  
    success = 1; /* we're going to change ->state */
    /*获取这个进程当前处在的cpu上面,并不是时候进程就在这个cpu上运行,后面会挑选cpu*/  
    cpu = task_cpu(p);  
  
    /* 
     * Ensure we load p->on_rq _after_ p->state, otherwise it would 
     * be possible to, falsely, observe p->on_rq == 0 and get stuck 
     * in smp_cond_load_acquire() below. 
     * 
     * sched_ttwu_pending()                 try_to_wake_up() 
     *   [S] p->on_rq = 1;                  [L] P->state 
     *       UNLOCK rq->lock  -----. 
     *                              \ 
     *               +---   RMB 
     * schedule()                   / 
     *       LOCK rq->lock    -----' 
     *       UNLOCK rq->lock 
     * 
     * [task p] 
     *   [S] p->state = UNINTERRUPTIBLE     [L] p->on_rq 
     * 
     * Pairs with the UNLOCK+LOCK on rq->lock from the 
     * last wakeup of our task and the schedule that got our task 
     * current. 
     */  
    smp_rmb();  
   /*使用内存屏障保证p->on_rq的数值是最新的.如果task已经在rq里面,即进程已经处于
    runnable/running状态.ttwu_remote目的是由于task p已经在rq里面了,并且并没有完全
    取消调度,再次会wakeup的话,需要将task的状态翻转,将状态设置为TASK_RUNNING,这样
    task就一直在rq里面运行了.这种情况直接退出下面的流程,并对调度状态/数据进行统计*/
    if (p->on_rq && ttwu_remote(p, wake_flags))  
        goto stat;  
  
#ifdef CONFIG_SMP  
    /* 
     * Ensure we load p->on_cpu _after_ p->on_rq, otherwise it would be 
     * possible to, falsely, observe p->on_cpu == 0. 
     * 
     * One must be running (->on_cpu == 1) in order to remove oneself 
     * from the runqueue. 
     * 
     *  [S] ->on_cpu = 1;    [L] ->on_rq 
     *      UNLOCK rq->lock 
     *          RMB 
     *      LOCK   rq->lock 
     *  [S] ->on_rq = 0;    [L] ->on_cpu 
     * 
     * Pairs with the full barrier implied in the UNLOCK+LOCK on rq->lock 
     * from the consecutive calls to schedule(); the first switching to our 
     * task, the second putting it to sleep. 
     */  
    smp_rmb();  
  
    /* 
     * If the owning (remote) cpu is still in the middle of schedule() with 
     * this task as prev, wait until its done referencing the task. 
     */  
     /*如果拥有（远程）cpu仍然在schedule（）的中间，并且此任务为prev，请等待
     其完成引用任务,意思是说,task p是作为其他cpu上的调度实体被调度,并且没有调度完毕,需要
     等待其完毕,加内屏屏障就是保证on_cpu的数值是内存里面最新的数据
     */
    while (p->on_cpu)  
        cpu_relax();/*cpu 忙等待,期望有人修改p->on_cpu的数值,退出忙等待*/ 
    
    /* 
     * Combined with the control dependency above, we have an effective 
     * smp_load_acquire() without the need for full barriers. 
     * 
     * Pairs with the smp_store_release() in finish_lock_switch(). 
     * 
     * This ensures that tasks getting woken will be fully ordered against 
     * their previous state and preserve Program Order. 
     */  
    smp_rmb();  
    /*获取当前进程所在的cpu的rq*/
    rq = cpu_rq(task_cpu(p));  
    
    raw_spin_lock(&rq->lock); 
    /*获取当前时间作为walt的wallclock*/ 
    wallclock = walt_ktime_clock(); 
    /*更新rq->curr在当前时间对进程负载/对累加的runnable_time的影响*/ 
    walt_update_task_ravg(rq->curr, rq, TASK_UPDATE, wallclock, 0);
   /*更新新创建的进程p在当前时间对进程负载/对累加的runnable_time的影响*/  
    walt_update_task_ravg(p, rq, TASK_WAKE, wallclock, 0);  
    raw_spin_unlock(&rq->lock);  
    /*根据进程p的状态来确定,如果调度器调度这个task是否会对load有贡献.*/
    p->sched_contributes_to_load = !!task_contributes_to_load(p);  
    /*设置进程状态为TASK_WAKING*/
    p->state = TASK_WAKING;  
    /*根据进程的所属的调度类调用相关的callback函数,这里进程会减去当前所处的cfs_rq的最小
   vrunime,因为这里还没有确定进程会在哪个cpu上运行,等确定之后,会在入队的时候加上新cpu的
   cfs_rq的最小vruntime*/
    if (p->sched_class->task_waking)  
        p->sched_class->task_waking(p);  
    /*根据进程p相关参数设定和系统状态,为进程p选择合适的cpu供其运行*/
    cpu = select_task_rq(p, p->wake_cpu, SD_BALANCE_WAKE, wake_flags,  
                 sibling_count_hint); 
    /*如果选择的cpu与进程p当前所在的cpu不相同,则将进程的wake_flags标记添加需要迁移
     ,并将进程p迁移到cpu上.*/ 
    if (task_cpu(p) != cpu) {  
        wake_flags |= WF_MIGRATED;  
        set_task_cpu(p, cpu);  
    }  
  
#endif /* CONFIG_SMP */  
    /*进程p入队操作并标记p为runnable状态,同时执行wakeup preemption,即唤醒抢占.*/
    ttwu_queue(p, cpu);  
stat:  
    /*调度相关的统计*/
    ttwu_stat(p, cpu, wake_flags);  
out:  
    raw_spin_unlock_irqrestore(&p->pi_lock, flags);  
  
    return success;  
}

try_to_wake_up函数解析

根据进行状态state来决定是否需要继续唤醒动作,这就是p->state & state的目的
获取进程p所在的当前cpu
如果当前cpu已经在rq里面了,则需要翻转进程p的状态为TASK_RUNNING,这样进程p就一直在rq里面,不需要继续调度,退出唤醒动作
如果进程p在cpu上,则期待某个时刻去修改这个数值on_cpu为0,这样代码可以继续运行.对于cpu_relax()真实含义如下:cpu_relax()函数解析
根据WALT算法来实现rq当前task和新唤醒的task p相关的task load和rq相关的runnable_load的数值,具体怎么实现看:WALT基本原理
为task p挑选合适的cpu
如果挑选的cpu与进程p所在的cpu不是同一个cpu,则进程task migration操作
将进程p入队操作
统计调度相关信息作为debug使用.

对于关键的几个点,详细分析如下:

1 How does the scheduler pick a suitable CPU?

cpu = select_task_rq(p, p->wake_cpu, SD_BALANCE_WAKE, wake_flags,  
             sibling_count_hint);

对函数select_task_rq的实现原理如下:

/* 
 * The caller (fork, wakeup) owns p->pi_lock, ->cpus_allowed is stable. 
 */  
static inline  
int select_task_rq(struct task_struct *p, int cpu, int sd_flags, int wake_flags,  
           int sibling_count_hint)  
{  
    lockdep_assert_held(&p->pi_lock);  
    /*nr_cpus_allowed这个变量是进程p可以运行的cpu数量,一般在系统初始化的时候就已经
    设定好了的,或者可以通过设定cpu的亲和性来修改*/
    if (p->nr_cpus_allowed > 1)
        /*核心函数的callback*/  
        cpu = p->sched_class->select_task_rq(p, cpu, sd_flags, wake_flags,  
                             sibling_count_hint);  
  
    /* 
     * In order not to call set_task_cpu() on a blocking task we need 
     * to rely on ttwu() to place the task on a valid ->cpus_allowed 
     * cpu. 
     * 
     * Since this is common to all placement strategies, this lives here. 
     * 
     * [ this allows ->select_task() to simply return task_cpu(p) and 
     *   not worry about this generic constraint ] 
     */  
    /*1.如果选择的cpu与进程p允许运行的cpu不匹配 或者
      2.如果挑选的cpu offline
     只要满足上面的任何一条,则重新选择cpu在进程p成员变cpus_allowed里面选择*/
    if (unlikely(!cpumask_test_cpu(cpu, tsk_cpus_allowed(p)) ||  
             !cpu_online(cpu)))  
        cpu = select_fallback_rq(task_cpu(p), p);  
  
    return cpu;  
}

所以select_task_rq函数分两部分来分析:

1 select_task_rq callback函数分析

 cpu = p->sched_class->select_task_rq(p, cpu, sd_flags, wake_flags,  
                             sibling_count_hint);  
 -----> 
/* 
 * select_task_rq_fair: Select target runqueue for the waking task in domains 
 * that have the 'sd_flag' flag set. In practice, this is SD_BALANCE_WAKE, 
 * SD_BALANCE_FORK, or SD_BALANCE_EXEC. 
 * 
 * Balances load by selecting the idlest cpu in the idlest group, or under 
 * certain conditions an idle sibling cpu if the domain has SD_WAKE_AFFINE set. 
 * 
 * Returns the target cpu number. 
 * 
 * preempt must be disabled. 
 */  
static int  
select_task_rq_fair(struct task_struct *p, int prev_cpu, int sd_flag, int wake_flags,  
            int sibling_count_hint)  
{  
    struct sched_domain *tmp, *affine_sd = NULL, *sd = NULL;  
    int cpu = smp_processor_id(); /*获取当前运行的cpu id*/ 
    int new_cpu = prev_cpu;  /*将唤醒此进程p的cpu作为new_cpu*/
    int want_affine = 0;
    /*wake_falgs=0,so sync=0*/  
    int sync = wake_flags & WF_SYNC;  
  
#ifdef CONFIG_64BIT_ONLY_CPU  
    struct cpumask tmpmask;  
  
    if (find_packing_cpu(p, &new_cpu))  
        return new_cpu;  
  
    cpumask_andnot(&tmpmask, cpu_present_mask, &b64_only_cpu_mask);  
    if (cpumask_test_cpu(cpu, &tmpmask)) {  
        if (weighted_cpuload_32bit(cpu) >  
            sysctl_sched_32bit_load_threshold &&  
            !test_tsk_thread_flag(p, TIF_32BIT))  
            return min_load_64bit_only_cpu();  
    }  
#endif  
    /*sd_flag = SD_BALANCE_WAKE,是成立的,want_affine是一个核心变量包括三个部分数值
    的&&,后面会详细分析*/
    if (sd_flag & SD_BALANCE_WAKE) {  
        record_wakee(p);  
        want_affine = !wake_wide(p, sibling_count_hint) &&  
                  !wake_cap(p, cpu, prev_cpu) &&  
                  cpumask_test_cpu(cpu, &p->cpus_allowed);  
    }  
    /*如果系统使用EAS来决策的,则走这个分支,这种重点,也是新加的调度方案,根据cpu的能效和
    capacity来挑选cpu*/
    if (energy_aware())  
        return select_energy_cpu_brute(p, prev_cpu, sync);  
  
    rcu_read_lock();  
    for_each_domain(cpu, tmp) {  
        if (!(tmp->flags & SD_LOAD_BALANCE))  
            break;  
  
        /* 
         * If both cpu and prev_cpu are part of this domain, 
         * cpu is a valid SD_WAKE_AFFINE target. 
         */  
        if (want_affine && (tmp->flags & SD_WAKE_AFFINE) &&  
            cpumask_test_cpu(prev_cpu, sched_domain_span(tmp))) {  
            affine_sd = tmp;  
            break;  
        }  
  
        if (tmp->flags & sd_flag)  
            sd = tmp;  
        else if (!want_affine)  
            break;  
    }  
  
    if (affine_sd) {  
        sd = NULL; /* Prefer wake_affine over balance flags */  
        if (cpu != prev_cpu && wake_affine(affine_sd, p, prev_cpu, sync))  
            new_cpu = cpu;  
    }  
  
    if (sd && !(sd_flag & SD_BALANCE_FORK)) {  
        /* 
         * We're going to need the task's util for capacity_spare_wake 
         * in find_idlest_group. Sync it up to prev_cpu's 
         * last_update_time. 
         */  
        sync_entity_load_avg(&p->se);  
    }  
  
    if (!sd) {  
        if (sd_flag & SD_BALANCE_WAKE) /* XXX always ? */  
            new_cpu = select_idle_sibling(p, prev_cpu, new_cpu);  
  
    } else {  
        new_cpu = find_idlest_cpu(sd, p, cpu, prev_cpu, sd_flag);  
    }  
    rcu_read_unlock();  
  
    return new_cpu;  
}

分别分几个部分来select_task_rq_fair函数:

1.1 want_affine变量怎么获取的呢?

want_affine = !wake_wide(p, sibling_count_hint) &&    
                  !wake_cap(p, cpu, prev_cpu) &&    
                  cpumask_test_cpu(cpu, &p->cpus_allowed);   
 ----> 
/* 
 * Detect M:N waker/wakee relationships via a switching-frequency heuristic. 
 * A waker of many should wake a different task than the one last awakened 
 * at a frequency roughly N times higher than one of its wakees.  In order 
 * to determine whether we should let the load spread vs consolodating to 
 * shared cache, we look for a minimum 'flip' frequency of llc_size in one 
 * partner, and a factor of lls_size higher frequency in the other.  With 
 * both conditions met, we can be relatively sure that the relationship is 
 * non-monogamous, with partner count exceeding socket size.  Waker/wakee 
 * being client/server, worker/dispatcher, interrupt source or whatever is 
 * irrelevant, spread criteria is apparent partner count exceeds socket size. 
 */  
/*当前cpu的唤醒次数没有超标*/
static int wake_wide(struct task_struct *p, int sibling_count_hint)  
{  
    unsigned int master = current->wakee_flips;  
    unsigned int slave = p->wakee_flips;  
    int llc_size = this_cpu_read(sd_llc_size);  
  
    if (sibling_count_hint >= llc_size)  
        return 1;  
  
    if (master < slave)  
        swap(master, slave);  
    if (slave < llc_size || master < slave * llc_size)  
        return 0;  
    return 1;  
}  
  
/* 
 * Disable WAKE_AFFINE in the case where task @p doesn't fit in the 
 * capacity of either the waking CPU @cpu or the previous CPU @prev_cpu. 
 * 
 * In that case WAKE_AFFINE doesn't make sense and we'll let 
 * BALANCE_WAKE sort things out. 
 */  
static int wake_cap(struct task_struct *p, int cpu, int prev_cpu)  
{  
    long min_cap, max_cap;  
    /*获取当前cpu的orig_of和唤醒进程p的cpu的orig_of capacity的最小值
*/
    min_cap = min(capacity_orig_of(prev_cpu), capacity_orig_of(cpu));
    /*获取最大的capacity,为1024*/  
    max_cap = cpu_rq(cpu)->rd->max_cpu_capacity.val;  
  
    /* Minimum capacity is close to max, no need to abort wake_affine */  
    if (max_cap - min_cap < max_cap >> 3)  
        return 0;  
    /*根据PELT算法更新进程p作为调度实体的负载*/
    /* Bring task utilization in sync with prev_cpu */  
    sync_entity_load_avg(&p->se);  
    /*根据条件判断min_cap的capacity能够能够满足进程p吗?*/
    /*min_cap * 1024 < task_util(p) * 1138,
       task_util(p)∈[0,1024]*/
    return min_cap * 1024 < task_util(p) * capacity_margin;  
}  
 
static inline unsigned long task_util(struct task_struct *p)  
{  /*WALT启用*/
#ifdef CONFIG_SCHED_WALT  
    if (!walt_disabled && sysctl_sched_use_walt_task_util) {  
        unsigned long demand = p->ravg.demand;/*task的真实运行时间*/ 
        /*在一个窗口内是多少,注意是*了1024的,比如占用了50%的窗口时间,则这个
task_util =  0.5 * 1024=512.*/ 
        return (demand << 10) / walt_ravg_window; 
    }  
#endif  
    return p->se.avg.util_avg;  
}  
 
/* 
 * Synchronize entity load avg of dequeued entity without locking 
 * the previous rq. 
 */  
void sync_entity_load_avg(struct sched_entity *se)  
{  
    struct cfs_rq *cfs_rq = cfs_rq_of(se);  
    u64 last_update_time;  
    
    last_update_time = cfs_rq_last_update_time(cfs_rq);  
  /*PELT计算sched_entity调度实体的负载*/    
__update_load_avg(last_update_time, cpu_of(rq_of(cfs_rq)), &se->avg, 0, 0, NULL);  
}  
  
/** 
 * cpumask_test_cpu - test for a cpu in a cpumask 
 * @cpu: cpu number (< nr_cpu_ids) 
 * @cpumask: the cpumask pointer 
 * 
 * Returns 1 if @cpu is set in @cpumask, else returns 0 
 */
/*当前运行的cpu是否是task p cpu亲和数里面的一个*/  
static inline int cpumask_test_cpu(int cpu, const struct cpumask *cpumask)  
{  
    return test_bit(cpumask_check(cpu), cpumask_bits((cpumask)));  
}

只有满足下面三个条件:

当前cpu的唤醒次数没有超标
当前task p消耗的capacity * 1138小于min_cap * 1024
当前cpu在task p的cpu亲和数里面的一个

只有上面三个条件全部成立,则want_affine=1

1.2 使用EAS调度,怎么挑选合理的cpu呢?

如果使用EAS调度算法,则energy_aware()为true:

if (energy_aware())  
        return select_energy_cpu_brute(p, prev_cpu, sync);  
  
static inline bool energy_aware(void)  
{  /*energy_aware调度类*/
    return sched_feat(ENERGY_AWARE);  
}  
  
static int select_energy_cpu_brute(struct task_struct *p, int prev_cpu, int sync)  
{  
    struct sched_domain *sd;  
    int target_cpu = prev_cpu, tmp_target, tmp_backup;  
    bool boosted, prefer_idle;  
    /*调度统计信息*/
    schedstat_inc(p, se.statistics.nr_wakeups_secb_attempts);  
    schedstat_inc(this_rq(), eas_stats.secb_attempts);  
    /*条件不成立*/
    if (sysctl_sched_sync_hint_enable && sync) {  
        int cpu = smp_processor_id();  
  
        if (cpumask_test_cpu(cpu, tsk_cpus_allowed(p))) {  
            schedstat_inc(p, se.statistics.nr_wakeups_secb_sync);  
            schedstat_inc(this_rq(), eas_stats.secb_sync);  
            return cpu;  
        }  
    }  
  
    rcu_read_lock();  
    /*下面的两个参数都可以在init.rc里面配置,一般boost都会配置,尤其是top-app*/
#ifdef CONFIG_CGROUP_SCHEDTUNE
    /*获取当前task是否有util boost增益.如果有则boosted=true.
     即如果原先的负载为util,那么boost之后的负载为util+boost/100*util*/  
    boosted = schedtune_task_boost(p) > 0; 
    /*获取在挑选cpu的时候,是否更倾向于idle cpu,默认为0,也就是说 prefer_idle=false*/ 
    prefer_idle = schedtune_prefer_idle(p) > 0;  
#else  
    boosted = get_sysctl_sched_cfs_boost() > 0;  
    prefer_idle = 0;  
#endif  
    /*再次更新调度实体负载,使用PELT算法,比较奇怪的时候,在计算want_affine→
wake_cap函数里面已经update了调度实体的负载了,为何在这里还需要再次计算呢?*/
    sync_entity_load_avg(&p->se);  
    /*DEFINE_PER_CPU(struct sched_domain *, sd_ea),在解析调度域调度组的创建和初始
化的时候,解析过,每个cpu在每个SDTL上面都有对应的调度域*/
    sd = rcu_dereference(per_cpu(sd_ea, prev_cpu));  
    /* Find a cpu with sufficient capacity */
    /*核心函数*/  
    tmp_target = find_best_target(p, &tmp_backup, boosted, prefer_idle);  
  
………………………………………………….

######1.2.1 下面讲解核心函数find_best_target,函数比较长:

static inline int find_best_target(struct task_struct *p, int *backup_cpu,
                   bool boosted, bool prefer_idle)  
{  
    unsigned long best_idle_min_cap_orig = ULONG_MAX; 
    /*计算task p经过boost之后的util数值,即在task_util(p)的基础上+boost%*util*/ 
    unsigned long min_util = boosted_task_util(p);  
    unsigned long target_capacity = ULONG_MAX;  
    unsigned long min_wake_util = ULONG_MAX;  
    unsigned long target_max_spare_cap = 0;  
    int best_idle_cstate = INT_MAX;  
    unsigned long target_cap = ULONG_MAX;  
    unsigned long best_idle_cap_orig = ULONG_MAX;  
    int best_idle = INT_MAX;  
    int backup_idle_cpu = -1;  
    struct sched_domain *sd;  
    struct sched_group *sg;  
    int best_active_cpu = -1;  
    int best_idle_cpu = -1;  
    int target_cpu = -1;  
    int cpu, i;
    /*获取当前cpu的运行队列rq的root_domain*/  
    struct root_domain *rd = cpu_rq(smp_processor_id())->rd; 
    /*获取当前root_domain的最大capacity数值*/ 
    unsigned long max_cap = rd->max_cpu_capacity.val;  
  
    *backup_cpu = -1;  
  
    schedstat_inc(p, se.statistics.nr_wakeups_fbt_attempts);  
    schedstat_inc(this_rq(), eas_stats.fbt_attempts);  
  
    /* Find start CPU based on boost value */
    /*start_cpu找出rd->min_cap_orig_cpu,即min_cap_orig的第一个cpu id
     min的cpu为0*/  
    cpu = start_cpu(boosted);  
    if (cpu < 0) {  
        schedstat_inc(p, se.statistics.nr_wakeups_fbt_no_cpu);  
        schedstat_inc(this_rq(), eas_stats.fbt_no_cpu);  
        return -1;  
    }  
  
    /* Find SD for the start CPU */
    /*找到启动cpu的调度域*/  
    sd = rcu_dereference(per_cpu(sd_ea, cpu));  
    if (!sd) {  
        schedstat_inc(p, se.statistics.nr_wakeups_fbt_no_sd);  
        schedstat_inc(this_rq(), eas_stats.fbt_no_sd);  
        return -1;  
    }  
  
    /* Scan CPUs in all SDs */  
    sg = sd->groups;  
    do {  
        for_each_cpu_and(i, tsk_cpus_allowed(p), sched_group_cpus(sg)) {  
            unsigned long capacity_orig = capacity_orig_of(i);  
            unsigned long wake_util, new_util;  
  
            if (!cpu_online(i))  
                continue;  
  
            if (walt_cpu_high_irqload(i))  
                continue;  
  
            /* 
             * p's blocked utilization is still accounted for on prev_cpu 
             * so prev_cpu will receive a negative bias due to the double 
             * accounting. However, the blocked utilization may be zero. 
             */  
            wake_util = cpu_util_wake(i, p);  
            new_util = wake_util + task_util(p);  
  
            /* 
             * Ensure minimum capacity to grant the required boost. 
             * The target CPU can be already at a capacity level higher 
             * than the one required to boost the task. 
             */  
            new_util = max(min_util, new_util);  
            if (new_util > capacity_orig) {  
                if (idle_cpu(i)) {  
                    int idle_idx;  
  
                    idle_idx =  
                        idle_get_state_idx(cpu_rq(i));  
  
                    if (capacity_orig >  
                        best_idle_cap_orig) {  
                        best_idle_cap_orig =  
                            capacity_orig;  
                        best_idle = idle_idx;  
                        backup_idle_cpu = i;  
                        continue;  
                    }  
  
                    /* 
                     * Skip CPUs in deeper idle state, but 
                     * only if they are also less energy 
                     * efficient. 
                     * IOW, prefer a deep IDLE LITTLE CPU 
                     * vs a shallow idle big CPU. 
                     */  
                    if (sysctl_sched_cstate_aware &&  
                        best_idle <= idle_idx)  
                        continue;  
  
                    /* Keep track of best idle CPU */  
                    best_idle_cap_orig = capacity_orig;  
                    best_idle = idle_idx;  
                    backup_idle_cpu = i;  
                    continue;  
                }  
  
                if (capacity_orig > target_cap) {  
                    target_cap = capacity_orig;  
                    min_wake_util = wake_util;  
                    best_active_cpu = i;  
                    continue;  
                }  
  
                if (wake_util > min_wake_util)  
                    continue;  
  
                min_wake_util = wake_util;  
                best_active_cpu = i;  
                continue;  
  
            }  
            /* 
             * Enforce EAS mode 
             * 
             * For non latency sensitive tasks, skip CPUs that 
             * will be overutilized by moving the task there. 
             * 
             * The goal here is to remain in EAS mode as long as 
             * possible at least for !prefer_idle tasks. 
             */  
            if (capacity_orig == max_cap)  
                if (idle_cpu(i))  
                    goto skip;  
  
            if ((new_util * capacity_margin) >  
                (capacity_orig * SCHED_CAPACITY_SCALE))  
                continue;  
skip:  
            if (idle_cpu(i)) {  
                int idle_idx;  
  
                if (prefer_idle ||  
                    cpumask_test_cpu(i, &min_cap_cpu_mask)) {  
                    trace_sched_find_best_target(p,  
                        prefer_idle, min_util, cpu,  
                        best_idle_cpu, best_active_cpu,  
                        i);  
                    return i;  
                }  
                idle_idx = idle_get_state_idx(cpu_rq(i));  
  
                /* Select idle CPU with lower cap_orig */  
                if (capacity_orig > best_idle_min_cap_orig)  
                    continue;  
  
                /* 
                 * Skip CPUs in deeper idle state, but only 
                 * if they are also less energy efficient. 
                 * IOW, prefer a deep IDLE LITTLE CPU vs a 
                 * shallow idle big CPU. 
                 */  
                if (sysctl_sched_cstate_aware &&  
                    best_idle_cstate <= idle_idx)  
                    continue;  
  
                /* Keep track of best idle CPU */  
                best_idle_min_cap_orig = capacity_orig;  
                best_idle_cstate = idle_idx;  
                best_idle_cpu = i;  
                continue;  
            }  
  
            /* Favor CPUs with smaller capacity */  
            if (capacity_orig > target_capacity)  
                continue;  
  
            /* Favor CPUs with maximum spare capacity */  
            if ((capacity_orig - new_util) < target_max_spare_cap)  
                continue;  
  
            target_max_spare_cap = capacity_orig - new_util;  
            target_capacity = capacity_orig;  
            target_cpu = i;  
        }  
  
    } while (sg = sg->next, sg != sd->groups);  
  
    /* 
     * For non latency sensitive tasks, cases B and C in the previous loop, 
     * we pick the best IDLE CPU only if we was not able to find a target 
     * ACTIVE CPU. 
     * 
     * Policies priorities: 
     * 
     * - prefer_idle tasks: 
     * 
     *   a) IDLE CPU available, we return immediately 
     *   b) ACTIVE CPU where task fits and has the bigger maximum spare 
     *      capacity (i.e. target_cpu) 
     *   c) ACTIVE CPU with less contention due to other tasks 
     *      (i.e. best_active_cpu) 
     * 
     * - NON prefer_idle tasks: 
     * 
     *   a) ACTIVE CPU: target_cpu 
     *   b) IDLE CPU: best_idle_cpu 
     */  
    if (target_cpu == -1) {  
        if (best_idle_cpu != -1)  
            target_cpu = best_idle_cpu;  
        else  
            target_cpu = (backup_idle_cpu != -1)  
            ? backup_idle_cpu  
            : best_active_cpu;  
    } else  
        *backup_cpu = best_idle_cpu;  
  
    trace_sched_find_best_target(p, prefer_idle, min_util, cpu,  
                     best_idle_cpu, best_active_cpu,  
                     target_cpu);  
  
    schedstat_inc(p, se.statistics.nr_wakeups_fbt_count);  
    schedstat_inc(this_rq(), eas_stats.fbt_count);  
  
    return target_cpu;  
}

分下面如下几个部分来分析上面的do{}while()循环

do{}while()循环是对sched domain里面的所有调度组进行遍历
for_each_cpu_and(i, tsk_cpus_allowed§, sched_group_cpus(sg)),这个for循环比较有意思,sched_group_cpus(sg),表示这个调度组所有的cpumask,其返回值就是sg->cpumask.for_each_cpu_and抽象循环的意思是i必须在tsk_cpus_allowed§ && sched_group_cpus(sg)交集里面.

/** 
 * for_each_cpu_and - iterate over every cpu in both masks 
 * @cpu: the (optionally unsigned) integer iterator 
 * @mask: the first cpumask pointer 
 * @and: the second cpumask pointer 
 * 
 * This saves a temporary CPU mask in many places.  It is equivalent to: 
 *  struct cpumask tmp; 
 *  cpumask_and(&tmp, &mask, &and); 
 *  for_each_cpu(cpu, &tmp) 
 *      ... 
 * 
 * After the loop, cpu is >= nr_cpu_ids. 
 */  
#define for_each_cpu_and(cpu, mask, and)                \  
    for ((cpu) = -1;                        \  
        (cpu) = cpumask_next_and((cpu), (mask), (and)),     \  
        (cpu) < nr_cpu_ids;)

如果cpu id为i的cpu offline,则遍历下一个cpu
如果这个cpu是一个irq high load的cpu,则遍历下一个cpu
计算wake_util,即为当前cpu id=i的cpu_util的数值,new_util为cpu i的cpu_util+进程p的task_util数值,最后new_util = max(min_util, new_util);min_util数值是task_util boost之后的数值,如果没有boost,则min_util=task_util.

上面条件判断之后,并且获取了当前遍历cpu的util和新唤醒的进程task_util叠加到cpu_util变成new_util之后,通过capacity/util的比较来获取target_cpu,下面分析代码77~187行代码:
在遍历cpu的时候

如果new_util > 遍历的cpu的capacity数值(dts获取),分两部分逻辑处理:
1.如果cpu是idle状态.记录此cpu处在idle的level_idx,并修改下面三个参数数值之后遍历下一个符合条件的cpu:

/*获取当前遍历cpu的capacity,并保存在best_idle_cap_orig变量中 */
best_idle_cap_orig = capacity_orig; 
best_idle = idle_idx;  //获取idle level index
backup_idle_cpu = i;  //idle cpu number,后面会使用到

2.如果不是idle cpu,则修正下面两个参数之后遍历下一个符合条件的cpu:

min_wake_util = wake_util;  //将遍历的cpu_util赋值给min_wake_uti  
best_active_cpu = i;  //得到best_active_cpu id

如果new_util <= 遍历的cpu的capacity数值(dts获取),分如下几部分逻辑处理:

1.如果capacity_orig == max_cap并且遍历的cpu恰好是idle状态,直接调到去更新下面三个参数:

/*将当前遍历的cpu的capacity保存到best_idle_min_cap_orig变量中*/
best_idle_min_cap_orig = capacity_orig;  
best_idle_cstate = idle_idx;  //保存idleindex
best_idle_cpu = i;  //保存最佳的idle cpu,后面会用到

2.如果(new_util * capacity_margin) > (capacity_orig * SCHED_CAPACITY_SCALE)成立,则直接遍历下一个cpu,说明此时遍历的cpu已经overutilization,没必须继续遍历了.
3.如果capacity_orig==max_cap不成立,也会执行第一条先判断遍历的cpu是否是idle,是的话,执行1一样的流程
4.如果遍历的cpu不是idle,则比较capacity_orig-new_util差值与target_max_spare_cap的比较目的是选择一个差值余量最大的cpu,防止新唤醒的task p在余量不足的cpu上运行导致后面的负载均衡,白白浪费系统资源.同时更新下面三个参数,并在每次遍历的时候更新:

/*util余量,目的找出最大余量的cpu id*/
target_max_spare_cap = capacity_orig - new_util;  
/*目标capacity*/
target_capacity = capacity_orig;  
/*选择的目标cpu*/
target_cpu = i;

下面分析212~220之间的代码:

从解释来看(对于非敏感延迟性进程)进程分两种,prefer_idle flag:

一种是偏爱idle cpu运行的进程,那么如果有idle cpu,则优先选择idle cpu并立即返回,如代码145~152行的代码;之后task的util不太大,就选择有最大余量的cpu了;最后挑选有更少争抢的cpu,比如best_avtive_cpu
不偏爱idle cpu运行的进程,优先选择余量最大的cpu,之后选择best_idle_cpu

明白了prefer_idle这个flag会影响对cpu类型的选择,那么分析212~220行之间的代码如下,即在没有机会执行寻找最大余量的cpu capacity的情况下:
如果target_cpu = -1

如果best_idle_cpu更新过,表明要么new_util很大,要么大部分cpu处于idle状态,这时候直接选择best_idle_cpu为target_cpu
否则,根据backup_idle_cpu是否update过来决定target_cpu选择是backup_idle_cpu还是best_active_cpu

如果target_cpu!=-1

候选cpu设置为best_idle_cpu,通过函数指针被使用

最后返回target_cpu的作为选择的cpu id.

1.2.2 select_energy_cpu_brute函数剩余部分分析:

static int select_energy_cpu_brute(struct task_struct *p, int prev_cpu, int sync)  
{  
.......................  
    /* Find a cpu with sufficient capacity */
    /*下面这个函数已经解析完毕*/  
    tmp_target = find_best_target(p, &tmp_backup, boosted, prefer_idle);  
  
    if (!sd)  
        goto unlock;  
    if (tmp_target >= 0) {  
        target_cpu = tmp_target;
        /*如果boosted or prefer_idle && target_cpu为idlecpu,或者target_cpu为
        min_cap的cpu,则挑选的cpu就是target_cpu了并直接退出代码流程*/  
        if (((boosted || prefer_idle) && idle_cpu(target_cpu)) ||  
             cpumask_test_cpu(target_cpu, &min_cap_cpu_mask)) {  
            schedstat_inc(p, se.statistics.nr_wakeups_secb_idle_bt);  
            schedstat_inc(this_rq(), eas_stats.secb_idle_bt);  
            goto unlock;  
        }  
    }  
    /*如果target_cpu等于唤醒进程p的cpu并且best_idle_cpu>=0,则修改target_cpu为
    best_idle_cpu数值,目的不在唤醒进程P的cpu上运行.why???*/
    if (target_cpu == prev_cpu && tmp_backup >= 0) {  
        target_cpu = tmp_backup;  
        tmp_backup = -1;  
    }  
  
    if (target_cpu != prev_cpu) {  
        int delta = 0; 
        /*构造需要迁移的环境变量*/ 
        struct energy_env eenv = {  
            .util_delta     = task_util(p),  
            .src_cpu        = prev_cpu,  
            .dst_cpu        = target_cpu,  
            .task           = p,  
            .trg_cpu    = target_cpu,  
        };  
  
  
#ifdef CONFIG_SCHED_WALT  
        if (!walt_disabled && sysctl_sched_use_walt_cpu_util &&  
            p->state == TASK_WAKING)
            /*获取进程P本身的util load*/  
            delta = task_util(p);  
#endif  
        /* Not enough spare capacity on previous cpu */
        /*唤醒进程p的cpu负载过载,超过本身capacity的90%.*/  
        if (__cpu_overutilized(prev_cpu, delta, p)) {  
            /*有限选择Energy合理的cpu,即小的cluster的idle cpu*/
            if (tmp_backup >= 0 &&  
                capacity_orig_of(tmp_backup) <  
                    capacity_orig_of(target_cpu))  
                target_cpu = tmp_backup;  
            schedstat_inc(p, se.statistics.nr_wakeups_secb_insuff_cap);  
            schedstat_inc(this_rq(), eas_stats.secb_insuff_cap);  
            goto unlock;  
        }  
        /*计算pre_cpu与target_cpu的功耗差异,如果大于0,则执行下面的代码流程
        就是计算MC知道DIE的SDTL的功耗总和的差异*/
        if (energy_diff(&eenv) >= 0) {  
            /* No energy saving for target_cpu, try backup */  
            target_cpu = tmp_backup;  
            eenv.dst_cpu = target_cpu;  
            eenv.trg_cpu = target_cpu;  
            if (tmp_backup < 0 ||  
                tmp_backup == prev_cpu ||  
                energy_diff(&eenv) >= 0) {  
                schedstat_inc(p, se.statistics.nr_wakeups_secb_no_nrg_sav);  
                schedstat_inc(this_rq(), eas_stats.secb_no_nrg_sav);  
                target_cpu = prev_cpu;  
                goto unlock;  
            }  
        }  
  
        schedstat_inc(p, se.statistics.nr_wakeups_secb_nrg_sav);  
        schedstat_inc(this_rq(), eas_stats.secb_nrg_sav);  
        goto unlock;  
    }  
  
    schedstat_inc(p, se.statistics.nr_wakeups_secb_count);  
    schedstat_inc(this_rq(), eas_stats.secb_count);  
  
unlock:  
    rcu_read_unlock();  
  
    return target_cpu;  
}

总结下就是如下四点:

EAS遍历sched_group(cluster)和cpu，找到一个既能满足进程p的affinity又能容纳下进程p的负载util，属于能用最小capacity满足的cluster其中余量capacity最多的target_cpu.首先找到能容纳进程p的util且capacity最小的cluster
然后在目标cluster中找到加上进程p以后，剩余capacity最大的cpu

pre_cpu是进程p上一次运行的cpu作为src_cpu，上面选择的target_cpu作为dst_cpu，就是尝试计算进程p从pre_cpu迁移到target_cpu系统的功耗差异

计算负载变化前后，target_cpu和prev_cpu带来的power变化。如果没有power增加则返回target_cpu，如果有power增加则返回prev_cpu.计算负载变化的函数energy_diff()循环很多比较复杂，仔细分析下来就是计算target_cpu/prev_cpu在“MC层次cpu所在sg链表”+“DIE层级cpu所在sg”，这两种范围在负载变化中的功耗差异：

上面四个过程很清晰明了的pre_cpu和target_cpu的转换关系

接下来分析energy_diff(&eenv)怎么来计算pre_cpu与target_cpu power之间的关系的。

static inline int  
energy_diff(struct energy_env *eenv)  
{  
    int boost = schedtune_task_boost(eenv->task);  
    int nrg_delta;  
    /*计算绝对功耗差值*/
    /* Conpute "absolute" energy diff */  
    __energy_diff(eenv);  
  
    /* Return energy diff when boost margin is 0 */  
    if (1 || boost == 0) {  
        trace_sched_energy_diff(eenv->task,  
                eenv->src_cpu, eenv->dst_cpu, eenv->util_delta,  
                eenv->nrg.before, eenv->nrg.after, eenv->nrg.diff,  
                eenv->cap.before, eenv->cap.after, eenv->cap.delta,  
                0, -eenv->nrg.diff);  
        return eenv->nrg.diff;  
    }  
  
    /* Compute normalized energy diff */  
    nrg_delta = normalize_energy(eenv->nrg.diff);  
    eenv->nrg.delta = nrg_delta;  
  
    eenv->payoff = schedtune_accept_deltas(  
            eenv->nrg.delta,  
            eenv->cap.delta,  
            eenv->task);  
  
    trace_sched_energy_diff(eenv->task,  
            eenv->src_cpu, eenv->dst_cpu, eenv->util_delta,  
            eenv->nrg.before, eenv->nrg.after, eenv->nrg.diff,  
            eenv->cap.before, eenv->cap.after, eenv->cap.delta,  
            eenv->nrg.delta, eenv->payoff);  
  
    /* 
     * When SchedTune is enabled, the energy_diff() function will return 
     * the computed energy payoff value. Since the energy_diff() return 
     * value is expected to be negative by its callers, this evaluation 
     * function return a negative value each time the evaluation return a 
     * positive payoff, which is the condition for the acceptance of 
     * a scheduling decision 
     */  
    return -eenv->payoff;  
}  
  
/* 
 * energy_diff(): Estimate the energy impact of changing the utilization 
 * distribution. eenv specifies the change: utilisation amount, source, and 
 * destination cpu. Source or destination cpu may be -1 in which case the 
 * utilization is removed from or added to the system (e.g. task wake-up). If 
 * both are specified, the utilization is migrated. 
 */  
static inline int __energy_diff(struct energy_env *eenv)  
{  
    struct sched_domain *sd;  
    struct sched_group *sg;  
    int sd_cpu = -1, energy_before = 0, energy_after = 0;  
    int diff, margin;  
    /*在brute函数里面设置好的eenv参数,构造迁移前的环境变量*/
    struct energy_env eenv_before = {  
        .util_delta = task_util(eenv->task),  
        .src_cpu    = eenv->src_cpu,  
        .dst_cpu    = eenv->dst_cpu,  
        .trg_cpu    = eenv->src_cpu,  
        .nrg        = { 0, 0, 0, 0},  
        .cap        = { 0, 0, 0 },  
        .task       = eenv->task,  
    };  
  
    if (eenv->src_cpu == eenv->dst_cpu)  
        return 0;  
    /*sd来至于cache sd_ea，是cpu对应的顶层sd(tl DIE层)*/
    sd_cpu = (eenv->src_cpu != -1) ? eenv->src_cpu : eenv->dst_cpu;  
    sd = rcu_dereference(per_cpu(sd_ea, sd_cpu));  
  
    if (!sd)  
        return 0; /* Error */  
  
    sg = sd->groups;  
    /*遍历sg所在sg链表，找到符合条件的sg， 累加计算eenv_before、eenv
      相关sg的功耗*/
    do {  /*如果当前sg包含src_cpu或者dst_cpu，则进行计算*/
        if (cpu_in_sg(sg, eenv->src_cpu) || cpu_in_sg(sg, eenv->dst_cpu)) {
            /*当前顶层sg为eenv的sg_top*/  
            eenv_before.sg_top = eenv->sg_top = sg;  
            /*计算eenv_before负载下sg的power*/
            if (sched_group_energy(&eenv_before))  
                return 0; /* Invalid result abort */  
            energy_before += eenv_before.energy;  
  
            /* Keep track of SRC cpu (before) capacity */  
            eenv->cap.before = eenv_before.cap.before;  
            eenv->cap.delta = eenv_before.cap.delta;  
            /*计算eenv负载下sg的power*/
            if (sched_group_energy(eenv))  
                return 0; /* Invalid result abort */  
            energy_after += eenv->energy;  
        }  
    } while (sg = sg->next, sg != sd->groups);  
    /*计算energy_after - energy_before*/
    eenv->nrg.before = energy_before;  
    eenv->nrg.after = energy_after;  
    eenv->nrg.diff = eenv->nrg.after - eenv->nrg.before;  
    eenv->payoff = 0;  
#ifndef CONFIG_SCHED_TUNE  
    trace_sched_energy_diff(eenv->task,  
            eenv->src_cpu, eenv->dst_cpu, eenv->util_delta,  
            eenv->nrg.before, eenv->nrg.after, eenv->nrg.diff,  
            eenv->cap.before, eenv->cap.after, eenv->cap.delta,  
            eenv->nrg.delta, eenv->payoff);  
#endif  
    /* 
     * Dead-zone margin preventing too many migrations. 
     */  
  
    margin = eenv->nrg.before >> 6; /* ~1.56% */  
  
    diff = eenv->nrg.after - eenv->nrg.before;  
  
    eenv->nrg.diff = (abs(diff) < margin) ? 0 : eenv->nrg.diff;  
  
    return eenv->nrg.diff;  
}
/*接下来看sched_group_energy函数的实现过程*/  
/* 
 * sched_group_energy(): Computes the absolute energy consumption of cpus 
 * belonging to the sched_group including shared resources shared only by 
 * members of the group. Iterates over all cpus in the hierarchy below the 
 * sched_group starting from the bottom working it's way up before going to 
 * the next cpu until all cpus are covered at all levels. The current 
 * implementation is likely to gather the same util statistics multiple times. 
 * This can probably be done in a faster but more complex way. 
 * Note: sched_group_energy() may fail when racing with sched_domain updates. 
 */  
static int sched_group_energy(struct energy_env *eenv)  
{  
    struct cpumask visit_cpus;  
    u64 total_energy = 0;  
    int cpu_count;  
  
    WARN_ON(!eenv->sg_top->sge);  
  
    cpumask_copy(&visit_cpus, sched_group_cpus(eenv->sg_top));  
    /* If a cpu is hotplugged in while we are in this function, 
     * it does not appear in the existing visit_cpus mask 
     * which came from the sched_group pointer of the 
     * sched_domain pointed at by sd_ea for either the prev 
     * or next cpu and was dereferenced in __energy_diff. 
     * Since we will dereference sd_scs later as we iterate 
     * through the CPUs we expect to visit, new CPUs can 
     * be present which are not in the visit_cpus mask. 
     * Guard this with cpu_count. 
     */  
    cpu_count = cpumask_weight(&visit_cpus);  
    /*根据sg_top顶层sd，找到需要计算的cpu集合visit_cpus，逐个遍历其中每一个
     cpu,这一套复杂的循环算法计算下来，其实就计算了几个power，以cpu0-cpu3为例
    :4个底层sg的power + 1个顶层sg的power*/
    while (!cpumask_empty(&visit_cpus)) {  
        struct sched_group *sg_shared_cap = NULL;
        /*选取visit_cpus中的第一个cpu*/  
        int cpu = cpumask_first(&visit_cpus);  
        struct sched_domain *sd;  
  
        /* 
         * Is the group utilization affected by cpus outside this 
         * sched_group? 
         * This sd may have groups with cpus which were not present 
         * when we took visit_cpus. 
         */  
        sd = rcu_dereference(per_cpu(sd_scs, cpu));  
  
        if (sd && sd->parent)  
            sg_shared_cap = sd->parent->groups;  
         /*从底层到顶层逐个遍历cpu所在的sd*/
        for_each_domain(cpu, sd) {  
            struct sched_group *sg = sd->groups;  
             /*如果是顶层sd，只会计算一个sg*/
            /* Has this sched_domain already been visited? */  
            if (sd->child && group_first_cpu(sg) != cpu)  
                break;  
            /*逐个遍历该层次sg链表所在sg*/
            do {  
                unsigned long group_util;  
                int sg_busy_energy, sg_idle_energy;  
                int cap_idx, idle_idx;  
  
                if (sg_shared_cap && sg_shared_cap->group_weight >= sg->group_weight)  
                    eenv->sg_cap = sg_shared_cap;  
                else  
                    eenv->sg_cap = sg;  
                /*根据eenv指示的负载变化，找出满足该sg中最大负载cpu的
                            capacity_index*/
                cap_idx = find_new_capacity(eenv, sg->sge);  
  
                if (sg->group_weight == 1) {  
                    /* Remove capacity of src CPU (before task move) */  
                    if (eenv->trg_cpu == eenv->src_cpu &&  
                        cpumask_test_cpu(eenv->src_cpu, sched_group_cpus(sg))) {  
                        eenv->cap.before = sg->sge->cap_states[cap_idx].cap;  
                        eenv->cap.delta -= eenv->cap.before;  
                    }  
                    /* Add capacity of dst CPU  (after task move) */  
                    if (eenv->trg_cpu == eenv->dst_cpu &&  
                        cpumask_test_cpu(eenv->dst_cpu, sched_group_cpus(sg))) {  
                        eenv->cap.after = sg->sge->cap_states[cap_idx].cap;  
                        eenv->cap.delta += eenv->cap.after;  
                    }  
                }  
                /*找出sg所有cpu中最小的idle index*/
                idle_idx = group_idle_state(eenv, sg); 
 /*累加sg中所有cpu的相对负载，
                    最大负载为sg->sge->cap_states[eenv->cap_idx].cap*/ 
                group_util = group_norm_util(eenv, sg);  
      /*计算power = busy_power + idle_power*/
                sg_busy_energy = (group_util * sg->sge->cap_states[cap_idx].power);  
                sg_idle_energy = ((SCHED_LOAD_SCALE-group_util)  
                                * sg->sge->idle_states[idle_idx].power);  
  
                total_energy += sg_busy_energy + sg_idle_energy;  
  
                if (!sd->child) {  
                    /* 
                     * cpu_count here is the number of 
                     * cpus we expect to visit in this 
                     * calculation. If we race against 
                     * hotplug, we can have extra cpus 
                     * added to the groups we are 
                     * iterating which do not appear in 
                     * the visit_cpus mask. In that case 
                     * we are not able to calculate energy 
                     * without restarting so we will bail 
                     * out and use prev_cpu this time. 
                     */  
                    if (!cpu_count)  
                        return -EINVAL;
                    /*如果遍历了底层sd，从visit_cpus中去掉对应的sg cpu*/  
                    cpumask_xor(&visit_cpus, &visit_cpus, sched_group_cpus(sg));  
                    cpu_count--;  
                }  
  
                if (cpumask_equal(sched_group_cpus(sg), sched_group_cpus(eenv->sg_top)))  
                    goto next_cpu;  
  
            } while (sg = sg->next, sg != sd->groups);  
        }  
  
        /* 
         * If we raced with hotplug and got an sd NULL-pointer; 
         * returning a wrong energy estimation is better than 
         * entering an infinite loop. 
         * Specifically: If a cpu is unplugged after we took 
         * the visit_cpus mask, it no longer has an sd_scs 
         * pointer, so when we dereference it, we get NULL. 
         */  
        if (cpumask_test_cpu(cpu, &visit_cpus))  
            return -EINVAL;  
next_cpu:  /*如果遍历了cpu的底层到顶层sd，从visit_cpus中去掉对应的cpu*/
        cpumask_clear_cpu(cpu, &visit_cpus);  
        continue;  
    }  
  
    eenv->energy = total_energy >> SCHED_CAPACITY_SHIFT;  
    return 0;  
}

计算思想简单，但是代码计算比较烧脑，痛苦。。。。。。。。。。。。。

1.3 如果不使用EAS,那又怎么挑选合理的cpu呢?

接着分析select_task_rq_fair函数剩下的部分:

static int  
select_task_rq_fair(struct task_struct *p, int prev_cpu, int sd_flag, int wake_flags,  
            int sibling_count_hint)  
{  
............  
    if (energy_aware())  
        return select_energy_cpu_brute(p, prev_cpu, sync);  
    /*如果没有启用EAS,则使用传统的方式来挑选合适的cpu*/
    rcu_read_lock();  
    for_each_domain(cpu, tmp) {  
        if (!(tmp->flags & SD_LOAD_BALANCE))  
            break;  
  
        /* 
         * If both cpu and prev_cpu are part of this domain, 
         * cpu is a valid SD_WAKE_AFFINE target. 
         */  
        if (want_affine && (tmp->flags & SD_WAKE_AFFINE) &&  
            cpumask_test_cpu(prev_cpu, sched_domain_span(tmp))) {  
            affine_sd = tmp;  
            break;  
        }  
  
        if (tmp->flags & sd_flag)  
            sd = tmp;  
        else if (!want_affine)  
            break;  
    }  
  
    if (affine_sd) {  
        sd = NULL; /* Prefer wake_affine over balance flags */  
        if (cpu != prev_cpu && wake_affine(affine_sd, p, prev_cpu, sync))  
            new_cpu = cpu;  
    }  
  
    if (sd && !(sd_flag & SD_BALANCE_FORK)) {  
        /* 
         * We're going to need the task's util for capacity_spare_wake 
         * in find_idlest_group. Sync it up to prev_cpu's 
         * last_update_time. 
         */  
        sync_entity_load_avg(&p->se);  
    }  
  
    if (!sd) {  
        if (sd_flag & SD_BALANCE_WAKE) /* XXX always ? */  
            new_cpu = select_idle_sibling(p, prev_cpu, new_cpu);  
  
    } else {  
        new_cpu = find_idlest_cpu(sd, p, cpu, prev_cpu, sd_flag);  
    }  
    rcu_read_unlock();  
  
    return new_cpu;  
}

涉及到传统的负载均衡，以后在分析。

###2 当挑选的cpu与当前唤醒进程所在的cpu不同时,怎么处理?

 cpu = select_task_rq(p, p->wake_cpu, SD_BALANCE_WAKE, wake_flags,  
                 sibling_count_hint);  
    if (task_cpu(p) != cpu) {  
        wake_flags |= WF_MIGRATED;  
        set_task_cpu(p, cpu);  
    }  
  ……………..
  
void set_task_cpu(struct task_struct *p, unsigned int new_cpu)  
{  
#ifdef CONFIG_SCHED_DEBUG  
    /* 
     * We should never call set_task_cpu() on a blocked task, 
     * ttwu() will sort out the placement. 
     */  
    WARN_ON_ONCE(p->state != TASK_RUNNING && p->state != TASK_WAKING &&  
            !p->on_rq);  
  
#ifdef CONFIG_LOCKDEP  
    /* 
     * The caller should hold either p->pi_lock or rq->lock, when changing 
     * a task's CPU. ->pi_lock for waking tasks, rq->lock for runnable tasks. 
     * 
     * sched_move_task() holds both and thus holding either pins the cgroup, 
     * see task_group(). 
     * 
     * Furthermore, all task_rq users should acquire both locks, see 
     * task_rq_lock(). 
     */  
    WARN_ON_ONCE(debug_locks && !(lockdep_is_held(&p->pi_lock) ||  
                      lockdep_is_held(&task_rq(p)->lock)));  
#endif  
#endif  
  
    trace_sched_migrate_task(p, new_cpu);  
  
    if (task_cpu(p) != new_cpu) {  
        if (p->sched_class->migrate_task_rq)  
            p->sched_class->migrate_task_rq(p);  
        p->se.nr_migrations++;  
        perf_event_task_migrate(p);  
  
        walt_fixup_busy_time(p, new_cpu);  
    }  
  
    __set_task_cpu(p, new_cpu);  
}  
  
/* 
 * Called immediately before a task is migrated to a new cpu; task_cpu(p) and 
 * cfs_rq_of(p) references at time of call are still valid and identify the 
 * previous cpu.  However, the caller only guarantees p->pi_lock is held; no 
 * other assumptions, including the state of rq->lock, should be made. 
 */  
static void migrate_task_rq_fair(struct task_struct *p)  
{  
    /* 
     * We are supposed to update the task to "current" time, then its up to date 
     * and ready to go to new CPU/cfs_rq. But we have difficulty in getting 
     * what current time is, so simply throw away the out-of-date time. This 
     * will result in the wakee task is less decayed, but giving the wakee more 
     * load sounds not bad. 
     */  
    /*重新计算在新cpu上的调度实体的负载*/
    remove_entity_load_avg(&p->se);  
    /*重置新的调度实体的负载的最后更新时间和调度实体的执行时间*/
    /* Tell new CPU we are migrated */  
    p->se.avg.last_update_time = 0;  
  
    /* We have migrated, no longer consider this task hot */  
    p->se.exec_start = 0;  
}  
/* 
 * Synchronize entity load avg of dequeued entity without locking 
 * the previous rq. 
 */  
void sync_entity_load_avg(struct sched_entity *se)  
{  
    struct cfs_rq *cfs_rq = cfs_rq_of(se);  
    u64 last_update_time;  
  
    last_update_time = cfs_rq_last_update_time(cfs_rq);  
    __update_load_avg(last_update_time, cpu_of(rq_of(cfs_rq)), &se->avg, 0, 0, NULL);  
}  
  
/* 
 * Task first catches up with cfs_rq, and then subtract 
 * itself from the cfs_rq (task must be off the queue now). 
 */  
void remove_entity_load_avg(struct sched_entity *se)  
{  
    struct cfs_rq *cfs_rq = cfs_rq_of(se);  
  
    /* 
     * tasks cannot exit without having gone through wake_up_new_task() -> 
     * post_init_entity_util_avg() which will have added things to the 
     * cfs_rq, so we can remove unconditionally. 
     * 
     * Similarly for groups, they will have passed through 
     * post_init_entity_util_avg() before unregister_sched_fair_group() 
     * calls this. 
     */  
  
    sync_entity_load_avg(se);  
    atomic_long_add(se->avg.load_avg, &cfs_rq->removed_load_avg);  
    atomic_long_add(se->avg.util_avg, &cfs_rq->removed_util_avg);  
}

###3 How task p enqueue?
入队操作:ttwu_queue(p, cpu);

static void ttwu_queue(struct task_struct *p, int cpu)  
{  
    struct rq *rq = cpu_rq(cpu);  
  
#if defined(CONFIG_SMP)  
    if (sched_feat(TTWU_QUEUE) && !cpus_share_cache(smp_processor_id(), cpu)) {  
        sched_clock_cpu(cpu); /* sync clocks x-cpu */  
        ttwu_queue_remote(p, cpu);  
        return;  
    }  
#endif  
  
    raw_spin_lock(&rq->lock);  
    lockdep_pin_lock(&rq->lock);  
    ttwu_do_activate(rq, p, 0);  
    lockdep_unpin_lock(&rq->lock);  
    raw_spin_unlock(&rq->lock);  
}  
  
static void  
ttwu_do_activate(struct rq *rq, struct task_struct *p, int wake_flags)  
{  
    lockdep_assert_held(&rq->lock);  
  
#ifdef CONFIG_SMP  
    if (p->sched_contributes_to_load)  
        rq->nr_uninterruptible--;  
#endif  
  
    ttwu_activate(rq, p, ENQUEUE_WAKEUP | ENQUEUE_WAKING);  
    ttwu_do_wakeup(rq, p, wake_flags);  
}  
  
static inline void ttwu_activate(struct rq *rq, struct task_struct *p, int en_flags)  
{   /*实际入队的核心函数,同时更新task的vruntime并插入rb tree*/
    activate_task(rq, p, en_flags);  
    p->on_rq = TASK_ON_RQ_QUEUED;  
  
    /* if a worker is waking up, notify workqueue */  
    if (p->flags & PF_WQ_WORKER)  
        wq_worker_waking_up(p, cpu_of(rq));  
}  
  
/* 
 * Mark the task runnable and perform wakeup-preemption. 
 */  
static void  
ttwu_do_wakeup(struct rq *rq, struct task_struct *p, int wake_flags)  
{  
    check_preempt_curr(rq, p, wake_flags); 
    /*修改task的状态为running状态,即task正在cpu上运行*/ 
    p->state = TASK_RUNNING;  
    trace_sched_wakeup(p);  
  
#ifdef CONFIG_SMP  
    /*cfs没有定义*/
    if (p->sched_class->task_woken) {  
        /* 
         * Our task @p is fully woken up and running; so its safe to 
         * drop the rq->lock, hereafter rq is only used for statistics. 
         */  
        lockdep_unpin_lock(&rq->lock);  
        p->sched_class->task_woken(rq, p);  
        lockdep_pin_lock(&rq->lock);  
    }  
    /*如果之前rq处于idle状态,即cpu处于idle状态,则修正rq的idle时间戳和判决idle时间*/
    if (rq->idle_stamp) {  
        u64 delta = rq_clock(rq) - rq->idle_stamp;  
        u64 max = 2*rq->max_idle_balance_cost;//max=1ms  
  
        update_avg(&rq->avg_idle, delta);  
  
        if (rq->avg_idle > max)  
            rq->avg_idle = max;  
  
        rq->idle_stamp = 0;  
    }  
#endif  
}

3.1 核心函数activate_task分析

void activate_task(struct rq *rq, struct task_struct *p, int flags)  
{   /*如果进程被置为TASK_UNINTERRUPTIBLE状态的话,减少处于uninterruptible状态的进程
   数量,因为当前是进程处于唤醒运行阶段*/
    if (task_contributes_to_load(p))  
        rq->nr_uninterruptible--;  
    /*入队操作*/
    enqueue_task(rq, p, flags);  
}  
  
#define task_contributes_to_load(task)  \  
                ((task->state & TASK_UNINTERRUPTIBLE) != 0 && \  
                 (task->flags & PF_FROZEN) == 0 && \  
                 (task->state & TASK_NOLOAD) == 0)  
  
static inline void enqueue_task(struct rq *rq, struct task_struct *p, int flags)  
{  
    update_rq_clock(rq);
    /*flag=0x3 & 0x10 = 0*/  
    if (!(flags & ENQUEUE_RESTORE))
        /*更新task p入队的时间戳*/  
        sched_info_queued(rq, p);  
#ifdef CONFIG_INTEL_DWS //没有定义 
    if (sched_feat(INTEL_DWS))  
        update_rq_runnable_task_avg(rq);  
#endif  
    /*callback enqueue_task_fair函数*/
    p->sched_class->enqueue_task(rq, p, flags);  
}

核心函数enqueue_task_fair解析如下:

/* 
 * The enqueue_task method is called before nr_running is 
 * increased. Here we update the fair scheduling stats and 
 * then put the task into the rbtree: 
 */  
static void  
enqueue_task_fair(struct rq *rq, struct task_struct *p, int flags)  
{  
    struct cfs_rq *cfs_rq;  
    struct sched_entity *se = &p->se;  
#ifdef CONFIG_SMP  
    /*task_new = 0x3 & 0x20 = 0*/
    int task_new = flags & ENQUEUE_WAKEUP_NEW;  
#endif  
    /*进程p开始运行,并将进程p的运行时间累加到整个rq的cumulative_runnable_avg时间
    上,作为rq的负载值*/
    walt_inc_cumulative_runnable_avg(rq, p);  
  
    /* 
     * Update SchedTune accounting. 
     * 
     * We do it before updating the CPU capacity to ensure the 
     * boost value of the current task is accounted for in the 
     * selection of the OPP. 
     * 
     * We do it also in the case where we enqueue a throttled task; 
     * we could argue that a throttled task should not boost a CPU, 
     * however: 
     * a) properly implementing CPU boosting considering throttled 
     *    tasks will increase a lot the complexity of the solution 
     * b) it's not easy to quantify the benefits introduced by 
     *    such a more complex solution. 
     * Thus, for the time being we go for the simple solution and boost 
     * also for throttled RQs. 
     */ 
    /*主要根据这个task的属性是否需要update task group的boost参数*/ 
    schedtune_enqueue_task(p, cpu_of(rq));  
  
    /* 
     * If in_iowait is set, the code below may not trigger any cpufreq 
     * utilization updates, so do it here explicitly with the IOWAIT flag 
     * passed. 
     */
    /*如果进程p是一个iowait的进程,则进行cpu频率调整*/  
    if (p->in_iowait)  
        cpufreq_update_util(rq, SCHED_CPUFREQ_IOWAIT);  
  
    for_each_sched_entity(se) {  
        if (se->on_rq)  
            break;  
        cfs_rq = cfs_rq_of(se);  
        walt_inc_cfs_cumulative_runnable_avg(cfs_rq, p);  
        enqueue_entity(cfs_rq, se, flags);  
  
        /* 
         * end evaluation on encountering a throttled cfs_rq 
         * 
         * note: in the case of encountering a throttled cfs_rq we will 
         * post the final h_nr_running increment below. 
         */  
        if (cfs_rq_throttled(cfs_rq))  
            break;  
        cfs_rq->h_nr_running++;  
  
        flags = ENQUEUE_WAKEUP;  
    }  
  
    for_each_sched_entity(se) {  
        cfs_rq = cfs_rq_of(se);  
        cfs_rq->h_nr_running++;  
        walt_inc_cfs_cumulative_runnable_avg(cfs_rq, p);  
  
        if (cfs_rq_throttled(cfs_rq))  
            break;  
  
        update_load_avg(se, UPDATE_TG);  
        update_cfs_shares(se);  
    }  
  
    if (!se)  
        add_nr_running(rq, 1);  
  
#ifdef CONFIG_SMP  
    if (!se) {  
        struct sched_domain *sd;  
  
        rcu_read_lock();  
        sd = rcu_dereference(rq->sd);  
        if (!task_new && sd) {  
            if (cpu_overutilized(rq->cpu))  
                set_sd_overutilized(sd);  
            if (rq->misfit_task && sd->parent)  
                set_sd_overutilized(sd->parent);  
        }  
        rcu_read_unlock();  
    }  
  
#endif /* CONFIG_SMP */  
    hrtick_update(rq);  
}

后面的流程与新创建进程的流程一致了：https://blog.csdn.net/wukongmingjing/article/details/82466628
遗留action：负载均衡，这是scheduler里面比较恶心的一部分，最难啃的骨头，跨过这个坎，就会看到美丽的日出了。

你可能感兴趣的:(EAS-调度器学习,linux,kernel,cfs,scheduler)

情绪觉察日记第37天露露_e800
今天是家庭关系规划师的第二阶最后一天，慧萍老师帮我做了个案，帮我处理了埋在心底好多年的一份恐惧，并给了我深深的力量！这几天出来学习，爸妈过来婆家帮我带小孩，妈妈出于爱帮我收拾东西，并跟我先生和婆婆产生矛盾，妈妈觉得他们没有照顾好我…。今晚回家见到妈妈，我很欣赏她并赞扬她，妈妈说今晚要跟我睡我说好，当我们俩躺在床上准备睡觉的时候，我握着妈妈的手对她说:妈妈这几天辛苦你了，你看你多利害把我们的家收拾得
机器学习与深度学习间关系与区别 ℒℴѵℯ心·动ꦿ໊ོ꫞ 人工智能学习深度学习 python
一、机器学习概述定义机器学习（MachineLearning,ML）是一种通过数据驱动的方法，利用统计学和计算算法来训练模型，使计算机能够从数据中学习并自动进行预测或决策。机器学习通过分析大量数据样本，识别其中的模式和规律，从而对新的数据进行判断。其核心在于通过训练过程，让模型不断优化和提升其预测准确性。主要类型1.监督学习（SupervisedLearning）监督学习是指在训练数据集中包含输入
android系统selinux中添加新属性property 辉色投像
1.定位/android/system/sepolicy/private/property_contexts声明属性开头：persist.charge声明属性类型：u:object_r:system_prop:s0图12.定位到android/system/sepolicy/public/domain.te删除neverallow{domain-init}default_prop:property
铭刻于星（四十二）随风至
69夜晚，绍敏同学做完功课后，看了眼房外，没听到动静才敢从书包的夹层里拿出那个心形纸团。折痕压得很深，都有些旧了，想来是已经写好很久了。绍敏同学慢慢地、轻轻地捏开折叠处，待到全部拆开后，又反复抚平纸张，然后仔细地一字字默看。只是开头的三个字是第一次看到，让她心漏跳了几拍。“亲爱的绍敏：从四年级的时候，我就喜欢你了，但是我一直不敢说，怕影响你学习。六年级的时候听说有人跟你表白，你接受了，我很难过，但
UI学习——cell的复用和自定义cell Magnetic_h ui 学习
目录cell的复用手动（非注册）自动（注册）自定义cellcell的复用在iOS开发中，单元格复用是一种提高表格（UITableView）和集合视图（UICollectionView）滚动性能的技术。当一个UITableViewCell或UICollectionViewCell首次需要显示时，如果没有可复用的单元格，则视图会创建一个新的单元格。一旦这个单元格滚动出屏幕，它就不会被销毁。相反，它被添
学点心理知识，呵护孩子健康静候花开_7090
昨天听了华中师范大学教育管理学系副教授张玲老师的《哪里才是学生心理健康的最后庇护所，超越教育与技术的思考》的讲座。今天又重新学习了一遍，收获匪浅。张玲博士也注意到了当今社会上的孩子由于心理问题导致的自残、自杀及伤害他人等恶性事件。她向我们普及了一个重要的命题，她说心理健康的一些基本命题，我们与我们通常的一些教育命题是不同的，她还举了几个例子，让我们明白我们原来以为的健康并非心理学上的健康。比如如果
Linux下QT开发的动态库界面弹出操作（SDL2） 13jjyao QT类 qt 开发语言 sdl2 linux
需求：操作系统为linux，开发框架为qt，做成需带界面的qt动态库，调用方为java等非qt程序难点：调用方为java等非qt程序，也就是说调用方肯定不带QApplication::exec()，缺少了这个，QTimer等事件和QT创建的窗口将不能弹出(包括opencv也是不能弹出)；这与qt调用本身qt库是有本质的区别的思路：1.调用方缺QApplication::exec()，那么我们在接口
ArcGIS栅格计算器常见公式（赋值、0和空值的转换、补充栅格空值）研学随笔 arcgis 经验分享
我们在使用ArcGIS时通常经常用到栅格计算器，今天主要给大家介绍我日常中经常用到的几个公式，供大家参考学习。将特定值（-9999）赋值为0，例如-9999.Con("raster"==-9999,0,"raster")2.给空值赋予特定的值（如0）Con(IsNull("raster"),0,"raster")3.将特定的栅格值(如1)赋值为空值，其他保留原值SetNull("raster"==
【一起学Rust | 设计模式】习惯语法——使用借用类型作为参数、格式化拼接字符串、构造函数广龙宇一起学Rust #Rust设计模式 rust 设计模式开发语言
提示：文章写完后，目录可以自动生成，如何生成可参考右边的帮助文档文章目录前言一、使用借用类型作为参数二、格式化拼接字符串三、使用构造函数总结前言Rust不是传统的面向对象编程语言，它的所有特性，使其独一无二。因此，学习特定于Rust的设计模式是必要的。本系列文章为作者学习《Rust设计模式》的学习笔记以及自己的见解。因此，本系列文章的结构也与此书的结构相同（后续可能会调成结构），基本上分为三个部分
回溯 Leetcode 332 重新安排行程 mmaerd Leetcode刷题学习记录 leetcode 算法职场和发展
重新安排行程Leetcode332学习记录自代码随想录给你一份航线列表tickets，其中tickets[i]=[fromi,toi]表示飞机出发和降落的机场地点。请你对该行程进行重新规划排序。所有这些机票都属于一个从JFK（肯尼迪国际机场）出发的先生，所以该行程必须从JFK开始。如果存在多种有效的行程，请你按字典排序返回最小的行程组合。例如，行程[“JFK”,“LGA”]与[“JFK”,“LGB
Python数据分析与可视化实战指南 William数据分析 python python 数据
在数据驱动的时代，Python因其简洁的语法、强大的库生态系统以及活跃的社区，成为了数据分析与可视化的首选语言。本文将通过一个详细的案例，带领大家学习如何使用Python进行数据分析，并通过可视化来直观呈现分析结果。一、环境准备1.1安装必要库在开始数据分析和可视化之前，我们需要安装一些常用的库。主要包括pandas、numpy、matplotlib和seaborn等。这些库分别用于数据处理、数学
2019-12-22-22:30 涓涓1016
今天是冬至，写下我的日更，是因为这两天的学习真的是能量的满满，让我看到了自己，未来另外一种可能性，也让我看到了这两年这几年的过程中我所接受那些痛苦的来源。一切的根源和痛苦都来自于人生，家庭，而你的原生家庭，你的爸爸和妈妈，是因为你这个灵魂在那一刻选择他们作为你的爸爸和妈妈来的，所以你得接受他，你得接纳他，他就是因为他的存在而给你的学习和成长带来这些痛苦，那其实是你必然要经历的这个过程，当你去接纳的
linux sdl windows.h,Windows下的SDL安装奔跑吧linux内核 linux sdl windows.h
首先你要下载并安装SDL开发包。如果装在C盘下，路径为C:\SDL1.2.5如果在WINDOWS下。你可以按以下步骤：1.打开VC++，点击"Tools",Options2,点击directories选项3.选择"Includefiles"增加一个新的路径。"C:\SDL1.2.5\include"4，现在选择"Libaryfiles“增加"C:\SDL1.2.5\lib"现在你可以开始编写你的第
linux中sdl的使用教程,sdl使用入门 Melissa Corvinus linux中sdl的使用教程
本文通过一个简单示例讲解SDL的基本使用流程。示例中展示一个窗口，窗口里面有个随机颜色快随机移动。当我们鼠标点击关闭按钮时间窗口关闭。基本步骤如下：1.初始化SDL并创建一个窗口。SDL_Init()初始化SDL_CreateWindow()创建窗口2.纹理渲染存储RGB和存储纹理的区别：比如一个从左到右由红色渐变到蓝色的矩形，用存储RGB的话就需要把矩形中每个点的具体颜色值存储下来；而纹理只是一
将cmd中命令输出保存为txt文本文件落难Coder Windows cmd window
最近深度学习本地的训练中我们常常要在命令行中运行自己的代码，无可厚非，我们有必要保存我们的炼丹结果，但是复制命令行输出到txt是非常麻烦的，其实Windows下的命令行为我们提供了相应的操作。其基本的调用格式就是：运行指令>输出到的文件名称或者具体保存路径测试下，我打开cmd并且ping一下百度：pingwww.baidu.com>./data.txt看下相同目录下data.txt的输出：如果你再
PHP环境搭建详细教程好看资源平台前端 php
PHP是一个流行的服务器端脚本语言，广泛用于Web开发。为了使PHP能够在本地或服务器上运行，我们需要搭建一个合适的PHP环境。本教程将结合最新资料，介绍在不同操作系统上搭建PHP开发环境的多种方法，包括Windows、macOS和Linux系统的安装步骤，以及本地和Docker环境的配置。1.PHP环境搭建概述PHP环境的搭建主要分为以下几类：集成开发环境：例如XAMPP、WAMP、MAMP，这
使用 FinalShell 进行远程连接（ssh 远程连接 Linux 服务器）编程经验分享开发工具服务器 ssh linux
目录前言基本使用教程新建远程连接连接主机自定义命令路由追踪前言后端开发，必然需要和服务器打交道，部署应用，排查问题，查看运行日志等等。一般服务器都是集中部署在机房中，也有一些直接是云服务器，总而言之，程序员不可能直接和服务器直接操作，一般都是通过ssh连接来登录服务器。刚接触远程连接时，使用的是XSHELL来远程连接服务器，连接上就能够操作远程服务器了，但是仅用XSHELL并没有上传下载文件的功能
四章-32-点要素的聚合彩云飘过
本文基于腾讯课堂老胡的课《跟我学Openlayers--基础实例详解》做的学习笔记，使用的openlayers5.3.xapi。源码见1032.html，对应的官网示例https://openlayers.org/en/latest/examples/cluster.htmlhttps://openlayers.org/en/latest/examples/earthquake-clusters.
GitHub上克隆项目 bigbig猩猩 github
从GitHub上克隆项目是一个简单且直接的过程，它允许你将远程仓库中的项目复制到你的本地计算机上，以便进行进一步的开发、测试或学习。以下是一个详细的步骤指南，帮助你从GitHub上克隆项目。一、准备工作1.安装Git在克隆GitHub项目之前，你需要在你的计算机上安装Git工具。Git是一个开源的分布式版本控制系统，用于跟踪和管理代码变更。你可以从Git的官方网站（https://git-scm.
HTML网页设计制作大作业（div+css）云南我的家乡旅游景点带文字滚动二挡起步 web前端期末大作业 web设计网页规划与设计 html css javascript dreamweaver 前端
Web前端开发技术描述网页设计题材，DIV+CSS布局制作,HTML+CSS网页设计期末课程大作业游景点介绍|旅游风景区|家乡介绍|等网站的设计与制作HTML期末大学生网页设计作业HTML：结构CSS：样式在操作方面上运用了html5和css3，采用了div+css结构、表单、超链接、浮动、绝对定位、相对定位、字体样式、引用视频等基础知识JavaScript：做与用户的交互行为文章目录前端学习路线
Day1笔记-Python简介&标识符和关键字&输入输出 ~在杰难逃~ Python python 开发语言大数据数据分析数据挖掘
大家好，从今天开始呢，杰哥开展一个新的专栏，当然，数据分析部分也会不定时更新的，这个新的专栏主要是讲解一些Python的基础语法和知识，帮助0基础的小伙伴入门和学习Python，感兴趣的小伙伴可以开始认真学习啦！一、Python简介【了解】1.计算机工作原理编程语言就是用来定义计算机程序的形式语言。我们通过编程语言来编写程序代码，再通过语言处理程序执行向计算机发送指令，让计算机完成对应的工作，编程
人工智能时代，程序员如何保持核心竞争力？ jmoych 人工智能
随着AIGC（如chatgpt、midjourney、claude等）大语言模型接二连三的涌现，AI辅助编程工具日益普及，程序员的工作方式正在发生深刻变革。有人担心AI可能取代部分编程工作，也有人认为AI是提高效率的得力助手。面对这一趋势,程序员应该如何应对?是专注于某个领域深耕细作，还是广泛学习以适应快速变化的技术环境?又或者，我们是否应该将重点转向AI无法轻易替代的软技能？让我们一起探讨程序员
libyuv之linux编译 jaronho Linux linux 运维服务器
文章目录一、下载源码二、编译源码三、注意事项1、银河麒麟系统（aarch64）（1）解决armv8-a+dotprod+i8mm指令集支持问题（2）解决armv9-a+sve2指令集支持问题一、下载源码到GitHub网站下载https://github.com/lemenkov/libyuv源码，或者用直接用git克隆到本地，如：gitclonehttps://github.com/lemenko
node.js学习小猿L node.js node.js 学习 vim
node.js学习实操及笔记温故node.js，node.js学习实操过程及笔记~node.js学习视频node.js官网node.js中文网实操笔记githubcsdn笔记为什么学node.js可以让别人访问我们编写的网页为后续的框架学习打下基础，三大框架vuereactangular离不开node.jsnode.js是什么官网：node.js是一个开源的、跨平台的运行JavaScript的运行
阶段总结反思轻争
马上就要进入10月份了，今天做一下前段时间的总结和反思。前段时间，日更、英语、健身、护肤坚持的比较好。阅读、书法坚持的不好。1.中间被迫停更半个多月，其余时间一直在坚持日更挑战。偶尔也有不想写的时候，就做一下摘抄。因为阅读（输入）没跟上来，所以写作（输出）质量有待进一步加强。2.英语做到了一周至少学习5天，每次不少于30分钟，但是小班课没有跟上更新速度，下一步要争取利用零碎时间补听小班课。3.减肥
ARM驱动学习之基础小知识 JT灬新一 ARM 嵌入式 arm开发学习
ARM驱动学习之基础小知识•sch原理图工程师工作内容–方案–元器件选型–采购（能不能买到，价格）–原理图（涉及到稳定性）•layout画板工程师–layout（封装、布局，布线，log）（涉及到稳定性）–焊接的一部分工作（调试阶段板子的焊接）•驱动工程师–驱动，原理图，layout三部分的交集容易发生矛盾•PCB研发流程介绍–方案，原理图(网表)–layout工程师（gerber文件）–PCB板
ARM驱动学习之5 LEDS驱动 JT灬新一嵌入式 C 底层 arm开发学习单片机
ARM驱动学习之5LEDS驱动知识点：•linuxGPIO申请函数和赋值函数–gpio_request–gpio_set_value•三星平台配置GPIO函数–s3c_gpio_cfgpin•GPIO配置输出模式的宏变量–S3C_GPIO_OUTPUT注意点：DRIVER_NAME和DEVICE_NAME匹配。实现步骤：1.加入需要的头文件：//Linux平台的gpio头文件#include//三
ARM驱动学习之4小结 JT灬新一嵌入式 C++arm开发学习 linux
ARM驱动学习之4小结#include#include#include#include#include#defineDEVICE_NAME"hello_ctl123"MODULE_LICENSE("DualBSD/GPL");MODULE_AUTHOR("TOPEET");staticlonghello_ioctl(structfile*file,unsignedintcmd,unsignedlo
展现思维导图魅力，不断挖掘人生宝藏思维导图讲师Mandy
第13期最强思维导图训练营已经结束一周了，但是我依旧是感觉所有学员还在努力的学习，这些学员中有教师、学生、白领、公务员、宝妈等等，只要你努力，只要你想改变自己，任何行业，任何岗位都可以参与进来，28天足以让你见成效，在这28天中，我们的学员不仅仅是收获了一枚毕业证，最重要的是让自己的思维方式得到升级，今天的你为自己投资，明天的你就会感谢你今天的付出，我们来听一听来自13期最强思维导图训练营优秀学员
2019-3-23晨间日记红红火火小耳朵
今天是什么日子起床：7点40就寝：23点半天气：有太阳，不过一会儿出来一会儿进去特别清爽的凉意，还蛮舒服的心情：小激动要给女朋友过生日啦纪念日：田田女士过生日任务清单昨日完成的任务，最重要的三件事：1.英语一对一2.运动计划3.认真护肤习惯养成：调整状态周目标·完成进度英语七天打卡（5/7）轻课阅读（87/180）音标课（25/30）读书（福尔摩斯一章）学习·信息·阅读#英语课#Cookingte
矩阵求逆（JAVA）利用伴随矩阵 qiuwanchi 利用伴随矩阵求逆矩阵
package gaodai.matrix; import gaodai.determinant.DeterminantCalculation; import java.util.ArrayList; import java.util.List; import java.util.Scanner; /** * 矩阵求逆(利用伴随矩阵) * @author 邱万迟
单例（Singleton）模式 aoyouzi 单例模式 Singleton
3.1 概述如果要保证系统里一个类最多只能存在一个实例时，我们就需要单例模式。这种情况在我们应用中经常碰到，例如缓存池，数据库连接池，线程池，一些应用服务实例等。在多线程环境中，为了保证实例的唯一性其实并不简单，这章将和读者一起探讨如何实现单例模式。 3.2
[开源与自主研发]就算可以轻易获得外部技术支持,自己也必须研发 comsci 开源
现在国内有大量的信息技术产品，都是通过盗版，免费下载，开源，附送等方式从国外的开发者那里获得的。。。。。。虽然这种情况带来了国内信息产业的短暂繁荣，也促进了电子商务和互联网产业的快速发展，但是实际上，我们应该清醒的看到，这些产业的核心力量是被国外的
页面有两个frame,怎样点击一个的链接改变另一个的内容 Array_06 UI XHTML
<a src="地址" targets="这里写你要操作的Frame的名字" />搜索然后你点击连接以后你的新页面就会显示在你设置的Frame名字的框那里 targerts="",就是你要填写目标的显示页面位置 ===================== 例如： <frame src=&
Struts2实现单个/多个文件上传和下载 oloz 文件上传 struts
struts2单文件上传：步骤01:jsp页面  　　<form action="fileUplo
推荐10个在线logo设计网站 362217990 logo
在线设计Logo网站。 1、http://flickr.nosv.org（这个太简单） 2、http://www.logomaker.com/?source=1.5770.1 3、http://www.simwebsol.com/ImageTool 4、http://www.logogenerator.com/logo.php?nal=1&tpl_catlist[]=2 5、ht
jsp上传文件香水浓 jsp fileupload
1. jsp上传 Notice： 1. form表单 method 属性必须设置为 POST 方法，不能使用 GET 方法 2. form表单 enctype 属性需要设置为 multipart/form-data 3. form表单 action 属性需要设置为提交到后台处理文件上传的jsp文件地址或者servlet地址。例如 uploadFile.jsp 程序文件用来处理上传的文
我的架构经验系列文章 - 前端架构 agevs JavaScript Web 框架 UI jQuer
框架层面：近几年前端发展很快，前端之所以叫前端因为前端是已经可以独立成为一种职业了，js也不再是十年前的玩具了，以前富客户端RIA的应用可能会用flash/flex或是silverlight，现在可以使用js来完成大部分的功能，因此js作为一门前端的支撑语言也不仅仅是进行的简单的编码，越来越多框架性的东西出现了。越来越多的开发模式转变为后端只是吐json的数据源，而前端做所有UI的事情。MVCMV
android ksoap2 中把XML(DataSet) 当做参数传递 aijuans android
我的android app中需要发送webservice ，于是我使用了 ksop2 进行发送，在测试过程中不是很顺利,不能正常工作.我的web service 请求格式如下 [html] view plain copy <Envelope xmlns="http://schemas.
使用Spring进行统一日志管理 + 统一异常管理 baalwolf spring
统一日志和异常管理配置好后，SSH项目中，代码以往散落的log.info() 和 try..catch..finally 再也不见踪影！统一日志异常实现类： [java] view plain copy package com.pilelot.web.util; impor
Android SDK 国内镜像 BigBird2012 android sdk
一、镜像地址： 1、东软信息学院的 Android SDK 镜像，比配置代理下载快多了。配置地址， http://mirrors.neusoft.edu.cn/configurations.we#android 2、北京化工大学的： IPV4:ubuntu.buct.edu.cn IPV4:ubuntu.buct.cn IPV6:ubuntu.buct6.edu.cn
HTML无害化和Sanitize模块 bijian1013 JavaScript AngularJS Linky Sanitize
一.ng-bind-html、ng-bind-html-unsafe AngularJS非常注重安全方面的问题，它会尽一切可能把大多数攻击手段最小化。其中一个攻击手段是向你的web页面里注入不安全的HTML，然后利用它触发跨站攻击或者注入攻击。考虑这样一个例子，假设我们有一个变量存
[Maven学习笔记二]Maven命令 bit1129 maven
mvn compile compile编译命令将src/main/java和src/main/resources中的代码和配置文件编译到target/classes中，不会对src/test/java中的测试类进行编译 MVN编译使用 maven-resources-plugin:2.6:resources maven-compiler-plugin:2.5.1:compile &nbs
【Java命令二】jhat bit1129 Java命令
jhat用于分析使用jmap dump的文件，，可以将堆中的对象以html的形式显示出来，包括对象的数量，大小等等，并支持对象查询语言。 jhat默认开启监听端口7000的HTTP服务，jhat是Java Heap Analysis Tool的缩写 1. 用法： [hadoop@hadoop bin]$ jhat -help Usage: jhat [-stack <bool&g
JBoss 5.1.0 GA:Error installing to Instantiated: name=AttachmentStore state=Desc ronin47
进到类似目录 server/default/conf/bootstrap，打开文件 profile.xml找到： Xml代码<bean name="AttachmentStore" class="org.jboss.system.server.profileservice.repository.AbstractAtta
写给初学者的6条网页设计安全配色指南 brotherlamp UI ui自学 ui视频 ui教程 ui资料
网页设计中最基本的原则之一是，不管你花多长时间创造一个华丽的设计，其最终的角色都是这场秀中真正的明星——内容的衬托我仍然清楚地记得我最早的一次美术课，那时我还是一个小小的、对凡事都充满渴望的孩子，我摆放出一大堆漂亮的彩色颜料。我仍然记得当我第一次看到原色与另一种颜色混合变成第二种颜色时的那种兴奋，并且我想，既然两种颜色能创造出一种全新的美丽色彩，那所有颜色
有一个数组，每次从中间随机取一个，然后放回去，当所有的元素都被取过，返回总共的取的次数。写一个函数实现。复杂度是什么。 bylijinnan java 算法面试
import java.util.Random; import java.util.Set; import java.util.TreeSet; /** * http://weibo.com/1915548291/z7HtOF4sx * #面试题#有一个数组，每次从中间随机取一个，然后放回去，当所有的元素都被取过，返回总共的取的次数。 * 写一个函数实现。复杂度是什么
struts2获得request、session、application方式 chiangfai application
1、与Servlet API解耦的访问方式。 a.Struts2对HttpServletRequest、HttpSession、ServletContext进行了封装，构造了三个Map对象来替代这三种对象要获取这三个Map对象，使用ActionContext类。 -----> package pro.action; import java.util.Map; imp
改变python的默认语言设置 chenchao051 python
import sys sys.getdefaultencoding() 可以测试出默认语言，要改变的话，需要在python lib的site-packages文件夹下新建： sitecustomize.py，这个文件比较特殊，会在python启动时来加载，所以就可以在里面写上： import sys sys.setdefaultencoding('utf-8') &n
mysql导入数据load data infile用法 daizj mysql 导入数据
我们常常导入数据！mysql有一个高效导入方法，那就是load data infile 下面来看案例说明基本语法： load data [low_priority] [local] infile 'file_name txt' [replace | ignore] into table tbl_name [fields [terminated by't'] [OPTI
phpexcel导入excel表到数据库简单入门示例 dcj3sjt126com PHP Excel
跟导出相对应的，同一个数据表，也是将phpexcel类放在class目录下，将Excel表格中的内容读取出来放到数据库中 <?php error_reporting(E_ALL); set_time_limit(0); ?> <html> <head> <meta http-equiv="Content-Type"
22岁到72岁的男人对女人的要求 dcj3sjt126com
22岁男人对女人的要求是：一，美丽，二，性感，三，有份具品味的职业，四，极有耐性，善解人意，五，该聪明的时候聪明，六，作小鸟依人状时尽量自然，七，怎样穿都好看，八，懂得适当地撒娇，九，虽作惊喜反应，但看起来自然，十，上了床就是个无条件荡妇。 32岁的男人对女人的要求，略作修定，是：一，入得厨房，进得睡房，二，不必服侍皇太后，三，不介意浪漫蜡烛配盒饭，四，听多过说，五，不再傻笑，六，懂得独
Spring和HIbernate对DDM设计的支持 e200702084 DAO 设计模式 spring Hibernate 领域模型
A：数据访问对象 DAO和资源库在领域驱动设计中都很重要。DAO是关系型数据库和应用之间的契约。它封装了Web应用中的数据库CRUD操作细节。另一方面，资源库是一个独立的抽象，它与DAO进行交互，并提供到领域模型的“业务接口”。资源库使用领域的通用语言，处理所有必要的DAO，并使用领域理解的语言提供对领域模型的数据访问服务。
NoSql 数据库的特性比较 geeksun NoSQL
Redis 是一个开源的使用ANSI C语言编写、支持网络、可基于内存亦可持久化的日志型、Key-Value数据库，并提供多种语言的API。目前由VMware主持开发工作。 1. 数据模型作为Key-value型数据库，Redis也提供了键（Key）和值（Value）的映射关系。除了常规的数值或字符串，Redis的键值还可以是以下形式之一： Lists （列表） Sets
使用 Nginx Upload Module 实现上传文件功能 hongtoushizi nginx
转载自： http://www.tuicool.com/wx/aUrAzm 普通网站在实现文件上传功能的时候，一般是使用Python，Java等后端程序实现，比较麻烦。Nginx有一个Upload模块，可以非常简单的实现文件上传功能。此模块的原理是先把用户上传的文件保存到临时文件，然后在交由后台页面处理，并且把文件的原名，上传后的名称，文件类型，文件大小set到页面。下
spring-boot-web-ui及thymeleaf基本使用 jishiweili spring thymeleaf
视图控制层代码demo如下： @Controller @RequestMapping("/") public class MessageController { private final MessageRepository messageRepository; @Autowired public MessageController(Mes
数据源架构模式之活动记录 home198979 PHP 架构活动记录数据映射
hello!架构一、概念活动记录（Active Record）：一个对象，它包装数据库表或视图中某一行，封装数据库访问，并在这些数据上增加了领域逻辑。对象既有数据又有行为。活动记录使用直截了当的方法，把数据访问逻辑置于领域对象中。二、实现简单活动记录活动记录在php许多框架中都有应用，如cakephp。 <?php /** * 行数据入口类 *
Linux Shell脚本之自动修改IP pda158 linux centos Debian 脚本
作为一名 Linux SA，日常运维中很多地方都会用到脚本，而服务器的ip一般采用静态ip或者MAC绑定，当然后者比较操作起来相对繁琐，而前者我们可以设置主机名、ip信息、网关等配置。修改成特定的主机名在维护和管理方面也比较方便。如下脚本用途为：修改ip和主机名等相关信息，可以根据实际需求修改，举一反三！ #!/bin/sh #auto Change ip netmask ga
开发环境搭建独浮云 eclipse jdk tomcat
最近在开发过程中，经常出现MyEclipse内存溢出等错误，需要重启的情况，好麻烦。对于一般的JAVA+TOMCAT项目开发，其实没有必要使用重量级的MyEclipse，使用eclipse就足够了。尤其是开发机器硬件配置一般的人。 &n