蔚1

spark源码剖析（一，job调用流程）

版本信息spark version 2.3.3jdk 1.8idea 2019MacBook Pro

最近领导让做一次关于 Spark 的分享，于是专门把 spark 的流程看了一边，做一下记录，也是为了练练 MarkDown，仅此而已。

版本信息 spark version 2.3.3 jdk 1.8 idea 2019 MacBook Pro

从 RDD 开始

在 spark 中，一个 action 算子触发真正的计算，我们看下 RDD 上的 count

/**   * Return the number of elements in the RDD.   */  def count(): Long = sc.runJob(this, Utils.getIteratorSize _).sum

这就是一个一般的函数调用，有点内容的东西，就是这个方法

/**   * Counts the number of elements of an iterator using a while loop rather than calling   * [[scala.collection.Iterator#size]] because it uses a for loop, which is slightly slower   * in the current version of Scala.   */  def getIteratorSize[T](iterator: Iterator[T]): Long = {    var count = 0L    while (iterator.hasNext) {      count += 1L      iterator.next()    }    count  }

这个方法也很简单，就是一个计数。所以 RDD 上的 action 算子没有什么难点，很容易明白。

RDD 上的 action 算子实际触发的上 SparkContext 的 runJob 方法，下面就进入了。

SparkContext

/**   * Run a job on all partitions in an RDD and return the results in an array.   *   * @param rdd target RDD to run tasks on   * @param func a function to run on each partition of the RDD   * @return in-memory collection with a result of the job (each collection element will contain   * a result from one partition)   */  def runJob[T, U: ClassTag](rdd: RDD[T], func: Iterator[T] => U): Array[U] = {    runJob(rdd, func, 0 until rdd.partitions.length)  }

这个函数也没什么难点，就是函数重载调用，多了一个计算分区的参数

/**   * Run a function on a given set of partitions in an RDD and return the results as an array.   *   * @param rdd target RDD to run tasks on   * @param func a function to run on each partition of the RDD   * @param partitions set of partitions to run on; some jobs may not want to compute on all   * partitions of the target RDD, e.g. for operations like `first()`   * @return in-memory collection with a result of the job (each collection element will contain   * a result from one partition)   */  def runJob[T, U: ClassTag](      rdd: RDD[T],      func: Iterator[T] => U,      partitions: Seq[Int]): Array[U] = {    val cleanedFunc = clean(func)    runJob(rdd, (ctx: TaskContext, it: Iterator[T]) => cleanedFunc(it), partitions)  }

需要注意下这个 clean 函数的功能

val cleanedFunc = clean(func)

我们传递的匿名函数，可能有外部变量，这里专门做了处理，使之可以序列化。

具体的技术细节，不明白，仔细看了一遍注释说明，这个 clean 的目的是明白了。

有兴趣的同学可以点进去看看源码上的注释。

继续调用 runJob 的重载方法，这里的函数参数。

可能比较难理解一点的是这个参数

(ctx: TaskContext, it: Iterator[T]) => cleanedFunc(it)

这里是我难理解的一个地方,作为对比，我们放一起看两个 runJob 调用

runJob(rdd, func, 0 until rdd.partitions.length)runJob(rdd, (ctx: TaskContext, it: Iterator[T]) => cleanedFunc(it), partitions)

上面的 runJob 中的参数都是实际参数而在下面的 runjob 中，怎么看都觉得第二个参数

(ctx: TaskContext, it: Iterator[T]) => cleanedFunc(it)

是一个 $\color{red}{形式参数}$ ，而不是实际参数,这是一个疑惑点。

昨天想了一晚上，搞明白了，这里传递的就是一个函数, 其实在 count 函数中 Utils.getIteratorSize 也是一个函数，我们看下参数原型

/**   * Run a function on a given set of partitions in an RDD and return the results as an array.   * The function that is run against each partition additionally takes `TaskContext` argument.   *   * @param rdd target RDD to run tasks on   * @param func a function to run on each partition of the RDD   * @param partitions set of partitions to run on; some jobs may not want to compute on all   * partitions of the target RDD, e.g. for operations like `first()`   * @return in-memory collection with a result of the job (each collection element will contain   * a result from one partition)   */  def runJob[T, U: ClassTag](      rdd: RDD[T],      func: (TaskContext, Iterator[T]) => U,      partitions: Seq[Int]): Array[U] = {    val results = new Array[U](partitions.size)    runJob[T, U](rdd, func, partitions, (index, res) => results(index) = res)    results  }

参数原型是

func: (TaskContext, Iterator[T]) => U

我们实现了这个参数，只不过套用了 cleanedFunc 来实现，啥也没干而已

(ctx: TaskContext, it: Iterator[T]) => cleanedFunc(it)

$\color{red}{那么这个函数会在什么时候，什么地点被用到呢？}$

答案是将来在 ResultStage 中会被序列化到 ResultTask 中，从 driver 端发送到 executor 端，开始计算任务

计算结果拉回到 driver 上，填充数组

  /**   * Run a function on a given set of partitions in an RDD and pass the results to the given   * handler function. This is the main entry point for all actions in Spark.   *   * @param rdd target RDD to run tasks on   * @param func a function to run on each partition of the RDD   * @param partitions set of partitions to run on; some jobs may not want to compute on all   * partitions of the target RDD, e.g. for operations like `first()`   * @param resultHandler callback to pass each result to   */  def runJob[T, U: ClassTag](      rdd: RDD[T],      func: (TaskContext, Iterator[T]) => U,      partitions: Seq[Int],      resultHandler: (Int, U) => Unit): Unit = {    if (stopped.get()) {      throw new IllegalStateException("SparkContext has been shutdown")    }    val callSite = getCallSite    val cleanedFunc = clean(func)    logInfo("Starting job: " + callSite.shortForm)    if (conf.getBoolean("spark.logLineage", false)) {      logInfo("RDD's recursive dependencies:\n" + rdd.toDebugString)    }    dagScheduler.runJob(rdd, cleanedFunc, partitions, callSite, resultHandler, localProperties.get)    progressBar.foreach(_.finishAll())    rdd.doCheckpoint()  }

又一次 clean 闭包，这里的最后一句代码

rdd.doCheckpoint()

递归保存设置了检查点的 RDD，可见 checkpoint 操作在 job 完成后调用。开始转入高层调度器

DAGScheduler

/**   * Run an action job on the given RDD and pass all the results to the resultHandler function as   * they arrive.   *   * @param rdd target RDD to run tasks on   * @param func a function to run on each partition of the RDD   * @param partitions set of partitions to run on; some jobs may not want to compute on all   *   partitions of the target RDD, e.g. for operations like first()   * @param callSite where in the user program this job was called   * @param resultHandler callback to pass each result to   * @param properties scheduler properties to attach to this job, e.g. fair scheduler pool name   *   * @note Throws `Exception` when the job fails   */  def runJob[T, U](      rdd: RDD[T],      func: (TaskContext, Iterator[T]) => U,      partitions: Seq[Int],      callSite: CallSite,      resultHandler: (Int, U) => Unit,      properties: Properties): Unit = {    val start = System.nanoTime    val waiter = submitJob(rdd, func, partitions, callSite, resultHandler, properties)    ThreadUtils.awaitReady(waiter.completionFuture, Duration.Inf)    waiter.completionFuture.value.get match {      case scala.util.Success(_) =>        logInfo("Job %d finished: %s, took %f s".format          (waiter.jobId, callSite.shortForm, (System.nanoTime - start) / 1e9))      case scala.util.Failure(exception) =>        logInfo("Job %d failed: %s, took %f s".format          (waiter.jobId, callSite.shortForm, (System.nanoTime - start) / 1e9))        // SPARK-8644: Include user stack trace in exceptions coming from DAGScheduler.        val callerStackTrace = Thread.currentThread().getStackTrace.tail        exception.setStackTrace(exception.getStackTrace ++ callerStackTrace)        throw exception    }  }

DAGScheduler 的 runJob 方法没有难理解的地方，内部调用方法 submitJob 开始提交 job

 /**   * Submit an action job to the scheduler.   *   * @param rdd target RDD to run tasks on   * @param func a function to run on each partition of the RDD   * @param partitions set of partitions to run on; some jobs may not want to compute on all   *   partitions of the target RDD, e.g. for operations like first()   * @param callSite where in the user program this job was called   * @param resultHandler callback to pass each result to   * @param properties scheduler properties to attach to this job, e.g. fair scheduler pool name   *   * @return a JobWaiter object that can be used to block until the job finishes executing   *         or can be used to cancel the job.   *   * @throws IllegalArgumentException when partitions ids are illegal   */  def submitJob[T, U](      rdd: RDD[T],      func: (TaskContext, Iterator[T]) => U,      partitions: Seq[Int],      callSite: CallSite,      resultHandler: (Int, U) => Unit,      properties: Properties): JobWaiter[U] = {    // Check to make sure we are not launching a task on a partition that does not exist.    val maxPartitions = rdd.partitions.length    partitions.find(p => p >= maxPartitions || p < 0).foreach { p =>      throw new IllegalArgumentException(        "Attempting to access a non-existent partition: " + p + ". " +          "Total number of partitions: " + maxPartitions)    }    val jobId = nextJobId.getAndIncrement()    if (partitions.size == 0) {      // Return immediately if the job is running 0 tasks      return new JobWaiter[U](this, jobId, 0, resultHandler)    }    assert(partitions.size > 0)    val func2 = func.asInstanceOf[(TaskContext, Iterator[_]) => _]    val waiter = new JobWaiter(this, jobId, partitions.size, resultHandler)    eventProcessLoop.post(JobSubmitted(      jobId, rdd, func2, partitions.toArray, callSite, waiter,      SerializationUtils.clone(properties)))    waiter  }

可以看到 submitJob 中向 eventProcessLoop 投递了一个 JobSubmitted 事件，DAGScheduler 内部的消息循环体DAGSchedulerEventProcessLoop#doOnReceive 处理事件

private def doOnReceive(event: DAGSchedulerEvent): Unit = event match {    case JobSubmitted(jobId, rdd, func, partitions, callSite, listener, properties) =>      dagScheduler.handleJobSubmitted(jobId, rdd, func, partitions, callSite, listener, properties)    case MapStageSubmitted(jobId, dependency, callSite, listener, properties) =>      dagScheduler.handleMapStageSubmitted(jobId, dependency, callSite, listener, properties)    case StageCancelled(stageId, reason) =>      dagScheduler.handleStageCancellation(stageId, reason)    case JobCancelled(jobId, reason) =>      dagScheduler.handleJobCancellation(jobId, reason)    case JobGroupCancelled(groupId) =>      dagScheduler.handleJobGroupCancelled(groupId)    case AllJobsCancelled =>      dagScheduler.doCancelAllJobs()    case ExecutorAdded(execId, host) =>      dagScheduler.handleExecutorAdded(execId, host)    case ExecutorLost(execId, reason) =>      val workerLost = reason match {        case SlaveLost(_, true) => true        case _ => false      }      dagScheduler.handleExecutorLost(execId, workerLost)    case WorkerRemoved(workerId, host, message) =>      dagScheduler.handleWorkerRemoved(workerId, host, message)    case BeginEvent(task, taskInfo) =>      dagScheduler.handleBeginEvent(task, taskInfo)    case SpeculativeTaskSubmitted(task) =>      dagScheduler.handleSpeculativeTaskSubmitted(task)    case GettingResultEvent(taskInfo) =>      dagScheduler.handleGetTaskResult(taskInfo)    case completion: CompletionEvent =>      dagScheduler.handleTaskCompletion(completion)    case TaskSetFailed(taskSet, reason, exception) =>      dagScheduler.handleTaskSetFailed(taskSet, reason, exception)    case ResubmitFailedStages =>      dagScheduler.resubmitFailedStages()  }

反过来再次调用 DAGScheduler 的内部方法 handleJobSubmitted 真正的 job 提交

private[scheduler] def handleJobSubmitted(jobId: Int,      finalRDD: RDD[_],      func: (TaskContext, Iterator[_]) => _,      partitions: Array[Int],      callSite: CallSite,      listener: JobListener,      properties: Properties) {    var finalStage: ResultStage = null    try {      // New stage creation may throw an exception if, for example, jobs are run on a      // HadoopRDD whose underlying HDFS files have been deleted.      finalStage = createResultStage(finalRDD, func, partitions, jobId, callSite)    } catch {      case e: Exception =>        logWarning("Creating new stage failed due to exception - job: " + jobId, e)        listener.jobFailed(e)        return    }    val job = new ActiveJob(jobId, finalStage, callSite, listener, properties)    clearCacheLocs()    logInfo("Got job %s (%s) with %d output partitions".format(      job.jobId, callSite.shortForm, partitions.length))    logInfo("Final stage: " + finalStage + " (" + finalStage.name + ")")    logInfo("Parents of final stage: " + finalStage.parents)    logInfo("Missing parents: " + getMissingParentStages(finalStage))    val jobSubmissionTime = clock.getTimeMillis()    jobIdToActiveJob(jobId) = job    activeJobs += job    finalStage.setActiveJob(job)    val stageIds = jobIdToStageIds(jobId).toArray    val stageInfos = stageIds.flatMap(id => stageIdToStage.get(id).map(_.latestInfo))    listenerBus.post(      SparkListenerJobStart(job.jobId, jobSubmissionTime, stageInfos, properties))    submitStage(finalStage)  }

handleJobSubmitted 方法分成两个阶段

划分 stage

finalStage = createResultStage(finalRDD, func, partitions, jobId, callSite)

提交 stage

submitStage(finalStage)

我们先整体描述一下思路划分 stage 的阶段，用语言描述如下

代码先后顺序是：1. 首先创建最后一个 ResultStage 时，这时需要倒数第二个 ShuffleMapStage 作为父 Stage2. 创建倒数第二个 ShuffleMapStage 时，需要倒数第三个 ShuffleMapStage 作为父 Stage3. 除了最后 2 个 stage 是单独创建以外，其他的 stage 批量存在栈中然后创建所以，真实的创建过程如下1. 用后进先出的 stack 结构，从后向前回溯 shuffleDependency 入栈，然后出栈创建 ShuffleMapStage2. 创建倒数第二个 ShuffleMapStage3. 创建最后一个 ResultStage

Stage 的提交是递归提交

1. 如果有父 Stage 未提交，提交父 Stage2. 如果父 Stage 都已经提交，提交该 Stage

下面看下具体代码实现,划分 stage

finalStage = createResultStage(finalRDD, func, partitions, jobId, callSite)

/**   * Create a ResultStage associated with the provided jobId.   */  private def createResultStage(      rdd: RDD[_],      func: (TaskContext, Iterator[_]) => _,      partitions: Array[Int],      jobId: Int,      callSite: CallSite): ResultStage = {    val parents = getOrCreateParentStages(rdd, jobId)    val id = nextStageId.getAndIncrement()    val stage = new ResultStage(id, rdd, func, partitions, parents, jobId, callSite)    stageIdToStage(id) = stage    updateJobIdStageIdMaps(jobId, stage)    stage  }

这个代码逻辑并不复杂，很好理解，创建父 Stage，然后创建 ResultStage，因为代码调用前套关系，这个方法中的 ResultStage 自然就是 job 中最后一个 Stage倒数第二个 Stage（如果 ResultStage 是 join 产生的那就是倒数第二个和倒数第三个）

  /**   * Get or create the list of parent stages for a given RDD.  The new Stages will be created with   * the provided firstJobId.   */  private def getOrCreateParentStages(rdd: RDD[_], firstJobId: Int): List[Stage] = {    getShuffleDependencies(rdd).map { shuffleDep =>      getOrCreateShuffleMapStage(shuffleDep, firstJobId)    }.toList  }

代码很短，理解起来也很容易

/**   * Returns shuffle dependencies that are immediate parents of the given RDD.   *   * This function will not return more distant ancestors.  For example, if C has a shuffle   * dependency on B which has a shuffle dependency on A:   *   * A <-- B <-- C   *   * calling this function with rdd C will only return the B <-- C dependency.   *   * This function is scheduler-visible for the purpose of unit testing.   */  private[scheduler] def getShuffleDependencies(rdd: RDD[_]): HashSet[ShuffleDependency[_, _, _]] = {    val parents = new HashSet[ShuffleDependency[_, _, _]]    val visited = new HashSet[RDD[_]]    val waitingForVisit = new ArrayStack[RDD[_]]    waitingForVisit.push(rdd)    while (waitingForVisit.nonEmpty) {      val toVisit = waitingForVisit.pop()      if (!visited(toVisit)) {        visited += toVisit        toVisit.dependencies.foreach {          case shuffleDep: ShuffleDependency[_, _, _] =>            parents += shuffleDep          case dependency =>            waitingForVisit.push(dependency.rdd)        }      }    }    parents  }

$\color{red}{这里用到了 stack 结构 }$ NarrowDependency 的 rdd 入栈ShuffleDependency 的 rdd 不入栈因此倒数第二个 ShuffleMapStage 的 rdd 都会出入一次栈,最终找到与 ResultStage 直接关联的 ShuffleDependency 函数结束。

在这个方法中，我们能得到更多洞察。

toVisit.dependenciesdependency.rdd

rdd 中保存了 dependency，这是 rdd 的 5 大属性之一 dependency 中保存了 rdd

对应到代码中就是

/** * :: DeveloperApi :: * Base class for dependencies where each partition of the child RDD depends on a small number * of partitions of the parent RDD. Narrow dependencies allow for pipelined execution. */@DeveloperApiabstract class NarrowDependency[T](_rdd: RDD[T]) extends Dependency[T] {  /**   * Get the parent partitions for a child partition.   * @param partitionId a partition of the child RDD   * @return the partitions of the parent RDD that the child partition depends upon   */  def getParents(partitionId: Int): Seq[Int]  override def rdd: RDD[T] = _rdd}

NarrowDependency 中用了一个 rdd 方法保存_rdd 构造参数，

我原来一直认为 NarrowDependency 序列化的时候不会保存 rdd，因为没有对应字段，但是自己写了一个小代码跑了一下，发现确实是保存到磁盘上了

package com.wsyimport java.io.{File, FileInputStream, FileOutputStream, ObjectInputStream, ObjectOutputStream}import org.apache.spark.internal.Loggingclass Student(name: String) extends Serializable {  def getName(): String = name}object Test extends Logging {  def main(args: Array[String]): Unit = {    val a = new Student("lilei")    val file = new File("student.txt")    val oout = new ObjectOutputStream(new FileOutputStream(file))    oout.writeObject(a)    oout.close()    val oin = new ObjectInputStream(new FileInputStream(file))    val student = oin.readObject().asInstanceOf[Student]    oin.close()    println(student.getName())  }}

而在ShuffleDependency 类中明确用_rdd 字段保存，但是可以看到加了@transient

/** * :: DeveloperApi :: * Represents a dependency on the output of a shuffle stage. Note that in the case of shuffle, * the RDD is transient since we don't need it on the executor side. * * @param _rdd the parent RDD * @param partitioner partitioner used to partition the shuffle output * @param serializer [[org.apache.spark.serializer.Serializer Serializer]] to use. If not set *                   explicitly then the default serializer, as specified by `spark.serializer` *                   config option, will be used. * @param keyOrdering key ordering for RDD's shuffles * @param aggregator map/reduce-side aggregator for RDD's shuffle * @param mapSideCombine whether to perform partial aggregation (also known as map-side combine) */@DeveloperApiclass ShuffleDependency[K: ClassTag, V: ClassTag, C: ClassTag](    @transient private val _rdd: RDD[_ <: Product2[K, V]],    val partitioner: Partitioner,    val serializer: Serializer = SparkEnv.get.serializer,    val keyOrdering: Option[Ordering[K]] = None,    val aggregator: Option[Aggregator[K, V, C]] = None,    val mapSideCombine: Boolean = false)  extends Dependency[Product2[K, V]] {  override def rdd: RDD[Product2[K, V]] = _rdd.asInstanceOf[RDD[Product2[K, V]]]  private[spark] val keyClassName: String = reflect.classTag[K].runtimeClass.getName  private[spark] val valueClassName: String = reflect.classTag[V].runtimeClass.getName  // Note: It's possible that the combiner class tag is null, if the combineByKey  // methods in PairRDDFunctions are used instead of combineByKeyWithClassTag.  private[spark] val combinerClassName: Option[String] =    Option(reflect.classTag[C]).map(_.runtimeClass.getName)  val shuffleId: Int = _rdd.context.newShuffleId()  val shuffleHandle: ShuffleHandle = _rdd.context.env.shuffleManager.registerShuffle(    shuffleId, _rdd.partitions.length, this)  _rdd.sparkContext.cleaner.foreach(_.registerShuffleForCleanup(this))}

因为@transient 注解的关系，所以 driver 上的 Dependency 上持有 rdd 的，但是序列化以后，发送到 executor 后，经过反序列化的 Dependency 上没有 rdd 的自己写了一小段代码，模拟跑了一下，写入磁盘，然后再从磁盘读出来发现 rdd1，rdd2 没有打印输出，说明没写入磁盘

package com.wsy.rddimport java.io.{File, FileInputStream, FileOutputStream, ObjectInputStream, ObjectOutputStream}class Rdd(val name:String,var deps:Dependency) extends Serializable{  override def toString: String = {    val str=s"Rdd(${name},${deps})"    str  }}abstract class Dependency(val name:String) extends Serializable {  def rdd:Rdd}abstract class NarrowDependency(name:String,_rdd: Rdd) extends Dependency(name ) {  override def rdd: Rdd = _rdd}class ShuffleDependency(name:String,@transient private val _rdd: Rdd) extends Dependency(name){  override def rdd: Rdd = _rdd  override def toString: String = {    val str=s"ShuffleDependency(${name},${_rdd})"    str  }}class OneToOneDependency(name:String,rdd: Rdd) extends NarrowDependency(name,rdd) {  override def toString: String = {    val str=s"OneToOneDependency(${name},${rdd})"    str  }}object Util {  def main(args: Array[String]): Unit = {    //rdd 序列化到 executor 的时候，只序列化一个 stage 的 rdd    val rdd1=new Rdd("rdd1",null)    val dep1=new OneToOneDependency("OneToOneDependency1",rdd1)    val rdd2=new Rdd("rdd2",dep1)    val dep2=new ShuffleDependency("ShuffleDependency1",rdd2)    val rdd3=new Rdd("rdd3",dep2)    val dep3=new OneToOneDependency("OneToOneDependency2",rdd3)    val rdd4=new Rdd("rdd4",dep3)    val file = new File("rdd4.txt")    val oout = new ObjectOutputStream(new FileOutputStream(file))    oout.writeObject(rdd4)    oout.close()    val oin = new ObjectInputStream(new FileInputStream(file))    val newPerson = oin.readObject()    oin.close()    println("内存中的 rdd4")    println(rdd4)    println("序列化到磁盘，再次从磁盘读出来的 rdd4")    println(newPerson)  }}

这一段对目前的理解没有什么帮助，但是对于后面 Task 序列化很有帮助因为我自己以前始终不明白，序列化发送 task 到 executor 到底发送的是什么东西在后面可以看到对于 ShuffleMapTask 是（rdd，shuffleDependency）对于 ResultStage 是（rdd，func）那么我想肯定有好多人跟我有一样的困惑，这两段小代码对于理解挺有帮助

我们经常说 RDD 构成一个 DAG，就是因为这种结构，RDD 中有 Dependency， Dependency 中有 RDD，这样不断的延长 DAG

得到了最后一个 rdd 的 ShuffleDependency 后，创建 ShuffleMapStage

/**   * Gets a shuffle map stage if one exists in shuffleIdToMapStage. Otherwise, if the   * shuffle map stage doesn't already exist, this method will create the shuffle map stage in   * addition to any missing ancestor shuffle map stages.   */  private def getOrCreateShuffleMapStage(      shuffleDep: ShuffleDependency[_, _, _],      firstJobId: Int): ShuffleMapStage = {    shuffleIdToMapStage.get(shuffleDep.shuffleId) match {      case Some(stage) =>        stage      case None =>        // Create stages for all missing ancestor shuffle dependencies.        getMissingAncestorShuffleDependencies(shuffleDep.rdd).foreach { dep =>          // Even though getMissingAncestorShuffleDependencies only returns shuffle dependencies          // that were not already in shuffleIdToMapStage, it's possible that by the time we          // get to a particular dependency in the foreach loop, it's been added to          // shuffleIdToMapStage by the stage creation process for an earlier dependency. See          // SPARK-13902 for more information.          if (!shuffleIdToMapStage.contains(dep.shuffleId)) {            createShuffleMapStage(dep, firstJobId)          }        }        // Finally, create a stage for the given shuffle dependency.        createShuffleMapStage(shuffleDep, firstJobId)    }  }

这里和 createResultStage 如出一辙

/**   * Create a ResultStage associated with the provided jobId.   */  private def createResultStage(      rdd: RDD[_],      func: (TaskContext, Iterator[_]) => _,      partitions: Array[Int],      jobId: Int,      callSite: CallSite): ResultStage = {    val parents = getOrCreateParentStages(rdd, jobId)    val id = nextStageId.getAndIncrement()    val stage = new ResultStage(id, rdd, func, partitions, parents, jobId, callSite)    stageIdToStage(id) = stage    updateJobIdStageIdMaps(jobId, stage)    stage  }

方法 createResultStage 中，先创建 parents，后创建 ResultStage方法 getOrCreateShuffleMapStage 中，先创建 ancestor shuffle dependencies，后创建倒数第二个 Stage，也就是倒数第一个 ShuffleMapStage，因为 ResultStage 是最后一个 Stage

/** Find ancestor shuffle dependencies that are not registered in shuffleToMapStage yet */  private def getMissingAncestorShuffleDependencies(      rdd: RDD[_]): ArrayStack[ShuffleDependency[_, _, _]] = {    val ancestors = new ArrayStack[ShuffleDependency[_, _, _]]    val visited = new HashSet[RDD[_]]    // We are manually maintaining a stack here to prevent StackOverflowError    // caused by recursively visiting    val waitingForVisit = new ArrayStack[RDD[_]]    waitingForVisit.push(rdd)    while (waitingForVisit.nonEmpty) {      val toVisit = waitingForVisit.pop()      if (!visited(toVisit)) {        visited += toVisit        getShuffleDependencies(toVisit).foreach { shuffleDep =>          if (!shuffleIdToMapStage.contains(shuffleDep.shuffleId)) {            ancestors.push(shuffleDep)            waitingForVisit.push(shuffleDep.rdd)          } // Otherwise, the dependency and its ancestors have already been registered.        }      }    }    ancestors  }

这里用一个 stack 结构，从后向前回溯，一直回溯到第一个 rdd，然后把获得的所有的 ShuffleDependency 装入栈中

这里的 ShuffleDependency 是从倒数第二个 ShuffleDependency 开始的倒数第一个 ShuffleDependency 不在其中

/**   * Creates a ShuffleMapStage that generates the given shuffle dependency's partitions. If a   * previously run stage generated the same shuffle data, this function will copy the output   * locations that are still available from the previous shuffle to avoid unnecessarily   * regenerating data.   */  def createShuffleMapStage(shuffleDep: ShuffleDependency[_, _, _], jobId: Int): ShuffleMapStage = {    val rdd = shuffleDep.rdd    val numTasks = rdd.partitions.length    val parents = getOrCreateParentStages(rdd, jobId)    val id = nextStageId.getAndIncrement()    val stage = new ShuffleMapStage(      id, rdd, numTasks, parents, jobId, rdd.creationSite, shuffleDep, mapOutputTracker)    stageIdToStage(id) = stage    shuffleIdToMapStage(shuffleDep.shuffleId) = stage    updateJobIdStageIdMaps(jobId, stage)    if (!mapOutputTracker.containsShuffle(shuffleDep.shuffleId)) {      // Kind of ugly: need to register RDDs with the cache and map output tracker here      // since we can't do it in the RDD constructor because # of partitions is unknown      logInfo("Registering RDD " + rdd.id + " (" + rdd.getCreationSite + ")")      mapOutputTracker.registerShuffle(shuffleDep.shuffleId, rdd.partitions.length)    }    stage  }

这里又调用了 getOrCreateParentStages

getOrCreateParentStages -->getOrCreateShuffleMapStage -->createShuffleMapStage --> getOrCreateParentStages

形成了一个循环从 createShuffleMapStage 中，我们可以看到

一个 ShuffleMapStage 是由 ShuffleDependency 确定的比如下图中 stage1 由 groupBy 确定 stage2 由 join 确定 stage3 是 resultStage

Stage 都创建好以后，可以提交，提交一个 Stage，对应了提交一组 Task

/** Submits stage, but first recursively submits any missing parents. */  private def submitStage(stage: Stage) {    val jobId = activeJobForStage(stage)    if (jobId.isDefined) {      logDebug("submitStage(" + stage + ")")      if (!waitingStages(stage) && !runningStages(stage) && !failedStages(stage)) {        val missing = getMissingParentStages(stage).sortBy(_.id)        logDebug("missing: " + missing)        if (missing.isEmpty) {          logInfo("Submitting " + stage + " (" + stage.rdd + "), which has no missing parents")          submitMissingTasks(stage, jobId.get)        } else {          for (parent <- missing) {            submitStage(parent)          }          waitingStages += stage        }      }    } else {      abortStage(stage, "No active job for stage " + stage.id, None)    }  }

Stage 的创建虽然不是递归的，但是用栈结构，真实创建顺序是从头到尾 Stage 提交是递归的，真实提交顺序同样是从头到尾

提交 task 的动作就包含着 submitMissingTasks 方法中，这个方法代码很长，只看关心的代码

/** Called when stage's parents are available and we can now do its task. */  private def submitMissingTasks(stage: Stage, jobId: Int) {    logDebug("submitMissingTasks(" + stage + ")")    // First figure out the indexes of partition ids to compute.    val partitionsToCompute: Seq[Int] = stage.findMissingPartitions()    // Use the scheduling pool, job group, description, etc. from an ActiveJob associated    // with this Stage    val properties = jobIdToActiveJob(jobId).properties    runningStages += stage    // SparkListenerStageSubmitted should be posted before testing whether tasks are    // serializable. If tasks are not serializable, a SparkListenerStageCompleted event    // will be posted, which should always come after a corresponding SparkListenerStageSubmitted    // event.    stage match {      case s: ShuffleMapStage =>        outputCommitCoordinator.stageStart(stage = s.id, maxPartitionId = s.numPartitions - 1)      case s: ResultStage =>        outputCommitCoordinator.stageStart(          stage = s.id, maxPartitionId = s.rdd.partitions.length - 1)    }    val taskIdToLocations: Map[Int, Seq[TaskLocation]] = try {      stage match {        case s: ShuffleMapStage =>          partitionsToCompute.map { id => (id, getPreferredLocs(stage.rdd, id))}.toMap        case s: ResultStage =>          partitionsToCompute.map { id =>            val p = s.partitions(id)            (id, getPreferredLocs(stage.rdd, p))          }.toMap      }    } catch {      case NonFatal(e) =>        stage.makeNewStageAttempt(partitionsToCompute.size)        listenerBus.post(SparkListenerStageSubmitted(stage.latestInfo, properties))        abortStage(stage, s"Task creation failed: $e\n${Utils.exceptionString(e)}", Some(e))        runningStages -= stage        return    }    stage.makeNewStageAttempt(partitionsToCompute.size, taskIdToLocations.values.toSeq)    // If there are tasks to execute, record the submission time of the stage. Otherwise,    // post the even without the submission time, which indicates that this stage was    // skipped.    if (partitionsToCompute.nonEmpty) {      stage.latestInfo.submissionTime = Some(clock.getTimeMillis())    }    listenerBus.post(SparkListenerStageSubmitted(stage.latestInfo, properties))    // TODO: Maybe we can keep the taskBinary in Stage to avoid serializing it multiple times.    // Broadcasted binary for the task, used to dispatch tasks to executors. Note that we broadcast    // the serialized copy of the RDD and for each task we will deserialize it, which means each    // task gets a different copy of the RDD. This provides stronger isolation between tasks that    // might modify state of objects referenced in their closures. This is necessary in Hadoop    // where the JobConf/Configuration object is not thread-safe.    var taskBinary: Broadcast[Array[Byte]] = null    var partitions: Array[Partition] = null    try {      // For ShuffleMapTask, serialize and broadcast (rdd, shuffleDep).      // For ResultTask, serialize and broadcast (rdd, func).      var taskBinaryBytes: Array[Byte] = null      // taskBinaryBytes and partitions are both effected by the checkpoint status. We need      // this synchronization in case another concurrent job is checkpointing this RDD, so we get a      // consistent view of both variables.      RDDCheckpointData.synchronized {        taskBinaryBytes = stage match {          case stage: ShuffleMapStage =>            JavaUtils.bufferToArray(              closureSerializer.serialize((stage.rdd, stage.shuffleDep): AnyRef))          case stage: ResultStage =>            JavaUtils.bufferToArray(closureSerializer.serialize((stage.rdd, stage.func): AnyRef))        }        partitions = stage.rdd.partitions      }      taskBinary = sc.broadcast(taskBinaryBytes)    } catch {      // In the case of a failure during serialization, abort the stage.      case e: NotSerializableException =>        abortStage(stage, "Task not serializable: " + e.toString, Some(e))        runningStages -= stage        // Abort execution        return      case e: Throwable =>        abortStage(stage, s"Task serialization failed: $e\n${Utils.exceptionString(e)}", Some(e))        runningStages -= stage        // Abort execution        return    }    val tasks: Seq[Task[_]] = try {      val serializedTaskMetrics = closureSerializer.serialize(stage.latestInfo.taskMetrics).array()      stage match {        case stage: ShuffleMapStage =>          stage.pendingPartitions.clear()          partitionsToCompute.map { id =>            val locs = taskIdToLocations(id)            val part = partitions(id)            stage.pendingPartitions += id            new ShuffleMapTask(stage.id, stage.latestInfo.attemptNumber,              taskBinary, part, locs, properties, serializedTaskMetrics, Option(jobId),              Option(sc.applicationId), sc.applicationAttemptId)          }        case stage: ResultStage =>          partitionsToCompute.map { id =>            val p: Int = stage.partitions(id)            val part = partitions(p)            val locs = taskIdToLocations(id)            new ResultTask(stage.id, stage.latestInfo.attemptNumber,              taskBinary, part, locs, id, properties, serializedTaskMetrics,              Option(jobId), Option(sc.applicationId), sc.applicationAttemptId)          }      }    } catch {      case NonFatal(e) =>        abortStage(stage, s"Task creation failed: $e\n${Utils.exceptionString(e)}", Some(e))        runningStages -= stage        return    }    if (tasks.size > 0) {      logInfo(s"Submitting ${tasks.size} missing tasks from $stage (${stage.rdd}) (first 15 " +        s"tasks are for partitions ${tasks.take(15).map(_.partitionId)})")      taskScheduler.submitTasks(new TaskSet(        tasks.toArray, stage.id, stage.latestInfo.attemptNumber, jobId, properties))    } else {      // Because we posted SparkListenerStageSubmitted earlier, we should mark      // the stage as completed here in case there are no tasks to run      markStageAsFinished(stage, None)      stage match {        case stage: ShuffleMapStage =>          logDebug(s"Stage ${stage} is actually done; " +              s"(available: ${stage.isAvailable}," +              s"available outputs: ${stage.numAvailableOutputs}," +              s"partitions: ${stage.numPartitions})")          markMapStageJobsAsFinished(stage)        case stage : ResultStage =>          logDebug(s"Stage ${stage} is actually done; (partitions: ${stage.numPartitions})")      }      submitWaitingChildStages(stage)    }  }

在 submitMissingTasks 中

val taskIdToLocations: Map[Int, Seq[TaskLocation]] = try {      stage match {        case s: ShuffleMapStage =>          partitionsToCompute.map { id => (id, getPreferredLocs(stage.rdd, id))}.toMap        case s: ResultStage =>          partitionsToCompute.map { id =>            val p = s.partitions(id)            (id, getPreferredLocs(stage.rdd, p))          }.toMap

这一段获取 rdd 的 partition 数据在集群中的物理位置，是绝对的

后面的

@DeveloperApiobject TaskLocality extends Enumeration {  // Process local is expected to be used ONLY within TaskSetManager for now.  val PROCESS_LOCAL, NODE_LOCAL, NO_PREF, RACK_LOCAL, ANY = Value  type TaskLocality = Value  def isAllowed(constraint: TaskLocality, condition: TaskLocality): Boolean = {    condition <= constraint  }}

这些是数据和内存相对而言的优先级级别得到数据的位置以后,序列化 task

var taskBinary: Broadcast[Array[Byte]] = null    var partitions: Array[Partition] = null    try {      // For ShuffleMapTask, serialize and broadcast (rdd, shuffleDep).      // For ResultTask, serialize and broadcast (rdd, func).      var taskBinaryBytes: Array[Byte] = null      // taskBinaryBytes and partitions are both effected by the checkpoint status. We need      // this synchronization in case another concurrent job is checkpointing this RDD, so we get a      // consistent view of both variables.      RDDCheckpointData.synchronized {        taskBinaryBytes = stage match {          case stage: ShuffleMapStage =>            JavaUtils.bufferToArray(              closureSerializer.serialize((stage.rdd, stage.shuffleDep): AnyRef))          case stage: ResultStage =>            JavaUtils.bufferToArray(closureSerializer.serialize((stage.rdd, stage.func): AnyRef))        }        partitions = stage.rdd.partitions      }

从这里我们看到，ShuffleMapStage 和 ResultStage 的 task 是不同的我们的 count 算子的函数是在 ResultStage 中序列化的

          case stage: ShuffleMapStage =>            JavaUtils.bufferToArray(              closureSerializer.serialize((stage.rdd, stage.shuffleDep): AnyRef))          case stage: ResultStage =>            JavaUtils.bufferToArray(closureSerializer.serialize((stage.rdd, stage.func):

并且 stage.shuffleDep 和 stage.func 地位是对等的,在 Task 类中可以看到

ShuffleMapTask      writer.write(rdd.iterator(partition, context).asInstanceOf[Iterator[_ <: Product2[Any, Any]]])ResultTask      func(context, rdd.iterator(partition, context))

将来在 executor 端，计算时，

rdd 的 iterator 方法

在 ShuffleMapTask 中被 ShuffleWriter 使用（ShuffleWriter 就是从 stage.shuffleDep 得到的）在 ResultTask 中被 func 函数使用

把序列化好的 task 广播出去，因为 task 是相同的，每个分区计算任务都一样

taskBinary = sc.broadcast(taskBinaryBytes)

序列化了通用的 partition 计算函数，剩下的就是生成含有分区信息的 Task 类的序列

val tasks: Seq[Task[_]] = try {      val serializedTaskMetrics = closureSerializer.serialize(stage.latestInfo.taskMetrics).array()      stage match {        case stage: ShuffleMapStage =>          stage.pendingPartitions.clear()          partitionsToCompute.map { id =>            val locs = taskIdToLocations(id)            val part = partitions(id)            stage.pendingPartitions += id            new ShuffleMapTask(stage.id, stage.latestInfo.attemptNumber,              taskBinary, part, locs, properties, serializedTaskMetrics, Option(jobId),              Option(sc.applicationId), sc.applicationAttemptId)          }        case stage: ResultStage =>          partitionsToCompute.map { id =>            val p: Int = stage.partitions(id)            val part = partitions(p)            val locs = taskIdToLocations(id)            new ResultTask(stage.id, stage.latestInfo.attemptNumber,              taskBinary, part, locs, id, properties, serializedTaskMetrics,              Option(jobId), Option(sc.applicationId), sc.applicationAttemptId)          }      }    }

我们看到一个 stage 生成一组 task，每个 task 对应一个 partition

这里有一点思考后面将看到发送到 executor 端的不是 Task，而是 TaskDescription 所以在目前，Task 上看不到 executor 信息，也就是说，这些 task 无论发送到哪个 executor 上都可以完成计算，差别大的是计算时间

task 完成封装后，这一组 task 打包成 TaskSet 交给底层调度器 TaskScheduler,

taskScheduler.submitTasks(new TaskSet(        tasks.toArray, stage.id, stage.latestInfo.attemptNumber, jobId, properties))

在 submitTasks 方法中，taskSet 又封装成 TaskSetManager

TaskSchedulerImpl

 override def submitTasks(taskSet: TaskSet) {    val tasks = taskSet.tasks    logInfo("Adding task set " + taskSet.id + " with " + tasks.length + " tasks")    this.synchronized {      val manager = createTaskSetManager(taskSet, maxTaskFailures)      val stage = taskSet.stageId      val stageTaskSets =        taskSetsByStageIdAndAttempt.getOrElseUpdate(stage, new HashMap[Int, TaskSetManager])      stageTaskSets(taskSet.stageAttemptId) = manager      val conflictingTaskSet = stageTaskSets.exists { case (_, ts) =>        ts.taskSet != taskSet && !ts.isZombie      }      if (conflictingTaskSet) {        throw new IllegalStateException(s"more than one active taskSet for stage $stage:" +          s" ${stageTaskSets.toSeq.map{_._2.taskSet.id}.mkString(",")}")      }      schedulableBuilder.addTaskSetManager(manager, manager.taskSet.properties)      if (!isLocal && !hasReceivedTask) {        starvationTimer.scheduleAtFixedRate(new TimerTask() {          override def run() {            if (!hasLaunchedTask) {              logWarning("Initial job has not accepted any resources; " +                "check your cluster UI to ensure that workers are registered " +                "and have sufficient resources")            } else {              this.cancel()            }          }        }, STARVATION_TIMEOUT_MS, STARVATION_TIMEOUT_MS)      }      hasReceivedTask = true    }    backend.reviveOffers()  }

在这个 submitTasks 方法中,createTaskSetManager 中创建 TaskSetManager

val manager = createTaskSetManager(taskSet, maxTaskFailures)

// Label as private[scheduler] to allow tests to swap in different task set managers if necessary  private[scheduler] def createTaskSetManager(      taskSet: TaskSet,      maxTaskFailures: Int): TaskSetManager = {    new TaskSetManager(this, taskSet, maxTaskFailures, blacklistTrackerOpt)  }

在 TaskSetManager 的构造函数中首先根据 task 对应 partition 的数据物理位置，分门别类存入 Map 结构中

// Add all our tasks to the pending lists. We do this in reverse order  // of task index so that tasks with low indices get launched first.  for (i <- (0 until numTasks).reverse) {    addPendingTask(i)  }

  /** Add a task to all the pending-task lists that it should be on. */  private[spark] def addPendingTask(index: Int) {    for (loc <- tasks(index).preferredLocations) {      loc match {        case e: ExecutorCacheTaskLocation =>          pendingTasksForExecutor.getOrElseUpdate(e.executorId, new ArrayBuffer) += index        case e: HDFSCacheTaskLocation =>          val exe = sched.getExecutorsAliveOnHost(loc.host)          exe match {            case Some(set) =>              for (e <- set) {                pendingTasksForExecutor.getOrElseUpdate(e, new ArrayBuffer) += index              }              logInfo(s"Pending task $index has a cached location at ${e.host} " +                ", where there are executors " + set.mkString(","))            case None => logDebug(s"Pending task $index has a cached location at ${e.host} " +                ", but there are no executors alive there.")          }        case _ =>      }      pendingTasksForHost.getOrElseUpdate(loc.host, new ArrayBuffer) += index      for (rack <- sched.getRackForHost(loc.host)) {        pendingTasksForRack.getOrElseUpdate(rack, new ArrayBuffer) += index      }    }    if (tasks(index).preferredLocations == Nil) {      pendingTasksWithNoPrefs += index    }    allPendingTasks += index  // No point scanning this whole list to find the old task there  }

  // Set of pending tasks for each executor. These collections are actually  // treated as stacks, in which new tasks are added to the end of the  // ArrayBuffer and removed from the end. This makes it faster to detect  // tasks that repeatedly fail because whenever a task failed, it is put  // back at the head of the stack. These collections may contain duplicates  // for two reasons:  // (1): Tasks are only removed lazily; when a task is launched, it remains  // in all the pending lists except the one that it was launched from.  // (2): Tasks may be re-added to these lists multiple times as a result  // of failures.  // Duplicates are handled in dequeueTaskFromList, which ensures that a  // task hasn't already started running before launching it.  private val pendingTasksForExecutor = new HashMap[String, ArrayBuffer[Int]]  // Set of pending tasks for each host. Similar to pendingTasksForExecutor,  // but at host level.  private val pendingTasksForHost = new HashMap[String, ArrayBuffer[Int]]  // Set of pending tasks for each rack -- similar to the above.  private val pendingTasksForRack = new HashMap[String, ArrayBuffer[Int]]  // Set containing pending tasks with no locality preferences.  private[scheduler] var pendingTasksWithNoPrefs = new ArrayBuffer[Int]  // Set containing all pending tasks (also used as a stack, as above).  private val allPendingTasks = new ArrayBuffer[Int]

这样每个 task 的数据在什么位置就很清楚了然后计算 task 的 locality levels

 /**   * Track the set of locality levels which are valid given the tasks locality preferences and   * the set of currently available executors.  This is updated as executors are added and removed.   * This allows a performance optimization, of skipping levels that aren't relevant (eg., skip   * PROCESS_LOCAL if no tasks could be run PROCESS_LOCAL for the current set of executors).   */  private[scheduler] var myLocalityLevels = computeValidLocalityLevels()

  /**   * Compute the locality levels used in this TaskSet. Assumes that all tasks have already been   * added to queues using addPendingTask.   *   */  private def computeValidLocalityLevels(): Array[TaskLocality.TaskLocality] = {    import TaskLocality.{PROCESS_LOCAL, NODE_LOCAL, NO_PREF, RACK_LOCAL, ANY}    val levels = new ArrayBuffer[TaskLocality.TaskLocality]    if (!pendingTasksForExecutor.isEmpty &&        pendingTasksForExecutor.keySet.exists(sched.isExecutorAlive(_))) {      levels += PROCESS_LOCAL    }    if (!pendingTasksForHost.isEmpty &&        pendingTasksForHost.keySet.exists(sched.hasExecutorsAliveOnHost(_))) {      levels += NODE_LOCAL    }    if (!pendingTasksWithNoPrefs.isEmpty) {      levels += NO_PREF    }    if (!pendingTasksForRack.isEmpty &&        pendingTasksForRack.keySet.exists(sched.hasHostAliveOnRack(_))) {      levels += RACK_LOCAL    }    levels += ANY    logDebug("Valid locality levels for " + taskSet + ": " + levels.mkString(", "))    levels.toArray  }

从代码中可以看到，task 的 locality levels 是根据当前所有计算资源和 task 的数据的物理位置匹配的结果

这里说一点，我自己的疑惑，以前我一直认为 locality levels 是 task 的数据物理位置和当前可用的计算资源的匹配结果今天看代码，不是当前可用计算资源，是全部计算资源比如一个数据在节点 A 上的 executor 中，但是节点 A 上已经没有 cpu 可用了，那么 partition 对应的 task 的 locality level 依然是 PROCESS_LOCAL 这样的话，locality levels 的计算简单很多

初始化好 TaskSetManager 之后，，提交到调度树 schedulableBuilder 中这个调度结构是在 sparkContext 初始化的时候初始化的这个 submitTasks 方法只是把 task 提交到了 TaskScheduler 的调度池中，并没有真正 submit

schedulableBuilder.addTaskSetManager(manager, manager.taskSet.properties)

如果运行 spark on yarn 的话，这个 backend 其实是 YarnClusterSchedulerBackend，我们看下 CoarseGrainedSchedulerBackend 类中的这个 reviveOffers 方法，给 driver 发送一个消息。

DriverEndpoint

backend.reviveOffers()

override def reviveOffers() {    driverEndpoint.send(ReviveOffers)  }

driver 接收到消息以后

case ReviveOffers =>        makeOffers()

调用 DriverEndpoint 类中的 makeOffers 方法

// Make fake resource offers on all executors    private def makeOffers() {      // Make sure no executor is killed while some task is launching on it      val taskDescs = CoarseGrainedSchedulerBackend.this.synchronized {        // Filter out executors under killing        val activeExecutors = executorDataMap.filterKeys(executorIsAlive)        val workOffers = activeExecutors.map {          case (id, executorData) =>            new WorkerOffer(id, executorData.executorHost, executorData.freeCores)        }.toIndexedSeq        scheduler.resourceOffers(workOffers)      }      if (!taskDescs.isEmpty) {        launchTasks(taskDescs)      }    }

决策哪个 task 发送到哪个 executor，在下面代码中

scheduler.resourceOffers(workOffers)

代码中的 workOffers 是 driver 拿到的全部计算资源中剩余 executor 里的可用资源逻辑抽象表示,scheduler 是 TaskSchedulerImpl

TaskSchedulerImpl.submitTasks -->

CoarseGrainedSchedulerBackend#reviveOffers -->

CoarseGrainedSchedulerBackend.DriverEndpoint#makeOffers-->

CoarseGrainedSchedulerBackend.DriverEndpoint#launchTasks

整个调用逻辑 TaskSchedulerImpl 到 driver，并且 driver 上调用了 TaskSchedulerImpl 的方法因为资源并不在 TaskSchedulerImpl 手中，而是在 driver 手中，TaskSchedulerImpl 只管调度

  /**   * Called by cluster manager to offer resources on slaves. We respond by asking our active task   * sets for tasks in order of priority. We fill each node with tasks in a round-robin manner so   * that tasks are balanced across the cluster.   */  def resourceOffers(offers: IndexedSeq[WorkerOffer]): Seq[Seq[TaskDescription]] = synchronized {    // Mark each slave as alive and remember its hostname    // Also track if new executor is added    var newExecAvail = false    for (o <- offers) {      if (!hostToExecutors.contains(o.host)) {        hostToExecutors(o.host) = new HashSet[String]()      }      if (!executorIdToRunningTaskIds.contains(o.executorId)) {        hostToExecutors(o.host) += o.executorId        executorAdded(o.executorId, o.host)        executorIdToHost(o.executorId) = o.host        executorIdToRunningTaskIds(o.executorId) = HashSet[Long]()        newExecAvail = true      }      for (rack <- getRackForHost(o.host)) {        hostsByRack.getOrElseUpdate(rack, new HashSet[String]()) += o.host      }    }    // Before making any offers, remove any nodes from the blacklist whose blacklist has expired. Do    // this here to avoid a separate thread and added synchronization overhead, and also because    // updating the blacklist is only relevant when task offers are being made.    blacklistTrackerOpt.foreach(_.applyBlacklistTimeout())    val filteredOffers = blacklistTrackerOpt.map { blacklistTracker =>      offers.filter { offer =>        !blacklistTracker.isNodeBlacklisted(offer.host) &&          !blacklistTracker.isExecutorBlacklisted(offer.executorId)      }    }.getOrElse(offers)    val shuffledOffers = shuffleOffers(filteredOffers)    // Build a list of tasks to assign to each worker.    val tasks = shuffledOffers.map(o => new ArrayBuffer[TaskDescription](o.cores / CPUS_PER_TASK))    val availableCpus = shuffledOffers.map(o => o.cores).toArray    val sortedTaskSets = rootPool.getSortedTaskSetQueue    for (taskSet <- sortedTaskSets) {      logDebug("parentName: %s, name: %s, runningTasks: %s".format(        taskSet.parent.name, taskSet.name, taskSet.runningTasks))      if (newExecAvail) {        taskSet.executorAdded()      }    }    // Take each TaskSet in our scheduling order, and then offer it each node in increasing order    // of locality levels so that it gets a chance to launch local tasks on all of them.    // NOTE: the preferredLocality order: PROCESS_LOCAL, NODE_LOCAL, NO_PREF, RACK_LOCAL, ANY    for (taskSet <- sortedTaskSets) {      var launchedAnyTask = false      var launchedTaskAtCurrentMaxLocality = false      for (currentMaxLocality <- taskSet.myLocalityLevels) {        do {          launchedTaskAtCurrentMaxLocality = resourceOfferSingleTaskSet(            taskSet, currentMaxLocality, shuffledOffers, availableCpus, tasks)          launchedAnyTask |= launchedTaskAtCurrentMaxLocality        } while (launchedTaskAtCurrentMaxLocality)      }      if (!launchedAnyTask) {        taskSet.abortIfCompletelyBlacklisted(hostToExecutors)      }    }    if (tasks.size > 0) {      hasLaunchedTask = true    }    return tasks  }

在这个方法中，重点代码如下首先更新

for (o <- offers) {      if (!hostToExecutors.contains(o.host)) {        hostToExecutors(o.host) = new HashSet[String]()      }      if (!executorIdToRunningTaskIds.contains(o.executorId)) {        hostToExecutors(o.host) += o.executorId        executorAdded(o.executorId, o.host)        executorIdToHost(o.executorId) = o.host        executorIdToRunningTaskIds(o.executorId) = HashSet[Long]()        newExecAvail = true      }      for (rack <- getRackForHost(o.host)) {        hostsByRack.getOrElseUpdate(rack, new HashSet[String]()) += o.host      }    }

 // IDs of the tasks running on each executor  private val executorIdToRunningTaskIds = new HashMap[String, HashSet[Long]]  // The set of executors we have on each host; this is used to compute hostsAlive, which  // in turn is used to decide when we can attain data locality on a given host  protected val hostToExecutors = new HashMap[String, HashSet[String]]  protected val hostsByRack = new HashMap[String, HashSet[String]]  protected val executorIdToHost = new HashMap[String, String]

资源在 driver 中，TaskSchedulerImpl 中的这些 Map 一开始都是空的在调度 task 的过程中慢慢更新，用于计算 task 的 locality levels

// Take each TaskSet in our scheduling order, and then offer it each node in increasing order    // of locality levels so that it gets a chance to launch local tasks on all of them.    // NOTE: the preferredLocality order: PROCESS_LOCAL, NODE_LOCAL, NO_PREF, RACK_LOCAL, ANY    for (taskSet <- sortedTaskSets) {      var launchedAnyTask = false      var launchedTaskAtCurrentMaxLocality = false      for (currentMaxLocality <- taskSet.myLocalityLevels) {        do {          launchedTaskAtCurrentMaxLocality = resourceOfferSingleTaskSet(            taskSet, currentMaxLocality, shuffledOffers, availableCpus, tasks)          launchedAnyTask |= launchedTaskAtCurrentMaxLocality        } while (launchedTaskAtCurrentMaxLocality)      }      if (!launchedAnyTask) {        taskSet.abortIfCompletelyBlacklisted(hostToExecutors)      }    }

调度池中的 taskSet 一般是 FIFO我们现在的代码中，计算资源是有的，数据物理位置也是有的现在就是决策分配哪个 task 到哪个 executor 的时候了我们可以思考一下

是一个 task 决定发送到哪个 executor，还是一个 executor 决定哪个 task 在其上运行？或者说，是 task 挑选 executor，还是 executor 挑选 task？哪个是主动，哪个是被动？

这个问题很值得思考。因为一般我们都说发送 task 到 executor，但是到底是怎么样的，我自己也没思考过这里边是 2 层循环，taskset 一层循环，myLocalityLevels 一层循环

private def resourceOfferSingleTaskSet(      taskSet: TaskSetManager,      maxLocality: TaskLocality,      shuffledOffers: Seq[WorkerOffer],      availableCpus: Array[Int],      tasks: IndexedSeq[ArrayBuffer[TaskDescription]]) : Boolean = {    var launchedTask = false    // nodes and executors that are blacklisted for the entire application have already been    // filtered out by this point    for (i <- 0 until shuffledOffers.size) {      val execId = shuffledOffers(i).executorId      val host = shuffledOffers(i).host      if (availableCpus(i) >= CPUS_PER_TASK) {        try {          for (task <- taskSet.resourceOffer(execId, host, maxLocality)) {            tasks(i) += task            val tid = task.taskId            taskIdToTaskSetManager.put(tid, taskSet)            taskIdToExecutorId(tid) = execId            executorIdToRunningTaskIds(execId).add(tid)            availableCpus(i) -= CPUS_PER_TASK            assert(availableCpus(i) >= 0)            launchedTask = true          }        } catch {          case e: TaskNotSerializableException =>            logError(s"Resource offer failed, task set ${taskSet.name} was not serializable")            // Do not offer resources for this task, but don't throw an error to allow other            // task sets to be submitted.            return launchedTask        }      }    }    return launchedTask  }

对于特定的一个 LocalityLevels，在 shuffledOffers 遍历

for (i <- 0 until shuffledOffers.size)

在一个具体的 execId 上,taskSet 寻找满足此本地性的 task

taskSet.resourceOffer(execId, host, maxLocality)

这个过程很复杂，我自己目前没看懂，就不展开了，但是功能应该是很明确

我自己的观察，既不是单纯的 executor 挑 task，也不是 task 挑 executor 而是在满足本地性要求的前提下，executor 挑选 task，

在这个过程中，executor 选定 Task 之后，Task 被序列化封装到 TaskDescription 中

我们看下一共序列化了几次 (stage.rdd, stage.func)或者(stage.rdd, stage.shuffleDep)序列化封装到 Task 中 Task 序列化封装到 TaskDescription 中 TaskDescription 序列化发送到 executor

在 driver 端，一切都搞定以后，就可以提交到 executor 了

 // Launch tasks returned by a set of resource offers    private def launchTasks(tasks: Seq[Seq[TaskDescription]]) {      for (task <- tasks.flatten) {        val serializedTask = TaskDescription.encode(task)        if (serializedTask.limit() >= maxRpcMessageSize) {          Option(scheduler.taskIdToTaskSetManager.get(task.taskId)).foreach { taskSetMgr =>            try {              var msg = "Serialized task %s:%d was %d bytes, which exceeds max allowed: " +                "spark.rpc.message.maxSize (%d bytes). Consider increasing " +                "spark.rpc.message.maxSize or using broadcast variables for large values."              msg = msg.format(task.taskId, task.index, serializedTask.limit(), maxRpcMessageSize)              taskSetMgr.abort(msg)            } catch {              case e: Exception => logError("Exception in error callback", e)            }          }        }        else {          val executorData = executorDataMap(task.executorId)          executorData.freeCores -= scheduler.CPUS_PER_TASK          logDebug(s"Launching task ${task.taskId} on executor id: ${task.executorId} hostname: " +            s"${executorData.executorHost}.")          executorData.executorEndpoint.send(LaunchTask(new SerializableBuffer(serializedTask)))        }      }    }

提交这个动作涉及到与 executor 端通信，所以是 driver 启动

CoarseGrainedSchedulerBackend.DriverEndpoint#launchTasks

任务就发送到了 Executor 上

Executor

org.apache.spark.executor.CoarseGrainedExecutorBackend#receive

 case LaunchTask(data) =>      if (executor == null) {        exitExecutor(1, "Received LaunchTask command but executor was null")      } else {        val taskDesc = TaskDescription.decode(data.value)        logInfo("Got assigned task " + taskDesc.taskId)        executor.launchTask(this, taskDesc)      }

接收到 TaskDescription 以后转交给 Executor 类的 launchTask 方法

def launchTask(context: ExecutorBackend, taskDescription: TaskDescription): Unit = {    val tr = new TaskRunner(context, taskDescription)    runningTasks.put(taskDescription.taskId, tr)    threadPool.execute(tr)  }

TaskRunner 是一个 Runnable，这个线程随着 threadPool 的调用，开始执行自己的 run 方法这个 run 方法代码较长，这里不贴了，有兴趣可以看

org.apache.spark.executor.Executor.TaskRunner#run

我们看下关键代码反序列化出 Task 类

task = ser.deserialize[Task[Any]](          taskDescription.serializedTask, Thread.currentThread.getContextClassLoader)

task 执行自己的计算任务

val value = try {          val res = task.run(            taskAttemptId = taskId,            attemptNumber = taskDescription.attemptNumber,            metricsSystem = env.metricsSystem)          threwException = false          res        }

task.run 运行 Task 子类实现的 runTask(context)方法

/**   * Called by [[org.apache.spark.executor.Executor]] to run this task.   *   * @param taskAttemptId an identifier for this task attempt that is unique within a SparkContext.   * @param attemptNumber how many times this task has been attempted (0 for the first attempt)   * @return the result of the task along with updates of Accumulators.   */  final def run(      taskAttemptId: Long,      attemptNumber: Int,      metricsSystem: MetricsSystem): T = {    SparkEnv.get.blockManager.registerTask(taskAttemptId)    context = new TaskContextImpl(      stageId,      stageAttemptId, // stageAttemptId and stageAttemptNumber are semantically equal      partitionId,      taskAttemptId,      attemptNumber,      taskMemoryManager,      localProperties,      metricsSystem,      metrics)    TaskContext.setTaskContext(context)    taskThread = Thread.currentThread()    if (_reasonIfKilled != null) {      kill(interruptThread = false, _reasonIfKilled)    }    new CallerContext(      "TASK",      SparkEnv.get.conf.get(APP_CALLER_CONTEXT),      appId,      appAttemptId,      jobId,      Option(stageId),      Option(stageAttemptId),      Option(taskAttemptId),      Option(attemptNumber)).setCurrentContext()    try {      runTask(context)    } catch {      case e: Throwable =>        // Catch all errors; run task failure callbacks, and rethrow the exception.        try {          context.markTaskFailed(e)        } catch {          case t: Throwable =>            e.addSuppressed(t)        }        context.markTaskCompleted(Some(e))        throw e    } finally {      try {        // Call the task completion callbacks. If "markTaskCompleted" is called twice, the second        // one is no-op.        context.markTaskCompleted(None)      } finally {        try {          Utils.tryLogNonFatalError {            // Release memory used by this thread for unrolling blocks            SparkEnv.get.blockManager.memoryStore.releaseUnrollMemoryForThisTask(MemoryMode.ON_HEAP)            SparkEnv.get.blockManager.memoryStore.releaseUnrollMemoryForThisTask(              MemoryMode.OFF_HEAP)            // Notify any tasks waiting for execution memory to be freed to wake up and try to            // acquire memory again. This makes impossible the scenario where a task sleeps forever            // because there are no other tasks left to notify it. Since this is safe to do but may            // not be strictly necessary, we should revisit whether we can remove this in the            // future.            val memoryManager = SparkEnv.get.memoryManager            memoryManager.synchronized { memoryManager.notifyAll() }          }        } finally {          // Though we unset the ThreadLocal here, the context member variable itself is still          // queried directly in the TaskRunner to check for FetchFailedExceptions.          TaskContext.unset()        }      }    }  }

我们分别看下 ShuffleMapTask 和 ResultTask 这两个 Task 子类的实现

ShuffleMapTask

  override def runTask(context: TaskContext): MapStatus = {    // Deserialize the RDD using the broadcast variable.    val threadMXBean = ManagementFactory.getThreadMXBean    val deserializeStartTime = System.currentTimeMillis()    val deserializeStartCpuTime = if (threadMXBean.isCurrentThreadCpuTimeSupported) {      threadMXBean.getCurrentThreadCpuTime    } else 0L    val ser = SparkEnv.get.closureSerializer.newInstance()    val (rdd, dep) = ser.deserialize[(RDD[_], ShuffleDependency[_, _, _])](      ByteBuffer.wrap(taskBinary.value), Thread.currentThread.getContextClassLoader)    _executorDeserializeTime = System.currentTimeMillis() - deserializeStartTime    _executorDeserializeCpuTime = if (threadMXBean.isCurrentThreadCpuTimeSupported) {      threadMXBean.getCurrentThreadCpuTime - deserializeStartCpuTime    } else 0L    var writer: ShuffleWriter[Any, Any] = null    try {      val manager = SparkEnv.get.shuffleManager      writer = manager.getWriter[Any, Any](dep.shuffleHandle, partitionId, context)      writer.write(rdd.iterator(partition, context).asInstanceOf[Iterator[_ <: Product2[Any, Any]]])      writer.stop(success = true).get    } catch {      case e: Exception =>        try {          if (writer != null) {            writer.stop(success = false)          }        } catch {          case e: Exception =>            log.debug("Could not stop writer", e)        }        throw e    }  }

几行关键代码

val (rdd, dep) = ser.deserialize[(RDD[_], ShuffleDependency[_, _, _])]val manager = SparkEnv.get.shuffleManagerwriter = manager.getWriter[Any, Any](dep.shuffleHandle, partitionId, context)writer.write(rdd.iterator(partition, context).asInstanceOf[Iterator[_ <: Product2[Any, Any]]])writer.stop(success = true).get

这里我们看到 dep 的作用是得到 shuffleManager，rdd 的作用是得到 iterator

这里反序列化出来 rdd，我们看到的只有一个其实整个 stage 的 rdd 都被反序列化出来了，记住是整个 stage 并且除了第一个 ShuffleMapStage 之外，其他的 stage 的第一个 rdd 一般是 ShuffledRDD

ResultTask

 override def runTask(context: TaskContext): U = {    // Deserialize the RDD and the func using the broadcast variables.    val threadMXBean = ManagementFactory.getThreadMXBean    val deserializeStartTime = System.currentTimeMillis()    val deserializeStartCpuTime = if (threadMXBean.isCurrentThreadCpuTimeSupported) {      threadMXBean.getCurrentThreadCpuTime    } else 0L    val ser = SparkEnv.get.closureSerializer.newInstance()    val (rdd, func) = ser.deserialize[(RDD[T], (TaskContext, Iterator[T]) => U)](      ByteBuffer.wrap(taskBinary.value), Thread.currentThread.getContextClassLoader)    _executorDeserializeTime = System.currentTimeMillis() - deserializeStartTime    _executorDeserializeCpuTime = if (threadMXBean.isCurrentThreadCpuTimeSupported) {      threadMXBean.getCurrentThreadCpuTime - deserializeStartCpuTime    } else 0L    func(context, rdd.iterator(partition, context))  }

这个里边关键代码的理解就比 ShuffleMapTask 简单多了

val (rdd, func) = ser.deserialize[(RDD[T], (TaskContext, Iterator[T]) => U)]func(context, rdd.iterator(partition, context))

那么到此为止，整个 job 的流程就大致结束了，内部关于 Shuffle 的细节还需要一次思考。

本文首发于 GitChat，未经授权不得转载，转载需与 GitChat 联系。

阅读全文: http://gitbook.cn/gitchat/activity/5d63f3ddeafb4d1efa9e7ef5

您还可以下载 CSDN 旗下精品原创内容社区 GitChat App ，阅读更多 GitChat 专享技术内容哦。

你可能感兴趣的:(scala,java,大数据)

java实现国密加解密分享心得 Java java vue
1.pom依赖org.bouncycastlebcprov-jdk15to181.662.SM3封装packagecom.yl.encrypt.sm;importorg.bouncycastle.crypto.digests.SM3Digest;importorg.bouncycastle.crypto.macs.HMac;importorg.bouncycastle.crypto.params.
Java SSLSocket TLS 1.3示例 cyan20115 java 数据库网络
该Java11JEP332添加了对TLS1.3协议的支持。SSLSocket+TLS1.3具有TLS1.3协议和TLS_AES_128_GCM_SHA256流密码的SSLSocket客户端，用于将请求发送到https://google.com并打印响应。JavaTLS13.javapackagecom.mkyong.java11.jep332;importjavax.net.ssl.SSLSock
C#实现AES-CBC加密工具类（含完整源码及使用教程） WangMing_X C#实现各种功能工具集 c#AES-CBC加密
一、AES-CBC加密应用场景AES（AdvancedEncryptionStandard）作为全球公认的安全加密标准，广泛使用在以下场景：API通信加密：保护HTTP接口传输的敏感数据（如身份令牌、支付信息）文件安全存储：加密本地配置文件、数据库连接字符串等用户隐私保护：加密存储密码、身份证号等PII（个人身份信息）跨平台数据交换：与Java/Python等其他语言实现的加密系统互通物联网设备通
java解析el-upload上传的文件,vue+elementui 使用el-upload组件实现单个文件手动上传神经脱臼
ref='upload'style="width:100%":action="uploadUrl":on-remove="handleRemove":on-change="handleChange":data="uploadData":file-list="filelist":before-upload="handleBeforeUpload":auto-upload="false">选取文件仅支
[Java实战]性能优化qps从1万到3万曼岛_ 国密实战 java 性能优化开发语言
一、问题背景事情起因是项目上springboot项目提供的tps达不到客户要求，除了增加服务器提高tps之外，作为团队的技术总监，架构师，技术扛把子，本着我不入地狱谁入地狱的原则，决心从代码上优化，让客户享受到飞一般的感觉。虽然大多数编程工作在写下第一行代码时已经完成，但本着谦虚使人进步，骄傲使人落后的原则还是一步一个脚印的把问题慢慢展开，慢慢分析。以下内容是抽丝剥茧的心路历程，请君欣赏。二、TP
Java面试精选：Kafka+Zookeeper+redis+JVM+RabbitMQ，最全总结我叫小迁W：bjmsb2019 Java 架构面试数据库 java redis mysql 分布式
大家开始准备金九银十了吗？不知是跳槽还是找工作的朋友，趁现在增进一下自己的技术何尝不是一件好事呢？一、RabbitMQ1.rabbitmq的使用场景有哪些？2.rabbitmq有哪些重要的角色？3.rabbitmq有哪些重要的组件？4.rabbitmq中vhost的作用是什么？5.rabbitmq的消息是怎么发送的？6.rabbitmq怎么保证消息的稳定性？7.rabbitmq怎么避免消息丢失？8
Java常用集合与映射的线程安全问题深度解析 jiajia651304 java 安全开发语言
Java常用集合与映射的线程安全问题深度解析一、线程安全基础认知在并发编程环境下，当多个线程同时操作同一集合对象时，若未采取同步措施，可能导致以下典型问题：数据竞争：多个线程同时修改数据导致结果不可预测状态不一致：部分线程看到集合的中间状态内存可见性：线程本地缓存与主内存数据不同步死循环风险：特定操作引发无限循环（如JDK7的HashMap扩容）二、典型非线程安全集合问题分析1.ArrayList
java语言开源协议_Language Server Protocol weixin_39709674 java语言开源协议
软件简介LSP(LanguageServerProtocol)开源的语言服务器协定。由红帽、微软和Codenvy联合推出，可以让不同的程序编辑器与集成开发环境(IDE)方便嵌入各种程序语言，允许开发人员在最喜爱的工具中使用各种语言来撰写程序。唯一基于JSON的语言服务器数据交换协定，目前由GitHub代管，并采用CC及MIT授权。该协定主要用来促进编辑器及语言服务器之间的互动，允许开发人员在各种编
Java常用API：String与ArrayList的设计哲学与实践应用 shy2005_5_31 Java全栈开发学习 java 开发语言 java-ee intellij-idea
在Java编程中，API（应用程序编程接口）是开发者最强大的工具之一。它们封装了复杂的底层逻辑，提供了简洁的调用方式。本文将聚焦Java中两个最常用的API——String和ArrayList，从底层原理到实际应用，结合深度思考，探讨它们的设计哲学与使用技巧。一、String：不可变性的艺术1.String的核心特性String类在Java中代表字符串，其核心特性是不可变性。这意味着一旦一个Str
深入理解Java集合框架：构建高效、灵活的数据管理方案星河浪人 java 开发语言
深入理解Java集合框架：构建高效、灵活的数据管理方案引言Java集合框架（JavaCollectionsFramework,JCF）是Java语言提供的一套用于表示和操作集合的统一架构。它包含了一系列的接口和类，用于存储和操作对象集合，如列表（List）、集合（Set）、映射（Map）和队列（Queue）等。集合框架的设计初衷是为了提供一套灵活、可重用且类型安全的集合数据结构，帮助开发者以统一和
Python, Java 开发全国苦难人群诉苦平台APP Geeker-2025 python java
开发一个“全国苦难人群诉苦平台APP”是一个非常有社会意义的项目，旨在为需要帮助的人群提供一个表达困难、寻求支持和解决方案的平台。这种APP可以帮助政府、公益组织和社会各界更好地了解和解决社会问题，同时为苦难人群提供心理支持和实际帮助。以下是一个高层次的开发思路，以及如何用**Python**和**Java**实现相关功能的示例。---###**1.功能需求分析**一个“全国苦难人群诉苦平台APP
Maven的安装配置 2301_82243979 作者\/maven java
（注：文章是前段时间写的忘记发了，防止丢失在此记录一下）Maven是Apache下的纯Java开发的开源项目,是一个项目管理工具,使用Maven对项目进行构建,依赖管理。依赖管理就是一个项目中要使用第三方jar包才可以运行。Maven正是对这些jar包进行规范化管理，所以在Maven的项目中只需要在pom.xml中添加jar包的坐标,自动从Maven仓库中下载jar包,运行即可。Maven项目对比
微信小程序点餐系统的设计与实现（开题报告+源码+论文）_在线点餐外卖系统设计与实现文献 2301_82243979 程序员微信小程序小程序
二.技术环境jdk版本：1.8及以上ide工具：Eclipse或者IDEA，微信小程序开发工具数据库:mysql5.7（必须5.7）编程语言:Javajava框架：ssmmaven:3.6.1详细技术：HTML+CSS+JAVA+SSM+MYSQL+VUE+MAVEN+微信开发工具三.功能设计本基于微信小程序的外卖点餐系统的设计与实现有管理员，外卖员，餐厅，用户共四个角色。管理员功能有个人中心，外
JavaScript基础-全局作用域難釋懷 javascript 开发语言
在JavaScript编程中，理解变量的作用域是编写高效、可维护代码的关键之一。全局作用域是指变量在整个程序范围内都可访问的状态，这意味着它们可以在任何函数或代码块中被读取和修改。然而，过度使用全局变量也可能导致一些问题，如命名冲突和意外的副作用。本文将详细介绍全局作用域的概念、特性以及如何合理使用全局变量。一、什么是全局作用域？当一个变量在任何函数、代码块之外声明时，它就处于全局作用域下。这意味
idea java cpu100_Intellij Idea cpu 100% 卡顿解决办法 weixin_39968760
配置文件位置修改配置文件idea.vmoptions具体参数配置#堆栈设置-Xms4096m-Xmx4096m-Xmn3072m-XX:MetaspaceSize=1024m-XX:MaxMetaspaceSize=1024m-XX:+AlwaysPreTouch-XX:InitialCodeCacheSize=1200m-XX:ReservedCodeCacheSize=1200m-XX:+Us
开发过程中如何减少属性注释? 紫乾2014 开发经验开发经验
一、注释冗余举个例子，我们在开发项目中肯定会有状态字段，现在有个工单状态枚举StatusEnum.javapackagecn.zxj.note;/***@author:Administrator*@since:2025/1/3014:40*@description:*/publicenumStatusEnum{TO_BE_SUBMITTED(1,"待提交"),SUBMITTED(2,"已提交"),
SpringBoot设置过滤器（Filter）或拦截器（Interceptor）的执行顺序：@Order注解、setOrder()方法 pan_junbiao Spring Spring Boot 我の原创 spring boot java 后端
JavaWeb过滤器、拦截器、监听器，系列文章：（1）过滤器（Filter）的使用：《Servlet过滤器（Filter）的使用：Filter接口、@WebFilter注释》《SpringMVC使用过滤器（Filter）解决中文乱码》《SpringBoot过滤器（Filter）的使用：Filter接口、FilterRegistrationBean类配置、@WebFilter注释》《SpringBo
mybatisSql拼接报错严欣铷 sql mybatis 数据库
Column'id'inwhereclauseisambiguous;nestedexceptionisjava.sql.SQLIntegrityConstraintViolationException:Column'id'inwhereclauseisambiguousSELECTs.id,s.serial_no,s.material_requisition_id,s.material_requ
Elasticsearch Java High Level Client [7.17] 使用 DavidSoCool java elasticsearch 开发语言
es的HighLevelClient存在es源代码的引用，结合springboot使用时，会存在es版本的冲突，这里记录下解决冲突和使用方式（es已经不建议使用这个了）。注意es服务端的版本需要与client的版本对齐，否则返回数据可能会解析失败！！！文档地址：JavaHighLevelRESTClient|JavaRESTClient[7.17]|Elastic1、首先创建个javaspring
深入理解 Java 中 synchronized 的使用和锁升级谢家小布柔 java中的面试题 java 开发语言
目录一、synchronized的使用方式（一）修饰普通方法（二）修饰静态方法（三）修饰代码块二、synchronized的锁升级（一）无锁（二）偏向锁（三）轻量级锁（四）重量级锁在Java并发编程中，synchronized是一个非常重要的关键字，用于实现线程同步，保证在同一时刻只有一个线程可以访问被同步的代码块或方法，从而避免多线程带来的数据不一致等问题。同时，Java虚拟机（JVM）为了提高
Java面向对象编程进阶：深入理解static、单例模式与继承 shy2005_5_31 Java全栈开发学习 java 单例模式开发语言
在面向对象编程（OOP）中，掌握高级特性是提升代码质量和设计能力的关键。本文基于Java语言，深入探讨static关键字、单例设计模式、继承等核心概念，并结合实际应用场景与深度思考，帮助读者构建系统化的知识体系。一、static关键字：共享与效率的基石1.静态变量vs实例变量静态变量：用static修饰，属于类，内存中仅一份，被所有对象共享。应用场景：全局计数器、配置参数。publicclassU
25年大数据开发省赛样题第一套，离线数据处理答案 Tometor 大数据 spark scala
省赛样题一，数据抽取模块这一模块的作用是从mysql抽取数据到ods层进行指标计算，在题目中要求进行全量抽取，并新增etl-date字段进行分区，日期为比赛前一天importorg.apache.spark.sql.SparkSessionimportjava.util.PropertiesobjectTask1{defmain(args:Array[String]):Unit={valspark
Mybatis的基本使用学c真好玩 mybatis
MyBatis简介MyBatis用于持久层框架,持久层是对数据库操作的部分，前版本iBatis由Apache软件基金组织进行更名并维护。特点:简化数据库的操作SQL映射灵活(半ORM框架)支持高级映射易于集成维护配置动态SQL缓存机制功能：替代JDBC,JDBC是java中提供的用于操作数据库的技术及方案数据库的连接控制难。连接池SQL语句硬编码。将sql语句存放到xml配置文件中参数传递问题。提
VSCode 2025最新后端开发必备插件汇总（必备插件合集，Python、Java、Go等语言） Code_流苏实用软件与高效工具 vscode python java 后端开发必备插件合集
前言:作为微软推出的轻量级跨平台编辑器，VSCode凭借智能代码补全、远程开发、Git集成等核心功能，已成为后端开发者首选工具。其强大的插件生态更是覆盖了主流后端语言支持、代码质量优化、性能分析等全场景需求。名人说：博观而约取，厚积而薄发。——苏轼《稼说送张琥》创作者：Code_流苏(CSDN)（一个喜欢古诗词和编程的Coder）目录一、语言支持类插件二、代码质量和格式化工具三、数据库工具四、AP
Spring Cloud 与微服务学习总结（14）—— 云原生时代，如何从 Java 开发者转型微服务？一杯甜酒 Spring Cloud与微服务 java 云原生 spring cloud 微服务微服务架构
前言根据维基百科定义，微服务不是整体应用程序中的一个层。相反，微服务是一个独立的业务功能，具有清晰的接口，并且可以通过内部组件实现分层架构。从战略角度来看，微服务架构基本上遵循“做一件事，就要做得好”的Unix哲学。为了应对传统单体架构的缺陷，微服务架构被企业广泛应用。然而，实践之前有很多问题都需要提前考虑清楚，比如Java背景的开发者是否更有优势？微服务、容器化、DevOps和CI/CD之间的关
【零基础JavaScript入门 | Day1】从“消费记账“案例学编程基础 ⭐表格制作+数据类型转换全解析 helbyYoung JavaScrip学习 javascript 学习
【零基础JavaScript入门|Day1】从"消费记账"案例学编程基础⭐表格制作+数据类型转换全解析今日学习重点：✅数据与现实的映射关系→网页表格如何承载生活数据✅变量容器的本质→用let管理三大消费支出✅隐式类型转换技巧→+prompt()输入处理的妙用✅模板字符串实战→动态生成消费报表案例亮点：1️⃣用户交互三部曲：通过prompt()实现饮食/交通/娱乐费用的分步输入2️⃣自动统计系统：t
2024年最新【Rust指南】快速入门开发环境 hello world_rust开发是啥(3) 2301_79772893 程序员 rust 开发语言后端
2、与其他编程语言相比较C/C++性能很好，但是系统类型和内存都不太安全。Java/C#，拥有GC机制，能保证内存安全，特性也优秀，但是性能不行Rust：安全无需GC代码安全高效，易于维护、调试3、Rust特别擅长的领域高性能WebServiceWebAssembly命令行工具网络编程嵌入式设备系统编程4、Rust的用户和案例Google：新操作系统Fuschia，其中Rust代码量约占30%Am
【Rust基础】Rust后端开发常用库勇敢牛牛_ rust 开发语言后端
使用Rust有一段时间了，期间尝试过使用Rust做后端开发、命令行工具开发，以及做端侧模型部署，也尝试过交叉编译、FFI调用等，也算是基本入门了。在用Rust做后端接口开发时，常常会找不到一些合适库，而这些库在Java中却很常见，于是在此汇总一下后Rust后端开发中常用的一些库。基础框架首先是基础web开发框架，在Java中，最常用的就是Spring了，而Spring其实不单单是一个开发框架，而是
Java 虚拟机优化指南：CMS垃圾回收器参数调优与性能监控工具详解小徐Chao努力并发编程 java jvm 后端
Java虚拟机优化指南：CMS垃圾回收器参数调优与性能监控工具详解引言在高并发、大流量的企业级Java应用中，JVM参数的调优对系统性能至关重要。合理的JVM配置不仅能提高应用响应速度，还能减少垃圾回收造成的停顿时间，提升用户体验。本文将深入探讨CMS垃圾回收器的核心参数及其在大型电商系统中的优化策略，同时介绍几款实用的JVM监控与调优工具。CMS垃圾回收器概述CMS(ConcurrentMark
Java实现微信小程序商家转账到零钱功能V3 程序猿小张丶 Java 公众号 java 微信小程序开发语言
Java实现微信小程序商家转账到零钱功能V3版本工具类该工具类主要用于发送请求签名等@Slf4jpublicclassWechatPayV3Util{/***@parammethod请求方法post*@paramcanonicalUrl请求地址*@parambody请求参数*@parammerchantId商户号*@paramcertSerialNo商户证书序列号*@paramkeyPath私钥商
java杨辉三角 3213213333332132 java基础
package com.algorithm; /** * @Description 杨辉三角 * @author FuJianyong * 2015-1-22上午10:10:59 */ public class YangHui { public static void main(String[] args) { //初始化二维数组长度 int[][] y
《大话重构》之大布局的辛酸历史白糖_ 重构
《大话重构》中提到“大布局你伤不起”，如果企图重构一个陈旧的大型系统是有非常大的风险，重构不是想象中那么简单。我目前所在公司正好对产品做了一次“大布局重构”，下面我就分享这个“大布局”项目经验给大家。背景公司专注于企业级管理产品软件，企业有大中小之分，在2000年初公司用JSP/Servlet开发了一套针对中
电驴链接在线视频播放源码 dubinwei 源码电驴播放器视频 ed2k
本项目是个搜索电驴（ed2k）链接的应用,借助于磁力视频播放器（官网： http://loveandroid.duapp.com/ 开放平台），可以实现在线播放视频，也可以用迅雷或者其他下载工具下载。项目源码： http://git.oschina.net/svo/Emule,动态更新。也可从附件中下载。项目源码依赖于两个库项目，库项目一链接： http://git.oschina.
Javascript中函数的toString()方法周凡杨 JavaScript js toString function object
简述 The toString() method returns a string representing the source code of the function. 简译之，Javascript的toString()方法返回一个代表函数源代码的字符串。句法 function.
struts处理自定义异常 g21121 struts
很多时候我们会用到自定义异常来表示特定的错误情况，自定义异常比较简单，只要分清是运行时异常还是非运行时异常即可，运行时异常不需要捕获，继承自RuntimeException，是由容器自己抛出，例如空指针异常。非运行时异常继承自Exception，在抛出后需要捕获，例如文件未找到异常。此处我们用的是非运行时异常，首先定义一个异常LoginException: /** * 类描述：登录相
Linux中find常见用法示例 510888780 linux
Linux中find常见用法示例 ·find path -option [ -print ] [ -exec -ok command ] {} \; find命令的参数；
SpringMVC的各种参数绑定方式 Harry642 springMVC 绑定表单
1. 基本数据类型(以int为例，其他类似)： Controller代码： @RequestMapping("saysth.do") public void test(int count) { } 表单代码： <form action="saysth.do" method="post&q
Java 获取Oracle ROWID aijuans java oracle
A ROWID is an identification tag unique for each row of an Oracle Database table. The ROWID can be thought of as a virtual column, containing the ID for each row. The oracle.sql.ROWID class i
java获取方法的参数名 antlove java jdk parameter method reflect
reflect.ClassInformationUtil.java package reflect; import javassist.ClassPool; import javassist.CtClass; import javassist.CtMethod; import javassist.Modifier; import javassist.bytecode.CodeAtt
JAVA正则表达式匹配查找替换提取操作百合不是茶 java 正则表达式替换提取查找
正则表达式的查找;主要是用到String类中的split(); String str; str.split();方法中传入按照什么规则截取,返回一个String数组常见的截取规则: str.split("\\.")按照.来截取 str.
Java中equals()与hashCode()方法详解 bijian1013 java set equals()hashCode()
一.equals()方法详解 equals()方法在object类中定义如下： public boolean equals(Object obj) { return (this == obj); } 很明显是对两个对象的地址值进行的比较（即比较引用是否相同）。但是我们知道，String 、Math、I
精通Oracle10编程SQL(4)使用SQL语句 bijian1013 oracle 数据库 plsql
--工资级别表 create table SALGRADE ( GRADE NUMBER(10), LOSAL NUMBER(10,2), HISAL NUMBER(10,2) ) insert into SALGRADE values(1,0,100); insert into SALGRADE values(2,100,200); inser
【Nginx二】Nginx作为静态文件HTTP服务器 bit1129 HTTP服务器
Nginx作为静态文件HTTP服务器在本地系统中创建/data/www目录，存放html文件(包括index.html) 创建/data/images目录，存放imags图片在主配置文件中添加http指令 http { server { listen 80; server_name
kafka获得最新partition offset blackproof kafka partition offset 最新
kafka获得partition下标，需要用到kafka的simpleconsumer import java.util.ArrayList; import java.util.Collections; import java.util.Date; import java.util.HashMap; import java.util.List; import java.
centos 7安装docker两种方式 ronin47
第一种是采用yum 方式 yum install -y docker
java-60-在O(1)时间删除链表结点 bylijinnan java
public class DeleteNode_O1_Time { /** * Q 60 在O(1)时间删除链表结点 * 给定链表的头指针和一个结点指针(!!)，在O(1)时间删除该结点 * * Assume the list is: * head->...->nodeToDelete->mNode->nNode->..
nginx利用proxy_cache来缓存文件 cfyme cache
user zhangy users; worker_processes 10; error_log /var/vlogs/nginx_error.log crit; pid /var/vlogs/nginx.pid; #Specifies the value for ma
[JWFD开源工作流]JWFD嵌入式语法分析器负号的使用问题 comsci 嵌入式
假如我们需要用JWFD的语法分析模块定义一个带负号的方程式，直接在方程式之前添加负号是不正确的，而必须这样做： string str01 = "a=3.14;b=2.71;c=0;c-((a*a)+(b*b))" 定义一个0整数c,然后用这个整数c去
如何集成支付宝官方文档 dai_lm android
官方文档下载地址 https://b.alipay.com/order/productDetail.htm?productId=2012120700377310&tabId=4#ps-tabinfo-hash 集成的必要条件 1. 需要有自己的Server接收支付宝的消息 2. 需要先制作app，然后提交支付宝审核，通过后才能集成调试的时候估计会真的扣款，请注意
应该在什么时候使用Hadoop datamachine hadoop
原帖地址：http://blog.chinaunix.net/uid-301743-id-3925358.html 存档，某些观点与我不谋而合，过度技术化不可取，且hadoop并非万能。 --------------------------------------------万能的分割线-------------------------------- 有人问我，“你在大数据和Hado
在GridView中对于有外键的字段使用关联模型进行搜索和排序 dcj3sjt126com yii
在GridView中使用关联模型进行搜索和排序首先我们有两个模型它们直接有关联: class Author extends CActiveRecord { ... } class Post extends CActiveRecord { ... function relations() { return array( '
使用NSString 的格式化大全 dcj3sjt126com Objective-C
格式定义The format specifiers supported by the NSString formatting methods and CFString formatting functions follow the IEEE printf specification; the specifiers are summarized in Table 1. Note that you c
使用activeX插件对象object滚动有重影蕃薯耀 activeX插件滚动有重影
使用activeX插件对象object滚动有重影 <object style="width:0;" id="abc" classid="CLSID:D3E3970F-2927-9680-BBB4-5D0889909DF6" codebase="activex/OAX339.CAB#
SpringMVC4零配置 hanqunfeng springmvc4
基于Servlet3.0规范和SpringMVC4注解式配置方式，实现零xml配置，弄了个小demo，供交流讨论。项目说明如下： 1.db.sql是项目中用到的表，数据库使用的是oracle11g 2.该项目使用mvn进行管理，私服为自搭建nexus,项目只用到一个第三方 jar，就是oracle的驱动； 3.默认项目为零配置启动，如果需要更改启动方式，请
《开源框架那点事儿16》：缓存相关代码的演变 j2eetop 开源框架
问题引入上次我参与某个大型项目的优化工作，由于系统要求有比较高的TPS，因此就免不了要使用缓冲。该项目中用的缓冲比较多，有MemCache，有Redis，有的还需要提供二级缓冲，也就是说应用服务器这层也可以设置一些缓冲。当然去看相关实现代代码的时候，大致是下面的样子。 [java] view plain copy print ? public vo
AngularJS浅析 kvhur JavaScript
概念 AngularJS is a structural framework for dynamic web apps. 了解更多详情请见原文链接：http://www.gbtags.com/gb/share/5726.htm Directive 扩展html，给html添加声明语句，以便实现自己的需求。对于页面中html元素以ng为前缀的属性名称，ng是angular的命名空间
架构师之jdk的bug排查(一)---------------split的点号陷阱 nannan408 split
1.前言. jdk1.6的lang包的split方法是有bug的,它不能有效识别A.b.c这种类型,导致截取长度始终是0.而对于其他字符,则无此问题.不知道官方有没有修复这个bug. 2.代码 String[] paths = "object.object2.prop11".split("'"); System.ou
如何对10亿数据量级的mongoDB作高效的全表扫描 quentinXXZ mongodb
本文链接: http://quentinXXZ.iteye.com/blog/2149440 一、正常情况下，不应该有这种需求首先，大家应该有个概念，标题中的这个问题，在大多情况下是一个伪命题，不应该被提出来。要知道，对于一般较大数据量的数据库，全表查询，这种操作一般情况下是不应该出现的，在做正常查询的时候，如果是范围查询，你至少应该要加上limit。说一下，
C语言算法之水仙花数 qiufeihu c 算法
/** * 水仙花数 */ #include <stdio.h> #define N 10 int main() { int x,y,z; for(x=1;x<=N;x++) for(y=0;y<=N;y++) for(z=0;z<=N;z++) if(x*100+y*10+z == x*x*x
JSP指令 wyzuomumu jsp
jsp指令的一般语法格式： <%@ 指令名属性 =”值 ” %> 常用的三种指令： page,include,taglib page指令语法形式： <%@ page 属性 1=”值 1” 属性 2=”值 2”%> include指令语法形式： <%@include file=”relative url”%> (jsp可以通过 include