cjuexuan

spark 2.0 踩过的SparkSession的坑

spark 20 踩过的SparkSession的坑
- 背景
- 抽象的运行代码
- 初步定位问题
- 进一步定位问题
- 源码相关分析
- 再看SparkSession的创建
- 最终解决

取代了SQLContext(HiveContext)的SparkSession

背景

我的服务端的逻辑是在actor内部进行的，但发现多个actor中执行的过程中，访问到了其他actor内部session中注册的临时表

抽象的运行代码

actor的逻辑大概可以抽象成这样：

package com.ximalaya.xql.datatask

import akka.actor.{Actor, ActorLogging}
import com.ximalaya.xql.datatask.TestActor.CreateView
import org.apache.spark.SparkConf
import org.apache.spark.sql.SparkSession

import scala.util.Random

/**
  * @author todd.chen at 17/11/2016 13:52.
  *         email : todd.chen@ximalaya.com
  */
class TestActor(sparkConf: SparkConf) extends Actor with ActorLogging {
  var sparkSession: SparkSession = _
  override def preStart(): Unit = {
    sparkSession = SparkSession.builder().config(sparkConf).getOrCreate()
  }
  override def postStop(): Unit = sparkSession.stop()
  override def receive: Receive = {
    case CreateView(limit) ⇒      sparkSession.createDataFrame(getBeans(limit)).createOrReplaceTempView("test1")
    case s: String ⇒
      log.info(s"exec $s")
      sparkSession.sql(s).show(1000)
    case e: Any ⇒ println(e)
  }
  val ids = 1 to 1000
  val names = ('a' to 'z').map(_.toString)
  val beans = for {
    id ← ids
    name ← names
  } yield Bean(id, name)

  def getBeans(limit: Int): Seq[Bean] = {
    Random.shuffle(beans).take(limit)
  }
}

object TestActor {
  case class CreateView(limit: Int)
  case class Bean(id: Int, name: String)
}

驱动的测试类

package com.ximalaya.xql.datatask
import akka.actor.{ActorSystem, Props}
import com.ximalaya.xql.datatask.TestActor.CreateView
import org.apache.spark.SparkConf

import scala.util.Random

/**
  * @author todd.chen at 17/11/2016 13:59.
  *         email : todd.chen@ximalaya.com
  */
object Test {
  def main(args: Array[String]): Unit = {
    val sparkConf = new SparkConf().setAppName("local").setMaster("local[*]")
    val actorSystem = ActorSystem("testSystem")
    val actor1 = actorSystem.actorOf(Props(new TestActor(sparkConf)), name = "actor1")
    val actor2 = actorSystem.actorOf(Props(new TestActor(sparkConf)), name = "actor2")
    val actor3 = actorSystem.actorOf(Props(new TestActor(sparkConf)), name = "actor3")
    val actor4 = actorSystem.actorOf(Props(new TestActor(sparkConf)), name = "actor4")
    val actors = actor1 :: actor2 :: actor3 :: actor4 :: Nil
    def getLimit = Random.nextInt(100)
    actors.foreach(_ ! CreateView(getLimit))
    Thread sleep 10000
    actors.foreach(_ ! "select * from test1")
    Thread sleep 10000
    actors.foreach(_ ! "show tables")
    Thread sleep 10000
  }
}

初步定位问题

现象是发现多个query的执行结果完全相同，这里过滤了很多info日志，最后日志大概是这样的

20:07:46 731  WARN (org.apache.spark.SparkContext:66) - Use an existing SparkContext, some configuration may not take effect.
20:07:46 845  INFO (org.apache.spark.sql.internal.SharedState:54) - Warehouse path is '/user/hive/warehouse'.
20:07:46 849  WARN (org.apache.spark.sql.SparkSession$Builder:66) - Using an existing SparkSession; some configuration may not take effect.
20:07:46 849  WARN (org.apache.spark.sql.SparkSession$Builder:66) - Using an existing SparkSession; some configuration may not take effect.
20:07:46 850  WARN (org.apache.spark.sql.SparkSession$Builder:66) - Using an existing SparkSession; some configuration may not take effect.
20:07:49 997  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: test1
20:07:49 999  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: test1
20:07:49 997  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: test1
20:07:49 997  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: test1
20:07:54 342  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: select * from test1
20:07:54 343  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: select * from test1
20:07:54 343  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: select * from test1
[INFO] [11/17/2016 20:07:54.341] [testSystem-akka.actor.default-dispatcher-4] [akka://testSystem/user/actor4] exec select * from test1
[INFO] [11/17/2016 20:07:54.341] [testSystem-akka.actor.default-dispatcher-3] [akka://testSystem/user/actor2] exec select * from test1
[INFO] [11/17/2016 20:07:54.341] [testSystem-akka.actor.default-dispatcher-5] [akka://testSystem/user/actor1] exec select * from test1
20:07:54 351  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: select * from test1
[INFO] [11/17/2016 20:07:54.351] [testSystem-akka.actor.default-dispatcher-2] [akka://testSystem/user/actor3] exec select * from test1
+---+----+
| id|name|
+---+----+
|110|   u|
|720|   f|
+---+----+

+---+----+
| id|name|
+---+----+
|110|   u|
|720|   f|
+---+----+

+---+----+
| id|name|
+---+----+
|110|   u|
|720|   f|
+---+----+

+---+----+
| id|name|
+---+----+
|110|   u|
|720|   f|
+---+----+

+---------+-----------+
|tableName|isTemporary|
+---------+-----------+
|    test1|       true|
+---------+-----------+

+---------+-----------+
|tableName|isTemporary|
+---------+-----------+
|    test1|       true|
+---------+-----------+

+---------+-----------+
|tableName|isTemporary|
+---------+-----------+
|    test1|       true|
+---------+-----------+

+---------+-----------+
|tableName|isTemporary|
+---------+-----------+
|    test1|       true|
+---------+-----------+

进一步定位问题

我们观察到其中是有几个warn的，这一点应该就是导致了多个actor内部访问到同一个共享的sparkSession的真正原因，为了加深我们的判断，我们尝试将createOrReplaceTempView换成一个更粗暴的createTempView

  override def receive: Receive = {
    case CreateView(limit) ⇒
      sparkSession.createDataFrame(getBeans(limit)).createTempView("test1")
    case s: String ⇒
      log.info(s"exec $s")
      sparkSession.sql(s).show(1000)
    case e: Any ⇒ println(e)
  }

这两个方法的区别在源码中解释的还是很清晰的：

  /**
   * Creates a temporary view using the given name. The lifetime of this
   * temporary view is tied to the [[SparkSession]] that was used to create this Dataset.
   *
   * @throws AnalysisException if the view name already exists
   *
   * @group basic
   * @since 2.0.0
   */
  @throws[AnalysisException]
  def createTempView(viewName: String): Unit = withPlan {
    val tableDesc = CatalogTable(
      identifier = sparkSession.sessionState.sqlParser.parseTableIdentifier(viewName),
      tableType = CatalogTableType.VIEW,
      schema = Seq.empty[CatalogColumn],
      storage = CatalogStorageFormat.empty)
    CreateViewCommand(tableDesc, logicalPlan, allowExisting = false, replace = false,
      isTemporary = true)
  }

  /**
   * Creates a temporary view using the given name. The lifetime of this
   * temporary view is tied to the [[SparkSession]] that was used to create this Dataset.
   *
   * @group basic
   * @since 2.0.0
   */
  def createOrReplaceTempView(viewName: String): Unit = withPlan {
    val tableDesc = CatalogTable(
      identifier = sparkSession.sessionState.sqlParser.parseTableIdentifier(viewName),
      tableType = CatalogTableType.VIEW,
      schema = Seq.empty[CatalogColumn],
      storage = CatalogStorageFormat.empty)
    CreateViewCommand(tableDesc, logicalPlan, allowExisting = false, replace = true,
      isTemporary = true)
  }

也就是上面的createTempView如果创建视图，而视图名已经存在，将会报错，我们运行一下，看日志输出：

[ERROR] [11/17/2016 20:13:34.499] [testSystem-akka.actor.default-dispatcher-6] [akka://testSystem/user/actor1] Temporary table 'test1' already exists;
org.apache.spark.sql.catalyst.analysis.TempTableAlreadyExistsException: Temporary table 'test1' already exists;
    at org.apache.spark.sql.catalyst.catalog.SessionCatalog.createTempView(SessionCatalog.scala:341)
    at org.apache.spark.sql.execution.command.CreateViewCommand.createTemporaryView(views.scala:146)
    at org.apache.spark.sql.execution.command.CreateViewCommand.run(views.scala:97)
    at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult$lzycompute(commands.scala:58)
    at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult(commands.scala:56)
    at org.apache.spark.sql.execution.command.ExecutedCommandExec.doExecute(commands.scala:74)
    at org.apache.spark.sql.execution.SparkPlan$$anonfun$execute$1.apply(SparkPlan.scala:115)
	at org.apache.spark.sql.execution.SparkPlan$$anonfun$execute$1.apply(SparkPlan.scala:115)
    at org.apache.spark.sql.execution.SparkPlan$$anonfun$executeQuery$1.apply(SparkPlan.scala:136)
	at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:151)
	at org.apache.spark.sql.execution.SparkPlan.executeQuery(SparkPlan.scala:133)
	at org.apache.spark.sql.execution.SparkPlan.execute(SparkPlan.scala:114)
	at org.apache.spark.sql.execution.QueryExecution.toRdd$lzycompute(QueryExecution.scala:86)
	at org.apache.spark.sql.execution.QueryExecution.toRdd(QueryExecution.scala:86)
	at org.apache.spark.sql.Dataset.(Dataset.scala:186)
	at org.apache.spark.sql.Dataset.(Dataset.scala:167)
	at org.apache.spark.sql.Dataset$.ofRows(Dataset.scala:65)
	at org.apache.spark.sql.Dataset.org$apache$spark$sql$Dataset$$withPlan(Dataset.scala:2603)
    at org.apache.spark.sql.Dataset.createTempView(Dataset.scala:2398)
    at com.ximalaya.xql.datatask.TestActor$$anonfun$receive$1.applyOrElse(TestActor.scala:43)
    at akka.actor.Actor$class.aroundReceive(Actor.scala:482)
    at com.ximalaya.xql.datatask.TestActor.aroundReceive(TestActor.scala:30)
    at akka.actor.ActorCell.receiveMessage(ActorCell.scala:526)
    at akka.actor.ActorCell.invoke(ActorCell.scala:495)
    at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:257)
    at akka.dispatch.Mailbox.run(Mailbox.scala:224)
    at akka.dispatch.Mailbox.exec(Mailbox.scala:234)
    at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
    at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
    at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
    at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)

果然和我们的猜想是一样的

源码相关分析

通过createOrReplaceTempView方法定位到CreateViewCommand的run方法中,全类名为org.apache.spark.sql.execution.command.CreateViewCommand

override def run(sparkSession: SparkSession): Seq[Row] = {
// If the plan cannot be analyzed, throw an exception and don't proceed.
val qe = sparkSession.sessionState.executePlan(child)
qe.assertAnalyzed()
val analyzedPlan = qe.analyzed

if (tableDesc.schema != Nil && tableDesc.schema.length != analyzedPlan.output.length) {
  throw new AnalysisException(s"The number of columns produced by the SELECT clause " +
    s"(num: `${analyzedPlan.output.length}`) does not match the number of column names " +
    s"specified by CREATE VIEW (num: `${tableDesc.schema.length}`).")
}
val sessionState = sparkSession.sessionState

if (isTemporary) {
  createTemporaryView(tableDesc.identifier, sparkSession, analyzedPlan)
} else {
  // Adds default database for permanent table if it doesn't exist, so that tableExists()
  // only check permanent tables.
  val database = tableDesc.identifier.database.getOrElse(
    sessionState.catalog.getCurrentDatabase)
  val tableIdentifier = tableDesc.identifier.copy(database = Option(database))

  if (sessionState.catalog.tableExists(tableIdentifier)) {
    if (allowExisting) {
      // Handles `CREATE VIEW IF NOT EXISTS v0 AS SELECT ...`. Does nothing when the target view
      // already exists.
    } else if (replace) {
      // Handles `CREATE OR REPLACE VIEW v0 AS SELECT ...`
      sessionState.catalog.alterTable(prepareTable(sparkSession, analyzedPlan))
    } else {
      // Handles `CREATE VIEW v0 AS SELECT ...`. Throws exception when the target view already
      // exists.
      throw new AnalysisException(
        s"View $tableIdentifier already exists. If you want to update the view definition, " +
          "please use ALTER VIEW AS or CREATE OR REPLACE VIEW AS")
    }
  } else {
    // Create the view if it doesn't exist.
    sessionState.catalog.createTable(
      prepareTable(sparkSession, analyzedPlan), ignoreIfExists = false)
  }
}
Seq.empty[Row]
}

run中主要操作了传入的SparkSession的sessionState这个变量
```
val sessionState = sparkSession.sessionState
```
在操作过程中进一步操作了SessionCatalog这个类

  /**
   * Internal catalog for managing table and database states.
   */
  lazy val catalog = new SessionCatalog(
    sparkSession.sharedState.externalCatalog,
    functionResourceLoader,
    functionRegistry,
    conf,
    newHadoopConf())

SessionCatalog 中有个存放临时表的变量

/** List of temporary tables, mapping from table name to their logical plan. */
  @GuardedBy("this")
  protected val tempTables = new mutable.HashMap[String, LogicalPlan]

    /**
   * Create a temporary table.
   */
  def createTempView(
      name: String,
      tableDefinition: LogicalPlan,
      overrideIfExists: Boolean): Unit = synchronized {
    val table = formatTableName(name)
    if (tempTables.contains(table) && !overrideIfExists) {
      throw new TempTableAlreadyExistsException(name)
    }
    tempTables.put(table, tableDefinition)
  }

至此我们已经搞清楚了80%了，就是因为大家共用了同一个sessionState，进而共用了同一个catalog，导致的问题的发生

再看SparkSession的创建

问题就是第一次创建后有了defaultSession存在

/**
     * Gets an existing [[SparkSession]] or, if there is no existing one, creates a new
     * one based on the options set in this builder.
     *
     * This method first checks whether there is a valid thread-local SparkSession,
     * and if yes, return that one. It then checks whether there is a valid global
     * default SparkSession, and if yes, return that one. If no valid global default
     * SparkSession exists, the method creates a new SparkSession and assigns the
     * newly created SparkSession as the global default.
     *
     * In case an existing SparkSession is returned, the config options specified in
     * this builder will be applied to the existing SparkSession.
     *
     * @since 2.0.0
     */
    def getOrCreate(): SparkSession = synchronized {
      // Get the session from current thread's active session.
      var session = activeThreadSession.get()
      if ((session ne null) && !session.sparkContext.isStopped) {
        options.foreach { case (k, v) => session.conf.set(k, v) }
        if (options.nonEmpty) {
          logWarning("Using an existing SparkSession; some configuration may not take effect.")
        }
        return session
      }

      // Global synchronization so we will only set the default session once.
      SparkSession.synchronized {
        // If the current thread does not have an active session, get it from the global session.
        session = defaultSession.get()
        if ((session ne null) && !session.sparkContext.isStopped) {
          options.foreach { case (k, v) => session.conf.set(k, v) }
          if (options.nonEmpty) {
            logWarning("Using an existing SparkSession; some configuration may not take effect.")
          }
          return session
        }

        // No active nor global default session. Create a new one.
        val sparkContext = userSuppliedContext.getOrElse {
          // set app name if not given
          val randomAppName = java.util.UUID.randomUUID().toString
          val sparkConf = new SparkConf()
          options.foreach { case (k, v) => sparkConf.set(k, v) }
          if (!sparkConf.contains("spark.app.name")) {
            sparkConf.setAppName(randomAppName)
          }
          val sc = SparkContext.getOrCreate(sparkConf)
          // maybe this is an existing SparkContext, update its SparkConf which maybe used
          // by SparkSession
          options.foreach { case (k, v) => sc.conf.set(k, v) }
          if (!sc.conf.contains("spark.app.name")) {
            sc.conf.setAppName(randomAppName)
          }
          sc
        }
        session = new SparkSession(sparkContext)
        options.foreach { case (k, v) => session.conf.set(k, v) }
        defaultSession.set(session)

        // Register a successfully instantiated context to the singleton. This should be at the
        // end of the class definition so that the singleton is updated only if there is no
        // exception in the construction of the instance.
        sparkContext.addSparkListener(new SparkListener {
          override def onApplicationEnd(applicationEnd: SparkListenerApplicationEnd): Unit = {
            defaultSession.set(null)
            sqlListener.set(null)
          }
        })
      }

      return session
    }
  }

而defaultSession是一个原子类

  /** Reference to the root SparkSession. */
  private val defaultSession = new AtomicReference[SparkSession]

而为了让他不进入 Global synchronization so we will only set the default session once.的逻辑，我们需要在创建Session后clean这个对象

设置了断点也和我的猜想一样

4个actor，这个断点进入了3次

  /**
   * Clears the default SparkSession that is returned by the builder.
   *
   * @since 2.0.0
   */
  def clearDefaultSession(): Unit = {
    defaultSession.set(null)
  }

最终解决

  def getSparkSession(sparkConf: SparkConf) = SparkSession.synchronized {
    SparkSession.clearDefaultSession()
    val session = SparkSession.builder().config(sparkConf).getOrCreate()
    SparkSession.clearDefaultSession()
    session
  }
    override def preStart(): Unit = {
    sparkSession = TestActor.getSparkSession(sparkConf)
  }

log

20:41:35 942  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: test1
20:41:35 943  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: test1
20:41:35 942  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: test1
20:41:35 943  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: test1
20:41:40 85  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: select * from test1
20:41:40 85  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: select * from test1
20:41:40 86  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: select * from test1
[INFO] [11/17/2016 20:41:40.083] [testSystem-akka.actor.default-dispatcher-4] [akka://testSystem/user/actor1] exec select * from test1
[INFO] [11/17/2016 20:41:40.083] [testSystem-akka.actor.default-dispatcher-2] [akka://testSystem/user/actor2] exec select * from test1
[INFO] [11/17/2016 20:41:40.083] [testSystem-akka.actor.default-dispatcher-3] [akka://testSystem/user/actor4] exec select * from test1
[INFO] [11/17/2016 20:41:40.093] [testSystem-akka.actor.default-dispatcher-5] [akka://testSystem/user/actor3] exec select * from test1
20:41:40 93  INFO (org.apache.spark.sql.execution.SparkSqlParser:54) - Parsing command: select * from test1
20:41:40 709  INFO (org.apache.spark.sql.catalyst.expressions.codegen.CodeGenerator:54) - Code generated in 365.008452 ms
+---+----+
| id|name|
+---+----+
+---+----+

20:41:40 785  INFO (org.apache.spark.sql.catalyst.expressions.codegen.CodeGenerator:54) - Code generated in 17.495479 ms
+---+----+
| id|name|
+---+----+
|192|   p|
| 65|   t|
|594|   c|
| 26|   w|
|787|   s|
|456|   j|
|687|   m|
+---+----+

+---+----+
| id|name|
+---+----+
|993|   b|
|290|   a|
|706|   n|
| 85|   f|
+---+----+

+---+----+
| id|name|
+---+----+
|510|   w|
|473|   x|
|877|   c|
| 80|   f|
|826|   k|
+---+----+

断点也始终没有进去，问题得到了解决

my github

开源框架--Glide源码阅读下 Bonnie_cat 开源 glide
接上半部分开源框架–Glide源码阅读上，我们接着看Glide源码的with和load。3.源码阅读3.2load上半部分分析知道了with()方法返回的是RequestManager，下面看RequestManager的load()方法，@OverridepublicRequestBuilderload(@NullableStringstring){returnasDrawable().load
还不会构建MindIE镜像？一篇文章搞定 Zain Lau vim 编辑器 linux MindIE 昇腾
MindIE镜像构建工程项目简介用于构建多平台/架构的MindiE镜像的脚本。用户可以根据需要准备好所需的软件包，修改相关配置并构建镜像。前提条件网络连接在整个构建过程中，必须保持稳定的网络连接。此构建工程依赖于在线下载多个资源，包括但不限于Python源码、编译工具以及各种依赖，无法离线构建。Docker推荐版本：Docker20.10.x及以上最低版本要求：Docker19.03.x安装方式：
adb shell input text 完美支持中文输入 hzm326 python android windows linux adb
adb默认是不支持Unicode编码的，无法通过adbshellinputtext命令输入中文到手机或模拟器解决中文输入还得感谢老外写了一个输入法，源码地址：https://github.com/senzhk/ADBKeyBoard第一步：安装ADBKeyBoard.apk文件打开手机或模拟器，adbinstallADBKeyBoard.apk安装该输入法或者直接安装即可第二步：设置默认输入法默认
Scala语言的硬件驱动花韵婷包罗万象 golang 开发语言后端
使用Scala语言进行硬件驱动开发引言随着计算机技术的快速发展，硬件设备的交互和控制在现代应用中显得尤为重要。大多数硬件驱动程序都用C或C++编写，但随着Scala语言的流行及其在数据处理和并发编程中的优势，越来越多的开发者开始探讨利用Scala进行硬件驱动开发的可能性。本文将深入探讨Scala语言在硬件驱动开发中的应用、优势、以及一些实际案例。什么是硬件驱动硬件驱动（DeviceDriver）是
Redis 源码分析-内部数据结构 quicklist 笨手笨脚の #Redis redis 数据结构数据库 quicklist 链表快速链表 ziplist
Redis源码分析-内部数据结构quicklistquicklist是Redis对外暴露的list数据结构的内部实现，经常被当作队列或栈使用，我们可以从常用的一些api上先思考一下它的结构最常用的就是lpush、lpop、rpush、rpop，同时它也支持lindex查询某元素在list中的索引，linsert在指定元素旁边插入新元素。从头、尾节点的push、pop来看，这就是双向链表最优秀的设计
AMIS低代码构建系统，定制界面一醉千秋 Cesium nodejs arm64 低代码
AMIS低代码构建系统界面一、基础环境1）设计环境：amis-editor，http://139.196.235.123:9988/#/源码地址：https://github.com/baidu/amis在线文档https://aisuda.bce.baidu.com/amis/2）运行环境使用amisjssdk进行运行二、构建流程1）设计需要的界面，针对控件设计Event方法，支持amis原生和
基于asp.NET的病历管理系统 (源码+net+vue+部署文档+讲解等) qq_1406299528 计算机毕业设计 asp asp.net vue.js 后端
收藏关注不迷路！！文末获取源码+数据库感兴趣的可以先收藏起来，还有大家在毕设选题（免费咨询指导选题），项目以及论文编写等相关问题都可以给我留言咨询，希望帮助更多的人文章目录前言程序资料获取一、项目技术二、项目内容和功能介绍三、核心代码数据库参考四、效果图五、资料获取前言博主介绍：✨全网粉丝10W+,CSDN特邀作者、博客专家、CSDN新星计划导师，专注于Java/Python/小程序app/深度学
基于Asp.net的汽车租赁管理系统计算机学姐 Asp精选实战项目源码 asp.net 汽车后端 mysql sqlserver vue.js c#
作者：计算机学姐开发技术：SpringBoot、SSM、Vue、MySQL、JSP、ElementUI、Python、小程序等，“文末源码”。专栏推荐：前后端分离项目源码、SpringBoot项目源码、Vue项目源码、SSM项目源码、微信小程序源码精品专栏：Java精选实战项目源码、Python精选实战项目源码、大数据精选实战项目源码系统展示【2025最新】基于Asp.net的汽车租赁管理系统开发
uni-app移动端应用开发底部安全区域适配向凡而生 uni-app 前端
针对IOS机型，底部安全区域是系统动态计算的，所以我们也需要动态获取底部安全区域来适配在uniapp的manifest.json，打开源码视图，对需要适配的页面添加如下配置"styles":{"safeArea":{"bottom":"auto"}}如果使用CSS。一种常见的方法是使用padding-bottom或margin-bottom属性，并使用vh-unit单位来确保底部安全距离不受屏幕尺
HIVE开窗函数 Cciccd sql hive
ETL,SQL面试高频考点——HIVE开窗函数（基础篇）目录标题ETL,SQL面试高频考点——HIVE开窗函数（基础篇）一，窗口函数介绍二，开窗函数三，分析函数分类1，排序分析函数：实列解析对比总结2.聚合分析函数3.用spark自定义HIVE用户自定义函数后续更新中~一，窗口函数介绍窗口函数，也叫OLAP函数（OnlineAnallyticalProcessing,联机分析处理），可以对数据库数
Hive MR & Spark & Yarn参数优化总结大数据侠客 hive相关问题汇总及解决 hive spark mr yarn 参数优化
一、hivemr参数调优：sethive.optimize.ppd=true;--开启谓词下推。--动态分区参数sethive.exec.mode.local.auto=true;sethive.exec.dynamic.partition.mode=nonstrict;--默认是strict，表示至少有一个静态分区，nonstri
焊接机器人与线激光视觉系统搭配的详细教程自动化专业爱好者机器人 opencv 人工智能
以下是关于焊接机器人与线激光视觉系统搭配的详细教程，包含核心程序框架、调参方法及源码实现思路。本文综合了多个技术文档与专利内容，结合工业应用场景进行系统化总结。一、系统硬件配置与视觉系统搭建1.硬件组成焊接机器人系统通常由以下模块构成：线激光视觉传感器：用于发射线激光并采集焊缝图像（如英莱科技PF系列传感器，支持4K视频监控与微间隙焊缝检测）。机器人本体与焊枪：需支持外部轴控制，传感器通过夹具安装
基于时间序列预测的推理服务弹性扩缩容实战指南：（行业案例+数学推导+源码解析）燃灯工作室 Ai 计算机视觉语音识别目标检测机器学习人工智能
技术原理（数学公式）整体架构请求量预测→扩缩容决策→资源配置动态调整三阶段闭环，周期为5-30分钟核心预测模型（时间序列预测）LSTM预测公式（CSDN兼容格式）：$$h_t=\text{LSTM}(x_t,h_{t-1})\\\hat{y}_{t+1}=W_h\cdoth_t+b_h$$其中Wh∈Rd×1W_h\in\mathbb{R}^{d\times1}Wh∈Rd×1为权重矩阵，ddd为隐藏
webpack5（Module Federation）+vue3.0实现微前端 weixin_42140041 前端奇淫技巧前端 javascript vue.js
项目源码地址：https://github.com/wuxiaohuaer/webpack5-vue-admin一、什么是微前端微前端是一个比较宏观的概念，他的核心就是独立，开发独立、部署独立，比较适合大的团队来进行重量级项目开发。从MicroFrontends官网可以了解到，微前端概念是从微服务概念扩展而来的，摒弃大型单体方式，将前端整体分解为小而简单的块，这些块可以独立开发、测试和部署，同时仍
Spark 中创建 DataFrame 的2种方式对比闯闯桑 spark 大数据分布式 scala
spark.createDataFrame(data).toDF("name","age")和spark.createDataFrame(spark.sparkContext.parallelize(data),schema)创建df的方式有什么区别？在Spark中，创建DataFrame的方式有多种，其中两种常见的方式是：spark.createDataFrame(data).toDF("nam
2280将数组和减少的最少操作次数（贪心算法）分析+源码+证明懒羊羊大王& 算法（贪心算法）c++(初阶)贪心算法算法
题目解析请你返回将nums数组和至少减少一半的最少操作数。这句话相当于最后数组和小于等于最开始数组和的一半。1.1算法原理解法：贪心+大根堆（堆顶为最大值）具体策略：每次挑选数组中最大的数，进行减半，直到数组和减少到至少一半为止。举例：初始nums的和为5+19+8+1=33。以下是将数组和减少至少一半的一种方法：选择数字19并减小为9.5。选择数字9.5并减小为4.75。选择数字8并减小为4。最
LLM之Colossal-LLaMA-2：源码解读(init_tokenizer.py文件)实现基于源词表的扩展、(init_model.py文件)实现过计算均值扩展模型、(prepare_pretr 一个处女座的程序猿 CaseCode NLP/LLMs 精选(人工智能)-中级 Colossal LLaMA-2 自然语言处理
LLM之Colossal-LLaMA-2：源码解读(init_tokenizer.py文件)实现基于jsonl文件中读取新词列表(新中文词汇)→for循环去重实现词表的扩展(中文标记的新词汇)→保存新的分词模型、(init_model.py文件)实现过计算均值来扩展模型的嵌入层以适应新的词汇表，然后保存扩展后的模型、(prepare_pretrain_dataset.py文件)将原始数据集进行处理
LLMs之Colossal-LLaMA-2：源码解读(train.py文件)基于给定数据集实现持续预训练LLaMA-2—解析命令行参数→初始化配置(分布式训练环境colossalai+训练日志+加速插一个处女座的程序猿 NLP/LLMs 精选(人工智能)-中级 Colossal-AI LLaMA-2 大语言模型自然语言处理
LLMs之Colossal-LLaMA-2：源码解读(train.py文件)基于给定数据集实现持续预训练LLaMA-2—解析命令行参数→初始化配置(分布式训练环境colossalai+训练日志+加速插件)→数据预处理(初始化分词器+数据处理器+数据加载器)→模型训练(初始化模型/优化器/学习率调度器/梯度检查点/Flash-Attention/设置数据类型/是否加载预训练模型/从上一次训练点继续训
1llama源码学习·model.py[3]ROPE旋转位置编码(1)原理小杜不吃糖学习
零：(导学)Transformer位置编码（1）为什么需要位置编码位置编码描述序列中实体的位置信息，为每个位置分配唯一的表示。Transformer使用智能位置编码方案，其中每个位置/索引都映射到一个向量。因此，位置编码层的输出是一个矩阵，其中矩阵的每一行表示序列的编码对象与其位置信息的总和（2）Transformer中的位置编码假设有一个长度为LLL的输入序列，并要求位置kkk为该序列中的对象，
llama源码学习·model.py[1]RMSNorm归一化小杜不吃糖 llama python
一、model.py中的RMSNorm源码classRMSNorm(torch.nn.Module):def__init__(self,dim:int,eps:float=1e-6):super().__init__()self.eps=epsself.weight=nn.Parameter(torch.ones(dim))def_norm(self,x):returnx*torch.rsqrt(
小狐狸AI数字人源码独立SAAS部署全开源+搭建环境教程 kaui52066 kaui52066精品源码人工智能 uni-app 前端小程序 php 小狐狸AI数字人数字人源码
一.系统介绍小狐狸AI数字人分身系统源码独立部署支持PC端、小程序端、H5端，一键克隆真人形象+声音核心功能亮点：1:1真人级克隆技术声音克隆：上传3分钟音频，AI深度学习声纹特征，复刻语气、情感、方言形象克隆：通过照片/视频建模，生成动态3D数字人，表情自然，动作流畅智能口型同步引擎AI算法精准匹配唇形与语音，实现口型同步0门槛SAAS化操作无需专业设备，网页端一键生成数字人视频海量模板库：电商
openharmony5.0中HDF驱动框架源码梳理-服务管理接口咸鱼过江 openharmony5.0 harmonyos hdf框架 linux
要想大概了解一个公司，我们可能只需要知道它的运行逻辑即可，例如我们只需要知道它有财务有研发有运营等，财务报销、研发负责产品等即可，但是如果想深入具体的了解的话我们就要了解都有什么部门(对象)、各部门都包含哪些职责(对象方法)以及各部门都包含哪些关键人员(子对象)以及他们的职责(子对象方法)，根据这个逻辑我大概整理了openharmony5.0的HDF框架中包含的关键对象以及对应的方法，便于更深的理
在 MacOS 上安装 Flutter：M1、M2 和 M3 芯片指南知识大胖 Flutter开发教程大全 macos flutter
简介Flutter是一个强大的跨平台开发框架，但在搭载M1、M2或M3芯片的Mac上设置它可能比您想象的要复杂得多。在本指南中，我将引导您完成整个过程，重点介绍我最初遇到的步骤，以帮助您避免同样的陷阱。推荐文章《Flutter应用中的GooglePay和ApplePay集成应用中的支付(教程含源码)》权重2，支付类《Flutter技巧之在Flutter中使一行按钮具有相同的宽度》《Flutter教
探究Visual Studio中的乱码问题 L-Super 杂记 visual studio ide
关于乱码，没遇到皆大欢喜，遇到了头痛不已。在VisualStudio中程序遇到乱码，需要明确三个概念，那么问题就好解决了。三个字符集概念源码字符集MSVC中/source-charset即源代码文本文件的字符集，NodePad++、记事本、VSCode这样类似的文本编辑器，可以打开源文件看一下你的字符集（文件编码）。源代码文本文件是以二进制的形式存在硬盘里的，无论中文英文都一样，当你输入一个汉字后
计算机视觉算法实战——驾驶员玩手机检测（主页有源码）喵了个AI 计算机视觉实战项目计算机视觉算法智能手机
✨个人主页欢迎您的访问✨期待您的三连✨✨个人主页欢迎您的访问✨期待您的三连✨✨个人主页欢迎您的访问✨期待您的三连✨1.领域简介：玩手机检测的重要性与技术挑战驾驶员玩手机检测是智能交通安全领域的核心课题。根据NHTSA数据，美国每年因手机使用导致的交通事故超过3000起，中国公安部的统计显示开车使用手机的事故率是正常驾驶的23倍。该技术通过实时监测驾驶员手部动作和视线方向，识别非法使用手机行为，在以
【原创】Linux上普通用户安装、运行nmap功能扫描指定IP地址上的端口赵庆明老师 Linux linux tcp/ip 运维
由于是普通用户，因此权限受限，基本上不用考虑常规途径安装了。加上服务器操作系统可能比较老，如果使用源码编译的话，可能会有一大堆编译错误，且由于权限问题，无法解决。这里我要用到一个工具：nmap，扫描某主机。登录nmap官网https://nmap.org/点击下载https://nmap.org/download.html点击Linux版的nmap,下载rpm安装包下载完后，上载到服务器。使用以下
音视频入门基础：RTP专题（18）——FFmpeg源码中，获取RTP的音频信息的实现（上）崔杰城音视频技术 FFmpeg源码分析音视频 ffmpeg
由于本文篇幅较长，分为上、下两篇。一、引言通过FFmpeg命令可以获取到SDP描述的RTP流的的音频压缩编码格式、音频压缩编码格式的profile、音频采样率、通道数信息：ffmpeg-protocol_whitelist"file,rtp,udp"-iXXX.sdp而由《音视频入门基础：RTP专题（17）——音频的SDP媒体描述》可以知道，SDP协议中，a=rtpmap属性和a=fmtp属性中的
鸿蒙5.0版开发：UI界面[email protected] (componentUtils) 星星不闪包退1 ArkTS 鸿蒙5.0 ArkUI harmonyos 华为 android 鸿蒙前端 UI
往期鸿蒙全套实战文章必看：鸿蒙开发核心知识点，看这篇文章就够了最新版！鸿蒙HarmonyOSNext应用开发实战学习路线鸿蒙HarmonyOSNEXT开发技术最全学习路线指南鸿蒙应用开发实战项目，看这一篇文章就够了（部分项目附源码）@ohos.arkui.componentUtils(componentUtils)提供获取组件绘制区域坐标和大小的能力。说明：从APIVersion10开始支持。后续
classfinal加密失败，踩坑了，不妨进来看看行云的逆袭 classfinal springboot jar包加密踩坑加密失败
最近在使用classfinal加密springboot，执行成功了，但是反编译后还是能看到源码，很郁闷！加密之后，反编译还是能看到源码，头疼我采用的加密方式是插件方式，放上配置net.roseboyclassfinal-maven-plugin${classfinal.version}#org.springxingyunapplication.yml,application-dev.yml,app
Redis 源码分析-内部数据结构 robj 笨手笨脚の #Redis redis 数据结构数据库 redisObject 44字节 embStr raw
Redis源码分析-内部数据结构robjRedis中，一个database内的这个映射关系是用一个dict来维护的（ht[0]）。dict的key固定用一种数据结构来表达就够了，即动态字符串sds。而value则比较复杂，为了在同一个dict内能够存储不同类型的value，这就需要一个通用的数据结构，这个通用的数据结构就是robj（全名redisObject）。#defineLRU_BITS24/
ztree设置禁用节点 3213213333332132 JavaScript ztree json setDisabledNode Ajax
ztree设置禁用节点的时候注意，当使用ajax后台请求数据,必须要设置为同步获取数据，否者会获取不到节点对象，导致设置禁用没有效果。 $(function(){ showTree(); setDisabledNode(); });
JVM patch by Taobao bookjovi java HotSpot
在网上无意中看到淘宝提交的hotspot patch，共四个，有意思，记录一下。 7050685：jsdbproc64.sh has a typo in the package name 7058036：FieldsAllocationStyle=2 does not work in 32-bit VM 7060619：C1 should respect inline and
将session存储到数据库中 dcj3sjt126com sql PHP session
CREATE TABLE sessions ( id CHAR(32) NOT NULL, data TEXT, last_accessed TIMESTAMP NOT NULL, PRIMARY KEY (id) ); <?php /** * Created by PhpStorm. * User: michaeldu * Date
Vector 171815164 vector
public Vector<CartProduct> delCart(Vector<CartProduct> cart, String id) { for (int i = 0; i < cart.size(); i++) { if (cart.get(i).getId().equals(id)) { cart.remove(i);
各连接池配置参数比较 g21121 连接池
排版真心费劲，大家凑合看下吧，见谅~ Druid DBCP C3P0 Proxool 数据库用户名称 Username Username User 数据库密码 Password Password Password 驱动名
[简单]mybatis insert语句添加动态字段 53873039oycg mybatis
mysql数据库,id自增,配置如下： <insert id="saveTestTb" useGeneratedKeys="true" keyProperty="id" parameterType=&
struts2拦截器配置云端月影 struts2拦截器
struts2拦截器interceptor的三种配置方法方法1. 普通配置法 <struts> <package name="struts2" extends="struts-default"> &
IE中页面不居中，火狐谷歌等正常 aijuans IE中页面不居中
问题是首页在火狐、谷歌、所有IE中正常显示，列表页的页面在火狐谷歌中正常，在IE6、7、8中都不中，觉得可能那个地方设置的让IE系列都不认识，仔细查看后发现，列表页中没写HTML模板部分没有添加DTD定义，就是<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3
String,int,Integer,char 几个类型常见转换 antonyup_2006 html sql .net
如何将字串 String 转换成整数 int? int i = Integer.valueOf(my_str).intValue(); int i=Integer.parseInt(str); 如何将字串 String 转换成Integer ? Integer integer=Integer.valueOf(str); 如何将整数 int 转换成字串 String ? 1.
PL/SQL的游标类型百合不是茶显示游标(静态游标)隐式游标游标的更新和删除 %rowtype ref游标(动态游标)
游标是oracle中的一个结果集,用于存放查询的结果; PL/SQL中游标的声明; 1,声明游标 2,打开游标(默认是关闭的); 3,提取数据 4,关闭游标注意的要点:游标必须声明在declare中,使用open打开游标,fetch取游标中的数据,close关闭游标隐式游标:主要是对DML数据的操作隐
JUnit4中@AfterClass @BeforeClass @after @before的区别对比 bijian1013 JUnit4 单元测试
一.基础知识 JUnit4使用Java5中的注解（annotation），以下是JUnit4常用的几个annotation： @Before：初始化方法对于每一个测试方法都要执行一次（注意与BeforeClass区别，后者是对于所有方法执行一次）@After：释放资源对于每一个测试方法都要执行一次（注意与AfterClass区别，后者是对于所有方法执行一次
精通Oracle10编程SQL(12)开发包 bijian1013 oracle 数据库 plsql
/* *开发包 *包用于逻辑组合相关的PL/SQL类型（例如TABLE类型和RECORD类型）、PL/SQL项（例如游标和游标变量）和PL/SQL子程序（例如过程和函数） */ --包用于逻辑组合相关的PL/SQL类型、项和子程序，它由包规范和包体两部分组成 --建立包规范：包规范实际是包与应用程序之间的接口，它用于定义包的公用组件，包括常量、变量、游标、过程和函数等 --在包规
【EhCache二】ehcache.xml配置详解 bit1129 ehcache.xml
在ehcache官网上找了多次，终于找到ehcache.xml配置元素和属性的含义说明文档了，这个文档包含在ehcache.xml的注释中！ ehcache.xml ： http://ehcache.org/ehcache.xml ehcache.xsd ： http://ehcache.org/ehcache.xsd ehcache配置文件的根元素是ehcahe ehcac
java.lang.ClassNotFoundException: org.springframework.web.context.ContextLoaderL 白糖_ java eclipse spring tomcat Web
今天学习spring+cxf的时候遇到一个问题：在web.xml中配置了spring的上下文监听器： <listener> <listener-class>org.springframework.web.context.ContextLoaderListener</listener-class> </listener> 随后启动
angular.element boyitech AngularJS AngularJS API angular.element
angular.element 描述: 包裹着一部分DOM element或者是HTML字符串，把它作为一个jQuery元素来处理。（类似于jQuery的选择器啦）如果jQuery被引入了，则angular.element就可以看作是jQuery选择器，选择的对象可以使用jQuery的函数；如果jQuery不可用，angular.e
java-给定两个已排序序列，找出共同的元素。 bylijinnan java
import java.util.ArrayList; import java.util.Arrays; import java.util.List; public class CommonItemInTwoSortedArray { /** * 题目：给定两个已排序序列，找出共同的元素。 * 1.定义两个指针分别指向序列的开始。 * 如果指向的两个元素
sftp 异常，有遇到的吗？求解 Chen.H java jcraft auth jsch jschexception
com.jcraft.jsch.JSchException: Auth cancel at com.jcraft.jsch.Session.connect(Session.java:460) at com.jcraft.jsch.Session.connect(Session.java:154) at cn.vivame.util.ftp.SftpServerAccess.connec
[生物智能与人工智能]神经元中的电化学结构代表什么? comsci 人工智能
我这里做一个大胆的猜想,生物神经网络中的神经元中包含着一些化学和类似电路的结构,这些结构通常用来扮演类似我们在拓扑分析系统中的节点嵌入方程一样,使得我们的神经网络产生智能判断的能力,而这些嵌入到节点中的方程同时也扮演着"经验"的角色.... 我们可以尝试一下...在某些神经
通过LAC和CID获取经纬度信息 dai_lm lac cid
方法1：用浏览器打开http://www.minigps.net/cellsearch.html，然后输入lac和cid信息(mcc和mnc可以填0)，如果数据正确就可以获得相应的经纬度方法2：发送HTTP请求到http://www.open-electronics.org/celltrack/cell.php?hex=0&lac=<lac>&cid=&
JAVA的困难分析 datamachine java
前段时间转了一篇SQL的文章（http://datamachine.iteye.com/blog/1971896），文章不复杂，但思想深刻，就顺便思考了一下java的不足，当砖头丢出来，希望引点和田玉。 -----------------------------------------------------------------------------------------
小学5年级英语单词背诵第二课 dcj3sjt126com english word
money 钱 paper 纸 speak 讲，说 tell 告诉 remember 记得，想起 knock 敲，击，打 question 问题 number 数字，号码 learn 学会，学习 street 街道 carry 搬运，携带 send 发送，邮寄，发射 must 必须 light 灯，光线，轻的 front
linux下面没有tree命令 dcj3sjt126com linux
centos p安装 yum -y install tree mac os安装 brew install tree 首先来看tree的用法 tree 中文解释：tree 功能说明：以树状图列出目录的内容。语　　法：tree [-aACdDfFgilnNpqstux][-I <范本样式>][-P <范本样式
Map迭代方式，Map迭代，Map循环蕃薯耀 Map循环 Map迭代 Map迭代方式
Map迭代方式，Map迭代，Map循环 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 蕃薯耀 2015年
Spring Cache注解+Redis hanqunfeng spring
Spring3.1 Cache注解依赖jar包：  <dependency> <groupId>org.springframework.data</groupId> <artifactId>spring-data-redis</artifactId>
Guava中针对集合的 filter和过滤功能 jackyrong filter
在guava库中，自带了过滤器(filter)的功能，可以用来对collection 进行过滤，先看例子： @Test public void whenFilterWithIterables_thenFiltered() { List<String> names = Lists.newArrayList("John"
学习编程那点事 lampcy 编程 android PHP html5
一年前的夏天，我还在纠结要不要改行，要不要去学php？能学到真本事吗？改行能成功吗？太多的问题，我终于不顾一切，下定决心，辞去了工作，来到传说中的帝都。老师给的乘车方式还算有效，很顺利的就到了学校，赶巧了，正好学校搬到了新校区。先安顿了下来，过了个轻松的周末，第一次到帝都，逛逛吧！接下来的周一，是我噩梦的开始，学习内容对我这个零基础的人来说，除了勉强完成老师布置的作业外，我已经没有时间和精力去
架构师之流处理---------bytebuffer的mark,limit和flip nannan408 ByteBuffer
1.前言。如题，limit其实就是可以读取的字节长度的意思，flip是清空的意思，mark是标记的意思。 2.例子. 例子代码: String str = "helloWorld"; ByteBuffer buff = ByteBuffer.wrap(str.getBytes()); Sy
org.apache.el.parser.ParseException: Encountered " ":" ": "" at line 1, column 1 Everyday都不同 $转义 el表达式
最近在做Highcharts的过程中，在写js时，出现了以下异常：严重: Servlet.service() for servlet jsp threw exception org.apache.el.parser.ParseException: Encountered " ":" ": "" at line 1,
用Java实现发送邮件到163 tntxia java实现
/* 在java版经常看到有人问如何用javamail发送邮件？如何接收邮件？如何访问多个文件夹等。问题零散，而历史的回复早已经淹没在问题的海洋之中。本人之前所做过一个java项目，其中包含有WebMail功能，当初为用java实现而对javamail摸索了一段时间，总算有点收获。看到论坛中的经常有此方面的问题，因此把我的一些经验帖出来，希望对大家有些帮助。此篇仅介绍用
探索实体类存在的真正意义 java小叶檀 POJO
一. 实体类简述实体类其实就是俗称的POJO,这种类一般不实现特殊框架下的接口，在程序中仅作为数据容器用来持久化存储数据用的 POJO（Plain Old Java Objects）简单的Java对象它的一般格式就是 public class A{ private String id; public Str

spark 2.0 踩过的SparkSession的坑

spark 2.0 踩过的SparkSession的坑

背景

抽象的运行代码

初步定位问题

进一步定位问题

源码相关分析

再看SparkSession的创建

最终解决

你可能感兴趣的:(spark,源码,spark2-0,scala,spark)