legotime

SparkML之预测(一)线性回归分析源码阅读

package org.apache.spark.mllib.regression包含了两个部分：LinearRegressionModel和LinearRegressionWithSGD

1、回归的模型（class和object），class 的参数是继承GeneralizedLinearModel广义回归模型，之后形成一个完整的

线性回归模型，object上面的方法用于导出已经保存的模型进行回归

2、LinearRegressionWithSGD：随机梯度下降法，cost function：f(weights) = 1/n ||A weights-y||^2也就是前面

记住这个还是加上m更能体现问题，（除以m表示均方误差）

LinearRegressionWithSGD是继承GeneralizedLinearAlgorithm[LinearRegressionModel]广义回归类

1、回归模型源码如下

/**  * Regression model trained using LinearRegression.  *  * @param weights Weights computed for every feature.(每个特征的权重向量)  * @param intercept Intercept computed for this model.（此模型的偏置或残差）  *  */ @Since("0.8.0")
class LinearRegressionModel @Since("1.1.0") (
    @Since("1.0.0") override val weights: Vector,
    @Since("0.8.0") override val intercept: Double)
  extends GeneralizedLinearModel(weights, intercept) with RegressionModel with Serializable
  with Saveable with PMMLExportable {

  //进行预测:Y = W*X+intercept
  override protected def predictPoint(
      dataMatrix: Vector,
      weightMatrix: Vector,
      intercept: Double): Double = {
    weightMatrix.toBreeze.dot(dataMatrix.toBreeze) + intercept
  }
  //模型保存包含：保存的位置，名字，权重和偏置
  @Since("1.3.0")
  override def save(sc: SparkContext, path: String): Unit = {
    GLMRegressionModel.SaveLoadV1_0.save(sc, path, this.getClass.getName, weights, intercept)
  }

  override protected def formatVersion: String = "1.0"
}
//加载上面保存和的模型，用load（sc,存储路径）
@Since("1.3.0")
object LinearRegressionModel extends Loader[LinearRegressionModel] {

  @Since("1.3.0")
  override def load(sc: SparkContext, path: String): LinearRegressionModel = {
    val (loadedClassName, version, metadata) = Loader.loadMetadata(sc, path)
    // Hard-code class name string in case it changes in the future
    val classNameV1_0 = "org.apache.spark.mllib.regression.LinearRegressionModel"
    (loadedClassName, version) match {
      case (className, "1.0") if className == classNameV1_0 =>
        val numFeatures = RegressionModel.getNumFeatures(metadata)
        val data = GLMRegressionModel.SaveLoadV1_0.loadData(sc, path, classNameV1_0, numFeatures)
        new LinearRegressionModel(data.weights, data.intercept)
      case _ => throw new Exception(
        s"LinearRegressionModel.load did not recognize model with (className, format version):" +
        s"($loadedClassName, $version).  Supported:\n" +
        s"  ($classNameV1_0, 1.0)")
    }
  }
}

2、LinearRegressionWithSGD类，该类是基于无正规化的随机梯度下降，而且是继承GeneralizedLinearAlgorithm[LinearRegressionModel]广义回归类

/**  * Train a linear regression model with no regularization using Stochastic Gradient Descent.  * This solves the least squares regression formulation  * f(weights) = 1/n ||A weights-y||^2^
 * (which is the mean squared error).  * Here the data matrix has n rows, and the input RDD holds the set of rows of A, each with  * its corresponding right hand side label y.  * See also the documentation for the precise formulation.  */ @Since("0.8.0")
class LinearRegressionWithSGD private[mllib] (
    private var stepSize: Double,//步长
    private var numIterations: Int,//迭代次数
    private var miniBatchFraction: Double)//参与迭代样本的比列
  extends GeneralizedLinearAlgorithm[LinearRegressionModel] with Serializable {

  private val gradient = new LeastSquaresGradient()  //阅读:3
  private val updater = new SimpleUpdater()  //阅读：4
  @Since("0.8.0")
  override val optimizer = new GradientDescent(gradient, updater) //阅读：5
    .setStepSize(stepSize)
    .setNumIterations(numIterations)
    .setMiniBatchFraction(miniBatchFraction)

  /**  * Construct a LinearRegression object with default parameters: {stepSize: 1.0,  * numIterations: 100, miniBatchFraction: 1.0}.  */  @Since("0.8.0")
  def this() = this(1.0, 100, 1.0) 

  override protected[mllib] def createModel(weights: Vector, intercept: Double) = {
    new LinearRegressionModel(weights, intercept)
  }
}

/**  * Top-level methods for calling LinearRegression.  *  */ @Since("0.8.0")
object LinearRegressionWithSGD {

  /**  * Train a Linear Regression model given an RDD of (label, features) pairs. We run a fixed number  * of iterations of gradient descent using the specified step size. Each iteration uses  * `miniBatchFraction` fraction of the data to calculate a stochastic gradient. The weights used  * in gradient descent are initialized using the initial weights provided.  *  * @param input RDD of (label, array of features) pairs. Each pair describes a row of the data  * matrix A as well as the corresponding right hand side label y  * @param numIterations Number of iterations of gradient descent to run.  * @param stepSize Step size to be used for each iteration of gradient descent.  * @param miniBatchFraction Fraction of data to be used per iteration.  * @param initialWeights Initial set of weights to be used. Array should be equal in size to  * the number of features in the data.  *  */  @Since("1.0.0")
  def train(
      input: RDD[LabeledPoint],
      numIterations: Int,
      stepSize: Double,
      miniBatchFraction: Double,
      initialWeights: Vector): LinearRegressionModel = {
    new LinearRegressionWithSGD(stepSize, numIterations, miniBatchFraction)
      .run(input, initialWeights)
  }

  /**  * Train a LinearRegression model given an RDD of (label, features) pairs. We run a fixed number  * of iterations of gradient descent using the specified step size. Each iteration uses  * `miniBatchFraction` fraction of the data to calculate a stochastic gradient.  *  * @param input RDD of (label, array of features) pairs. Each pair describes a row of the data  * matrix A as well as the corresponding right hand side label y  * @param numIterations Number of iterations of gradient descent to run.  * @param stepSize Step size to be used for each iteration of gradient descent.  * @param miniBatchFraction Fraction of data to be used per iteration.  *  */  @Since("0.8.0")
  def train(
      input: RDD[LabeledPoint],
      numIterations: Int,
      stepSize: Double,
      miniBatchFraction: Double): LinearRegressionModel = {
    new LinearRegressionWithSGD(stepSize, numIterations, miniBatchFraction).run(input)
  }

  /**  * Train a LinearRegression model given an RDD of (label, features) pairs. We run a fixed number  * of iterations of gradient descent using the specified step size. We use the entire data set to  * compute the true gradient in each iteration.  *  * @param input RDD of (label, array of features) pairs. Each pair describes a row of the data  * matrix A as well as the corresponding right hand side label y  * @param stepSize Step size to be used for each iteration of Gradient Descent.  * @param numIterations Number of iterations of gradient descent to run.  * @return a LinearRegressionModel which has the weights and offset from training.  *  */  @Since("0.8.0")
  def train(
      input: RDD[LabeledPoint],
      numIterations: Int,
      stepSize: Double): LinearRegressionModel = {
    train(input, numIterations, stepSize, 1.0)
  }

  /**  * Train a LinearRegression model given an RDD of (label, features) pairs. We run a fixed number  * of iterations of gradient descent using a step size of 1.0. We use the entire data set to  * compute the true gradient in each iteration.  *  * @param input RDD of (label, array of features) pairs. Each pair describes a row of the data  * matrix A as well as the corresponding right hand side label y  * @param numIterations Number of iterations of gradient descent to run.  * @return a LinearRegressionModel which has the weights and offset from training.  *  */  @Since("0.8.0")
  def train(
      input: RDD[LabeledPoint],
      numIterations: Int): LinearRegressionModel = {
    train(input, numIterations, 1.0, 1.0)
  }
}

3、最小平方梯度，首先联系我们的代价（损失）函数，如下：

损失函数源码标记为：L = 1/2n ||A weights-y||^2

每个样本的梯度值：

每个样本的误差值：

第一个compute返回的是 ,第二个compute返回的是

class LeastSquaresGradient extends Gradient {
  override def compute(data: Vector, label: Double, weights: Vector): (Vector, Double) = {
    val diff = dot(data, weights) - label
    val loss = diff * diff / 2.0//误差
    val gradient = data.copy
    scal(diff, gradient)////梯度值x*(y-h(x))
    (gradient, loss)
  }

  override def compute(
      data: Vector,
      label: Double,
      weights: Vector,
      cumGradient: Vector): Double = {
    val diff = dot(data, weights) - label//h(x)-y
    axpy(diff, data, cumGradient)//y = x*(h(x)-y)+cumGradient
    /**axpy用法：  * Computes y += x * a, possibly doing less work than actually doing that operation  * def axpy[A, X, Y](a: A, x: X, y: Y)(implicit axpy: CanAxpy[A, X, Y]) { axpy(a,x,y) }  */  diff * diff / 2.0
  }
}

4、权重更新（SimpleUpdater）,更新公式如下：

返回的时候偏置项设置为0了

class SimpleUpdater extends Updater {
  override def compute(
      weightsOld: Vector,//上一次计算后的权重向量
      gradient: Vector,//本次迭代的权重向量
      stepSize: Double,//步长
      iter: Int,//当前迭代次数
      regParam: Double): (Vector, Double) = {
    val thisIterStepSize = stepSize / math.sqrt(iter)//学习速率  a
    val brzWeights: BV[Double] = weightsOld.toBreeze.toDenseVector
    brzAxpy(-thisIterStepSize, gradient.toBreeze, brzWeights)
    //brzWeights + = gradient.toBreeze-thisIterStepSize

    (Vectors.fromBreeze(brzWeights), 0)
  }
}

5权重优化

权重优化采用的是随机梯度降，但是默认的是miniBatchFraction= 1.0。

/**  * Class used to solve an optimization problem using Gradient Descent.  * @param gradient Gradient function to be used.  * @param updater Updater to be used to update weights after every iteration.  */ class GradientDescent private[spark] (private var gradient: Gradient, private var updater: Updater)
  extends Optimizer with Logging {

  private var stepSize: Double = 1.0
  private var numIterations: Int = 100
  private var regParam: Double = 0.0
  private var miniBatchFraction: Double = 1.0
  private var convergenceTol: Double = 0.001//收敛公差

  /**  * Set the initial step size of SGD for the first step. Default 1.0.  * In subsequent steps, the step size will decrease with stepSize/sqrt(t)  */  def setStepSize(step: Double): this.type = {
    this.stepSize = step
    this  }

  /**  * :: Experimental ::  * Set fraction of data to be used for each SGD iteration.  * Default 1.0 (corresponding to deterministic/classical gradient descent)  */  @Experimental
  def setMiniBatchFraction(fraction: Double): this.type = {
    this.miniBatchFraction = fraction
    this  }

  /**  * Set the number of iterations for SGD. Default 100.  */  def setNumIterations(iters: Int): this.type = {
    this.numIterations = iters
    this  }

  /**  * Set the regularization parameter. Default 0.0.  */  def setRegParam(regParam: Double): this.type = {
    this.regParam = regParam
    this  }

  /**  * Set the convergence tolerance. Default 0.001  * convergenceTol is a condition which decides iteration termination.  * The end of iteration is decided based on below logic.  *  * - If the norm of the new solution vector is >1, the diff of solution vectors  * is compared to relative tolerance which means normalizing by the norm of  * the new solution vector.  * - If the norm of the new solution vector is <=1, the diff of solution vectors  * is compared to absolute tolerance which is not normalizing.  *  * Must be between 0.0 and 1.0 inclusively.  */  def setConvergenceTol(tolerance: Double): this.type = {
    require(0.0 <= tolerance && tolerance <= 1.0)
    this.convergenceTol = tolerance
    this  }

  /**  * Set the gradient function (of the loss function of one single data example)  * to be used for SGD.  */  def setGradient(gradient: Gradient): this.type = {
    this.gradient = gradient
    this  }


  /**  * Set the updater function to actually perform a gradient step in a given direction.  * The updater is responsible to perform the update from the regularization term as well,  * and therefore determines what kind or regularization is used, if any.  */  def setUpdater(updater: Updater): this.type = {
    this.updater = updater
    this  }

  /**  * :: DeveloperApi ::  * Runs gradient descent on the given training data.  * @param data training data  * @param initialWeights initial weights  * @return solution vector  */  @DeveloperApi
  def optimize(data: RDD[(Double, Vector)], initialWeights: Vector): Vector = {
    val (weights, _) = GradientDescent.runMiniBatchSGD(
      data,
      gradient,
      updater,
      stepSize,
      numIterations,
      regParam,
      miniBatchFraction,
      initialWeights,
      convergenceTol)
    weights
  }

}

/**  * :: DeveloperApi ::  * Top-level method to run gradient descent.  */ @DeveloperApi
object GradientDescent extends Logging {
  /**  * Run stochastic gradient descent (SGD) in parallel using mini batches.  * In each iteration, we sample a subset (fraction miniBatchFraction) of the total data  * in order to compute a gradient estimate.  * Sampling, and averaging the subgradients over this subset is performed using one standard  * spark map-reduce in each iteration.  *  * @param data Input data for SGD. RDD of the set of data examples, each of  * the form (label, [feature values]).  * @param gradient Gradient object (used to compute the gradient of the loss function of  * one single data example)  * @param updater Updater function to actually perform a gradient step in a given direction.  * @param stepSize initial step size for the first step  * @param numIterations number of iterations that SGD should be run.  * @param regParam regularization parameter  * @param miniBatchFraction fraction of the input data set that should be used for  * one iteration of SGD. Default value 1.0.  * @param convergenceTol Minibatch iteration will end before numIterations if the relative  * difference between the current weight and the previous weight is less  * than this value. In measuring convergence, L2 norm is calculated.  * Default value 0.001. Must be between 0.0 and 1.0 inclusively.  * @return A tuple containing two elements. The first element is a column matrix containing  * weights for every feature, and the second element is an array containing the  * stochastic loss computed for every iteration.  */  def runMiniBatchSGD(
      data: RDD[(Double, Vector)],
      gradient: Gradient,
      updater: Updater,
      stepSize: Double,
      numIterations: Int,
      regParam: Double,
      miniBatchFraction: Double,
      initialWeights: Vector,
      convergenceTol: Double): (Vector, Array[Double]) = {

    // convergenceTol should be set with non minibatch settings
    if (miniBatchFraction < 1.0 && convergenceTol > 0.0) {
      logWarning("Testing against a convergenceTol when using miniBatchFraction " +
        "< 1.0 can be unstable because of the stochasticity in sampling.")
    }
    //把历史的权重放在一个数组中
    val stochasticLossHistory = new ArrayBuffer[Double](numIterations)
    // Record previous weight and current one to calculate solution vector difference
    //初始化权重
    var previousWeights: Option[Vector] = None
    var currentWeights: Option[Vector] = None
    //训练的样本数
    val numExamples = data.count()

    // if no data, return initial weights to avoid NaNs
    if (numExamples == 0) {
      logWarning("GradientDescent.runMiniBatchSGD returning initial weights, no data found")
      return (initialWeights, stochasticLossHistory.toArray)
    }

    if (numExamples * miniBatchFraction < 1) {
      logWarning("The miniBatchFraction is too small")
    }

    // Initialize weights as a column vector
    var weights = Vectors.dense(initialWeights.toArray)
    val n = weights.size

    /**  * For the first iteration, the regVal will be initialized as sum of weight squares  * if it's L2 updater; for L1 updater, the same logic is followed.  */  var regVal = updater.compute(
      weights, Vectors.zeros(weights.size), 0, 1, regParam)._2

    var converged = false // indicates whether converged based on convergenceTol判断是否收敛
    var i = 1
    while (!converged && i <= numIterations) {
      //广播weights
      val bcWeights = data.context.broadcast(weights)

      // Sample a subset (fraction miniBatchFraction) of the total data
      // compute and sum up the subgradients on this subset (this is one map-reduce)
      val (gradientSum, lossSum, miniBatchSize) = data.sample(false, miniBatchFraction, 42 + i)
        .treeAggregate((BDV.zeros[Double](n), 0.0, 0L))(
          seqOp = (c, v) => {
            // c: (grad, loss, count), v: (label, features)
            val l = gradient.compute(v._2, v._1, bcWeights.value, Vectors.fromBreeze(c._1))
            (c._1, c._2 + l, c._3 + 1)
          },
          combOp = (c1, c2) => {
            // c: (grad, loss, count)
            (c1._1 += c2._1, c1._2 + c2._2, c1._3 + c2._3)
          })

      if (miniBatchSize > 0) {
        /**  * lossSum is computed using the weights from the previous iteration  * and regVal is the regularization value computed in the previous iteration as well.  */  //保存误差，更新权重
        stochasticLossHistory.append(lossSum / miniBatchSize + regVal)
        val update = updater.compute(
          weights, Vectors.fromBreeze(gradientSum / miniBatchSize.toDouble),
          stepSize, i, regParam)
        weights = update._1
        regVal = update._2

        previousWeights = currentWeights
        currentWeights = Some(weights)
        if (previousWeights != None && currentWeights != None) {
          converged = isConverged(previousWeights.get,
            currentWeights.get, convergenceTol)
        }
      } else {
        logWarning(s"Iteration ($i/$numIterations). The size of sampled batch is zero")
      }
      i += 1
    }

    logInfo("GradientDescent.runMiniBatchSGD finished. Last 10 stochastic losses %s".format(
      stochasticLossHistory.takeRight(10).mkString(", ")))
    //返回权重和历史误差数组
    (weights, stochasticLossHistory.toArray)

  }

Web三要素：CSS之Flex/Grid布局(4) 双囍菜菜前端随记前端 css
CSS布局革命：Flex与Grid的双子星战法文章目录CSS布局革命：Flex与Grid的双子星战法一、布局进化史：从洪荒时代到现代文明二、Flex布局：一维空间的舞蹈家2.1核心概念深度解析容器属性详解：2.2典型应用场景实战导航栏布局（React示例）垂直居中（Vue示例）三、Grid布局：二维空间的指挥官3.1网格系统深度解析核心概念图解：3.2高级布局技巧实战响应式网格（React示例）复
如何选择显卡（202408） =PNZ=BeijingL 操作系统经验分享 1024程序员节
（图片来自网络）显卡，也被称为视频卡、图形适配器或GPU（图形处理单元），是电脑中负责渲染图形输出到显示器的关键硬件组件一显卡的基本作用1.图形渲染显卡的主要任务是处理和渲染图形。无论是浏览网页、观看视频还是使用图形设计软件，所有这些操作都需要显卡来计算图形信息，并将其转化为可在屏幕上显示的图像。显卡包含专门设计用于图形处理的芯片，可以快速执行这些操作，从而提供流畅和高质量的视觉体验。2.加速图形
Node.js 模块化概念详细介绍还是鼠鼠 node.js node.js web javascript vscode 前端
目录模块化的概念模块化的好处：实现模块化代码实现1.创建计算器模块2.使用计算器模块3.运行结果总结常见的Node.js核心模块模块化的应用场景Node.js采用了模块化的设计，使得开发者能够将代码拆分成多个独立的模块，便于维护和复用。在Node.js中，每个文件都可以视为一个模块，并且可以通过require()函数引入其他模块的功能。模块化提高了代码的可维护性，减少了冗余代码，并提高了开发效率。
UI自动化测试之CSS Selector 定位秘籍：解锁 WEB UI 自动化测试的高效之道做测试的小薄测试进阶 css selenium UI自动化测试元素定位方式
在WebUI自动化测试中，元素定位是实现自动化操作的核心步骤。SeleniumWebDriver提供了多种元素定位方式，其中CSSSelector是一种功能强大且灵活的定位方法。它基于CSS选择器语法，能够快速、精准地定位目标元素，尤其适用于复杂的DOM结构。本文将深入解析CSSSelector的工作原理、使用技巧以及需要注意的事项，帮助你在自动化测试中更高效地运用这一工具。一、CSSSelect
罗丹明RB/四甲基罗丹明标记酰胺化果胶Amidated Pectin, Rhodamine B/TRITC labeled；Rhodamine B/TRITC-Amidated Pectin 齐岳hao java 数据库 jvm
果胶是一种多糖，其组成有同质多糖和杂多糖两种类型。它们多存在于植物细胞壁和细胞内层，大量存在于柑橘、柠檬、柚子等果皮中。呈白色至黄色粉状，相对分子质量约20000～400000，无味。在酸性溶液中较在碱性溶液中稳定，通常按其酯化度分为高酯果胶及低酯果胶。高酯果胶在可溶性糖含量≥60%、pH=2.6～3.4的范围内形成非可逆性凝胶。低酯果胶一部分甲酯转变为伯酰胺，不受糖、酸的影响，但需与钙、镁等二价
JVM GC四大算法 coding_-_半生 jvm 算法 java
JVMGC四大算法文章目录JVMGC四大算法GC四大算法一、引用计数法二、复制算法（COPY）三、标记清除算法（MARK-SWEEP）四、标记整理算法（MARK-COMPACT）五、总结GC四大算法一、引用计数法描述：给每一个对象分配一个计数器，用于记录对象是否被引用，被引用一次，计数进行+1优点：方便直接判断对象是否能够回收缺点：使用计数器需要消耗一定的内存，且每一次计数的修改同样需要消耗内存致
通信之PDH准同步数字系列玖Yee 信息与通信
PDH-准同步数字系列（PlesiochronousDigitalHierarchy）：是数字通信系统中的一种数字传输系列，采用在数字通信网的每个节点上都分别设置高精度时钟的方式，这些时钟信号有统一标准速率，但各时钟间存在微小差别，并非真正的同步，所以叫“准同步”。速率等级两大体系三个标准：国际上PDH有两大系列三个标准。以欧洲系列为例，各次群容纳的E1数量呈4倍关系，比如可将4个2Mbit/s复
php后端分页_thinkphp5框架前后端分离项目实现分页功能的方法分析淡定男 php后端分页
本文实例讲述了thinkphp5框架前后端分离项目实现分页功能的方法。分享给大家供大家参考，具体如下：方法一利用tp5提供的paginate方法实现自动分页参数page第几页，paginate分页方法会自动获取size每页数量代码/***Notes:消费记录*Date:2019/6/25*Time:15:43*@paramRequest$request*@return\think\response
python flask 分页_Python的Flask框架中实现分页功能的教程 weixin_39959126 python flask 分页
BlogPosts的提交让我们从简单的开始。首页上必须有一张用户提交新的post的表单。首先我们定义一个单域表单对象(fileapp/forms.py)：classPostForm(Form):post=TextField('post',validators=[Required()])下面，我们把这个表单添加到template中(fileapp/templates/index.html)：{%ex
linux jvm gc日志分析,JVM GC 日志详解一只小小的IOS linux jvm gc日志分析
本文采用的JDK版本：javaversion"1.8.0_144"Java(TM)SERuntimeEnvironment(build1.8.0_144-b01)JavaHotSpot(TM)64-BitServerVM(build25.144-b01,mixedmode)一、GC日志参数设置JVMGC格式日志的主要参数包括如下8个：-XX:+PrintGC输出简要GC日志-XX:+PrintGC
node mysql limit,nodejs mysql 实现分页的方法日签君AIUX node mysql limit
这两天学习了nodejsmysql实现分页，很重要，所以，今天添加一点小笔记。代码如下varexpress=require('express');varrouter=express.Router();varsettings=require('../settings.js');varmysql=require('mysql2');router.get('/',function(req,res,nex
Rabbitmq踩坑---删掉.erlang.cookie后重新启动服务报错原子一式 Rabbitmq
集群部署的时候，自己笔记本安装3台centos7服务器【102，103，104】，各种前期准备好后，执行rabbitmqctlcluster_status发现报错，第一个想到的是cookie可能不对，检查发现三台.erlang.cookie都是一样的，仔细一看是103我改过hostname，重启后，从102拷贝过来发现还是报错，我就直接删掉了.erlang.cookie,然后又从102拷贝过来，启
浏览器防截屏,录屏. zhongshizhi91 前端浏览器
浏览器防截屏,录屏使用加密媒体扩展APIhttps://developer.mozilla.org/zh-CN/docs/Web/API/Encrypted_Media_Extensions_APIEncryptedMediaExtensions(EME)EME是一种允许Web应用程序使用内容保护系统（通常称为DRM，数字版权管理）来控制媒体播放的API。它主要用于支持加密媒体内容的播放，比如流媒
OOM系列之一：java.lang.OutOfMemoryError: Java堆空间问题详解马小瑄经验分享开发语言程序人生 java 性能优化
第一篇：java.lang.OutOfMemoryError:JavaheapspaceJava应用程序只允许使用有限的内存量。此限制是在应用程序启动期间指定的。为了让事情变得更复杂，Java内存被分成两个不同的区域。这些区域称为堆空间和Permgen（用于永久代）：这些区域的大小是在Java虚拟机(JVM)启动期间设置的，可以通过指定JVM参数-Xmx和-XX:MaxPermSize进行自定义。
【Linux】日志插件 s_little_monster_ Linux linux 数据库 oracle 运维学习经验分享笔记
个人主页~日志插件一、日志文件的重要性二、日志文件的简单实现1、comm.hpp2、log.hpp三、测试用例一、日志文件的重要性故障排查与问题定位快速发现问题：日志能够实时记录系统运行过程中的各种事件和状态信息，当系统出现故障或异常时，通过查看日志可以快速察觉到问题的发生，例如，服务器突然崩溃，日志中可能会记录下崩溃前的错误信息、异常堆栈，帮助运维人员第一时间得知系统出现了故障精准定位根源：详细
JVMGC的分类详解 qq_17805795 JVM JVMGC的分类详解
JVMGC的分类详解首先JVM有4种GC第一种为单线程GC，也是默认的GC。，该GC适用于单CPU机器。第二种为ThroughputGC，是多线程的GC，适用于多CPU，使用大量线程的程序。第二种GC与第一种GC相似，不同在于GC在收集Young区是多线程的，但在Old区和第一种一样，仍然采用单线程。-XX:+UseParallelGC参数启动该GC。第三种为ConcurrentLowPauseG
【CSS 面经】元素的层叠顺序 Peter-Lu #CSS面经 css 前端 html ecmascript javascript
文章目录一、层叠顺序的基本概念1.视觉层叠2.默认层叠顺序二、如何控制元素的层叠顺序？1.`z-index`属性示例：使用`z-index`控制元素层叠顺序2.`position`属性与层叠顺序示例：不同`position`设置下的层叠顺序3.`z-index`和堆叠上下文示例：堆叠上下文三、总结在网页设计中，元素的层叠顺序（StackingOrder）是指在同一页面上，多个元素如何相互叠加的规则
Vue2快速入门 Vic2334 前端 vue.js 前端框架 vue 快速入门
1.概念理解什么是vue？Vue.js是一套构建用户界面的渐进式框架。Vue从设计角度来讲，虽然能够涵盖这张图上所有的东西，但是你并不需要一上手就把所有东西全用上，因为没有必要。无论从学习角度，还是实际情况，这都是可选的。声明式渲染和组件系统是Vue的核心库所包含内容，而客户端路由、状态管理、构建工具都有专门解决方案。这些解决方案相互独立，你可以在核心的基础上任意选用其他的部件，不一定要全部整合在
Spring Data JPA Vic2334 JAVA Spring spring 后端 java 开源
SpringDataJPA什么是JPA？相同处：1.都跟数据库操作有关，JPA是jdbc的升华，升级版。2.JDBC和JPA都是一组规范1接口。3.都是由SUN公司推出的不同处：1.JDBC是有各个关系型数据库实现的，JPA是有ORM框架实现。2.JDBC使用SQL语句和数据库通信，JPA用面向对象方式，通过ORM框架生成SQL，进行操作。3.JPA在JDBC之上，JPA也要依赖JDBC才能操作数
FastDVDnet：基于深度学习的视频去噪框架陆可鹃Joey
FastDVDnet：基于深度学习的视频去噪框架项目地址:https://gitcode.com/gh_mirrors/fa/fastdvdnet项目介绍FastDVDnet是一个高效、开源的深度学习模型，专注于视频去噪。该项目由MatteoTassano开发并维护，旨在提供一种快速且有效的解决方案，以消除视频中的噪声，同时保持图像细节和自然纹理。它利用了时间域的连续性和深层神经网络的力量，确保在
16届蓝桥杯模拟试题三-编程解析真-大意失仙人蓝桥杯
一、题目展示二、参考答案1、主函数初始化程序的相关初始化，记得引入自己的头文件，以及对下面会用lcd驱动的几个函数进行一定的修改，防止led出错，修改就不一一展示了，大致都是这样的，进入lcd驱动的相关函数时保存当前的led输出状态，即GPIOC的PIN15~8的输出值，退出lcd函数时再恢复GPIOC的引脚值。HAL_GPIO_WritePin(GPIOD,GPIO_PIN_2,GPIO_PIN
基于 KTransformers的DeepSeek-R1 本地部署方案，成本骤降32倍！爱科技Ai LLM 人工智能
随着DeepSeek-R1模型在全球范围内的流行，越来越多的用户开始在本地尝试部署该模型。然而，高昂的硬件需求和成本让许多公司望而却步。本文将深入探讨DeepSeek-R1部署中的挑战，并介绍一款创新框架KTransformers，它能够显著降低大规模模型部署的成本并提高推理效率，从而帮助更多中小企业有效部署此类高级AI模型。本地部署“成本骤降32倍”，助力R1真正落地「中小企业」中！1.Deep
懂车帝 2025.3.13 一面经凉 WispX888 java 面试
懂车帝2025.3.13一面经凉上来一道算法题：小于n的最大数（dfs）n=23121，数组{2,4,9},问利用数组中的数字组成的最大的小于n的数publicclassTest{publicstaticvoidmain(String[]args){for(inti=0;i<3;i++){dfs(1,a[i]);}System.out.println(ans);}privatestaticint[
手撕multi-head self attention 代码心若成风、自然语言处理语言模型 transformer
在深度学习和自然语言处理领域，多头自注意力（Multi-HeadSelf-Attention）机制是Transformer模型中的核心组件之一。它允许模型在处理序列数据时，能够同时关注序列中的不同位置，从而捕获到丰富的上下文信息。下面，我们将详细解析多头自注意力机制的实现代码。一、概述多头自注意力机制的核心思想是将输入序列进行多次线性变换，然后分别计算自注意力得分，最后将所有头的输出进行拼接，并通
商场促销-策略模式 WispX888 java 开发语言学习设计模式
商场促销-策略模式商场收银软件大鸟给小菜出了一个作业，让小菜做一个商场收银软件，营业员根据客户端所购买商品的单价和数量，向用户收费。核心代码如下：importjava.util.Scanner;publicclassMain{privatestaticdoubletotal=0;publicstaticvoidmain(String[]args){Scannersc=newScanner(Syst
js实现关于分页的一种实现方式番薯(Koali) Java java web 分页数据 javascript
项目中用到列表的地方很多，二页面列表的显示必然要求分页，所以分页和查询几乎密不可分，如果说你不会分页查询数据，那你基本上还属于菜鸟。分页的原理很简单，从sql上看就是从哪一条开始，往后差几条。所以sql只需要传2个参数，这只是原理罢了，关键是实现。而实现的方法就多了去了，架构师干这个是小菜一碟。在我的项目中，关于分页架构师已经写好了一个管理分页的类，这个类与sql耦合，控制分页只需哟啊控制这个类的
算法手撕面经系列(1)--手撕多头注意力机制夜半罟霖算法 python 深度学习
多头注意力机制一个简单的多头注意力模块可以分解为以下几个步骤：先不分多头，对输入张量分别做变换，得到Q,K,VQ,K,VQ,K,V对得到的Q,K,VQ,K,VQ,K,V按头的个数进行split；用Q,KQ,KQ,K计算向量点积考虑是否要添因果mask利softmax计算注意力得分矩阵atten对注意力得分矩阵施加Dropout将atten矩阵和VVV矩阵相乘再过一道最终的输出变换代码给出一个d
深度学习 Deep Learning 第2章线性代数 odoo中国 AI编程人工智能深度学习线性代数人工智能
深度学习第2章线性代数线性代数是深度学习的语言。张量操作是神经网络计算的基石，矩阵乘法是前向传播的核心，范数约束模型复杂度，而生成空间理论揭示模型表达能力的本质。本章介绍线性代数的基本内容，为进一步学习深度学习做准备。主要内容2.1标量、向量、矩阵和张量标量：单个数字，用斜体表示，通常赋予小写字母变量名。向量：数字数组，按顺序排列，用粗体小写字母表示，元素通过下标访问。矩阵：二维数字数组，用粗体大
Java 8 + Tomcat 9.0.102 的稳定环境搭建方案，适用于生产环境无极低码 java java tomcat 开发语言
一、安装Java8安装OpenJDK8bashsudoaptupdatesudoaptinstallopenjdk-8-jdk-y验证安装bashjava-version应输出类似：openjdkversion“1.8.0_412”OpenJDKRuntimeEnvironment(build1.8.0_412-8u412-ga-1~22.04-b08)OpenJDK64-BitServerVM(
Java对正则表达式的支持（手机、身份证校验）周里奥工具正则表达式 java
目录1【数量:单个】字符匹配2【数量:单个】字符集(可以从里面任选一个字符)。3【数量:单个】简化字符集;4【边界匹配】5【数量表示】默认情况下只有添加上了数量单位才可以匹配多位字符;6【逻辑表达式】可以连接多个正则7【理解字符\的含义】\在Java中的含义\在正则表达式中的含义\出现在Java的正则表达式中处理举例1：[email protected]举例2：ab\abJava对正则的支持类-常用方法
安装数据库首次应用 Array_06 java oracle sql
可是为什么再一次失败之后就变成直接跳过那个要求 enter full pathname of java.exe的界面这个java.exe是你的Oracle 11g安装目录中例如：【F:\app\chen\product\11.2.0\dbhome_1\jdk\jre\bin】下的java.exe 。不是你的电脑安装的java jdk下的java.exe！注意第一次，使用SQL D
Weblogic Server Console密码修改和遗忘解决方法 bijian1013 Welogic
在工作中一同事将Weblogic的console的密码忘记了，通过网上查询资料解决，实践整理了一下。一.修改Console密码打开weblogic控制台，安全领域 --> myrealm -->&n
IllegalStateException: Cannot forward a response that is already committed Cwind java Servlets
对于初学者来说，一个常见的误解是：当调用 forward() 或者 sendRedirect() 时控制流将会自动跳出原函数。标题所示错误通常是基于此误解而引起的。示例代码： protected void doPost() { if (someCondition) { sendRedirect(); } forward(); // Thi
基于流的装饰设计模式木zi_鸣设计模式
当想要对已有类的对象进行功能增强时，可以定义一个类，将已有对象传入，基于已有的功能，并提供加强功能。自定义的类成为装饰类模仿BufferedReader，对Reader进行包装，体现装饰设计模式装饰类通常会通过构造方法接受被装饰的对象，并基于被装饰的对象功能，提供更强的功能。装饰模式比继承灵活，避免继承臃肿，降低了类与类之间的关系装饰类因为增强已有对象，具备的功能该
Linux中的uniq命令被触发 linux
Linux命令uniq的作用是过滤重复部分显示文件内容，这个命令读取输入文件，并比较相邻的行。在正常情况下，第二个及以后更多个重复行将被删去，行比较是根据所用字符集的排序序列进行的。该命令加工后的结果写到输出文件中。输入文件和输出文件必须不同。如果输入文件用“- ”表示，则从标准输入读取。 AD： uniq [选项] 文件说明：这个命令读取输入文件，并比较相邻的行。在正常情况下，第二个
正则表达式Pattern 肆无忌惮_ Pattern
正则表达式是符合一定规则的表达式，用来专门操作字符串，对字符创进行匹配，切割，替换，获取。例如，我们需要对QQ号码格式进行检验规则是长度6~12位不能0开头只能是数字，我们可以一位一位进行比较，利用parseLong进行判断，或者是用正则表达式来匹配[1-9][0-9]{4,14} 或者 [1-9]\d{4,14} &nbs
Oracle高级查询之OVER (PARTITION BY ..) 知了ing oracle sql
一、rank()/dense_rank() over(partition by ...order by ...) 现在客户有这样一个需求，查询每个部门工资最高的雇员的信息，相信有一定oracle应用知识的同学都能写出下面的SQL语句： select e.ename, e.job, e.sal, e.deptno from scott.emp e, (se
Python调试矮蛋蛋 python pdb
原文地址： http://blog.csdn.net/xuyuefei1988/article/details/19399137 1、下面网上收罗的资料初学者应该够用了，但对比IBM的Python 代码调试技巧： IBM：包括 pdb 模块、利用 PyDev 和 Eclipse 集成进行调试、PyCharm 以及 Debug 日志进行调试： http://www.ibm.com/d
webservice传递自定义对象时函数为空，以及boolean不对应的问题 alleni123 webservice
今天在客户端调用方法 NodeStatus status=iservice.getNodeStatus(). 结果NodeStatus的属性都是null。进行debug之后，发现服务器端返回的确实是有值的对象。后来发现原来是因为在客户端，NodeStatus的setter全部被我删除了。本来是因为逻辑上不需要在客户端使用setter，结果改了之后竟然不能获取带属性值的
java如何干掉指针，又如何巧妙的通过引用来操作指针————>说的就是java指针百合不是茶
C语言的强大在于可以直接操作指针的地址，通过改变指针的地址指向来达到更改地址的目的,又是由于c语言的指针过于强大，初学者很难掌握， java的出现解决了c，c++中指针的问题 java将指针封装在底层，开发人员是不能够去操作指针的地址，但是可以通过引用来间接的操作：定义一个指针p来指向a的地址（&是地址符号）：
Eclipse打不开，提示“An error has occurred.See the log file ***/.log” bijian1013 eclipse
打开eclipse工作目录的\.metadata\.log文件，发现如下错误： !ENTRY org.eclipse.osgi 4 0 2012-09-10 09:28:57.139 !MESSAGE Application error !STACK 1 java.lang.NoClassDefFoundError: org/eclipse/core/resources/IContai
spring aop实例annotation方法实现 bijian1013 java spring AOP annotation
在spring aop实例中我们通过配置xml文件来实现AOP，这里学习使用annotation来实现，使用annotation其实就是指明具体的aspect,pointcut和advice。1.申明一个切面(用一个类来实现)在这个切面里,包括了advice和pointcut AdviceMethods.jav
[Velocity一]Velocity语法基础入门 bit1129 velocity
用户和开发人员参考文档 http://velocity.apache.org/engine/releases/velocity-1.7/developer-guide.html 注释 1.行级注释## 2.多行注释#* *# 变量定义使用$开头的字符串是变量定义，例如$var1, $var2, 赋值使用#set为变量赋值，例
【Kafka十一】关于Kafka的副本管理 bit1129 kafka
1. 关于request.required.acks request.required.acks控制者Producer写请求的什么时候可以确认写成功，默认是0， 0表示即不进行确认即返回。 1表示Leader写成功即返回，此时还没有进行写数据同步到其它Follower Partition中 -1表示根据指定的最少Partition确认后才返回，这个在 Th
lua统计nginx内部变量数据 ronin47 lua nginx　统计
server { listen 80; server_name photo.domain.com; location /{set $str $uri; content_by_lua ' local url = ngx.var.uri local res = ngx.location.capture(
java-11.二叉树中节点的最大距离 bylijinnan java
import java.util.ArrayList; import java.util.List; public class MaxLenInBinTree { /* a. 1 / \ 2 3 / \ / \ 4 5 6 7 max=4 pass "root"
Netty源码学习-ReadTimeoutHandler bylijinnan java netty
ReadTimeoutHandler的实现思路：开启一个定时任务，如果在指定时间内没有接收到消息，则抛出ReadTimeoutException 这个异常的捕获，在开发中，交给跟在ReadTimeoutHandler后面的ChannelHandler，例如 private final ChannelHandler timeoutHandler = new ReadTim
jquery验证上传文件样式及大小(好用) cngolon 文件上传 jquery验证
<!DOCTYPE html> <html> <head> <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> <script src="jquery1.8/jquery-1.8.0.
浏览器兼容【转】 cuishikuan css 浏览器 IE
浏览器兼容问题一：不同浏览器的标签默认的外补丁和内补丁不同问题症状：随便写几个标签，不加样式控制的情况下，各自的margin 和padding差异较大。碰到频率:100% 解决方案：CSS里 *{margin:0;padding:0;} 备注：这个是最常见的也是最易解决的一个浏览器兼容性问题，几乎所有的CSS文件开头都会用通配符*来设
Shell特殊变量：Shell $0, $#, $*, $@, $?, $$和命令行参数 daizj shell $#$?特殊变量
前面已经讲到，变量名只能包含数字、字母和下划线，因为某些包含其他字符的变量有特殊含义，这样的变量被称为特殊变量。例如，$ 表示当前Shell进程的ID，即pid，看下面的代码： $echo $$ 运行结果 29949 特殊变量列表变量含义 $0 当前脚本的文件名 $n 传递给脚本或函数的参数。n 是一个数字，表示第几个参数。例如，第一个
程序设计KISS 原则-------KEEP IT SIMPLE, STUPID! dcj3sjt126com unix
翻到一本书，讲到编程一般原则是kiss：Keep It Simple, Stupid.对这个原则深有体会，其实不仅编程如此，而且系统架构也是如此。 KEEP IT SIMPLE, STUPID! 编写只做一件事情，并且要做好的程序；编写可以在一起工作的程序，编写处理文本流的程序，因为这是通用的接口。这就是UNIX哲学.所有的哲学真正的浓缩为一个铁一样的定律，高明的工程师的神圣的“KISS 原
android Activity间List传值 dcj3sjt126com Activity
第一个Activity： import java.util.ArrayList;import java.util.HashMap;import java.util.List;import java.util.Map;import android.app.Activity;import android.content.Intent;import android.os.Bundle;import a
tomcat 设置java虚拟机内存 eksliang tomcat 内存设置
转载请出自出处：http://eksliang.iteye.com/blog/2117772 http://eksliang.iteye.com/ 常见的内存溢出有以下两种: java.lang.OutOfMemoryError: PermGen space java.lang.OutOfMemoryError: Java heap space ------------
Android 数据库事务处理 gqdy365 android
使用SQLiteDatabase的beginTransaction()方法可以开启一个事务，程序执行到endTransaction() 方法时会检查事务的标志是否为成功，如果程序执行到endTransaction()之前调用了setTransactionSuccessful() 方法设置事务的标志为成功则提交事务，如果没有调用setTransactionSuccessful() 方法则回滚事务。事
Java 打开浏览器 hw1287789687 打开网址 open浏览器 open browser 打开url 打开浏览器
使用java 语言如何打开浏览器呢? 我们先研究下在cmd窗口中,如何打开网址使用IE 打开 D:\software\bin>cmd /c start iexplore http://hw1287789687.iteye.com/blog/2153709 使用火狐打开 D:\software\bin>cmd /c start firefox http://hw1287789
ReplaceGoogleCDN：将 Google CDN 替换为国内的 Chrome 插件 justjavac chrome Google google api chrome插件
Chrome Web Store 安装地址： https://chrome.google.com/webstore/detail/replace-google-cdn/kpampjmfiopfpkkepbllemkibefkiice 由于众所周知的原因，只需替换一个域名就可以继续使用Google提供的前端公共库了。同样，通过script标记引用这些资源，让网站访问速度瞬间提速吧
进程VS.线程 m635674608 线程
资料来源： http://www.liaoxuefeng.com/wiki/001374738125095c955c1e6d8bb493182103fac9270762a000/001397567993007df355a3394da48f0bf14960f0c78753f000 1、Apache最早就是采用多进程模式 2、IIS服务器默认采用多线程模式 3、多进程优缺点优点：多进程模式最大
Linux下安装MemCached 字符串 memcached
前提准备：1. MemCached目前最新版本为：1.4.22，可以从官网下载到。2. MemCached依赖libevent，因此在安装MemCached之前需要先安装libevent。2.1 运行下面命令，查看系统是否已安装libevent。[root@SecurityCheck ~]# rpm -qa|grep libevent libevent-headers-1.4.13-4.el6.n
java设计模式之--jdk动态代理（实现aop编程） Supanccy2013 java DAO 设计模式 AOP
与静态代理类对照的是动态代理类，动态代理类的字节码在程序运行时由Java反射机制动态生成，无需程序员手工编写它的源代码。动态代理类不仅简化了编程工作，而且提高了软件系统的可扩展性，因为Java 反射机制可以生成任意类型的动态代理类。java.lang.reflect 包中的Proxy类和InvocationHandler 接口提供了生成动态代理类的能力。 &
Spring 4.2新特性-对java8默认方法(default method)定义Bean的支持 wiselyman spring 4
2.1 默认方法(default method) java8引入了一个default medthod; 用来扩展已有的接口,在对已有接口的使用不产生任何影响的情况下,添加扩展使用default关键字 Spring 4.2支持加载在默认方法里声明的bean 2.2 将要被声明成bean的类 public class DemoService {

SparkML之预测(一)线性回归分析源码阅读

你可能感兴趣的:(SparkML之预测(一)线性回归分析源码阅读)