joe.bq.wang

Scala - Combinator Parsing

Background

Occasionally, you may need to process a small, special-purpose language.
Essentially, you have only a few choices, One choice is to roll your own parser (and lexical analyzer). If you are not an expert, this is hard, if you are an expert, it is still time consuming
An alternative choice is to use a parser generator, there exist quite a few of these generators. some of the better known are Yacc and Bison for parser written in C and ANTLR for parer written in Java.
You will probably also need a scanner generator such as Lex, Flex, or JFlexto go with it. However, you need to learn new tools, including their sometimes obscure - error messages.
This chapter presents a third alternative. Instead of using the standalong domain specific language of a parser generator, you will learn an internal domain specific language, or internal DSL for short. The internal DSL will consist of a library of parser combinator - functions and operatos defined in Scala that will server as building block for parsers. These building block will map one to one to the constructions of a "context-free" grammer, to make it easy to understand
We will show by an example

Example : Arithmetic expressions

suppose that the grammer for an arithmetic expression is as follow.

/*
expr ::= term { "+" term | "-" term }.
term ::= factor { "*" factor | "/" factor }.
factor ::= floatingPointNumber  | "(" expr ")"

| : denote alternative production 
{...}: denotes repetition (zero or more times)
[...]: means an optional occurrence
*/

since we have the grammer, we can convert that to the combinator.

import scala.util.parsing.combinator._

class Arith extends JavaTokenParsers { 
  def expr: Parser[Any] = term ~ rep ("+" ~ term | "-" ~ term)
  def term: Parser[Any] = factor ~ rep("*" ~ factor | "/" ~ factor)
  def factor: Parser[Any] = floatingPointNumber | "(" ~ expr ~ ")"  // what is the definition of the floatingPointNumber, is that an internal one?
                                                                    // actually, it is defined in trait JavaTokenParser, the definition being:   def floatingPointNumber: Parser[String] =   """-?(\d+(\.\d*)?|\d*\.\d+)([eE][+-]?\d+)?[fFdD]?""".r
}

and we have an general rule converting our grammer to combinator expressions.

every production becomes a method
results of each method is Parser[Any], so you need to change ::= to ": Parser[Any]" , you can make it more precise, we will show later
in the grammer, sequential composition was implicit, in program, it is expressed by an explicit operator: ~.
Repetition is expressed rep(...) instead of {...}. Analogously, option is expressed opt(...) instead of [...]
The period (.) at the end of each production is omitted - you can, however, write a semicolon (;) if you prefer

Running the Parser

Nowe we have the grammer and we have formalize that into combinator syntax.

// running your parser
import scala.util.parsing.combinator._
object ParseExpr extends Arith {
  def main(args : Array[String]) { 
    println("input : "  + args(0))
    println(parseAll(expr, args(0)))
  }
}

the key here is parseAll(expr, input)

to run the main method.

ParseExpr.main(Array[String]("2 * ( 3 * 7)"))
//
// input : 2 * (3 + 7)
// [1.12] parsed: ((2~List((*~(((~((3~List())~List((+~(7~List())))))~)))))~List())

ParseExpr.main(Array[String]("2 * (3 + 7))")) // deliberately something wrong to the syntax
// 
//[1.12] failure: string matching regex `\z' expected but `)' found
//
//2 * (3 + 7))
//           ^

it is more like a hack, but you can also run that with Scala ParseExpr..

Basic regular expressions parsers

we have seen from above combinator, which has floatingPointNumber, this is inherited from Arith's super trait, which is called JavaTokenParsers. That uses a regular expresssion parser. The idea is that y ou can use any regular expression as a parser, the regular expression parses all strings it can match. the result is the parsed string.

import scala.util.parsing.combinator._

object MyParsers extends RegexParsers { 
  val ident : Parser[String] = """[a-zA-Z_]\w*""".r
  
  // to test 
  def main(args : Array[String] ) { 
    println("input " + args(0))
    println(parseAll(ident, args(0)))
  }
}

MyParsers.main(Array[String]("joe"))
MyParsers.main(Array[String]("-joe"))   // fail
MyParsers.main(Array[String]("joe.wang")) // fail
MyParsers.main(Array[String]("123")) // fail

Another example : JSON

/*
value ::=  obj | arr | stringLiteral |
		   floatingPointerNumber | 
		   "nul" | "true" | "false"
obj ::= "{" [members] "}"
arr ::= "{" [values] "}"
members ::= member {"," member}
member ::= stringLiteral ":" value
values ::= value {",", value}
*/

and a well-formed, valid json file could be like this:

{
   "address book": { 
      "name": "John Smith",
      "address" : {
         "street": "10 Market Street",
         "city" : "San Francisco, CA",
         "zip"  : 94111 
      },
      "phone numbers": [
         "408 338-4238",
         "408 111-6892"
      ]
   }
}

with the grammer in hand, we can transform that to the combinator syntax.

import java.io.FileReader 
import scala.util.parsing.combinator._

class JSON extends JavaTokenParsers {
  def value : Parser[Any] = obj | arr | stringLiteral | floatingPointNumber | "null" | "true" | "false"
  def obj : Parser[Any] = "{" ~ repsep(member, ",") ~ "}"
  def arr : Parser[Any] = "["~  repsep(value, ",") ~ "]"
  def member : Parser[Any] = stringLiteral ~ ":" ~ value 
}

and its to parse drive method is as follow.

object ParseJSON extends JSON { 
  def main(args : Array[String]) {
    val reader = new FileReader(args(0))
    println(parseAll(value, reader))
  }
}

to run the drive method.

// scala ParseJSON address-book.json
ParseJSON.main(Array[String]("address-book.json")) // you might need to given the right path for the *.json file

Parser output

If you run the command, you might get results such as

[14.2] parsed: (({~List((("address book"~:)~(({~List((("name"~:)~"John Smith"),
(("address"~:)~(({~List((("street"~:)~"10 Market Street"), (("city"~:)~"San Fran
cisco, CA"), (("zip"~:)~94111)))~})), (("phone numbers"~:)~(([~List("408 338-423
8", "408 111-6892"))~]))))~}))))~})

the output of the previous JSON program is that it does not have intuitive meanings. It seems to be a sequence composed of bits and pieces of the input glued together with lists and ~ combinations.

and check on the code, each and every combinator factor returns just a Parser[Any], it does not reflect the business model, the JSON object a more likely result type should be Map[String, Any].. and JSON array shuld be List[Any]., values "true", "false", should just be true, false.

there is an operator called "^^" which can help you do the result transformation.

Also, from the output, something like ~(~("{", ms, "}"), this is illegible. why we keep the "{" or "}" in the result? the combinator has introduced the fllowign operator, which allow you to just keep the left or right match . they are ~> and <~

now, equipped with the new operator, we have this:

import scala.util.parsing.combinator._

class JSON1 extends JavaTokenParsers { 
  def obj : Parser[Map[String, Any]] = 
    "{" ~> repsep (member, ",") <~ "}" ^^ (Map() ++ _)
  
  def arr : Parser[List[Any]] = 
     "[" ~> repsep (value, ",") <~ "]"
  
  def member : Parser[(String, Any)] = 
    stringLiteral ~ ":" ~ value ^^
      { case name ~ ":" ~value => (name, value) }
  
   def value : Parser[Any] = (
      obj 
    | arr
    | stringLiteral
    | floatingPointNumber ^^ (_.toDouble)
    | "null" ^^ (x => null)
    | "true" ^^ (x => true)
    | "fale" ^^ (x => false)
    )
   
}

import java.io.FileReader

object JSON1Test extends JSON1 {
  def main(args : Array[String]) { 
    val reader = new FileReader(args(0))
    println(parseAll(value, reader))
  }
}

now, if we run it, we have:

[14.2] parsed: Map("address book" -> Map("name" -> "John Smith", "address" -> Ma
p("street" -> "10 Market Street", "city" -> "San Francisco, CA", "zip" -> 94111.
0), "phone numbers" -> List("408 338-4238", "408 111-6892")))

Much easier to read, isn't it?

As a side note, since there is no need to explicitly insert some semi-colon, so you write

def value : Parser[Any] = 
  obj | 
  arr |
  stringLiteral |
  ...

or if you write as follow.

  obj; // semicolon implicitly inserted
| arr

you can put the whole expression in a parenthesis avoids the semicolon and makes the code compile correctly.

and the summary of the Parser combinators

"..."	literal
"...".r	regular expressions
P~Q	sequential composition
P<~Q, P~>Q	sequential composition; keep left/right only
P \| Q	alternative
opt(P)	option
rep(P)	repetition
repsep(P, Q)	interleaved repetition
p ^^ f	result conversion

Why we hvae those symbolic operators, what if we have alphabetic ones, it has too much visual estate. and the symbolic operators are specially chosen so that it has decreasing order of precedence.

suppose that we have the following alphabetic operators.

// suppose for a moment that we have only those alphabetic ones instead of the symbolic ones. 

class ArithHypothetical extends JavaTokenParsers { 
  def expr : Parser[Any] = 
    term andThen rep(("+" andThen term) orElse 
         ("-" andThen term))
  def term: Parser[Any] = 
       factor andThen rep(("*" andThen factor) orElse 
         ("/" andThen factor))
  def factor : Parser[Any] = floatingPointNumber orElse ("(" andThen expr andThen ")")
}

Implementing combinator parsers

The previous sections have shown that Scala's combinator provides a convenient means for constructing your own parsers.

the core of Scala's combinator parsing framework is contained in the trait scala.util.parsing.combinator.Parsers

package scala.util.parsing.combinator

trait Parser { 
  ... // code goes here unless otherwise stated.
}

// a parser is in essence just a function from some input type to a parser result 
type Parser[T] = Input = ParseResult[T]

Parser input

// reader is from scala.util.parsing.input, it is similar to Stream, but it also keeps track of the positions of all the elements it reads.
type Input = Reader[Elem]

e.g of Elem, for RegexParsers, the Elem fixed to Char, but it would also be possible to set Elem to some other type, such as the type of tokens returned from a separate lexer.

Parser Results

sealed abstract class ParseResult[+T] 
case class Success[T] (result : T, in :Input) extends ParseResult[T]
case class Failure (msg: String, in :Input) extends ParseResult[Nothing]

T stands for the type of results that returned by a success match, while the second parameter, in, is used for chaining.
for failure, the None denote nothing result returned, but the second in is used for not chaining, but positioning.

The parser class

abstract class Parser[+T] extends (Input => ParserResult[T]) {

   p => 
   // an unspecified method that denote 
   // the behavior of this parser
   def appply(in : INput) : ParserResult[T]
   def ~ ...
   def ! ...
}

because parser are (i.e. inherit from) functions, they need to define an apply method, you can see an abstract apply method in class Parser, but this is just for documentation, as the same method is in
any case inherited from the parent type Input => ParserResult[T]. (Input => ParserResult[T]) is an abbreviation for scala.Function1[Input, ParserResult[T]).

Alias this

abstract class Parser[+T] extends (Input => ParserResult[T]) { p =>

a clause such as "id => " immediately after the opening braces of a class template define the identifier id as an alias for this in the class. it as if you had written :
val id = this
use of this alias "is":

class Outer { outer => 
  class Inner {
     println(Outer.this eq outer) // prints : true

Single-token parsers

def elem(kind : String, p : Elem => Boolean) =
  new Parser[Elem] {
     def apply(in : Input) = 
       if (p(in.first)) Succes(in.first, in.rest)
       else Failure(Kind + " expected ", in)
}

Sequential composition

// as we can know that ~ is case class, and with element of type T and U 
abstract class Parser[+T] extends (Input => ParserResult[T]) { p =>
   def ~[U] (q : => Parser[U])  = new Parser[T ~U] { 
     case Success(x, in1) => 
       q(in1) match { 
         case Success(y, in2) => Success(new ~(x, y), in2)
         case failure => failure 
       }
     case failure => failure
   }
}

the other two sequential composition operator, <~ and ~>, coudl be defined just like ~, only with some small adjustment in how the result is computed, a more elegant technique, though is to define them in terms of ~ as follow.

def <~ [U](q : => Parser[U]) : Parser [T] = (p ~q) ^^ { case x ~ y => x }
def ~> [U](q : => Parser[U]) : Parser [U] = (p ~q) ^^ { case x ~ y => y }

alternative compositions

def | (q : => Input) => new Parser[T] {
  def apply(in : Input) = p(in) match { 
    case s1 @ Success(_, _) => s1
    case failure => q(in)
  }
}

if P and Q both fail, then the failure message is determined by Q, this subtle choice is discussed later.

Dealing with recursion

Note that the q parameter is method ~ and | is by-name - its type is precede by =>, this means that the actual parser argumetn will be evaluated only when q is needed. which should only be the case after p has run. this make sit possible to write recursive parser like the following.

def parens = floatingPointNumber | "(" ~ parens ~ ")"

if | and ~ took by-value parameters, this definitions would immediately cause a stack overflow without reading anything, because the value of parens occurs in the middle of the right-hand side.

Result conversion

def ^^ [U](f : T => U) : Parser[U] = new Parser[U] {
  def apply(in : Input) = p(in) match { 
    case Success(x, in1) => Success(f(x), in1)
    case failure => failure
  }
} // end Parser

Parsers that don't read any input

success and failure does not consume any input.

def success[T] (v : T) = new Parser[T] { 
   def apply(in :Input) = Success(v, in)
}

def failure(msg : String) = new Parser[Nothing] {
  def apply(in : Input) = Failure(msg, in)
}

Option and Repetition

def opt[T](p : => Parser[T]): Parser[Option[T]] = (
  p ^^ Some(_)
| success(None)
)

def rep[T](p : => Parser[T]): Parser[List[T]] = (
  p ~ rep(p) ^^ { case x~xs => x :: xs } 
| success(List())
)

def repsep[T](p : => Parser[T], q : => Parser[Any]): Parser[List[T]] = (
  p ~ rep(q ~> p) ^^ { case r~rs => r :: rs } 
| success(List())
) // end Parsers

String Literals and regular expressions

the definition of the RegexParsers is

trait RegexParsers extends Parsers {

while it works for Elem of Char

type Elem = Char

it defines two methods.

implicit def literal(s : String) : Parser[String] = ...
implicit def regex(r : Regex) : Parser[String] = ...

Because of the implicit modifier, so they are automatically applied, this is why you can directly write string lieral or regex in your grammer, because that parser "(" ~ expr ~ ")" will automatically expanded to literal("(") ~ expr ~ literal (")")
RegexParsers trait also takes care of handling white space, it uses the regular expression

protected val whiteSpace = """\s+""".r

but you can choose to override it

// you can choose to override it 
object MyParsers extends RegexParsers { 
	override val whiteSpace = "".r
 ... 
 
}

lexing and Parsing

the task of syntax analysis is often split into two phase,
the lexer phaser: recognize individual words into the input and classifies them into some token classes. the phase is also called lexical analysis.
the syntactical analysis: sometime is called parsing,
The parsers described in the previous section can be used for either phase, because its input element are of the abstract type Elem.

Scala's parsing combinations provides several utilities classes for lexical and syntactic analysis. they are comtained in two sub-packages, one for each kind of analysis

scala.util.parsing.combinator.lexical
scala.util.parsing.combinator.syntactical

Error reporting

Scala's parsing library implements a simple heuristic: among all failures, the one that occurred at the latest position in the input is chosen.

supose for the json examples:

{ "name" : John

you will have the following

[1.13] failure : "false" expected but identifier John found 
{ "name" : John
           ^

a better error message message can be engineered by adding a "catch-all" failure point as the last alternative of a value production

def value : Parser[Any] = 
  obj | arr | stringLit | floatingPointNumber | "null" | "true" | "false" | failure("illegal start of value")

and you might now get this error message

[1.13] failure : illegal start of value
{ "name" : John
           ^

how is that happening, combinator keeps a lastFailure variable.

var lastFailure : Option[Failure] = None

the field is initialized to None, it is updated in the constructor of the failure class:

case class Failure(msg  : String, in : input) { 
   if (lastFailure.isDefined && lastFailure.get.in.pos <= in.pos) 
      lastFailure = Some(this)
}

and it is used by the phrase method, which emits the final error message if the parser failed, Here is the implementation of phrase in trait Parsers

def phrase[T] (p : Parser[T]) = new Parser[T] { 
   lastFailure = None
   def apply(in : Input) = p(in) match { 
      case s @ Successs(out, in1) => 
        if (in1.atEnd) s
        else Failure("end of inpput expected", in1)
      case f : Failure => 
        lastfailure 
   }
}

the lastFailure is updated as a side effect by the constructor of Failure and by the phrase method itself.

Backtracking vesus LL(1)

The mainstream compiler do not use backtracking, why?

However, backtracking has imposed restriction

1. avoid left-recursive productions

e.g.

expr ::= expr + "+" term | term

progress never further and it is potentially costly because the same input can be parsed serveral times.

2. Maybe too costy

expr ::= term  "+" expr | term

for input "(1 + 2) * 3", because first try will fail and second try succeed, we waste one try here

So, it is common to modify the gramer, so that backtracking can be avoided. e..g , either one of the following works.

expr ::= term ["+" expr]

expr ::= term {"+" expr}

may admit so-called LL(1) grammer , when a grammer formed this way, it will never backtrack.

To express the expectation that a grammer is LL(1), using a new operator ~!, this operator is like a sequential composition ~ but it will never backtrack to "un-read" input element
that have already been parsed. using this, we can reformat the arithmetic expression alternatively as follow

def expr : Parser[Any] = 
	term ~! rep("+" ~! term | "-" ~! term) 
def term : Parser[Any] = 
	term ~! rep("*" ~! factor | "/" ~! factor)
def factor : Parser[Any] = 
	"(" ~! expr ~! ")" | floatingPointNumber

Conclusion

downside of Combinator in Scala: not very effecient

reason:

comparing to tools generator ones, e.g. Yacc, Bison, , backtracking method not effecient
they mix parser construction and input alaysis in the same set of operation, in effect, parser is generated anew for each input that is parsed.

Mordern ones uses Tre pareser, advantages

you factor out parser construction from input analysis, you can construct parser once and parse all inputs
the parser generation can use more efficient algorithm such as LALR(1)..

你可能感兴趣的:(scala)

svg图片兼容性和用法优缺点独行侠_ef93
svg图片的使用方法第一次来认认真真的研究了下svg图片，之前只是在网上见过，但都是一晃而过也没当回事，最近网站改版看到同事有用到svg格式的图片，想想自己干了几年的重构也没用过，这些细节的知识是应该好好研究研究了。暂时还没研究得完全透切，先记下目前为止所看到的吧不然又给忘了。svg可缩放矢量图形（ScalableVectorGraphics），顾名思义就是任意改变其大小也不会变形，是基于可扩展标
Kafka详细解析与应用分析芊言芊语 kafka 分布式
Kafka是一个开源的分布式事件流平台（EventStreamingPlatform），由LinkedIn公司最初采用Scala语言开发，并基于ZooKeeper协调管理。如今，Kafka已经被Apache基金会纳入其项目体系，广泛应用于大数据实时处理领域。Kafka凭借其高吞吐量、持久化、分布式和可靠性的特点，成为构建实时流数据管道和流处理应用程序的重要工具。Kafka架构Kafka的架构主要由
探索未来，大规模分布式深度强化学习——深入解析IMPALA架构汤萌妮Margaret
探索未来，大规模分布式深度强化学习——深入解析IMPALA架构scalable_agent项目地址:https://gitcode.com/gh_mirrors/sc/scalable_agent在当今的人工智能研究前沿，深度强化学习（DRL）因其在复杂任务中的卓越表现而备受瞩目。本文要介绍的是一个开源于GitHub的重量级项目：“ScalableDistributedDeep-RLwithImp
车载以太网之SOME/IP IT_码农车载以太网车载以太网 SOME/IP
整体介绍SOME/IP(全称为：Scalableservice-OrientedMiddlewarEoverIP)，是运行在车载以太网协议栈基础之上的中间件，或者也可以称为应用层软件。发展历程AUTOSAR4.0-完成宝马SOME/IP消息的初步集成；AUTOSAR4.1-支持SOME/IP-SD及其发布/订阅功能；AUTOSAR4.2-添加transformer用于序列化以及其他相关优化；AUT
Scala学习之旅－对Option友好的flatMap 喝冰咖啡 scala 学习
聊点什么OptionflatMapvs.OptionOption的作用在Java/Scala中,Optional/Option(本文还是以scala代码为例)是用来表示某个对象存在或者不存在，也就是说,Option是某个类型T的Wrapper,如果T!=null,Option(T).isDefined==true如果T==null,Option(T).isEmpty==true有了Option这层
编程常用命令总结 Yellow0523 Linux BigData 大数据
编程命令大全1.软件环境变量的配置JavaScalaSparkHadoopHive2.大数据软件常用命令Spark基本命令Spark-SQL命令Hive命令HDFS命令YARN命令Zookeeper命令kafka命令Hibench命令MySQL命令3.Linux常用命令Git命令conda命令pip命令查看Linux系统的详细信息查看Linux系统架构(X86还是ARM，两种方法都可)端口号命令L
区块链的可伸缩性以及面临的挑战 Mindfulness code 区块链开发区块链
1.可伸缩性在过去的几年中，可伸缩性（Scalability,也称为可扩展性)问题一直是激烈辩论、严格研究和媒体关注的焦点。这是一个至关重要的问题，因为它可能意味着区块链不适于广泛应用，而仅限于联盟许可的私有网络。在经过对该领域的大量研究之后，人们提出了许多解决方案，下面将详细介绍这些解决方案。从理论上讲，解决可伸缩性问题的一般方法通常围绕协议级别的强化。例如，通常提到的比特比可伸缩性解决方案是增
Scala教程之:静态类型 flydean程序那些事
Scala是静态类型的，它拥有一个强大的类型系统，静态地强制以安全、一致的方式使用抽象，我们通过下面几个特征来一一说明：泛类型型变类型上界类型下界内部类抽象类型复合类型自类型隐式参数隐式转换多态方法类型推断通过这些特性，为安全可重用的编程抽象以及类型安全的扩展提供了强大的基础。泛类型和java一样，Scala也有泛型的概念，在scala里面泛型是使用方括号[]来接受类型参数的。通常使用字母A来作为
Pytorch深度学习- Tensorboard的使用以及图像变换transform的使用（小土堆） Mr chenxizhi 深度学习人工智能 python
Tensorboard中的SummaryWriter使用导入数据包fromtorch.utils.tensorboardimportSummaryWriter构造函数方法#那么生成的数据文件都存在于logs文件夹下writer=SummaryWriter("logs")add_scalar代码示例'''tag:数据标题global_step:x轴数据scalar_value:y轴数据'''#运行结
动手学深度学习（pytorch土堆）-02TensorBoard的使用 #include<菜鸡> 深度学习深度学习 pytorch 人工智能
1.可视化代码使用了torch.utils.tensorboard将数据记录到TensorBoard以便可视化。具体来说，它将标量数据记录到目录logs中，使用的是SummaryWriter类。代码分解如下：SummaryWriter("logs")：初始化一个TensorBoard的写入器，日志会保存到"logs"目录。writer.add_scalar("y=x",i,i)：在循环的每一次迭代
【OpenCV】官方文档学习，库的命名冲突处理办法【声明命名空间】深耕AI opencv 学习人工智能
原文：SomeofthecurrentorfutureOpenCVexternalnamesmayconflictwithSTLorotherlibraries.Inthiscase,useexplicitnamespacespecifierstoresolvethenameconflicts:Mata(100,100,CV_32F);randu(a,Scalar::all(1),Scalar::
【鼠鼠学AI代码合集#5】线性代数鼠鼠龙年发大财鼠鼠学AI系列代码合集人工智能线性代数机器学习
在前面的例子中，我们已经讨论了标量的概念，并展示了如何使用代码对标量进行基本的算术运算。接下来，我将进一步说明该过程，并解释每一步的实现。标量（Scalar）的基本操作标量是只有一个元素的数值。它可以是整数、浮点数等。通过下面的Python代码，我们可以很容易地进行标量的加法、乘法、除法和指数运算。代码实现：importtorch#定义两个标量x=torch.tensor(3.0)#标量x，值为3
【Pytorch】cumsum的实现逻辑栏杆拍遍看吴钩 pytorch pytorch 人工智能 python
本文只记录cumsum的实现逻辑的CUDA部分，也即底层调用了CUDA的什么实现算子。voidlaunch_cumsum_cuda_kernel(constTensorBase&result,constTensorBase&self,int64_tdim){AT_DISPATCH_ALL_TYPES_AND_COMPLEX_AND2(ScalarType::Half,ScalarType::BFl
CloudCompare操作（某一指定要素按照PointSourceID分类）喵喵不爱吃鱼工具使用
CloudCompare操作（指定强度值点云按照PointSourceID分类）以实标线为例：强度值31、首先将点云按照Intensity显示Properties->Active:IntensityCurrent:Blue->Green…2、截取实标线菜单栏：Edit->Scalarfields->FilterByValue:3-43、截取的实标线点云按照PointSourceID显示，再使用Po
【MySQL】深圳大学数据库实验二看未来捏深大数据库数据库 mysql
目录一、实验目的二、实验要求三、实验设备四、建议的实验步骤4.1EXERCISES5GROUPBY&HAVINGGROUPBY的用法HAVING的用法综合示例小结4.2EXERCISES6SUBQUERIES.1.标量子查询（ScalarSubquery）2.行子查询（RowSubquery）3.表子查询（TableSubquery）4.相关子查询（CorrelatedSubquery）5.非相关
2024年大数据高频面试题(下篇）猿与禅 Java架构师面试大数据面试 scala 即席查询分桶调度系统数据倾斜
文章目录Scala数据类型函数式编程闭包函数柯里化面向对象样例类对象与伴生对象特质(trait)模式匹配隐式转换即席查询KylinKylin特点Kylin工作原理核心算法Kylin总结Kylin的优点什么场景用KylinKylin的缺点Impala什么是ImpalaImpala为什么快FrontendBackendImpala总结：Presto什么是PrestoPresto的执行过程Presto总
PyFlink自定义函数吉小雨 pyflink flink
在PyFlink（ApacheFlink的PythonAPI）中，自定义函数分为三种主要类型：ScalarFunction（标量函数）、TableFunction（表函数）和AggregateFunction（聚合函数）。这些自定义函数可以在Flink的SQL和TableAPI中使用，用于扩展PyFlink的内置功能，处理自定义的计算逻辑。1.安装PyFlink在开始之前，确保你的环境中已安装了P
训练过程可视化tensorboard和wandb及np.array和tensor互相转换小裴（碎碎念版） python
tensorboardfromtensorboardXimportSummaryWriter#设置保存日志文件路径logger_path=os.path.join(path,current_time)logger=SummaryWriter(log_dir=logger_path,comment=comment)#要保存的数据logger.add_scalar("value_loss",value
Hexagon_DSP_User_Guide(2) weixin_38498942 tools 简介 dsp开发开发语言 tool
Hexagon_DSP_User_Guide（2）4.2Guidelinesforassemblyandintrinsicoptimization4.2.1Maximizeinstructionsperpacket4.2.1.1Scalarinstructionpackingrules4.2.1.2HVXpackingrules4.2.2Understandandreducestalls4.2.2
多种model serving 的调研比较 Helen_Cat
image.pnggithubofficesite框架语言依赖项服务端开发语言客户端调用语言是否支持restful支持算法是否支持hdfs大规模是否支持同时部署多个模型服务是否支持模型切换是否支持模型跟踪是否支持pmmlpredictioniohttps://github.com/apache/predictioniohttp://predictionio.apache.orgscalamysql
SparkStreaming业务逻辑处理的一些高级算子看见我的小熊没 sparkStreaming scala spark big data scala
1、reduceByKey reduceByKey是按key进行计算，操作的数据是每个批次内的数据（一个采集周期），不能跨批次计算。如果需要实现对历史数据的跨批次统计累加，则需要使用updateStateByKey算子或者mapWithState算子。packagecom.sparkscala.streamingimportorg.apache.log4j.{Level,Logger}impor
ARM SIMD instruction -- fcmpe xiaozhiwise Assembly 汇编
FCMPEFloating-pointsignalingCompare(scalar).ThisinstructioncomparesthetwoSIMD&FPsourceregistervalues,orthefirstSIMD&FPsourceregistervalueandzero.ItwritestheresulttothePSTATE.{N,Z,C,V}flags.浮点数比较（标量）。此
Windows系统下的Spark环境配置 eeee~~ 3：大数据技术实用教程 spark 大数据分布式
一：Spark的介绍ApacheSpark是一个开源的分布式大数据处理引擎，它提供了一整套开发API，包括流计算和机器学习。Spark支持批处理和流处理，其显著特点是能够在内存中进行迭代计算，从而加快数据处理速度。尽管Spark是用Scala开发的，但它也为Java、Scala、Python和R等高级编程语言提供了开发接口。Spark提供了多个核心组件，包括：SparkCore：提供内存计算的能力
Redis概述 AC编程
一、为什么需要NoSQLHighperformance高并发读写HugeStorage海量数据的高效率存储和访问HighScalability&&HighAvailability高可拓展性和高可用性二、NoSQL数据库的四大分类键值（Key-Value）存储列存储文档数据库图形数据库三、四类NoSQL数据库比较键值（Key-Value）存储相关产品：Redis、Voldemort、TokyoCab
EMR组件部署指南 ivwdcwso 运维 EMR 大数据开源运维
EMR(ElasticMapReduce)是一个大数据处理和分析平台,包含了多个开源组件。本文将详细介绍如何部署EMR的主要组件,包括:JDK1.8ElasticsearchKafkaFlinkZookeeperHBaseHadoopPhoenixScalaSparkHive准备工作所有操作都在/data目录下进行。首先安装JDK1.8:yuminstalljava-1.8.0-openjdk部署
演示Scalalazy变量的惰性求值悻运 scala
惰性求值：在用到的时候才对表达式进行求值，而不是在定义时立即求值，并且变量只有在首次访问时才会进行初始化，之后再次访问时会使用已经计算好的值以下是演示Scala的lazy变量惰性求值的步骤：1：定义一个lazy变量。在Scala中，可以使用lazy关键字来声明一个lazy变量。例如，我们可以定义一个名为x的lazy变量：lazyvalx={//初始化代码块...}2:访问lazy变量。当我们第一次
kafka集群搭建 java皮皮虫 kafka springboot 后台 kafka kafka测试 kafka集群
Kafka集群搭建一、概念说明它是一个分布式消息系统，由linkedin使用scala编写，用作LinkedIn的活动流（ActivityStream）和运营数据处理管道（Pipeline）的基础。具有高水平扩展和高吞吐量。比较定义解释：1、Java和scala都是运行在JVM上的语言。2、erlang和最近比较火的和go语言一样是从代码级别就支持高并发的一种语言，所以RabbitMQ天生就有很高
介绍一下SOME/IP 的Session handing功能 aFakeProgramer SOME/IP精华学习笔记网络协议
SOME/IP（Scalableservice-OrientedMiddlewarEoverIP）协议中的会话处理功能（SessionHandling）是确保消息传递可靠性和顺序的重要机制。以下是其主要功能：1.会话ID（SessionID）：每个会话都有一个唯一的会话ID，用于标识和跟踪消息。所有属于同一会话的消息都共享相同的会话ID³。2.消息分段：对于需要分段传输的大消息，SOME/IP使用
429. N-ary Tree Level Order Traversal. Python/Scala 电饭锅娃儿
环境：python3.6，scala2.11.8题意N叉树的层次遍历，题意比较清晰，具体可戳此。分析虽然是N叉树遍历，仍可参考二叉树的先序、中序及后序遍历。三种方法都使用递归和栈来完成二叉树的遍历，不同的是N叉树要求返回的结果为二维列表，反映节点间的层级关系。递归基于二叉树的通用递归写法，先来看看遍历N叉树的递归起手式：defdfs(node):ifnode:#符合某些条件后，添加至结果列表。类似
AES加密类库教程马安柯Lorelei
AES加密类库教程AES-Encryption-ClassesAESencryptioninPython,PHP,C#,Java,C++,F#,Ruby,Scala,Node.js项目地址:https://gitcode.com/gh_mirrors/ae/AES-Encryption-Classes项目介绍本教程将详细介绍GitHub上的一个开源项目——AES-Encryption-Classe
scala的option和some 矮蛋蛋编程 scala
原文地址： http://blog.sina.com.cn/s/blog_68af3f090100qkt8.html 对于学习 Scala 的 Java™ 开发人员来说，对象是一个比较自然、简单的入口点。在本系列前几期文章中，我介绍了 Scala 中一些面向对象的编程方法，这些方法实际上与 Java 编程的区别不是很大。我还向您展示了 Scala 如何重新应用传统的面向对象概念，找到其缺点
NullPointerException Cb123456 android BaseAdapter
java.lang.NullPointerException: Attempt to invoke virtual method 'int android.view.View.getImportantForAccessibility()' on a null object reference 出现以上异常.然后就在baidu上
PHP使用文件和目录天子之骄 php文件和目录读取和写入 php验证文件 php锁定文件
PHP使用文件和目录 1.使用include()包含文件 (1)：使用include()从一个被包含文档返回一个值 (2)：在控制结构中使用include() include_once()函数需要一个包含文件的路径，此外，第一次调用它的情况和include()一样，如果在脚本执行中再次对同一个文件调用，那么这个文件不会再次包含。在php.ini文件中设置
SQL SELECT DISTINCT 语句何必如此 sql
SELECT DISTINCT 语句用于返回唯一不同的值。 SQL SELECT DISTINCT 语句在表中，一个列可能会包含多个重复值，有时您也许希望仅仅列出不同（distinct）的值。 DISTINCT 关键词用于返回唯一不同的值。 SQL SELECT DISTINCT 语法 SELECT DISTINCT column_name,column_name F
java冒泡排序 3213213333332132 java 冒泡排序
package com.algorithm; /** * @Description 冒泡 * @author FuJianyong * 2015-1-22上午09:58:39 */ public class MaoPao { public static void main(String[] args) { int[] mao = {17,50,26,18,9,10
struts2.18 +json,struts2-json-plugin-2.1.8.1.jar配置及问题！ 7454103 DAO spring Ajax json qq
struts2.18 出来有段时间了！（貌似是稳定版）闲时研究下下！貌似 sruts2 搭配 json 做 ajax 很吃香！实践了下下！不当之处请绕过！呵呵网上一大堆 struts2+json 不过大多的json 插件都是 jsonplugin.34.jar strut
struts2 数据标签说明 darkranger jsp bean struts servlet Scheme
数据标签主要用于提供各种数据访问相关的功能，包括显示一个Action里的属性，以及生成国际化输出等功能数据标签主要包括： action ：该标签用于在JSP页面中直接调用一个Action，通过指定executeResult参数，还可将该Action的处理结果包含到本页面来。 bean ：该标签用于创建一个javabean实例。如果指定了id属性，则可以将创建的javabean实例放入Sta
链表.简单的链表节点构建 aijuans 编程技巧
/*编程环境WIN-TC*/ #include "stdio.h" #include "conio.h" #define NODE(name, key_word, help) \ Node name[1]={{NULL, NULL, NULL, key_word, help}} typedef struct node { &nbs
tomcat下jndi的三种配置方式 avords tomcat
jndi(Java Naming and Directory Interface，Java命名和目录接口)是一组在Java应用中访问命名和目录服务的API。命名服务将名称和对象联系起来，使得我们可以用名称访问对象。目录服务是一种命名服务，在这种服务里，对象不但有名称，还有属性。 tomcat配置
关于敏捷的一些想法 houxinyou 敏捷
从网上看到这样一句话：“敏捷开发的最重要目标就是：满足用户多变的需求，说白了就是最大程度的让客户满意。” 感觉表达的不太清楚。感觉容易被人误解的地方主要在“用户多变的需求”上。第一种多变，实际上就是没有从根本上了解了用户的需求。用户的需求实际是稳定的，只是比较多，也比较混乱，用户一般只能了解自己的那一小部分，所以没有用户能清楚的表达出整体需求。而由于各种条件的，用户表达自己那一部分时也有
富养还是穷养，决定孩子的一生 bijian1013 教育人生
是什么决定孩子未来物质能否丰盛？为什么说寒门很难出贵子，三代才能出贵族？真的是父母必须有钱，才能大概率保证孩子未来富有吗？-----作者：@李雪爱与自由事实并非由物质决定，而是由心灵决定。一朋友富有而且修养气质很好，兄弟姐妹也都如此。她的童年时代，物质上大家都很贫乏，但妈妈总是保持生活中的美感，时不时给孩子们带回一些美好小玩意，从来不对孩子传递生活艰辛、金钱来之不易、要懂得珍惜
oracle 日期时间格式转化征客丶 oracle
oracle 系统时间有 SYSDATE 与 SYSTIMESTAMP； SYSDATE：不支持毫秒，取的是系统时间； SYSTIMESTAMP：支持毫秒，日期，时间是给时区转换的，秒和毫秒是取的系统的。日期转字符窜：一、不取毫秒： TO_CHAR(SYSDATE, 'YYYY-MM-DD HH24:MI:SS') 简要说明， YYYY 年 MM 月
【Scala六】分析Spark源代码总结的Scala语法四 bit1129 scala
1. apply语法 FileShuffleBlockManager中定义的类ShuffleFileGroup，定义： private class ShuffleFileGroup(val shuffleId: Int, val fileId: Int, val files: Array[File]) { ... def apply(bucketId
Erlang中有意思的bug bookjovi erlang
代码中常有一些很搞笑的bug，如下面的一行代码被调用两次（Erlang beam） commit f667e4a47b07b07ed035073b94d699ff5fe0ba9b Author: Jovi Zhang <[email protected]> Date: Fri Dec 2 16:19:22 2011 +0100 erts:
移位打印10进制数转16进制-2008-08-18 ljy325 java 基础
/** * Description 移位打印10进制的16进制形式 * Creation Date 15-08-2008 9:00 * @author 卢俊宇 * @version 1.0 * */ public class PrintHex { // 备选字符 static final char di
读《研磨设计模式》-代码笔记-组合模式 bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.util.ArrayList; import java.util.List; abstract class Component { public abstract void printStruct(Str
利用cmd命令将.class文件打包成jar chenyu19891124 cmd jar
cmd命令打jar是如下实现：在运行里输入cmd，利用cmd命令进入到本地的工作盘符。(如我的是D盘下的文件有此路径 D:\workspace\prpall\WEB-INF\classes) 现在是想把D:\workspace\prpall\WEB-INF\classes路径下所有的文件打包成prpall.jar。然后继续如下操作： cd D: 回车 cd workspace/prpal
[原创]JWFD v0.96 工作流系统二次开发包 for Eclipse 简要说明 comsci eclipse 设计模式算法工作 swing
JWFD v0.96 工作流系统二次开发包 for Eclipse 简要说明 &nb
SecureCRT右键粘贴的设置 daizj secureCRT 右键粘贴
一般都习惯鼠标右键自动粘贴的功能，对于SecureCRT6.7.5 ，这个功能也已经是默认配置了。老版本的SecureCRT其实也有这个功能，只是不是默认设置，很多人不知道罢了。菜单： Options->Global Options ...->Terminal 右边有个Mouse的选项块。 Copy on Select Paste on Right/Middle
Linux 软链接和硬链接 dongwei_6688 linux
1.Linux链接概念Linux链接分两种，一种被称为硬链接（Hard Link），另一种被称为符号链接（Symbolic Link）。默认情况下，ln命令产生硬链接。【硬连接】硬连接指通过索引节点来进行连接。在Linux的文件系统中，保存在磁盘分区中的文件不管是什么类型都给它分配一个编号，称为索引节点号(Inode Index)。在Linux中，多个文件名指向同一索引节点是存在的。一般这种连
DIV底部自适应 dcj3sjt126com JavaScript
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> <html xmlns="http://www.w3.org/1999/xhtml&q
Centos6.5使用yum安装mysql——快速上手必备 dcj3sjt126com mysql
第1步、yum安装mysql [root@stonex ~]# yum -y install mysql-server 安装结果： Installed: mysql-server.x86_64 0:5.1.73-3.el6_5 &nb
如何调试JDK源码 frank1234 jdk
相信各位小伙伴们跟我一样，想通过JDK源码来学习Java，比如collections包，java.util.concurrent包。可惜的是sun提供的jdk并不能查看运行中的局部变量，需要重新编译一下rt.jar。下面是编译jdk的具体步骤： 1.把C:\java\jdk1.6.0_26\sr
Maximal Rectangle hcx2013 max
Given a 2D binary matrix filled with 0's and 1's, find the largest rectangle containing all ones and return its area. public class Solution { public int maximalRectangle(char[][] matrix)
Spring MVC测试框架详解——服务端测试 jinnianshilongnian spring mvc test
随着RESTful Web Service的流行，测试对外的Service是否满足期望也变的必要的。从Spring 3.2开始Spring了Spring Web测试框架，如果版本低于3.2，请使用spring-test-mvc项目（合并到spring3.2中了）。 Spring MVC测试框架提供了对服务器端和客户端（基于RestTemplate的客户端）提供了支持。 &nbs
Linux64位操作系统（CentOS6.6）上如何编译hadoop2.4.0 liyong0802 hadoop
一、准备编译软件 1.在官网下载jdk1.7、maven3.2.1、ant1.9.4，解压设置好环境变量就可以用。环境变量设置如下：（1）执行vim /etc/profile （2）在文件尾部加入: export JAVA_HOME=/home/spark/jdk1.7 export MAVEN_HOME=/ho
StatusBar 字体白色 pangyulei status
[[UIApplication sharedApplication] setStatusBarStyle:UIStatusBarStyleLightContent]; /*you'll also need to set UIViewControllerBasedStatusBarAppearance to NO in the plist file if you use this method
如何分析Java虚拟机死锁 sesame java thread oracle 虚拟机 jdbc
英文资料： Thread Dump and Concurrency Locks Thread dumps are very useful for diagnosing synchronization related problems such as deadlocks on object monitors. Ctrl-\ on Solaris/Linux or Ctrl-B
位运算简介及实用技巧（一）：基础篇 tw_wangzhengquan 位运算
http://www.matrix67.com/blog/archives/263 去年年底写的关于位运算的日志是这个Blog里少数大受欢迎的文章之一，很多人都希望我能不断完善那篇文章。后来我看到了不少其它的资料，学习到了更多关于位运算的知识，有了重新整理位运算技巧的想法。从今天起我就开始写这一系列位运算讲解文章，与其说是原来那篇文章的follow-up，不如说是一个r
jsearch的索引文件结构 yangshangchuan 搜索引擎 jsearch 全文检索信息检索 word分词
jsearch是一个高性能的全文检索工具包，基于倒排索引，基于java8，类似于lucene，但更轻量级。 jsearch的索引文件结构定义如下： 1、一个词的索引由=分割的三部分组成：第一部分是词第二部分是这个词在多少