1.sparksql读取sql的相关处理操作

19/11/12 20:32:53 INFO CodeGenerator: Code generated in 12.9619 ms
19/11/12 20:32:53 INFO CodeGenerator: Code generated in 11.6207 ms
19/11/12 20:32:53 INFO MemoryStore: Block broadcast_2 stored as values in memory (estimated size 220.0 KB, free 4.1 GB)
19/11/12 20:32:53 INFO MemoryStore: Block broadcast_2_piece0 stored as bytes in memory (estimated size 20.6 KB, free 4.1 GB)
19/11/12 20:32:53 INFO BlockManagerInfo: Added broadcast_2_piece0 in memory on dst001825.cn1.global.ctrip.com:53907 (size: 20.6 KB, free: 4.1 GB)
19/11/12 20:32:53 INFO SparkContext: Created broadcast 2 from show at sparksql_json.scala:13
19/11/12 20:32:53 INFO FileSourceScanExec: Planning scan with bin packing, max size: 4194377 bytes, open cost is considered as scanning 4194304 bytes.
19/11/12 20:32:53 INFO SparkContext: Starting job: show at sparksql_json.scala:13
19/11/12 20:32:53 INFO DAGScheduler: Got job 1 (show at sparksql_json.scala:13) with 1 output partitions
19/11/12 20:32:53 INFO DAGScheduler: Final stage: ResultStage 1 (show at sparksql_json.scala:13)
19/11/12 20:32:53 INFO DAGScheduler: Parents of final stage: List()
19/11/12 20:32:53 INFO DAGScheduler: Missing parents: List()
19/11/12 20:32:53 INFO DAGScheduler: Submitting ResultStage 1 (MapPartitionsRDD[6] at show at sparksql_json.scala:13), which has no missing parents
19/11/12 20:32:53 INFO MemoryStore: Block broadcast_3 stored as values in memory (estimated size 10.6 KB, free 4.1 GB)
19/11/12 20:32:53 INFO MemoryStore: Block broadcast_3_piece0 stored as bytes in memory (estimated size 5.8 KB, free 4.1 GB)
19/11/12 20:32:53 INFO BlockManagerInfo: Added broadcast_3_piece0 in memory on dst001825.cn1.global.ctrip.com:53907 (size: 5.8 KB, free: 4.1 GB)
19/11/12 20:32:53 INFO SparkContext: Created broadcast 3 from broadcast at DAGScheduler.scala:1161
19/11/12 20:32:53 INFO DAGScheduler: Submitting 1 missing tasks from ResultStage 1 (MapPartitionsRDD[6] at show at sparksql_json.scala:13) (first 15 tasks are for partitions Vector(0))
19/11/12 20:32:53 INFO TaskSchedulerImpl: Adding task set 1.0 with 1 tasks
19/11/12 20:32:53 INFO TaskSetManager: Starting task 0.0 in stage 1.0 (TID 1, localhost, executor driver, partition 0, PROCESS_LOCAL, 8306 bytes)
19/11/12 20:32:53 INFO Executor: Running task 0.0 in stage 1.0 (TID 1)
19/11/12 20:32:53 INFO FileScanRDD: Reading File path: file:///D:/Users/shashahu/Desktop/work/spark-2.4.4/examples/src/main/resources/people.json, range: 0-73, partition values: [empty row]
19/11/12 20:32:53 INFO CodeGenerator: Code generated in 7.2391 ms
19/11/12 20:32:53 INFO Executor: Finished task 0.0 in stage 1.0 (TID 1). 1219 bytes result sent to driver
19/11/12 20:32:53 INFO TaskSetManager: Finished task 0.0 in stage 1.0 (TID 1) in 32 ms on localhost (executor driver) (1/1)
19/11/12 20:32:53 INFO TaskSchedulerImpl: Removed TaskSet 1.0, whose tasks have all completed, from pool 
19/11/12 20:32:53 INFO DAGScheduler: ResultStage 1 (show at sparksql_json.scala:13) finished in 0.041 s
19/11/12 20:32:53 INFO DAGScheduler: Job 1 finished: show at sparksql_json.scala:13, took 0.043552 s
+----+-------+
| age|   name|
+----+-------+
|null|Michael|
|  30|   Andy|
|  19| Justin|
+----+-------+

19/11/12 20:32:53 INFO SparkContext: Invoking stop() from shutdown hook

sparksql读取json数据进行相关的处理操作:

 

你可能感兴趣的:(大数据组件)