Apache Spark学习

做编辑的,无时不刻得学习,各种技术,各种趋势,导致编辑这个身份蜻蜓点水式地能把自己涉及的一些方面讲个大概好像和大约,但是要真做项目或者下笔编程,就很难有人能做到。我时常在想编辑的核心竞争力究竟在哪里,离开出版社,能不能继续在这个社会上活下去呢?

我不知道编辑的最终出路在哪里,我见过很多同行的转行和职业规划,却不知道如何规划自己的:有的做得好(主要策划了许多畅销书)的编辑离开一家出版社单独成立公司或者工作室与更多出版社合作策划出书;有的则直接去了互联网公司做起了数字出版,有的甚至在网上卖起了男士内裤;有的则立足于工资微薄的出版社岗位自己另外支一摊做出版,相辅相成;有的利用图书做起了其他行业的广告传媒……

而我,则因为某一组织想写Apache Spark的系列图书,就得从零开始学习这究竟是个什么东东,先摘选一段:

Apache Spark is an open-source data analytics cluster computing framework originally developed in the AMPLab atUC Berkeley. Spark fits into the Hadoop open-source community, building on top of the Hadoop Distributed File System (HDFS). However, Spark is not tied to the two-stage MapReduce paradigm, and promises performance up to 100 times faster than Hadoop MapReduce for certain applications.Spark provides primitives for in-memory cluster computing that allows user programs to load data into a cluster's memory and query it repeatedly, making it well suited to machine learning algorithms.

Spark became an Apache Top-Level Project in February 2014,and was previously an Apache Incubator project since June 2013. It has received code contributions from large companies that use Spark, including Yahoo! and Intel as well as small companies and startups such as Conviva, Quantifind, ClearStory Data, Ooyala and many more.By March 2014, over 150 individual developers had contributed code to Spark, representing over 30 different companies. Prior to joining Apache Incubator, versions 0.7 and earlier were licensed under the BSD License.

我想,编辑最大的好处就是,不断处于学习的状态之中,尽管物质收获不多,但是能使一个人不断保持学习状态的工作并不多……

你可能感兴趣的:(Apache Spark学习)