昆山人在上海

使用mysql数据库作为Hive的元数据库

在hive/conf文件夹下找到hive-default.xml.template，复制该文件并改名为hive-site.xml。

修改一下内容：

hive.metastore.local
true

javax.jdo.option.ConnectionURL
jdbc:mysql://master:3306/metastore
JDBC connect string for a JDBC metastore

javax.jdo.option.ConnectionDriverName
com.mysql.jdbc.Driver
Driver class name for a JDBC metastore

javax.jdo.option.ConnectionUserName
hadoop
username to use against metastore database

javax.jdo.option.ConnectionPassword
hadoop
password to use against metastore database

这里把整个文件例举出来：











  


  mapred.reduce.tasks
  -1
    The default number of reduce tasks per job.  Typically set
  to a prime close to the number of available hosts.  Ignored when
  mapred.job.tracker is "local". Hadoop set this to 1 by default, whereas hive uses -1 as its default value.
  By setting this property to -1, Hive will automatically figure out what should be the number of reducers.
  



  hive.exec.reducers.bytes.per.reducer
  1000000000
  size per reducer.The default is 1G, i.e if the input size is 10G, it will use 10 reducers.



  hive.exec.reducers.max
  999
  max number of reducers will be used. If the one
	specified in the configuration parameter mapred.reduce.tasks is
	negative, hive will use this one as the max number of reducers when
	automatically determine number of reducers.



  hive.cli.print.header
  false
  Whether to print the names of the columns in query output.



  hive.cli.print.current.db
  false
  Whether to include the current database in the hive prompt.



  hive.exec.scratchdir
  /tmp/hive-${user.name}
  Scratch space for Hive jobs



  hive.test.mode
  false
  whether hive is running in test mode. If yes, it turns on sampling and prefixes the output tablename



  hive.test.mode.prefix
  test_
  if hive is running in test mode, prefixes the output table by this string










  hive.test.mode.samplefreq
  32
  if hive is running in test mode and table is not bucketed, sampling frequency



  hive.test.mode.nosamplelist
  
  if hive is running in test mode, dont sample the above comma seperated list of tables



  hive.metastore.local
  true
  controls whether to connect to remove metastore server or open a new metastore server in Hive Client JVM



  javax.jdo.option.ConnectionURL
  
  jdbc:mysql://master:3306/metastore
  JDBC connect string for a JDBC metastore



  javax.jdo.option.ConnectionDriverName
  
  com.mysql.jdbc.Driver
  Driver class name for a JDBC metastore



  javax.jdo.PersistenceManagerFactoryClass
  org.datanucleus.jdo.JDOPersistenceManagerFactory
  class implementing the jdo persistence



  javax.jdo.option.DetachAllOnCommit
  true
  detaches all objects from session so that they can be used after transaction is committed



  javax.jdo.option.NonTransactionalRead
  true
  reads outside of transactions



  javax.jdo.option.ConnectionUserName
  
  root
  username to use against metastore database



  javax.jdo.option.ConnectionPassword
  
  root
  password to use against metastore database



  javax.jdo.option.Multithreaded
  true
  Set this to true if multiple threads access metastore through JDO concurrently.



  datanucleus.connectionPoolingType
  DBCP
  Uses a DBCP connection pool for JDBC metastore



  datanucleus.validateTables
  false
  validates existing schema against code. turn this on if you want to verify existing schema 



  datanucleus.validateColumns
  false
  validates existing schema against code. turn this on if you want to verify existing schema 



  datanucleus.validateConstraints
  false
  validates existing schema against code. turn this on if you want to verify existing schema 



  datanucleus.storeManagerType
  rdbms
  metadata store type



  datanucleus.autoCreateSchema
  true
  creates necessary schema on a startup if one doesn't exist. set this to false, after creating it once



  datanucleus.autoStartMechanismMode
  checked
  throw exception if metadata tables are incorrect



  datanucleus.transactionIsolation
  read-committed
  Default transaction isolation level for identity generation. 



  datanucleus.cache.level2
  false
  Use a level 2 cache. Turn this off if metadata is changed independently of hive metastore server



  datanucleus.cache.level2.type
  SOFT
  SOFT=soft reference based cache, WEAK=weak reference based cache.



  datanucleus.identifierFactory
  datanucleus
  Name of the identifier factory to use when generating table/column names etc. 'datanucleus' is used for backward compatibility



  datanucleus.plugin.pluginRegistryBundleCheck
  LOG
  Defines what happens when plugin bundles are found and are duplicated [EXCEPTION|LOG|NONE]



  hive.metastore.warehouse.dir
  /user/hive/warehouse
  location of default database for the warehouse



  hive.metastore.event.listeners
  
  list of comma seperated listeners for metastore events.



  hive.metastore.end.function.listeners
  
  list of comma separated listeners for the end of metastore functions.



  hive.metastore.event.expiry.duration
  0
  Duration after which events expire from events table (in seconds)



  hive.metastore.event.clean.freq
  0
  Frequency at which timer task runs to purge expired events in metastore(in seconds).



  hive.metastore.connect.retries
  5
  Number of retries while opening a connection to metastore



  hive.metastore.client.connect.retry.delay
  1
  Number of seconds for the client to wait between consecutive connection attempts



  hive.metastore.client.socket.timeout
  20
  MetaStore Client socket timeout in seconds



  hive.metastore.rawstore.impl
  org.apache.hadoop.hive.metastore.ObjectStore
  Name of the class that implements org.apache.hadoop.hive.metastore.rawstore interface. This class is used to store and retrieval of raw metadata objects such as table, database



  hive.metastore.batch.retrieve.max
  300
  Maximum number of objects (tables/partitions) can be retrieved from metastore in one batch. The higher the number, the less the number of round trips is needed to the Hive metastore server, but it may also cause higher memory requirement at the client side.



  hive.default.fileformat
  TextFile
  Default file format for CREATE TABLE statement. Options are TextFile and SequenceFile. Users can explicitly say CREATE TABLE ... STORED AS  to override



  hive.fileformat.check
  true
  Whether to check file format or not when loading data files



  hive.map.aggr
  true
  Whether to use map-side aggregation in Hive Group By queries



  hive.groupby.skewindata
  false
  Whether there is skew in data to optimize group by queries



  hive.groupby.mapaggr.checkinterval
  100000
  Number of rows after which size of the grouping keys/aggregation classes is performed



  hive.mapred.local.mem
  0
  For local mode, memory of the mappers/reducers



  hive.mapjoin.followby.map.aggr.hash.percentmemory
  0.3
  Portion of total memory to be used by map-side grup aggregation hash table, when this group by is followed by map join



  hive.map.aggr.hash.force.flush.memory.threshold
  0.9
  The max memory to be used by map-side grup aggregation hash table, if the memory usage is higher than this number, force to flush data



  hive.map.aggr.hash.percentmemory
  0.5
  Portion of total memory to be used by map-side grup aggregation hash table



  hive.map.aggr.hash.min.reduction
  0.5
  Hash aggregation will be turned off if the ratio between hash
  table size and input rows is bigger than this number. Set to 1 to make sure
  hash aggregation is never turned off.



  hive.optimize.cp
  true
  Whether to enable column pruner



  hive.optimize.index.filter
  false
  Whether to enable automatic use of indexes



  hive.optimize.index.groupby
  false
  Whether to enable optimization of group-by queries using Aggregate indexes.



  hive.optimize.ppd
  true
  Whether to enable predicate pushdown



  hive.optimize.ppd.storage
  true
  Whether to push predicates down into storage handlers.  Ignored when hive.optimize.ppd is false.



  hive.ppd.recognizetransivity
  true
  Whether to transitively replicate predicate filters over equijoin conditions.



  hive.optimize.groupby
  true
  Whether to enable the bucketed group by from bucketed partitions/tables.



  hive.multigroupby.singlemr
  false
  Whether to optimize multi group by query to generate single M/R
  job plan. If the multi group by query has common group by keys, it will be
  optimized to generate single M/R job.


  hive.join.emit.interval
  1000
  How many rows in the right-most join operand Hive should buffer before emitting the join result. 



  hive.join.cache.size
  25000
  How many rows in the joining tables (except the streaming table) should be cached in memory. 



  hive.mapjoin.bucket.cache.size
  100
  How many values in each keys in the map-joined table should be cached in memory. 



  hive.mapjoin.cache.numrows
  25000
  How many rows should be cached by jdbm for map join. 



  hive.optimize.skewjoin
  false
  Whether to enable skew join optimization. 



  hive.skewjoin.key
  100000
  Determine if we get a skew key in join. If we see more
	than the specified number of rows with the same key in join operator,
	we think the key as a skew join key. 



  hive.skewjoin.mapjoin.map.tasks
  10000
   Determine the number of map task used in the follow up map join job
	for a skew join. It should be used together with hive.skewjoin.mapjoin.min.split
	to perform a fine grained control.



  hive.skewjoin.mapjoin.min.split
  33554432
   Determine the number of map task at most used in the follow up map join job
	for a skew join by specifying the minimum split size. It should be used together with
	hive.skewjoin.mapjoin.map.tasks to perform a fine grained control.



  hive.mapred.mode
  nonstrict
  The mode in which the hive operations are being performed. In strict mode, some risky queries are not allowed to run



  hive.exec.script.maxerrsize
  100000
  Maximum number of bytes a script is allowed to emit to standard error (per map-reduce task). This prevents runaway scripts from filling logs partitions to capacity 



  hive.exec.script.allow.partial.consumption
  false
   When enabled, this option allows a user script to exit successfully without consuming all the data from the standard input.
  



  hive.script.operator.id.env.var
  HIVE_SCRIPT_OPERATOR_ID
   Name of the environment variable that holds the unique script operator ID in the user's transform function (the custom mapper/reducer that the user has specified in the query)
  



  hive.exec.compress.output
  false
   This controls whether the final outputs of a query (to a local/hdfs file or a hive table) is compressed. The compression codec and other options are determined from hadoop config variables mapred.output.compress* 



  hive.exec.compress.intermediate
  false
   This controls whether intermediate files produced by hive between multiple map-reduce jobs are compressed. The compression codec and other options are determined from hadoop config variables mapred.output.compress* 



  hive.exec.parallel
  false
  Whether to execute jobs in parallel



  hive.exec.parallel.thread.number
  8
  How many jobs at most can be executed in parallel



  hive.exec.rowoffset
  false
  Whether to provide the row offset virtual column



  hive.task.progress
  false
  Whether Hive should periodically update task progress counters during execution.  Enabling this allows task progress to be monitored more closely in the job tracker, but may impose a performance penalty.  This flag is automatically set to true for jobs with hive.exec.dynamic.partition set to true.



  hive.hwi.war.file
  lib/hive-hwi-0.8.0.war
  This sets the path to the HWI war file, relative to ${HIVE_HOME}. 



  hive.hwi.listen.host
  0.0.0.0
  This is the host address the Hive Web Interface will listen on



  hive.hwi.listen.port
  9999
  This is the port the Hive Web Interface will listen on



  hive.exec.pre.hooks
  
  Comma-separated list of pre-execution hooks to be invoked for each statement.  A pre-execution hook is specified as the name of a Java class which implements the org.apache.hadoop.hive.ql.hooks.ExecuteWithHookContext interface.



  hive.exec.post.hooks
  
  Comma-separated list of post-execution hooks to be invoked for each statement.  A post-execution hook is specified as the name of a Java class which implements the org.apache.hadoop.hive.ql.hooks.ExecuteWithHookContext interface.



  hive.exec.failure.hooks
  
  Comma-separated list of on-failure hooks to be invoked for each statement.  An on-failure hook is specified as the name of Java class which implements the org.apache.hadoop.hive.ql.hooks.ExecuteWithHookContext interface.



  hive.client.stats.publishers
  
  Comma-separated list of statistics publishers to be invoked on counters on each job.  A client stats publisher is specified as the name of a Java class which implements the org.apache.hadoop.hive.ql.stats.ClientStatsPublisher interface.



  hive.client.stats.counters
  
  Subset of counters that should be of interest for hive.client.stats.publishers (when one wants to limit their publishing). Non-display names should be used



  hive.merge.mapfiles
  true
  Merge small files at the end of a map-only job



  hive.merge.mapredfiles
  false
  Merge small files at the end of a map-reduce job



  hive.mergejob.maponly
  true
  Try to generate a map-only job for merging files if CombineHiveInputFormat is supported.



  hive.heartbeat.interval
  1000
  Send a heartbeat after this interval - used by mapjoin and filter operators



  hive.merge.size.per.task
  256000000
  Size of merged files at the end of the job



  hive.merge.smallfiles.avgsize
  16000000
  When the average output file size of a job is less than this number, Hive will start an additional map-reduce job to merge the output files into bigger files.  This is only done for map-only jobs if hive.merge.mapfiles is true, and for map-reduce jobs if hive.merge.mapredfiles is true.



  hive.mapjoin.smalltable.filesize
  25000000
  The threshold for the input file size of the small tables; if the file size is smaller than this threshold, it will try to convert the common join into map join



  hive.mapjoin.localtask.max.memory.usage
  0.90
  This number means how much memory the local task can take to hold the key/value into in-memory hash table; If the local task's memory usage is more than this number, the local task will be abort by themself. It means the data of small table is too large to be hold in the memory.



  hive.mapjoin.followby.gby.localtask.max.memory.usage
  0.55
  This number means how much memory the local task can take to hold the key/value into in-memory hash table when this map join followed by a group by; If the local task's memory usage is more than this number, the local task will be abort by themself. It means the data of small table is too large to be hold in the memory.



  hive.mapjoin.check.memory.rows
  100000
  The number means after how many rows processed it needs to check the memory usage



  hive.auto.convert.join
  false
  Whether Hive enable the optimization about converting common join into mapjoin based on the input file size




  hive.script.auto.progress
  false
  Whether Hive Tranform/Map/Reduce Clause should automatically send progress information to TaskTracker to avoid the task getting killed because of inactivity.  Hive sends progress information when the script is outputting to stderr.  This option removes the need of periodically producing stderr messages, but users should be cautious because this may prevent infinite loops in the scripts to be killed by TaskTracker.  



  hive.script.serde
  org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
  The default serde for trasmitting input data to and reading output data from the user scripts. 



  hive.script.recordreader
  org.apache.hadoop.hive.ql.exec.TextRecordReader
  The default record reader for reading data from the user scripts. 



  hive.script.recordwriter
  org.apache.hadoop.hive.ql.exec.TextRecordWriter
  The default record writer for writing data to the user scripts. 



  hive.input.format
  org.apache.hadoop.hive.ql.io.CombineHiveInputFormat
  The default input format. Set this to HiveInputFormat if you encounter problems with CombineHiveInputFormat.



  hive.udtf.auto.progress
  false
  Whether Hive should automatically send progress information to TaskTracker when using UDTF's to prevent the task getting killed because of inactivity.  Users should be cautious because this may prevent TaskTracker from killing tasks with infinte loops.  



  hive.mapred.reduce.tasks.speculative.execution
  true
  Whether speculative execution for reducers should be turned on. 



  hive.exec.counters.pull.interval
  1000
  The interval with which to poll the JobTracker for the counters the running job. The smaller it is the more load there will be on the jobtracker, the higher it is the less granular the caught will be.



  hive.enforce.bucketing
  false
  Whether bucketing is enforced. If true, while inserting into the table, bucketing is enforced. 



  hive.enforce.sorting
  false
  Whether sorting is enforced. If true, while inserting into the table, sorting is enforced. 



  hive.metastore.ds.connection.url.hook
  
  Name of the hook to use for retriving the JDO connection URL. If empty, the value in javax.jdo.option.ConnectionURL is used 



  hive.metastore.ds.retry.attempts
  1
  The number of times to retry a metastore call if there were a connection error



   hive.metastore.ds.retry.interval
   1000
   The number of miliseconds between metastore retry attempts



  hive.metastore.server.min.threads
  200
  Minimum number of worker threads in the Thrift server's pool.



  hive.metastore.server.max.threads
  100000
  Maximum number of worker threads in the Thrift server's pool.



  hive.metastore.server.tcp.keepalive
  true
  Whether to enable TCP keepalive for the metastore server. Keepalive will prevent accumulation of half-open connections.



  hive.metastore.sasl.enabled
  false
  If true, the metastore thrift interface will be secured with SASL. Clients must authenticate with Kerberos.



  hive.metastore.kerberos.keytab.file
  
  The path to the Kerberos Keytab file containing the metastore thrift server's service principal.



  hive.metastore.kerberos.principal
  hive-metastore/[email protected]
  The service principal for the metastore thrift server. The special string _HOST will be replaced automatically with the correct host name.



  hive.metastore.cache.pinobjtypes
  Table,StorageDescriptor,SerDeInfo,Partition,Database,Type,FieldSchema,Order
  List of comma separated metastore object types that should be pinned in the cache



  hive.optimize.reducededuplication
  true
  Remove extra map-reduce jobs if the data is already clustered by the same key which needs to be used again. This should always be set to true. Since it is a new feature, it has been made configurable.



  hive.exec.dynamic.partition
  false
  Whether or not to allow dynamic partitions in DML/DDL.



  hive.exec.dynamic.partition.mode
  strict
  In strict mode, the user must specify at least one static partition in case the user accidentally overwrites all partitions.



  hive.exec.max.dynamic.partitions
  1000
  Maximum number of dynamic partitions allowed to be created in total.



  hive.exec.max.dynamic.partitions.pernode
  100
  Maximum number of dynamic partitions allowed to be created in each mapper/reducer node.



  hive.exec.max.created.files
  100000
  Maximum number of HDFS files created by all mappers/reducers in a MapReduce job.



  hive.exec.default.partition.name
  __HIVE_DEFAULT_PARTITION__
  The default partition name in case the dynamic partition column value is null/empty string or anyother values that cannot be escaped. This value must not contain any special character used in HDFS URI (e.g., ':', '%', '/' etc). The user has to be aware that the dynamic partition value should not contain this value to avoid confusions.



  hive.stats.dbclass
  jdbc:derby
  The default database that stores temporary hive statistics.



  hive.stats.autogather
  true
  A flag to gather statistics automatically during the INSERT OVERWRITE command.



  hive.stats.jdbcdriver
  org.apache.derby.jdbc.EmbeddedDriver
  The JDBC driver for the database that stores temporary hive statistics.



  hive.stats.dbconnectionstring
  jdbc:derby:;databaseName=TempStatsStore;create=true
  The default connection string for the database that stores temporary hive statistics.



  hive.stats.default.publisher
  
  The Java class (implementing the StatsPublisher interface) that is used by default if hive.stats.dbclass is not JDBC or HBase.



  hive.stats.default.aggregator
  
  The Java class (implementing the StatsAggregator interface) that is used by default if hive.stats.dbclass is not JDBC or HBase.



  hive.stats.jdbc.timeout
  30
  Timeout value (number of seconds) used by JDBC connection and statements.



  hive.stats.retries.max
  0
  Maximum number of retries when stats publisher/aggregator got an exception updating intermediate database. Default is no tries on failures.



  hive.stats.retries.wait
  3000
  The base waiting window (in milliseconds) before the next retry. The actual wait time is calculated by baseWindow * failues + baseWindow * (failure + 1) * (random number between [0.0,1.0]).



  hive.support.concurrency
  false
  Whether hive supports concurrency or not. A zookeeper instance must be up and running for the default hive lock manager to support read-write locks.



  hive.lock.numretries
  100
  The number of times you want to try to get all the locks



  hive.unlock.numretries
  10
  The number of times you want to retry to do one unlock



  hive.lock.sleep.between.retries
  60
  The sleep time (in seconds) between various retries



  hive.zookeeper.quorum
  
  The list of zookeeper servers to talk to. This is only needed for read/write locks.



  hive.zookeeper.client.port
  2181
  The port of zookeeper servers to talk to. This is only needed for read/write locks.



  hive.zookeeper.session.timeout
  600000
  Zookeeper client's session timeout. The client is disconnected, and as a result, all locks released, if a heartbeat is not sent in the timeout.



  hive.zookeeper.namespace
  hive_zookeeper_namespace
  The parent node under which all zookeeper nodes are created.



  hive.zookeeper.clean.extra.nodes
  false
  Clean extra nodes at the end of the session.



  fs.har.impl
  org.apache.hadoop.hive.shims.HiveHarFileSystem
  The implementation for accessing Hadoop Archives. Note that this won't be applicable to Hadoop vers less than 0.20



  hive.archive.enabled
  false
  Whether archiving operations are permitted



  hive.archive.har.parentdir.settable
  false
  In new Hadoop versions, the parent directory must be set while
  creating a HAR. Because this functionality is hard to detect with just version
  numbers, this conf var needs to be set manually.



  hive.fetch.output.serde
  org.apache.hadoop.hive.serde2.DelimitedJSONSerDe
  The serde used by FetchTask to serialize the fetch output.



  hive.exec.mode.local.auto
  false
   Let hive determine whether to run in local mode automatically 



  hive.exec.drop.ignorenonexistent
  true
  
    Do not report an error if DROP TABLE/VIEW specifies a non-existent table/view
  



  hive.exec.show.job.failure.debug.info
  true
  
  	If a job fails, whether to provide a link in the CLI to the task with the
  	most failures, along with debugging hints if applicable.
  



  hive.auto.progress.timeout
  0
  
    How long to run autoprogressor for the script/UDTF operators (in seconds).
    Set to 0 for forever.
  





  hive.hbase.wal.enabled
  true
  Whether writes to HBase should be forced to the write-ahead log.  Disabling this improves HBase write performance at the risk of lost writes in case of a crash.



  hive.table.parameters.default
  
  Default property values for newly created tables



  hive.variable.substitute
  true
  This enables substitution using syntax like ${var} ${system:var} and ${env:var}.




  hive.security.authorization.enabled
  false
  enable or disable the hive client authorization



  hive.security.authorization.manager
  org.apache.hadoop.hive.ql.security.authorization.DefaultHiveAuthorizationProvider
  the hive client authorization manager class name.
  The user defined authorization class should implement interface org.apache.hadoop.hive.ql.security.authorization.HiveAuthorizationProvider. 
  



  hive.security.authenticator.manager
  org.apache.hadoop.hive.ql.security.HadoopDefaultAuthenticator
  hive client authenticator manager class name. 
  The user defined authenticator should implement interface org.apache.hadoop.hive.ql.security.HiveAuthenticationProvider.



  hive.security.authorization.createtable.user.grants
  
  the privileges automatically granted to some users whenever a table gets created. 
   An example like "userX,userY:select;userZ:create" will grant select privilege to userX and userY, 
   and grant create privilege to userZ whenever a new table created.



  hive.security.authorization.createtable.group.grants
  
  the privileges automatically granted to some groups whenever a table gets created. 
   An example like "groupX,groupY:select;groupZ:create" will grant select privilege to groupX and groupY, 
   and grant create privilege to groupZ whenever a new table created.



  hive.security.authorization.createtable.role.grants
  
  the privileges automatically granted to some roles whenever a table gets created. 
   An example like "roleX,roleY:select;roleZ:create" will grant select privilege to roleX and roleY, 
   and grant create privilege to roleZ whenever a new table created.



  hive.security.authorization.createtable.owner.grants
  
  the privileges automatically granted to the owner whenever a table gets created. 
   An example like "select,drop" will grant select and drop privilege to the owner of the table



  hive.metastore.authorization.storage.checks
  false
  Should the metastore do authorization checks against the underlying storage
  for operations like drop-partition (disallow the drop-partition if the user in 
  question doesn't have permissions to delete the corresponding directory
  on the storage).



  hive.error.on.empty.partition
  false
  Whether to throw an excpetion if dynamic partition insert generates empty results.



  hive.index.compact.file.ignore.hdfs
  false
  True the hdfs location stored in the index file will be igbored at runtime. 
  If the data got moved or the name of the cluster got changed, the index data should still be usable.



  hive.optimize.index.filter.compact.minsize
  5368709120
  Minimum size (in bytes) of the inputs on which a compact index is automatically used.



  hive.optimize.index.filter.compact.maxsize
  -1
  Maximum size (in bytes) of the inputs on which a compact index is automatically used.
  A negative number is equivalent to infinity.



  hive.index.compact.query.max.size
  10737418240
  The maximum number of bytes that a query using the compact index can read. Negative value is equivalent to infinity.



  hive.index.compact.query.max.entries
  10000000
  The maximum number of index entries to read during a query that uses the compact index. Negative value is equivalent to infinity.



  hive.index.compact.binary.search
  true
  Whether or not to use a binary search to find the entries in an index table that match the filter, where possible



  hive.exim.uri.scheme.whitelist
  hdfs,pfile
  A comma separated list of acceptable URI schemes for import and export.



  hive.lock.mapred.only.operation
  false
  This param is to control whether or not only do lock on queries 
  that need to execute at least one mapred job.



  hive.limit.row.max.size
  100000
  When trying a smaller subset of data for simple LIMIT, how much size we need to guarantee
   each row to have at least.



  hive.limit.optimize.limit.file
  10
  When trying a smaller subset of data for simple LIMIT, maximum number of files we can
   sample.



  hive.limit.optimize.enable
  false
  Whether to enable to optimization to trying a smaller subset of data for simple LIMIT first.



  hive.limit.optimize.fetch.max
  50000
  Maximum number of rows allowed for a smaller subset of data for simple LIMIT, if it is a fetch query.
   Insert queries are not restricted by this limit.



  hive.rework.mapredwork
  false
  should rework the mapred work or not. 
  This is first introduced by SymlinkTextInputFormat to replace symlink files with real paths at compile time.



  hive.exec.concatenate.check.index
  true
  If this sets to true, hive will throw error when doing
   'alter table tbl_name [partSpec] concatenate' on a table/partition 
    that has indexes on it. The reason the user want to set this to true 
    is because it can help user to avoid handling all index drop, recreation, 
    rebuild work. This is very helpful for tables with thousands of partitions.



  hive.sample.seednumber
  0
  A number used to percentage sampling. By changing this number, user will change the subsets
   of data sampled.



	hive.io.exception.handlers
	
	A list of io exception handler class names. This is used
		to construct a list exception handlers to handle exceptions thrown 
		by record readers



  hive.autogen.columnalias.prefix.label
  _c
  String used as a prefix when auto generating column alias. 
  By default the prefix label will be appended with a column position number to form the column alias. Auto generation would happen if an aggregate function is used in a select clause without an explicit alias.



  hive.autogen.columnalias.prefix.includefuncname
  false
  Whether to include function name in the column alias auto generated by hive.



  hive.exec.perf.logger
  org.apache.hadoop.hive.ql.log.PerfLogger
  The class responsible logging client side performance metrics.  Must be a subclass of org.apache.hadoop.hive.ql.log.PerfLogger



  hive.start.cleanup.scratchdir
  false
  To cleanup the hive scratchdir while starting the hive server



  hive.output.file.extension
  
  String used as a file extension for output files. If not set, defaults to the codec extension for text files (e.g. ".gz"), or no extension otherwise.



  hive.insert.into.multilevel.dirs
  false
  Where to insert into multilevel directories like 
  "insert directory '/HIVEFT25686/chinna/' from table"

下载mysql-connector-java-5.1.15-bin.jar，保存到hive\lib文件目录下。

在mysql数据库中新建metastore数据库。

运行Hive:

[root@master bin]# ./hive
WARNING: org.apache.hadoop.metrics.jvm.EventCounter is deprecated. Please use org.apache.hadoop.log.metrics.EventCounter in all the log4j.properties files.
Logging initialized using configuration in jar:file:/usr/local/hive/lib/hive-common-0.8.0.jar!/hive-log4j.properties
Hive history file=/tmp/root/hive_job_log_root_201201190021_1677611134.txt
hive> show tables;
OK

Time taken: 7.965 seconds

查看mysql,时候已经生成数据表。

Redis集群的高可用架构及维护 AI天才研究院 Python实战自然语言处理人工智能语言模型编程实践开发语言架构设计
作者：禅与计算机程序设计艺术1.简介2019年，随着云计算、微服务架构和容器技术的流行，NoSQL数据库和缓存技术越来越受到企业应用需求的关注。Redis集群作为一款开源内存键值存储数据库，在高性能、易用性等方面都给予了开发者更高的满意度。但在实际生产环境中运行Redis集群却并不容易，如何保证Redis集群的高可用、可靠性和持久化一直是很多公司关心的问题。本文将从以下两个角度出发，分析Redis
云计算的概念与特点：开启数字化时代的新篇章 ivwdcwso 运维云计算
在当今数字化时代，云计算（CloudComputing）已经成为推动技术创新和业务转型的核心力量。无论是大型企业、中小型企业，还是个人用户，云计算都为其提供了高效、灵活和经济的解决方案。本文将深入探讨云计算的概念及其核心特点，帮助读者全面了解这一革命性技术。©ivwdcwso(ID:u012172506)一、云计算的概念云计算是一种基于互联网的计算模式，通过将计算资源（如服务器、存储、网络、数据库
【笔记总结】华为云：应用上云后的安全规划及设计通信_楠木笔记华为云安全系统架构安全架构
一、背景和问题数字化时代，随着信息技术的飞速发展，企业和各类组织纷纷将自身的应用程序迁移至云端。云计算凭借其诸多优势，如成本效益、可扩展性、灵活性以及便捷的资源共享等，已然成为了现代业务运营的重要支撑。今年，我所在企业也将IT系统全面迁移上云，究其原因是为了在激烈的市场竞争中保持敏捷性和创新性，需要快速部署新的应用并实现高效的数据处理，云平台提供的丰富资源和便捷的服务模式使其能够迅速满足这些需求。
在Ubuntu上使用Apache+MariaDB安装部署Nextcloud并修改默认存储路径戴草帽的大z ubuntu linux 经验分享 nextcloud php apache mariadb
一、前言Nextcloud是一款开源的私有云存储解决方案，允许用户轻松搭建自己的云服务。它不仅支持文件存储和共享，还提供了日历、联系人、任务管理、笔记等丰富的功能。本文将详细介绍如何在Ubuntu22.04LTS上使用Apache和MariaDB安装部署Nextcloud，并修改默认存储路径为/home/nextcloud_data。二、环境操作系统：Ubuntu22.04LTSWeb服务器：Ap
Coze，Dify，FastGPT，对比云连山 AI编程 AI编程
在当今AI技术迅速发展的背景下，AIAgent智能体成为了关键领域，Coze、Dify和FastGPT作为其中的佼佼者，各有千秋。平台介绍-FastGPT：由环界云计算公司发起，是基于大语言模型（LLM）的开源知识库问答系统。其亮点是支持Flow可视化工作流编排，在知识问答领域表现出色，拥有庞大用户群体，包括数百家企业付费客户等。网址为https://fastgpt.cn/。-Dify：苏州语灵人
MinIO xiaolin0333 #微服务 minio 对象存储服务
简介Golang语言实现兼容亚马逊S3云存储服务接口，适合存储大量非结构化数据官方文档：MinIODocker安装MinIO创建并运行容器dockerrun-d\--nameminio\-p9000:9000\--restart=always\-e"MINIO_ACCESS_KEY=minio"\-e"MINIO_SECRET_KEY=minio123"\-v/home/data:/data\-v
数据项目相关的AWS云计算架构设计 weixin_30777913 云计算数据仓库 aws spark python
电商数据平台架构高性能：使用AmazonEC2的计算优化实例处理业务逻辑和数据计算，搭配AmazonElastiCache内存缓存，加速数据读取。应用负载均衡器（ALB）在EC2实例间分发流量，实现负载均衡。高可用性：采用多可用区（Multi-AZ）部署，将EC2实例、数据库等资源分布在多个可用区。使用AmazonRDS并开启多AZ部署，实现数据库自动故障转移。利用AWSAutoScaling根据
使用 Azure Functions 开发 Serverless 应用：详解与实战孟章豪 azure serverless flask
使用AzureFunctions开发Serverless应用：详解与实战随着云计算的发展，Serverless（无服务器架构）已成为构建现代应用的重要模式。它能够让开发者专注于业务逻辑，而不需要关注底层的服务器管理、扩展等问题。AzureFunctions是微软提供的Serverless计算服务，具有高度的可扩展性和易用性。本篇博客将详细介绍如何使用AzureFunctions开发Serverle
【云原生应用与Docker】如何在Centos7安装docker及其compose？奇墨 ITQM 云原生 docker 容器
随着云计算的深入发展，越来越多的企业开始采用云原生应用来优化他们的IT架构，提升业务敏捷性和效率。云原生应用是一种针对云环境进行优化，以容器化、微服务化、动态编排等为特点的应用形态。它能帮助企业快速响应市场变化，提高应用性能，并降低运维成本。在这个过程中，Docker作为一种开源的应用容器引擎，以其快速部署、可重复性和易于管理的特点，成为部署云原生应用的重要工具。Docker是一种轻量级的虚拟化技
AI Agent：一场智能革命的开始机器人openai区块链
在当今科技日新月异的时代，AI（人工智能）技术正以前所未有的速度改变着我们的生活和工作方式。其中，AIAgent作为AI领域的一个新兴分支，正逐渐展现出其巨大的潜力和价值。本文将深入探讨AIAgent的发展现状、核心优势以及未来的发展方向，带您领略这一前沿技术的无限魅力。一、AIAgent的发展现状：技术突破与广泛应用近年来，随着大数据、云计算和机器学习等技术的飞速发展，AIAgent的技术水平得
云桌面的应用场景有哪些？云计算服务器
01什么是云桌面？云桌面，又称桌面虚拟化、云电脑，是云计算时代的一种新型应用模式。它采用虚拟化技术，将传统电脑主机的硬件资源（如CPU、内存、硬盘）在服务器端进行集中管理和虚拟化，然后通过特定的通信协议将虚拟桌面推送至用户终端，从而实现远程桌面共享和操作。总之，云桌面，只要在有网络的地方就能高效办公。02云桌面的应用场景呼叫中心对于呼叫中心来说，云桌面意味着坐席不再受限于固定工位。员工无论身处何地
云电脑室，云电脑室的作用？
在当今数字化飞速发展的时代，云电脑作为云计算技术的璀璨明珠，正逐渐走进人们的视野。它以一种全新的计算模式，将传统电脑的硬件和软件资源虚拟化后放置在云端，用户只需通过网络连接，即可在任何终端设备上访问和使用个人桌面、应用程序及数据，仿佛将一台功能强大的电脑装进了“云端口袋”。今天小编给大家介绍云电脑室的作用。云电脑室是基于云计算技术的电脑机房，通过虚拟化技术将服务器、存储和网络等资源集中起来，提供云
云起无垠入选中国信息通信研究院2024年度首期“磐安”优秀案例人工智能
近日，中国信通院举办的深度观察报告会系列论坛在北京顺利召开。在数字生态治理分论坛上，2024年度首期“磐安”优秀案例——AI+数字安全应用优秀案例遴选结果正式公布，云起无垠凭借其在生成式AI网络安全攻防对抗垂直领域扎实的研究及应用成果，成功入选该年度首期“磐安”优秀案例。当下，数字化浪潮席卷全球，信息技术广泛渗透各个产业。云计算、大数据、人工智能、物联网等前沿技术深度融合，传统制造业生产线、现代服
优化性能：高性能云计算的虚拟化技术 xidianjiapei001 性能分析云原生与微服务治理云计算高性能计算性能优化虚拟化
优化性能：高性能云计算的虚拟化技术云计算已经改变了企业获取和利用计算资源的方式。从云服务器的按需处理能力，到托管数据存储等可扩展的存储解决方案，云计算提供了无与伦比的灵活性和成本效益。然而，对于特定的应用程序，尤其是那些需要高性能计算（HPC）的应用，传统的云解决方案可能会带来一些性能开销。这时，虚拟化技术就发挥作用了，它能帮助我们针对HPC工作负载优化云环境。理解虚拟化及其对性能的影响虚拟化是云
十分钟精通MinIO：minio的原理、部署、操作周盛欢 minio java springboot spring
一、认识MinIOMinio是一个简单易用的云存储服务，就像是一个放在网络上的大文件柜。想象一下，你有一间放满了各种文件的房间，有时候你需要把这些文件分享给朋友或者在不同地方访问它们。Minio就是帮你做到这一点的工具，它让你可以轻松地把文件上传到互联网上，这样无论你在哪里，只要有网络，就能访问或分享这些文件。现在，如果你想要从这个仓库里取出一张图片或一段视频，让网站的访客能看到或者下载，Mini
人工智能和云计算带来的技术变革：人工智能实现自动化营销的方式 AI天才研究院 AI实战 AI大模型企业级应用开发实战大数据人工智能语言模型 AI LLM Java Python 架构设计 Agent RPA
1.背景介绍随着人工智能（AI）和云计算技术的不断发展，我们正面临着一场技术革命。这场革命正在改变我们的生活方式、工作方式和商业模式。在这篇文章中，我们将探讨人工智能如何实现自动化营销的方式，并深入了解其背后的核心概念、算法原理、代码实例等。1.1人工智能简介人工智能是一种计算机科学的分支，旨在让计算机具有人类智能的能力，如学习、推理、感知、语言理解等。人工智能的目标是让计算机能够理解自然语言、解
揭秘！云勒索软件：云端安全新威胁，企业数据岌岌可危知白守黑V 安全运营数据安全云安全数据安全信息安全安全云计算勒索软件网络攻击网络安全
近年来，云勒索软件成为网络安全领域最具威胁性的攻击手段之一，全球各类规模的云存储企业都深受其害。云基础设施巨大的攻击面以及存储的海量敏感数据，为网络犯罪组织提供了前所未有的“丰厚回报”，使其成为勒索软件团伙追逐的高利润目标。云服务为何成为勒索软件的首选目标随着亚马逊AWS和微软Azure等云服务提供商（CSPs）的持续扩展，网络犯罪分子正将攻击重心从传统的终端设备转向云平台。正如SentinelL
【系统设计】服务型软件的部署方式乘风而来的思绪系统设计系统架构系统设计软件软件部署 SaaS 私有化部署
文章目录部署方式的诉求SaaS软件私有化部署之私有云部署私有化部署之机房部署挑战案例部署方式的诉求在云计算的时代，以IaaS、PaaS、SaaS等为代表的XaaS风靡一时，尤其是其中作为软件服务提供商，市值3000亿美元的Salesforce给大家看到了SaaS软件的巨大价值，不少公司将其作为构建未来软件的目标。但在toB领域，尤其是涉及到数据隐私等问题时，客户的定制难以避免，因此也一直有着独立部
云计算相关 xianKOG 云计算云计算
文章目录一、虚拟化1、虚拟化技术概述特点2、虚拟化与云化3、计算虚拟化分类与作用常见的计算服务架构4、存储虚拟化5、网络虚拟化二、行业管理规章制度1、服务器管理制度访问控制变更管理备份与恢复监控与审计2、操作系统安全管理规范更新与补丁管理用户账号管理防火墙与安全软件日志管理3、虚拟化管理规定资源分配隔离策略模版与镜像三、操作系统1、操作系统安装2、操作系统调优3、操作系统管理维护4、常见服务安装与
物联网导论复习材料物腐虫生物联网学习
简答题Q1：物联网的概述，特点，模型，应用，重点是应用层，云计算，数据集成。物联网的概述物联网（IoT，InternetofThings）是指通过各种传感器、设备和网络技术，将物理世界中的物体连接到互联网，实现数据的采集、传输、处理和应用的智能化系统。物联网的特点全面感知：通过传感器实时采集数据。可靠传输：通过互联网和无线网络传输数据。智能处理：利用云计算和大数据技术对数据进行分析和处理，实现智能
Powershell语言的云计算萧澄华包罗万象 golang 开发语言后端
PowerShell与云计算：新时代的自动化管理工具在当今快速发展的信息技术时代，云计算已经成为企业和个人计算资源的主要选择。随着云服务的普及，如何高效地管理和自动化云环境中的资源，成为了IT管理员和开发者们面临的重要挑战。PowerShell作为一款强大的脚本语言和自动化框架，凭借其优秀的功能和灵活性，逐渐在云计算管理中扮演了不可或缺的角色。一、PowerShell简介1.1什么是PowerSh
AI时代，需要怎样的架构师？腾讯云架构师峰会来了！架构
引言架构设计对应用有关键性的影响，不仅决定应用的整体品质，还直接影响开发、维护和扩展的难易度。卓越的架构设计不仅能够确保系统的稳定性、高效性和可扩展性，还能大幅提升研发效能，同时显著降低维护成本。在快速变化的技术环境中，架构师们面临业务需求快速迭代、数据量急剧膨胀以及系统复杂性不断提升等挑战。随着云计算、大数据、人工智能等前沿技术的蓬勃发展，一系列创新解决方案如微服务架构、AI大模型、自动化运维工
Databend 实现高效实时查询：深入解读 Dictionary 功能数据库
作者：洪文丽开源之夏2024“支持ExternalDictionaries”项目参与者东北大学软件工程专业云计算方向大二在读，喜欢挑战自我，尝试新鲜事物背景介绍在大型系统中，数据通常存储在多个不同的数据源中，例如PostgreSQL、MySQL和Redis负责存储在线数据，而Databend和ClickHouse则用于存储分析数据。传统的分析查询方法往往需要同时使用到多种不同的数据，通常通过ETL
程序员转行做什么好：数据分析师、AI大模型工程师、产品经理和云计算工程师？雪碧没气阿人工智能产品经理云计算大模型训练 LLM AI大模型程序员
程序员转行做什么好先给结论再说理由：数据分析师、AI大模型工程师、产品经理和云计算工程师。这些领域不仅因应了当前技术发展的趋势，也为程序员提供了转型的广阔舞台和职业发展的新机遇。一起来看看吧！数据分析师：数据驱动决策的关键程序员转行时，应考虑当前市场上的热门行业和岗位需求。例如，AI大模型工程师、数据分析师、前端开发工程师、全栈开发工程师等都是当前市场上需求量较大的职位。就拿数据分析师来说，因其在
Azure 基础 SmallFatMan #Azure azure microsoft 运维 linux 服务器学习面试
Azure基础一、Azure基础知识简介二、云计算简介？三、责任共担四、你始终负责：五、云服务提供商始终负责：六、云模型1、私有云2、公有云3、混合云4、多云一、Azure基础知识简介MicrosoftAzure是一个云计算平台，提供一系列不断扩展的服务，可帮助你构建解决方案来满足业务目标。Azure服务支持从简单到复杂的一切内容。Azure具有简单的Web服务，用于在云中托管业务。Azure还支
云计算运维工程师面试道亦无名面试云计算运维
1.云计算运维工程师的角色和职责是什么？回答：云计算运维工程师负责确保云计算环境（包括硬件和软件系统）的高可用性和稳定性。他们的主要职责包括：监测系统和应用程序的性能，确保它们正常运行。故障排除，快速响应并解决系统或应用程序中出现的问题。容量规划，根据业务需求预测和规划未来的资源需求。升级和维护操作系统、应用程序及相关的基础设施。与开发团队紧密合作，确保新功能的顺利部署和现有功能的持续优化。2.请
Python自动化运维：一键掌控服务器的高效之道蒙娜丽宁 Python杂谈运维 python 自动化
《PythonOpenCV从菜鸟到高手》带你进入图像处理与计算机视觉的大门！解锁Python编程的无限可能：《奇妙的Python》带你漫游代码世界在互联网和云计算高速发展的今天，服务器数量的指数增长使得手动运维和管理变得异常繁琐。Python凭借其强大的可读性和丰富的生态系统，成为实现自动化运维的理想语言。本文以“Python自动化运维：编写自动化脚本进行服务器管理”为主题，深入探讨了如何利用Py
深入探索Go中的网络编程 AI天才研究院一天一门编程语言自然语言处理人工智能语言模型编程实践开发语言架构设计
作者：禅与计算机程序设计艺术深入探索Go中的网络编程1.引言1.1.背景介绍网络编程是计算机网络领域中的一个重要分支,涉及如何在程序中实现网络通信,使程序具有网络访问能力。随着云计算、大数据、物联网等技术的普及,网络编程的需求也越来越大。Go作为一个静态类型的编程语言,以其简洁、高效、安全等特点,成为了许多开发者首选的网络编程语言。本文将深入探索Go中网络编程的特点、原理和实现,帮助读者更好地利用
程序员创业公司的技术栈选择与性能优化 AI天才研究院 ChatGPT AI大模型企业级应用开发实战大数据AI人工智能大厂Offer收割机面试题简历程序员读书硅基计算碳基计算认知计算生物计算深度学习神经网络大数据 AIGC AGI LLM Java Python 架构设计 Agent 程序员实现财富自由
《程序员创业公司的技术栈选择与性能优化》概述本文旨在探讨程序员创业公司在选择技术栈和进行性能优化方面的策略与实践。随着技术的不断进步和市场的快速变化，技术栈的选择和优化成为创业公司成功的关键因素。正确的技术栈选择不仅能够提升系统的性能和可扩展性，还能降低开发成本和维护难度。关键词技术栈选择性能优化创业公司云计算数据库微服务人工智能区块链边缘计算摘要本文首先分析了技术栈选择的重要性以及创业公司在技术
Linux 内核中的 InfiniBand 核心模块：drivers/infiniband/core/device.c 分析 109702008 #linux系统编程网络网络 linux 人工智能
InfiniBand是一种高性能、低延迟的网络互连技术，广泛应用于高性能计算（HPC）、数据中心和云计算等领域。Linux内核中的InfiniBand子系统提供了对InfiniBand设备的支持，而drivers/infiniband/core/device.c文件则是InfiniBand核心模块的重要组成部分。本文将对device.c文件的功能、数据结构、关键函数以及驱动核心入口进行详细分析。一
[黑洞与暗粒子]没有光的世界 comsci
无论是相对论还是其它现代物理学,都显然有个缺陷,那就是必须有光才能够计算但是,我相信,在我们的世界和宇宙平面中,肯定存在没有光的世界.... 那么,在没有光的世界,光子和其它粒子的规律无法被应用和考察,那么以光速为核心的 &nbs
jQuery Lazy Load 图片延迟加载 aijuans jquery
基于 jQuery 的图片延迟加载插件，在用户滚动页面到图片之后才进行加载。对于有较多的图片的网页，使用图片延迟加载，能有效的提高页面加载速度。版本： jQuery v1.4.4+ jQuery Lazy Load v1.7.2 注意事项：需要真正实现图片延迟加载，必须将真实图片地址写在 data-original 属性中。若 src
使用Jodd的优点 Kai_Ge jodd
1. 简化和统一 controller ，抛弃 extends SimpleFormController ，统一使用 implements Controller 的方式。 2. 简化 JSP 页面的 bind, 不需要一个字段一个字段的绑定。 3. 对 bean 没有任何要求，可以使用任意的 bean 做为 formBean。使用方法简介
jpa Query转hibernate Query 120153216 Hibernate
public List<Map> getMapList(String hql, Map map) { org.hibernate.Query jpaQuery = entityManager.createQuery(hql); if (null != map) { for (String parameter : map.keySet()) { jp
Django_Python3添加MySQL/MariaDB支持 2002wmj mariaDB
现状首先，[email protected] 中默认的引擎为 django.db.backends.mysql 。但是在Python3中如果这样写的话，会发现 django.db.backends.mysql 依赖 MySQLdb[5] ，而 MySQLdb 又不兼容 Python3 于是要找一种新的方式来继续使用MySQL。 MySQL官方的方案首先据MySQL文档[3]说，自从MySQL
在SQLSERVER中查找消耗IO最多的SQL 357029540 SQL Server
返回做IO数目最多的50条语句以及它们的执行计划。 select top 50 (total_logical_reads/execution_count) as avg_logical_reads, (total_logical_writes/execution_count) as avg_logical_writes, (tot
spring UnChecked 异常官方定义！ 7454103 spring
如果你接触过spring的事物管理！那么你必须明白 spring的非捕获异常！即 unchecked 异常！因为 spring 默认这类异常事物自动回滚！！ public static boolean isCheckedException(Throwable ex) { return !(ex instanceof RuntimeExcep
mongoDB 入门指南、示例 adminjun java mongodb 操作
一、准备工作 1、下载mongoDB 下载地址：http://www.mongodb.org/downloads 选择合适你的版本相关文档：http://www.mongodb.org/display/DOCS/Tutorial 2、安装mongoDB A、不解压模式：将下载下来的mongoDB-xxx.zip打开，找到bin目录，运行mongod.exe就可以启动服务，默
CUDA 5 Release Candidate Now Available aijuans CUDA
The CUDA 5 Release Candidate is now available at http://developer.nvidia.com/<wbr></wbr>cuda/cuda-pre-production. Now applicable to a broader set of algorithms, CUDA 5 has advanced fe
Essential Studio for WinRT网格控件测评 Axiba JavaScript html5
Essential Studio for WinRT界面控件包含了商业平板应用程序开发中所需的所有控件，如市场上运行速度最快的grid 和chart、地图、RDL报表查看器、丰富的文本查看器及图表等等。同时，该控件还包含了一组独特的库，用于从WinRT应用程序中生成Excel、Word以及PDF格式的文件。此文将对其另外一个强大的控件——网格控件进行专门的测评详述。网格控件功能 1、
java 获取windows系统安装的证书或证书链 bewithme windows
有时需要获取windows系统安装的证书或证书链，比如说你要通过证书来创建java的密钥库。有关证书链的解释可以查看此处。 public static void main(String[] args) { SunMSCAPI providerMSCAPI = new SunMSCAPI(); S
NoSQL数据库之Redis数据库管理(set类型和zset类型) bijian1013 redis 数据库 NoSQL
4.sets类型 Set是集合，它是string类型的无序集合。set是通过hash table实现的，添加、删除和查找的复杂度都是O(1)。对集合我们可以取并集、交集、差集。通过这些操作我们可以实现sns中的好友推荐和blog的tag功能。 sadd：向名称为key的set中添加元
异常捕获何时用Exception，何时用Throwable bingyingao
用Exception的情况 try { //可能发生空指针、数组溢出等异常 } catch (Exception e) {
【Kafka四】Kakfa伪分布式安装 bit1129 kafka
在http://bit1129.iteye.com/blog/2174791一文中，实现了单Kafka服务器的安装，在Kafka中，每个Kafka服务器称为一个broker。本文简单介绍下，在单机环境下Kafka的伪分布式安装和测试验证 1. 安装步骤 Kafka伪分布式安装的思路跟Zookeeper的伪分布式安装思路完全一样，不过比Zookeeper稍微简单些(不
Project Euler bookjovi haskell
Project Euler是个数学问题求解网站，网站设计的很有意思，有很多problem，在未提交正确答案前不能查看problem的overview，也不能查看关于problem的discussion thread，只能看到现在problem已经被多少人解决了，人数越多往往代表问题越容易。看看problem 1吧： Add all the natural num
Java-Collections Framework学习与总结-ArrayDeque BrokenDreams Collections
表、栈和队列是三种基本的数据结构，前面总结的ArrayList和LinkedList可以作为任意一种数据结构来使用，当然由于实现方式的不同，操作的效率也会不同。这篇要看一下java.util.ArrayDeque。从命名上看
读《研磨设计模式》-代码笔记-装饰模式-Decorator bylijinnan java 设计模式
声明：本文只为方便我个人查阅和理解，详细的分析以及源代码请移步原作者的博客http://chjavach.iteye.com/ import java.io.BufferedOutputStream; import java.io.DataOutputStream; import java.io.FileOutputStream; import java.io.Fi
Maven学习(一) chenyu19891124 Maven私服
学习一门技术和工具总得花费一段时间，5月底6月初自己学习了一些工具，maven+Hudson+nexus的搭建，对于maven以前只是听说，顺便再自己的电脑上搭建了一个maven环境，但是完全不了解maven这一强大的构建工具，还有ant也是一个构建工具，但ant就没有maven那么的简单方便，其实简单点说maven是一个运用命令行就能完成构建，测试，打包，发布一系列功
[原创]JWFD工作流引擎设计----节点匹配搜索算法(用于初步解决条件异步汇聚问题) 补充 comsci 算法工作 PHP 搜索引擎嵌入式
本文主要介绍在JWFD工作流引擎设计中遇到的一个实际问题的解决方案，请参考我的博文"带条件选择的并行汇聚路由问题"中图例A2描述的情况(http://comsci.iteye.com/blog/339756),我现在把我对图例A2的一个解决方案公布出来，请大家多指点节点匹配搜索算法(用于解决标准对称流程图条件汇聚点运行控制参数的算法) 需要解决的问题：已知分支
Linux中用shell获取昨天、明天或多天前的日期 daizj linux shell 上几年昨天获取上几个月
在Linux中可以通过date命令获取昨天、明天、上个月、下个月、上一年和下一年 # 获取昨天 date -d 'yesterday' # 或 date -d 'last day' # 获取明天 date -d 'tomorrow' # 或 date -d 'next day' # 获取上个月 date -d 'last month' #
我所理解的云计算 dongwei_6688 云计算
在刚开始接触到一个概念时，人们往往都会去探寻这个概念的含义，以达到对其有一个感性的认知，在Wikipedia上关于“云计算”是这么定义的，它说： Cloud computing is a phrase used to describe a variety of computing co
YII CMenu配置 dcj3sjt126com yii
Adding id and class names to CMenu We use the id and htmlOptions to accomplish this. Watch. //in your view $this->widget('zii.widgets.CMenu', array( 'id'=>'myMenu', 'items'=>$this-&g
设计模式之静态代理与动态代理 come_for_dream 设计模式
静态代理与动态代理代理模式是java开发中用到的相对比较多的设计模式，其中的思想就是主业务和相关业务分离。所谓的代理设计就是指由一个代理主题来操作真实主题，真实主题执行具体的业务操作，而代理主题负责其他相关业务的处理。比如我们在进行删除操作的时候需要检验一下用户是否登陆，我们可以删除看成主业务，而把检验用户是否登陆看成其相关业务
【转】理解Javascript 系列 gcc2ge JavaScript
理解Javascript_13_执行模型详解摘要: 在《理解Javascript_12_执行模型浅析》一文中,我们初步的了解了执行上下文与作用域的概念，那么这一篇将深入分析执行上下文的构建过程，了解执行上下文、函数对象、作用域三者之间的关系。函数执行环境简单的代码:当调用say方法时，第一步是创建其执行环境，在创建执行环境的过程中，会按照定义的先后顺序完成一系列操作:1.首先会创建一个
Subsets II hcx2013 set
Given a collection of integers that might contain duplicates, nums, return all possible subsets. Note: Elements in a subset must be in non-descending order. The solution set must not conta
Spring4.1新特性——Spring缓存框架增强 jinnianshilongnian spring4
目录 Spring4.1新特性——综述 Spring4.1新特性——Spring核心部分及其他 Spring4.1新特性——Spring缓存框架增强 Spring4.1新特性——异步调用和事件机制的异常处理 Spring4.1新特性——数据库集成测试脚本初始化 Spring4.1新特性——Spring MVC增强 Spring4.1新特性——页面自动化测试框架Spring MVC T
shell嵌套expect执行命令 liyonghui160com
一直都想把expect的操作写到bash脚本里,这样就不用我再写两个脚本来执行了,搞了一下午终于有点小成就,给大家看看吧. 系统:centos 5.x 1.先安装expect yum -y install expect 2.脚本内容: cat auto_svn.sh #!/bin/bash
Linux实用命令整理 pda158 linux
0. 基本命令　　linux 基本命令整理　　1. 压缩解压　　tar -zcvf a.tar.gz a #把a压缩成a.tar.gz 　　tar -zxvf a.tar.gz #把a.tar.gz解压成a 　　2. vim小结　　2.1 vim替换　　:m,ns/word_1/word_2/gc
独立开发人员通向成功的29个小贴士 shoothao 独立开发
概述：本文收集了关于独立开发人员通向成功需要注意的一些东西,对于具体的每个贴士的注解有兴趣的朋友可以查看下面标注的原文地址。明白你从事独立开发的原因和目的。保持坚持制定计划的好习惯。万事开头难，第一份订单是关键。培养多元化业务技能。提供卓越的服务和品质。谨小慎微。营销是必备技能。学会组织，有条理的工作才是最有效率的。 “独立
JAVA中堆栈和内存分配原理 uule java
1、栈、堆 1.寄存器：最快的存储区, 由编译器根据需求进行分配,我们在程序中无法控制.2. 栈：存放基本类型的变量数据和对象的引用，但对象本身不存放在栈中，而是存放在堆（new 出来的对象）或者常量池中（字符串常量对象存放在常量池中。）3. 堆：存放所有new出来的对象。4. 静态域：存放静态成员（static定义的）5. 常量池：存放字符串常量和基本类型常量（public static f

使用mysql数据库作为Hive的元数据库

你可能感兴趣的:(云计算云存储,No-SQL)