[Hive - LanguageManual] DML: Load, Insert, Update, Delete

LanguageManual DML

Hive Data Manipulation Language

There are multiple ways to modify data in Hive:

EXPORT and IMPORT commands are also available (as of Hive 0.8).

Loading files into tables

Hive does not do any transformation while loading data into tables. Load operations are currently pure copy/move (纯复制，移动) operations that move datafiles into locations corresponding to Hive tables.

Syntax 语法

 
           LOAD DATA [LOCAL] INPATH  
           'filepath'  
           [OVERWRITE] INTO TABLE tablename [PARTITION (partcol1=val1, partcol2=val2 ...)]

Synopsis 简介

Load operations are currently pure copy/move operations that move datafiles into locations corresponding to Hive tables.

filepath can be:
- a relative path, such as project/data1
- an absolute path, such as /user/hive/project/data1
- a full URI with scheme and (optionally) an authority, such as hdfs://namenode:9000/user/hive/project/data1
The target being loaded to can be a table or a partition. If the table is partitioned, then one must specify a specific partition of the table by specifying values for all of the partitioning columns.
filepath can refer to a file (in which case Hive will move the file into the table) or it can be a directory (in which case Hive will move all the files within that directory into the table). In either case,filepath addresses a set of files.
If the keyword LOCAL is specified, then:
- the load command will look for filepath in the local file system. If a relative path is specified, it will be interpreted relative to the user's current working directory. The user can specify a full URI for local files as well - for example: file:///user/hive/project/data1
- the load command will try to copy all the files addressed by filepath to the target filesystem. The target file system is inferred by looking at the location attribute of the table. The copied data files will then be moved to the table.
If the keyword LOCAL is not specified, then Hive will either use the full URI of filepath, if one is specified, or will apply the following rules:
- If scheme or authority are not specified, Hive will use the scheme and authority from the hadoop configuration variable fs.default.name that specifies the Namenode URI.
- If the path is not absolute, then Hive will interpret it relative to /user/<username>
- Hive will move the files addressed by filepath into the table (or partition)
If the OVERWRITE keyword is used then the contents of the target table (or partition) will be deleted and replaced by the files referred to by filepath; otherwise the files referred by filepath will be added to the table.
- Note that if the target table (or partition) already has a file whose name collides（冲突） with any of the filenames contained in filepath, then the existing file will be replaced with the new file.

Notes

filepath cannot contain subdirectories.（filepath可以是目录也可以是文件，但是不能包含子目录）
If the keyword LOCAL is not given, filepath must refer to files within the same filesystem as the table's (or partition's) location.
Hive does some minimal checks to make sure that the files being loaded match the target table. Currently it checks that if the table is stored in sequencefile format, the files being loaded are also sequencefiles, and vice versa.
A bug that prevented loading a file when its name includes the "+" character is fixed in release 0.13.0 (HIVE-6048).
Please read CompressedStorage if your datafile is compressed.

Inserting data into Hive Tables from queries

Query Results can be inserted into tables by using the insert clause.

Syntax 语法

 
           Standard syntax: 
          
           INSERT OVERWRITE TABLE tablename1 [PARTITION (partcol1=val1, partcol2=val2 ...) [IF NOT EXISTS]] select_statement1 FROM from_statement; 
          
           INSERT INTO TABLE tablename1 [PARTITION (partcol1=val1, partcol2=val2 ...)] select_statement1 FROM from_statement; 
          
           Hive extension (multiple inserts): 
          
           FROM from_statement 
          
           INSERT OVERWRITE TABLE tablename1 [PARTITION (partcol1=val1, partcol2=val2 ...) [IF NOT EXISTS]] select_statement1 
          
           [INSERT OVERWRITE TABLE tablename2 [PARTITION ... [IF NOT EXISTS]] select_statement2] 
          
           [INSERT INTO TABLE tablename2 [PARTITION ...] select_statement2] ...; 
          
           FROM from_statement 
          
           INSERT INTO TABLE tablename1 [PARTITION (partcol1=val1, partcol2=val2 ...)] select_statement1 
          
           [INSERT INTO TABLE tablename2 [PARTITION ...] select_statement2] 
          
           [INSERT OVERWRITE TABLE tablename2 [PARTITION ... [IF NOT EXISTS]] select_statement2] ...; 
          
           Hive extension (dynamic partition inserts): 
          
           INSERT OVERWRITE TABLE tablename PARTITION (partcol1[=val1], partcol2[=val2] ...) select_statement FROM from_statement; 
          
           INSERT INTO TABLE tablename PARTITION (partcol1[=val1], partcol2[=val2] ...) select_statement FROM from_statement;

Synopsis

INSERT OVERWRITE will overwrite any existing data in the table or partition
- unless IF NOT EXISTS is provided for a partition (as of Hive 0.9.0).
INSERT INTO will append to the table or partition, keeping the existing data intact（完整无缺的）. (Note: INSERT INTO syntax is only available starting in version 0.8.)
- As of Hive 0.13.0, a table can be made immutable（不可变的） by creating it with TBLPROPERTIES ("immutable"="true"). The default is "immutable"="false".
  INSERT INTO behavior into an immutable table is disallowed if any data is already present, although INSERT INTO still works if the immutable table is empty. The behavior of INSERT OVERWRITE is not affected by the "immutable" table property. （INSERT OVERWRITE不受immutable属性的限制）
  An immutable table is protected against accidental updates due to a script loading data into it being run multiple times by mistake. (避免多次插入和修改) The first insert into an immutable table succeeds and successive inserts fail, resulting in only one set of data in the table, instead of silently succeeding with multiple copies of the data in the table.

Inserts can be done to a table or a partition. If the table is partitioned, then one must specify a specific partition of the table by specifying values for all of the partitioning columns.
Multiple insert clauses (also known as Multi Table Insert) can be specified in the same query.
The output of each of the select statements is written to the chosen table (or partition). Currently the OVERWRITE keyword is mandatory（强制的） and implies(暗示，说明) that the contents of the chosen table or partition are replaced with the output of corresponding(适当的) select statement.
The output format and serialization class is determined by the table's metadata (as specified via DDL commands on the table).
As of Hive 0.14, if a table has an OutputFormat that implements AcidOutputFormat and the system is configured to use a transaction manager that implements ACID, then INSERT OVERWRITE will be disabled for that table. This is to avoid users unintentionally overwriting transaction history. The same functionality can be achieved by using TRUNCATE TABLE (for non-partitioned tables) or DROP PARTITION followed by INSERT INTO.

Notes

Multi Table Inserts minimize the number of data scans required. Hive can insert data into multiple tables by scanning the input data just once (and applying different query operators) to the input data.
Starting with Hive 0.13.0, the select statement can include one or more common table expressions (CTEs) as shown in the SELECT syntax. For an example, see Common Table Expression.

Dynamic Partition Inserts 动态分区插入

Version information

Icon

This information reflects the situation in Hive 0.12; dynamic partition inserts were added in Hive 0.6.

In the dynamic partition inserts, users can give partial partition specifications, which means just specifying the list of partition column names in the PARTITION clause. The column values are optional. If a partition column value is given, we call this a static partition, otherwise it is a dynamic partition. Each dynamic partition column has a corresponding input column from the select statement. This means that the dynamic partition creation is determined by the value of the input column. The dynamic partition columns must be specified last among the columns in the SELECT statement and in the same order in which they appear in the PARTITION() clause.

Dynamic Partition inserts are disabled by default. These are the relevant（相关的） configuration properties for dynamic partition inserts:

Configuration property	Default	Note
`hive.exec.dynamic.partition`	`false`	Needs to be set to `true` to enable dynamic partition inserts
`hive.exec.dynamic.partition.mode`	`strict`	In `strict` mode, the user must specify at least one static partition in case the user accidentally overwrites all partitions, in `nonstrict`mode all partitions are allowed to be dynamic
`hive.exec.max.dynamic.partitions.pernode`	100	Maximum number of dynamic partitions allowed to be created in each mapper/reducer node
`hive.exec.max.dynamic.partitions`	1000	Maximum number of dynamic partitions allowed to be created in total
`hive.exec.max.created.files`	100000	Maximum number of HDFS files created by all mappers/reducers in a MapReduce job
`hive.error.on.empty.partition`	`false`	Whether to throw an exception if dynamic partition insert generates empty results

Example

 
           FROM  
           page_view_stg pvs 
          
           INSERT  
           OVERWRITE  
           TABLE  
           page_view PARTITION(dt= 
           '2008-06-08' 
           , country) 
          
           SELECT  
           pvs.viewTime, pvs.userid, pvs.page_url, pvs.referrer_url,  
           null 
           ,  
           null 
           , pvs.ip, pvs.cnt

Here the country partition will be dynamically created by the last column from the SELECT clause (i.e. pvs.cnt). Note that the name is not used. In nonstrict mode the dt partition could also be dynamically created.

Additional Documentation

Writing data into the filesystem from queries

Query results can be inserted into filesystem directories by using a slight variation （细微的变化）of the syntax above:

Syntax

 
           Standard syntax: 
          
           INSERT OVERWRITE [LOCAL] DIRECTORY directory1 
          
           [ROW FORMAT row_format] [STORED AS file_format] (Note: Only available starting with Hive  
           0.11 
           . 
           0 
           ) 
          
           SELECT ... FROM ... 
          
           Hive extension (multiple inserts): 
          
           FROM from_statement 
          
           INSERT OVERWRITE [LOCAL] DIRECTORY directory1 select_statement1 
          
           [INSERT OVERWRITE [LOCAL] DIRECTORY directory2 select_statement2] ... 
          
           row_format 
          
           : DELIMITED [FIELDS TERMINATED BY  
           char  
           [ESCAPED BY  
           char 
           ]] [COLLECTION ITEMS TERMINATED BY  
           char 
           ] 
          
           [MAP KEYS TERMINATED BY  
           char 
           ] [LINES TERMINATED BY  
           char 
           ] 
          
           [NULL DEFINED AS  
           char 
           ] (Note: Only available starting with Hive  
           0.13 
           )

Synopsis

Directory can be a full URI. If scheme or authority are not specified, Hive will use the scheme and authority from the hadoop configuration variable fs.default.name that specifies the Namenode URI.
If LOCAL keyword is used, Hive will write data to the directory on the local file system.
Data written to the filesystem is serialized as text with columns separated by ^A and rows separated by newlines. If any of the columns are not of primitive type, then those columns are serialized to JSON format.

Notes

INSERT OVERWRITE statements to directories, local directories, and tables (or partitions) can all be used together within the same query.
INSERT OVERWRITE statements to HDFS filesystem directories are the best way to extract large amounts of data from Hive. Hive can write to HDFS directories in parallel from within a map-reduce job.
The directory is, as you would expect, OVERWRITten; in other words, if the specified path exists, it is clobbered and replaced with the output.
As of Hive 0.11.0 the separator used can be specified, in earlier versions it was always the ^A character (\001)
In Hive 0.14, inserts into ACID compliant tables will deactivate vectorization for the duration of the select and insert. This will be done automatically. ACID tables that have data inserted into them can still be queried using vectorization.

Inserting values into tables from SQL

The INSERT...VALUES statement can be used to insert data into tables directly from SQL.

Version Information

Icon

INSERT...VALUES is available starting in Hive 0.14.

Inserting values from SQL statements can only be performed on tables that support ACID. See Hive Transactions for details.

Syntax

 
           Standard Syntax: 
          
           INSERT INTO TABLE tablename [PARTITION (partcol1[=val1], partcol2[=val2] ...)] VALUES values_row [, values_row ...] 
          
           Where values_row is: 
          
           ( value [, value ...] ) 
          
           where a value is either  
           null  
           or any valid SQL literal

Synopsis

Each row listed in the VALUES clause is inserted into table tablename.
Values must be provided for every column in the table. The standard SQL syntax that allows the user to insert values into only some columns is not yet supported. To mimic the standard SQL, nulls can be provided for columns the user does not wish to assign a value to.
Dynamic partitioning is supported in the same way as for INSERT...SELECT.
If the table being inserted into supports ACID and a transaction manager that supports ACID is in use, this operation will be auto-committed upon successful completion.
Insert, update, delete operations are not supported on tables that are sorted (tables created with the SORTED BY clause).
Hive does not support literals for complex types, so it is not possible to use them in INSERT INTO...VALUES clauses.
Means user cannot insert data into complex datatype [array, map, struct, union] columns using INSERT INTO...VALUES clause.

Examples

 
      
       
         
           CREATE  
           TABLE  
           students ( 
           name  
           VARCHAR 
           (64), age  
           INT 
           , gpa  
           DECIMAL 
           (3, 2)) 
          
 
              
           CLUSTERED  
           BY  
           (age)  
           INTO  
           2 BUCKETS STORED  
           AS  
           ORC; 
          

              
          
 
           INSERT  
           INTO  
           TABLE  
           students 
          
 
              
           VALUES  
           ( 
           'fred flintstone' 
           , 35, 1.28), ( 
           'barney rubble' 
           , 32, 2.32); 
          

              
          

              
          
 
             
          
 
           CREATE  
           TABLE  
           pageviews (userid  
           VARCHAR 
           (64), link STRING,  
           from  
           STRING) 
          
 
              
           PARTITIONED  
           BY  
           (datestamp STRING) CLUSTERED  
           BY  
           (userid)  
           INTO  
           256 BUCKETS STORED  
           AS  
           ORC; 
          

              
          

              
          
 
           INSERT  
           INTO  
           TABLE  
           pageviews PARTITION (datestamp =  
           '2014-09-23' 
           ) 
          
 
              
           VALUES  
           ( 
           'jsmith' 
           ,  
           'mail.com' 
           ,  
           'sports.com' 
           ), ( 
           'jdoe' 
           ,  
           'mail.com' 
           ,  
           null 
           ); 
          

              
          

              
          
 
           INSERT  
           INTO  
           TABLE  
           pageviews PARTITION (datestamp) 
          
 
              
           VALUES  
           ( 
           'tjohnson' 
           ,  
           'sports.com' 
           ,  
           'finance.com' 
           ,  
           '2014-09-23' 
           ), ( 
           'tlee' 
           ,  
           'finance.com' 
           ,  
           null 
           ,  
           '2014-09-21' 
           ); 
          
 
       
 
      
    

Update

Version Information

Icon

UPDATE is available starting in Hive 0.14.

Updates can only be performed on tables that support ACID. See Hive Transactions for details.

Syntax

 
           Standard Syntax: 
          
           UPDATE tablename SET column = value [, column = value ...] [WHERE expression]

Synopsis

The referenced column must be a column of the table being updated.
The value assigned must be an expression that Hive supports in the select clause. Thus arithmetic operators, UDFs, casts, literals, etc. are supported. Subqueries are not supported.
Only rows that match the WHERE clause will be updated.
Partitioning columns cannot be updated.
Bucketing columns cannot be updated.
In Hive 0.14, upon successful completion of this operation the changes will be auto-committed.

Notes

Vectorization will be turned off for update operations. This is automatic and requires no action on the part of the user. Non-update operations are not affected. Updated tables can still be queried using vectorization.
In version 0.14 it is recommended that you set hive.optimize.sort.dynamic.partition=false when doing updates, as this produces more efficient execution plans.

Delete

Version Information

Icon

DELETE is available starting in Hive 0.14.

Deletes can only be performed on tables that support ACID. See Hive Transactions for details.

Syntax

 
           Standard Syntax: 
          
           DELETE FROM tablename [WHERE expression]

Synopsis

Only rows that match the WHERE clause will be deleted.
In Hive 0.14, upon successful completion of this operation the changes will be auto-committed.

Notes

Vectorization will be turned off for delete operations. This is automatic and requires no action on the part of the user. Non-delete operations are not affected. Tables with deleted data can still be queried using vectorization.
In version 0.14 it is recommended that you set hive.optimize.sort.dynamic.partition=false when doing deletes, as this produces more efficient execution plans.

你可能感兴趣的:(language)

C# LINQ扩展方法探索：Enumerable.Except实现集合差集操作 AitTech C#c#linq 开发语言
在C#中，Enumerable.Except方法是一个LINQ（LanguageIntegratedQuery）扩展方法，用于返回两个序列的差集。换句话说，它会返回第一个序列中存在但第二个序列中不存在的元素集合。此方法对于从一个集合中移除另一个集合中的所有匹配项非常有用。Enumerable.Except方法有两个主要重载版本：默认比较器：使用默认相等比较器(DefaultEqualityComp
Cognitive Architectures for Language Agents UnknownBody LLM Agent 语言模型 AI代理
本文是LLM系列文章，针对《CognitiveArchitecturesforLanguageAgents》的翻译。语言代理的认知架构摘要1引言2背景：从字符串到符号AGI3语言模型与生产系统之间的链接4语言代理的认知架构（CoALA）：一个概念框架5用例6可操作的见解7讨论8结论摘要最近的努力已经将大型语言模型（LLM）与外部资源（例如，互联网）或内部控制流（例如，提示链接）结合起来，用于需要基
详细分析Python爬虫中的xpath（附Demo）码农研究僧 Python python 爬虫 xpath
目录前言1.基本知识2.常用API3.简易Demo前言关于爬虫的基本知识推荐阅读：Python爬虫从入门到应用（超全讲解）该知识点需要提前安装相关依赖：pipinstalllxml1.基本知识XPath（XMLPathLanguage）是一种用于在XML文档中定位和选择节点的语言在XML文档中通过路径表达式（pathexpression）来定位节点，这些路径描述了节点在层次结构中的位置一、节点：在
SQL语言的数据库交互 C++小厨神包罗万象 golang 开发语言后端
SQL语言的数据库交互在当今的信息时代，数据的管理和处理变得越来越重要，而结构化查询语言（SQL）作为一种用于管理关系型数据库的标准语言，其重要性愈加凸显。本文将深入探讨SQL语言的基本概念、主要功能、常见语法以及在实际数据库交互中的应用。一、SQL语言概述SQL（StructuredQueryLanguage）是一种特定用途的编程语言，主要用于与关系型数据库进行交互。SQL于1970年代初被IB
Knowledge Boundary of Large Language Models: A Survey UnknownBody LLM Daily Survey Paper 语言模型人工智能自然语言处理
本文是LLM系列文章，针对《KnowledgeBoundaryofLargeLanguageModels:ASurvey》的翻译。大型语言模型的知识边界：综述摘要1引言2知识边界的定义3不良行为4知识边界的识别5缓解6挑战与前景7结论局限性摘要尽管大型语言模型（LLM）在其参数中存储了大量的知识，但它们在记忆和利用某些知识方面仍然存在局限性，导致了不良的行为，如产生不真实和不准确的反应。这突显了理
PL/SQL语言的语法糖技术的探险家包罗万象 golang 开发语言后端
PL/SQL语言的语法糖引言PL/SQL（ProceduralLanguage/StructuredQueryLanguage）是Oracle公司为其数据库管理系统（DBMS）设计的一种过程化语言。作为一种扩展SQL的语言，PL/SQL不仅支持数据的查询和操作，还增加了更复杂的编程特性，比如变量声明、控制结构、异常处理等，从而使得程序员能够编写出更加灵活和高效的数据库应用程序。然而在PL/SQL中
思维图GOT：用大语言模型解决复杂问题硅谷秋水大模型人工智能机器学习语言模型人工智能自然语言处理
23年8月份来自瑞士和波兰的大学以及一个数据公司Cledar的大语言模型论文“GraphofThoughts:SolvingElaborateProblemswithLargeLanguageModels“。思维图（GoT）是一个框架，提高大型语言模型（LLM）中的提示功能，超出思维链或思维树(ToT)等范式所提供的能力。GoT的关键思想和主要优势是能够将LLM生成的信息建模为任意图，其中信息单位
【论文速读】| 利用大语言模型在灰盒模糊测试中生成初始种子云起无垠论文速读/精读语言模型 p2p 人工智能
基本信息论文标题:HarnessingLargeLanguageModelsforSeedGenerationinGreyb0xFuzzing作者:WenxuanShi,YunhangZhang,XinyuXing,JunXu作者单位:NorthwesternUniversity,UniversityofUtah关键词:Greyb0xfuzzing,LargeLanguageModels,Seed
【LLM】大语言模型（LLMs）林九生人工智能语言模型人工智能自然语言处理
大型语言模型（LLMs）1.什么是大型语言模型？大型语言模型（LargeLanguageModel，LLM）是基于深度学习的自然语言处理模型，能够理解和生成自然语言文本。它们通过在大规模文本数据上进行训练，学习语言的语法、语义和各种语言特征，从而可以执行诸如文本生成、翻译、总结、问答等多种语言任务。以下是大型语言模型的定义和基本原理：1.1定义大型语言模型是由大量参数组成的神经网络，这些参数通过在
大语言模型（LLMs）入门教程（非常详细）从零基础入门到精通，看完这一篇就够了大模型零基础教程语言模型人工智能自然语言处理大模型
大语言模型（LLMs）作为人工智能（AI）领域的一项突破性发展，已经改变了自然语言处理（NLP）和机器学习（ML）应用的面貌。这些模型，包括OpenAI的GPT-4o和Google的gemini系列等，已经展现出了在理解和生成类人文本方面的令人印象深刻的能力，使它们成为各行各业的宝贵工具。如下这份指南将涵盖LLMs的基础知识、训练过程、用例和未来趋势……一.WhatareLargeLanguage
VScode使用element-ui插件准备 web15286201346 面试学习路线阿里巴巴 vscode ide visual studio code java-ee 后端
文章目录插件1、Chinese(Simplified)LanguagePackforVisualStidioCode中文汉化包2、AutoCloseTag自动闭合标签3、AutoRenameTag尾部闭合标签同步修改4、BracketPairColorizer用不同颜色高亮显示匹配的括号5、HighlightMatchingTag高亮显示匹配标签6、Vscode-element-helper7、l
【大模型LoRa微调】Qwen2.5 Coder 指令微调【代码已开源】 FF-Studio 大语言模型开源
本文需要用到的代码已经放在GitHub的仓库啦，别忘了给仓库点个小心心~~~https://github.com/LFF8888/FF-Studio-Resources第001个文件哦~一、引言：大语言模型与指令微调1.1大语言模型发展简史随着深度学习的飞速发展，特别是Transformer架构在自然语言处理（NLP）领域的成功，大语言模型（LLM,LargeLanguageModel）成为近年来
vscode 极简Linux下 cmake c++开发环境丘狸尾 vscode linux c++
安装这三插件vscode安装插件clangd后报错无法自动下载服务端Failedtoinstallclangdlanguageserver:FetchError:requesttohttps://api.github.com/repos/clangd/clangd/releases/latestfailed,reason:Failedtoestablishasocketconnectiontopr
HTML（超文本标记语言） Parrot 安全小子 html 前端
HTML（超文本标记语言-HyperTextMarkupLanguage）是一种用于创建网页的标准标记语言。HTML最初是由蒂姆・伯纳斯-李（TimBerners-Lee）在1990年左右开发的。当时的目的是为了让世界各地的科学家能够方便地共享和交流信息。随着互联网的飞速发展，HTML也经历了多个版本的更新，从HTML1.0到HTML4.01，再到现在广泛使用的HTML5。每一次版本更新都带来了新
《CPython Internals》阅读笔记：p61-p75 python
《CPythonInternals》学习第4天，p61-p75总结，总计15页。一、技术总结1.编译器类型(1)self-hostedcompilerSelf-hostedcompilersarecompilerswritteninthelanguagetheycompile,suchastheGocompiler.Thisisdonebyaprocessknownasbootstrapping.
SQLite3 使用Python快速操作单体文件的sqlite数据库 XLY23333 SQL Python 数据库 sqlite python
PY-SQLite3Note:XLY23333RAWVideo:checkPythonVersion:3.11SQL基础操作可参考文章：[CSDN]SQLBasicVERSION1[CSDN]SQLBasicVERSION2DataBase基础概念关系型数据库常见操作即SQL(StructuredQueryLanguage)语法创建/删除/修改表CREATE/DROP/ALTERTABLE新增/删
大规模语言模型从理论到实践大语言模型预训练数据 AI大模型应用之禅 AI大模型与大数据计算科学神经计算深度学习神经网络大数据人工智能大型语言模型 AI AGI LLM Java Python 架构设计 Agent RPA
大规模语言模型从理论到实践：大语言模型预训练数据关键词：大规模语言模型、预训练数据、数据集选择、数据清洗、数据增强、数据集评估、数据集扩展1.背景介绍1.1问题的由来随着深度学习和大规模神经网络的发展，大型语言模型（LargeLanguageModels,LLMs）成为了自然语言处理（NLP）领域的一项突破性技术。LLMs能够生成流畅且具有上下文关联性的文本，这得益于它们在海量文本数据上的预训练。
论文翻译：A survey on large language model (LLM) security and privacy: The Good, The Bad, and The Ugly CSPhD-winston-杨帆论文翻译 LLMs-鲁棒性语言模型人工智能自然语言处理
Asurveyonlargelanguagemodel(LLM)securityandprivacy:TheGood,TheBad,andTheUglyhttps://www.sciencedirect.com/science/article/pii/S266729522400014X文章目录关于大型语言模型（LLM）安全性和隐私的调查：好的、坏的和丑陋的摘要1.引言2.背景2.1大型语言模型（L
Qwen-VL环境搭建&推理测试要养家的程序猿 AI算法 python 计算机视觉 ai
引子这几天阿里的Qwen2.5大模型在大模型圈引起了轰动，号称地表最强中文大模型。前面几篇也写了QWen的微调等，视觉语言模型也写了一篇CogVLM，感兴趣的小伙伴可以移步Qwen1.5微调-CSDN博客。前面也写过一篇智谱AI的视觉大模型（CogVLM/CogAgent环境搭建&推理测试-CSDN博客）。Qwen-VL是阿里云研发的大规模视觉语言模型（LargeVisionLanguageMod
6. NLP自然语言处理（Natural Language Processing）啊波次得饿佛哥 AI人工智能自然语言处理人工智能
自然语言是指人类日常使用的语言，如中文、英语、法语等。自然语言处理是人工智能（AI）领域中的一个重要分支，它结合了计算机科学、语言学和统计学的方法，通过算法对文本和语音进行分析，使计算机能够理解、解释和生成自然语言。随着深度学习技术的发展，NLP在文本分类、机器翻译、情感分析、对话系统等任务中取得了显著进展，推动了人工智能技术在多个领域的广泛应用。自然语言处理的核心任务涉及如何使计算机理解和处理语
探索Qwen-VL：一个全栈式的视觉语言模型开发框架钟洁祺
探索Qwen-VL：一个全栈式的视觉语言模型开发框架Qwen-VLTheofficialrepoofQwen-VL(通义千问-VL)chat&pretrainedlargevisionlanguagemodelproposedbyAlibabaCloud.项目地址:https://gitcode.com/gh_mirrors/qw/Qwen-VL项目简介是一款由QwenLM开发的全栈式视觉语言（V
AI大模型的架构演进与最新发展季风泯灭的季节 AI大模型应用技术二人工智能架构
随着深度学习的发展，AI大模型（LargeLanguageModels,LLMs）在自然语言处理、计算机视觉等领域取得了革命性的进展。本文将详细探讨AI大模型的架构演进，包括从Transformer的提出到GPT、BERT、T5等模型的历史演变，并探讨这些模型的技术细节及其在现代人工智能中的核心作用。一、基础模型介绍：Transformer的核心原理Transformer架构的背景在Transfo
MySQl篇（SQL - 基本介绍）（持续更新迭代） wclass-zhengge mysql sql 数据库
目录一、简介二、SQL方言（分页查询为例）1.简介2.SQL方言大比拼2.1.Oracle2.1.1.使用ROWNUM实现分页查询2.1.2.使用ROW_NUMBER()实现分页查询2.2.MySQL2.3.PostgreSQL三、语法规范四、注释五、MySQL脚本中的标点符号一、简介1、SQL是结构化查询语言（StructureQueryLanguage），专门用来操作/访问关系型数据库的通用语
webstorm报错TypeError: this.cliEngine is not a constructor Blue_Color
点击Details在控制台会显示报错的位置TypeError:this.cliEngineisnotaconstructoratESLintPlugin.invokeESLint(/Applications/RubyMine.app/Contents/plugins/JavaScriptLanguage/languageService/eslint/bin/eslint-plugin.js:97:
Orange Pi编译脚本的分析点点吃得太多了 linux linux bash
脚本的运行流程/scripts/main.sh变量设置DEST=“${SRC}”/outputREVISION=“2.2.2”DOWNLOAD_MIRROR==“china”NTP_SERVER=“cn.pool.ntp.org”通过网络校准您计算机上的时钟BUILD_ALLCOLUMNS,LINESTTY_X,TTY_YLANGUAGE=“en_US:en”CONSOLE_CHAR=“UTF-8
前端HTML+CSS+JS的入门学习俊昭喜喜里前端 html css
一.HTMLHTML（HyperTextMarkupLanguage）即超文本标记语言，是用于创建网页和网页应用程序的标准标记语言。它不是一种编程语言，而是一种标记语言，通过一系列的元素（elements）来告诉浏览器如何显示网页上的内容，如文本、图片、链接、表格、列表等。HTML文档由一系列的标签（tags）组成，这些标签告诉浏览器如何显示内容。标签通常成对出现，例如和，其中是开始标签，表示一个
方的ScalersTalk第四轮新概念朗读持续力训练Day203 20200301 daisy境界的彼方
练习材料：Weoftenreadinnovelshowaseeminglyrespectablepersonorfamilyhassometerriblesecretwhichhasbeenconcealedfromstrangersforyears.TheEnglishlanguagepossessesavividsayingtodescribethissortofsituation.Thete
C# Linq语句用法大全以及Lambda表达式一个小码码 c#linq 开发语言 .net
C#Linq语句用法大全以及Lambda表达式Linq：是一种用于数据查询和操作的语言集成查询（LanguageIntegratedQuery）技术。通过Linq，我们可以使用类似于SQL查询的方式来查询、筛选和操作各种类型的数据集合，包括数组、列表、集合、XML文档、数据库表等等。常见的有：LinqtoObjects：用于操作对象集合，例如数组、列表等。LinqtoXML：用于操作XML数据，支
论文-A Stack-Propagation Framework with Token-Level Intent Detection for Spoken Language Understanding 魏鹏飞
1.简称论文《AStack-PropagationFrameworkwithToken-LevelIntentDetectionforSpokenLanguageUnderstanding》，作者LiboQin(HarbinInstituteofTechnology,China)，经典的NLU论文（SemanticFrame）。2.摘要意图检测和槽位填充是构建口语理解（SLU）系统的两个主要任务。
多模态大语言模型(MLLMs)-一般架构（非常详细）零基础入门到精通，收藏这一篇就够了程序员_大白语言模型人工智能自然语言处理
多模态大语言模型(MultimodalLargeLanguageModel,MLLM），在LLM原有的强大泛化和推理能力基础上，进一步引入了多模态信息处理能力。相比于以往的多模态方法，例如以CLIP为代表的判别式，或以OFA为代表的生成式，新兴的MLLM展现出一些典型的特质，在下面这两种特质的加持下，MLLM涌现出一些以往多模态模型所不具备的能力！模型大。MLLM通常具有数十亿的参数量，更多的参数
多线程编程之理财周凡杨 java 多线程生产者消费者理财
现实生活中，我们一边工作，一边消费，正常情况下会把多余的钱存起来，比如存到余额宝，还可以多挣点钱，现在就有这个情况：我每月可以发工资20000万元（暂定每月的1号），每月消费5000（租房+生活费）元（暂定每月的1号），其中租金是大头占90%，交房租的方式可以选择（一月一交，两月一交、三月一交），理财：1万元存余额宝一天可以赚1元钱，
[Zookeeper学习笔记之三]Zookeeper会话超时机制 bit1129 zookeeper
首先，会话超时是由Zookeeper服务端通知客户端会话已经超时，客户端不能自行决定会话已经超时，不过客户端可以通过调用Zookeeper.close()主动的发起会话结束请求，如下的代码输出内容 Created /zoo-739160015 CONNECTEDCONNECTED .............CONNECTEDCONNECTED CONNECTEDCLOSEDCLOSED
SecureCRT快捷键 daizj secureCRT 快捷键
ctrl + a : 移动光标到行首ctrl + e ：移动光标到行尾crtl + b: 光标前移1个字符crtl + f: 光标后移1个字符crtl + h : 删除光标之前的一个字符ctrl + d ：删除光标之后的一个字符crtl + k ：删除光标到行尾所有字符crtl + u : 删除光标至行首所有字符crtl + w: 删除光标至行首
Java 子类与父类这间的转换周凡杨 java 父类与子类的转换
最近同事调的一个服务报错，查看后是日期之间转换出的问题。代码里是把 java.sql.Date 类型的对象强制转换为 java.sql.Timestamp 类型的对象。报java.lang.ClassCastException。代码：
可视化swing界面编辑朱辉辉33 eclipse swing
今天发现了一个WindowBuilder插件，功能好强大，啊哈哈，从此告别手动编辑swing界面代码，直接像VB那样编辑界面，代码会自动生成。首先在Eclipse中点击help，选择Install New Software,然后在Work with中输入WindowBui
web报表工具FineReport常用函数的用法总结（文本函数）老A不折腾 finereport web报表工具报表软件 java报表
文本函数 CHAR CHAR(number):根据指定数字返回对应的字符。CHAR函数可将计算机其他类型的数字代码转换为字符。 Number:用于指定字符的数字，介于1Number:用于指定字符的数字，介于165535之间（包括1和65535）。示例: CHAR(88)等于“X”。 CHAR(45)等于“-”。 CODE CODE(text):计算文本串中第一个字
mysql安装出错林鹤霄 mysql安装
[root@localhost ~]# rpm -ivh MySQL-server-5.5.24-1.linux2.6.x86_64.rpm Preparing... #####################
linux下编译libuv aigo libuv
下载最新版本的libuv源码，解压后执行： ./autogen.sh 这时会提醒找不到automake命令，通过一下命令执行安装（redhat系用yum，Debian系用apt-get）： # yum -y install automake # yum -y install libtool 如果提示错误：make: *** No targe
中国行政区数据及三级联动菜单 alxw4616
近期做项目需要三级联动菜单,上网查了半天竟然没有发现一个能直接用的! 呵呵,都要自己填数据....我了个去这东西麻烦就麻烦的数据上. 哎,自己没办法动手写吧. 现将这些数据共享出了,以方便大家.嗯,代码也可以直接使用文件说明 lib\area.sql -- 县及县以上行政区划分代码（截止2013年8月31日)来源：国家统计局发布时间：2014-01-17 15:0
哈夫曼加密文件百合不是茶哈夫曼压缩哈夫曼加密二叉树
在上一篇介绍过哈夫曼编码的基础知识,下面就直接介绍使用哈夫曼编码怎么来做文件加密或者压缩与解压的软件,对于新手来是有点难度的,主要还是要理清楚步骤; 加密步骤: 1,统计文件中字节出现的次数,作为权值 2,创建节点和哈夫曼树 3,得到每个子节点01串 4,使用哈夫曼编码表示每个字节
JDK1.5 Cyclicbarrier实例 bijian1013 java thread java多线程 Cyclicbarrier
CyclicBarrier类一个同步辅助类，它允许一组线程互相等待，直到到达某个公共屏障点 (common barrier point)。在涉及一组固定大小的线程的程序中，这些线程必须不时地互相等待，此时 CyclicBarrier 很有用。因为该 barrier 在释放等待线程后可以重用，所以称它为循环的 barrier。 CyclicBarrier支持一个可选的 Runnable 命令，
九项重要的职业规划 bijian1013 工作学习
一. 学习的步伐不停止古人说，活到老，学到老。终身学习应该是您的座右铭。世界在不断变化，每个人都在寻找各自的事业途径。您只有保证了足够的技能储
【Java范型四】范型方法 bit1129 java
范型参数不仅仅可以用于类型的声明上，例如 package com.tom.lang.generics; import java.util.List; public class Generics<T> { private T value; public Generics(T value) { this.value =
【Hadoop十三】HDFS Java API基本操作 bit1129 hadoop
package com.examples.hadoop; import org.apache.hadoop.conf.Configuration; import org.apache.hadoop.fs.FSDataInputStream; import org.apache.hadoop.fs.FileStatus; import org.apache.hadoo
ua实现split字符串分隔 ronin47 lua split
LUA并不象其它许多"大而全"的语言那样，包括很多功能，比如网络通讯、图形界面等。但是LUA可以很容易地被扩展：由宿主语言(通常是C或 C++)提供这些功能，LUA可以使用它们，就像是本来就内置的功能一样。LUA只包括一个精简的核心和最基本的库。这使得LUA体积小、启动速度快，从而适合嵌入在别的程序里。因此在lua中并没有其他语言那样多的系统函数。习惯了其他语言的字符串分割函
java-从先序遍历和中序遍历重建二叉树 bylijinnan java
public class BuildTreePreOrderInOrder { /** * Build Binary Tree from PreOrder and InOrder * _______7______ / \ __10__ ___2 / \ / 4
openfire开发指南《连接和登陆》开窍的石头 openfire 开发指南 smack
第一步官网下载smack.jar包下载地址：http://www.igniterealtime.org/downloads/index.jsp#smack 第二步把smack里边的jar导入你新建的java项目中开始编写smack连接openfire代码 p
[移动通讯]手机后盖应该按需要能够随时开启 comsci 移动
看到新的手机，很多由金属材质做的外壳，内存和闪存容量越来越大，CPU速度越来越快，对于这些改进，我们非常高兴，也非常欢迎但是，对于手机的新设计，有几点我们也要注意第一：手机的后盖应该能够被用户自行取下来，手机的电池的可更换性应该是必须保留的设计,
20款国外知名的php开源cms系统 cuiyadll cms
内容管理系统，简称CMS，是一种简易的发布和管理新闻的程序。用户可以在后端管理系统中发布，编辑和删除文章，即使您不需要懂得HTML和其他脚本语言，这就是CMS的优点。在这里我决定介绍20款目前国外市面上最流行的开源的PHP内容管理系统，以便没有PHP知识的读者也可以通过国外内容管理系统建立自己的网站。 1. Wordpress WordPress的是一个功能强大且易于使用的内容管
Java生成全局唯一标识符 darrenzhu java uuid unique identifier id
How to generate a globally unique identifier in Java http://stackoverflow.com/questions/21536572/generate-unique-id-in-java-to-label-groups-of-related-entries-in-a-log http://stackoverflow
php安装模块检测是否已安装过, 使用的SQL语句 dcj3sjt126com sql
SHOW [FULL] TABLES [FROM db_name] [LIKE 'pattern'] SHOW TABLES列举了给定数据库中的非TEMPORARY表。您也可以使用mysqlshow db_name命令得到此清单。本命令也列举数据库中的其它视图。支持FULL修改符，这样SHOW FULL TABLES就可以显示第二个输出列。对于一个表，第二列的值为BASE T
5天学会一种 web 开发框架 dcj3sjt126com Web 框架 framework
web framework层出不穷，特别是ruby/python,各有10+个,php/java也是一大堆根据我自己的经验写了一个to do list,按照这个清单，一条一条的学习，事半功倍，很快就能掌握一共25条，即便很磨蹭，2小时也能搞定一条，25*2=50。只需要50小时就能掌握任意一种web框架各类web框架大同小异:现代web开发框架的6大元素，把握主线，就不会迷路建议把本文
Gson使用三(Map集合的处理,一对多处理) eksliang json gson Gson map Gson 集合处理
转载请出自出处：http://eksliang.iteye.com/blog/2175532 一、概述 Map保存的是键值对的形式，Json的格式也是键值对的，所以正常情况下，map跟json之间的转换应当是理所当然的事情。二、Map参考实例 package com.ickes.json; import java.lang.refl
cordova实现“再点击一次退出”效果 gundumw100 android
基本的写法如下： document.addEventListener("deviceready", onDeviceReady, false); function onDeviceReady() { //navigator.splashscreen.hide(); document.addEventListener("b
openldap configuration leaning note iwindyforest configuration
hostname // to display the computer name hostname <changed name> // to change go to: /etc/sysconfig/network, add/modify HOSTNAME=NEWNAME to change permenately dont forget to change /etc/hosts
Nullability and Objective-C 啸笑天 Objective-C
https://developer.apple.com/swift/blog/?id=25 http://www.cocoachina.com/ios/20150601/11989.html http://blog.csdn.net/zhangao0086/article/details/44409913 http://blog.sunnyxx
jsp中实现参数隐藏的两种方法 macroli JavaScript jsp
在一个JSP页面有一个链接，//确定是一个链接?点击弹出一个页面，需要传给这个页面一些参数。//正常的方法是设置弹出页面的src="***.do?p1=aaa&p2=bbb&p3=ccc"//确定目标URL是Action来处理?但是这样会在页面上看到传过来的参数，可能会不安全。要求实现src="***.do"，参数通过其他方法传！//////
Bootstrap A标签关闭modal并打开新的链接解决方案 qiaolevip 每天进步一点点学习永无止境 bootstrap 纵观千象
Bootstrap里面的js modal控件使用起来很方便，关闭也很简单。只需添加标签 data-dismiss="modal" 即可。可是偏偏有时候需要a标签既要关闭modal，有要打开新的链接，尝试多种方法未果。只好使用原始js来控制。 <a href="#/group-buy" class="btn bt
二维数组在Java和C中的区别流淚的芥末 java c 二维数组数组
Java代码： public class test03 { public static void main(String[] args) { int[][] a = {{1},{2,3},{4,5,6}}; System.out.println(a[0][1]); } } 运行结果： Exception in thread "mai
systemctl命令用法 wmlJava linux systemctl
对比表，以 apache / httpd 为例任务旧指令新指令使某服务自动启动 chkconfig --level 3 httpd on systemctl enable httpd.service 使某服务不自动启动 chkconfig --level 3 httpd off systemctl disable httpd.service 检查服务状态 service h