thisissally

牛客刷题——题型总结

文章目录

（一）表连接
- 1、多表连接
- - （1）join-on和多个where等价
  - （2）通过多表连接解决A成立B不成立的问题
  - （3）自连接
  - （4）from相同表但是where不同
- 2、表连接函数
- - （1）union、union all
  - （2）left join、right join、join的区别
- 3、行对应性表连接
- 4、嵌套查询
（二）筛选：where、having、in
- 1、where和having的区别
- 2、where in与join等价的情况
（三）聚合信息：groupby与窗口函数
- 1、要显示所有信息因此不能直接使用group by后的结果
- 2、窗口函数+groupby
- 3、根据两个变量分组
- 4、聚合函数不一定要和groupby一起用
- 5、groupby最易错点：select 分组变量/聚合函数
- 6、根据不同字段group by
（四）排序
- 1、第n多
- 2、前n多：窗口函数
（五）执行顺序
（六）统计不同——去重
（七）行列转换
（八）时间函数
（九）字符串函数
- 1、内置函数
- 2、正则表达式
（十）类型转换
（十一）随机抽样
（十二）空值null
- null和空值得区别

（一）表连接

1、多表连接

（1）join-on和多个where等价

涉及多个表，要么join用on来筛选，要么多表查询限制很多个where条件

-- 1、查询"01"课程比"02"课程成绩高的学生的信息及课程分数

01-多表join，用on筛选
（1）a连b连c，不要a连（b连c），这样会把过程写复杂
（2）中间的筛选放在on里写，where只能在最后（查询前）写，不能在join的过程中
select a.* ,b.s_score as 01_score,c.s_score as 02_score
from student a 
join score b on a.s_id=b.s_id and b.c_id='01'  #01一定要有成绩，所以用了join
left join score c on b.s_id=c.s_id and c.c_id='02' #02成绩可有可无，所以用left join
where b.s_score>c.s_score;

02-不连接，直接一个where筛选出所有的结果，要哪些信息就直接选择
select a.*,b.s_score as 01_score,c.s_score as 02_score from student a,score b,score c 
		where a.s_id=b.s_id 
		and a.s_id=c.s_id 
		and b.c_id='01' 
		and c.c_id='02' 
		and b.s_score>c.s_score

-- 9、查询学过编号为"01"并且也学过编号为"02"的课程的同学的信息
（1）join 方法
select student.*
from student 
join score a on a.s_id=student.s_id and a.c_id='01'
join score b on b.s_id=student.s_id and b.c_id='02'

（2）where方法，注意要from所有表，筛选所有条件都具备的情况
select student.*
from student,score a ,score b
where student.s_id=a.s_id and student.s_id=b.s_id and a.c_id='01' and b.c_id='02'

【例】SQL19、查找所有员工的last_name和first_name以及对应的dept_name，也包括暂时没有分配部门的员工

思路：因为要包括暂时没有分配部门的员工，所以要把employees放在最左边，且用两次left join

# 两次LEFT JOIN连接
SELECT last_name, first_name, dept_name
FROM employees
LEFT JOIN dept_emp ON employees.emp_no=dept_emp.emp_no
LEFT JOIN departments ON dept_emp.dept_no=departments.dept_no

【例】SQL22 统计各个部门的工资记录数

SELECT d.dept_no, dept_name,count(*) as sum
FROM salaries s 
JOIN dept_emp de ON de.emp_no = s.emp_no
JOIN departments d ON d.dept_no = de.dept_no
GROUP BY dept_no  -- 从groupby可以开始用select中的别名
ORDER BY dept_no

（2）通过多表连接解决A成立B不成立的问题

【例】SQL25 满足条件的用户的试卷完成数和题目练习数
请你找到高难度SQL试卷得分平均值大于80并且是7级的红名大佬，统计他们的2021年试卷完成数和题目练习数，只保留2021年有试卷完成记录的用户。结果按试卷完成数升序，按题目练习数降序。

题目的意思是说，试卷一定要有完成记录，但是题目不一定要有，这种情况下应该把试卷完成情况作为左表，题目完成情况作为右表，其他情况再做筛选。

select 
    uid,
    exam_cnt,
    (case when question_cnt is null then 0 else question_cnt end)
    #if(question_cnt is null, 0, question_cnt)
from
(select uid,count(score) as exam_cnt
from exam_record
where YEAR(submit_time) = 2021
group by uid) t  -- 试卷有成绩
 
left join
 
(select uid,count(submit_time) as question_cnt
from practice_record
where YEAR(submit_time) = 2021
group by uid) t2 using(uid)  -- 题目不一定做了
 
where uid in
(
select
    uid
from exam_record
join examination_info using(exam_id)
join user_info using(uid)
where tag = 'SQL' and difficulty = 'hard' and `level` = 7
group by uid
having avg(score) >= 80
)
order by exam_cnt asc, question_cnt desc

（3）自连接

【例】SQL70 牛客每个人最近的登录日期(五)

法一：sum(case when 1 else 0 end)分组计算
法二：自连接：join on user_id相等并且datediff(nextday,today)=1
法三：lead窗口函数

要在group by之后还能得到所有日期的结果，可以把原表和现表左边界，
select distinct date from login或者select date from login group by date（原表的date也要唯一）

# 法一
select date,
ifnull(
round(
sum(case when (user_id,date) in 
   (select user_id,date_add(date,interval -1 day) from login)
   and (user_id,date) in
   (select user_id,min(date) from login group by user_id)
   then 1 else 0 end)/
sum(case when (user_id,date) in 
   (select user_id,min(date) from login group by user_id)
   then 1 else 0 end),3),0) as p
from login 
group by date
order by date

# 法二
# a最早登录的日期左连接第二天的日期：on用户和时间差
# b所有的日期连接a：所有天的最早和第二天情况
# 对b计数

select t0.date,
ifnull(round(count(t2.user_id)/count(t1.user_id),3),0)
from 
(select distinct date from login)t0
left join
(select user_id,min(date)as min_date from login group by user_id)t1
on t0.date=min_date
left join 
login t2 on t1.user_id=t2.user_id and datediff(t2.date,min_date)=1
group by t0.date

# 法三
select date,
ifnull(round(sum(case when date=min_date and datediff(next_date,date)=1 then 1 else 0 end) / sum(case when date=min_date then 1 else 0 end),3),0) as p
from(
    select user_id, date, min(date) over (partition by user_id) as min_date, lead(date,1) over(partition by user_id order by date) as next_date
    from login
) a
group by date
order by date;

【例】SQL46 大小写混乱时的筛选统计

#自连接得到符合大小写要求的exam_id
#on的妙用

select a.tag,b.answer_cnt
from
(select tag,count(start_time) as answer_cnt
from examination_info join exam_record using(exam_id)
group by tag)a

join

(select tag,count(start_time) as answer_cnt
from examination_info join exam_record using(exam_id)
group by tag)b
on a.tag!=b.tag and upper(a.tag)=b.tag
group by tag
order by answer_cnt desc

（4）from相同表但是where不同

with temp as (
select x from table 
where x1 in ('a','b','c')
)
select x from temp wehre x1='a'

2、表连接函数

（1）union、union all

行合并，要求列是同数量且有相似的数据类型，每条 SELECT 语句中的列的顺序必须相同。union会去重并降低效率，union all允许重复的值。UNION 结果集中的列名总是等于 UNION 中第一个 SELECT 语句中的列名。

# union
SELECT column_name(s) FROM table_name1
UNION
SELECT column_name(s) FROM table_name2
# union all
SELECT column_name(s) FROM table_name1
UNION ALL
SELECT column_name(s) FROM table_name2

orderby只能在最后使用一次，所以这边只能放进子表中才能使用两次

【例】SQL23 每个题目和每份试卷被作答的人数和次数

select * from
(select exam_id as tid,
count(distinct uid) as uv,
count(*) as pv
from exam_record
group by tid
order by uv desc,pv desc)a  -- orderby只能在最后使用一次，所以这边只能放进子表中才能使用两次

union all 
select * from
(select question_id as tid,
count(distinct uid) as uv,
count(*) as pv
from practice_record
group by tid
order by uv desc,pv desc)b

【例】SQL24 分别满足两个活动的人
输出2021年里，所有每次试卷得分都能到85分的人以及至少有一次用了一半时间就完成高难度试卷且分数大于80的人的id和活动号，按用户ID排序输出。输出形式：

所有每次试卷得分都能到85分的人：
（1）思路1：找到存在分数小于85分的用户，筛选的时候用not in排除
（2）思路2：根据用户分组，最小分数>=85的用户，就是符合条件的用户

难点：分别’activity1’ as activity和’activity2’ as activity之后union all

# 思路1
with a as (select uid from exam_record 
           where score<85
           and year(submit_time) = 2021)  -- 存在分数<85的用户，不符合activity1
select distinct uid,
(case when uid not in (select * from a) then 'activity1' else null end) as activity
from exam_record
where (case when uid not in (select * from a) then 'activity1' else null end) is not null
union all
select distinct uid,'activity2' as activity
from exam_record e_r left join examination_info e_i using(exam_id)
where year(submit_time) = 2021
and difficulty = 'hard'
and score > 80
and timestampdiff(minute, start_time, submit_time) * 2 < e_i.duration
order by uid

# 思路2
select uid,'activity1' as activity
from exam_record
where year(submit_time) = 2021
group by uid
having min(score) >= 85
union all
select distinct uid,'activity2' as activity
from exam_record e_r left join examination_info e_i using(exam_id)
where year(submit_time) = 2021
and difficulty = 'hard'
and score > 80
and timestampdiff(minute, start_time, submit_time) * 2 < e_i.duration
order by uid

（2）left join、right join、join的区别

left join：查出来的结果显示左边的所有数据，然后右边显示的是和左边有交集部分的数据。
right join：查出表2所有数据，以及表1和表2有交集的数据。
join(inner join)：查出两个表有交集的部分，其余没有关联就不额外显示出来。

3、行对应性表连接

SQL86 实习广场投递简历分析(三)

代码注意点：（1）select一定要写清哪些，这里写*会报错；（2）RIGHT(s,n) 返回字符串 s 的后 n 个字符；（3）用right(first_year_mon,2)=right(second_year_mon,2)控制每行上时间的对应性；（4）顺序最后一定要调整

select t1.job,first_year_mon,first_year_cnt,second_year_mon,second_year_cnt 
from 
(select job,DATE_FORMAT(date,'%Y-%m') as first_year_mon,sum(num)as first_year_cnt
from resume_info
where date like '2025%'
group by job,first_year_mon)t1
JOIN
(select job,DATE_FORMAT(date,'%Y-%m') as second_year_mon,sum(num)as second_year_cnt
from resume_info
where date like '2026%'
group by job,second_year_mon)t2
on t1.job=t2.job AND right(first_year_mon,2)=right(second_year_mon,2)
order by first_year_mon desc,job desc

希望用到两张表的信息——表连接+条件筛选
SQL76 考试分数(五)

代码注意点：所有语句中，如果变量名是唯一的，就不需要写表名，写表名是在易混淆的情况下才这么做。

-- 查询各个岗位分数的中位数位置上的所有grade信息，并且按id升序排序
select id,t1.job,score,t_rank
from
(select id,job,score,
row_number() over (partition by job order by score desc) as t_rank
from grade)t1
JOIN
(select job,
(case when count(id)%2=0 then count(id)/2 else ceiling(count(id)/2) end) as start,
(case when count(id)%2=0 then count(id)/2+1 else ceiling(count(id)/2) end) as end
from grade
group by job )t2
on t1.job=t2.job
where t_rank=start or t_rank=end
order by id

4、嵌套查询

当下一层计算结果是基于上一层时，需要用到层层嵌套的方法
【例】SQL28 第二快/慢用时之差大于试卷时长一半的试卷

-- 先用窗口函数找出每门考试的正数和倒数的排名
-- 然后根据每门课分组，计算正数和倒数对应时间的时间差
-- 最后筛选出时间差符合要求的情况
-- 涉及的知识点：并列计数窗口、分组条件计算、嵌套查询

select distinct exam_id, duration, release_time
from
    (select exam_id, duration, release_time,
           #sum(case when rank1 = 2 then costtime when rank2 = 2 then -costtime else 0 end) as sub
           max(case when rank1=2 then costtime else null end)-max(case when rank2=2 then costtime else null end) as sub

     from (
        select e_i.exam_id, duration, release_time,
        timestampdiff(minute, start_time, submit_time) as costtime,
        row_number() over(partition by e_r.exam_id order by timestampdiff(minute, start_time, submit_time) desc) rank1,
        row_number() over(partition by e_r.exam_id order by timestampdiff(minute, start_time, submit_time) asc) rank2
        from exam_record e_r join examination_info e_i
        on e_r.exam_id = e_i.exam_id
    ) table1
    group by exam_id
) table2
where sub*2 >= duration
order by exam_id desc

【例】SQL29 连续两次作答试卷的最大时间窗

时间函数：
datediff(end_time,start_time)
date(start_time)

-- 细节点：（1）作答过只需要有start_time就可以了；（2）根据题意算时间差都需要在公式的基础上+1
-- 需要的数据：每个人的前后期开始作答时间（窗口），
-- groupby：每个人的最大窗口时间，对窗口时间筛选，每个人的最先时间，最后时间，作答次数
-- 在上面的基础上计算count,max,min
-- 一层基于一层来计算，用层层嵌套来做

select uid,days_window,
round(counts/sub_day*days_window,2) as avg_exam_cnt
from
(select uid,
max(datediff(next_time,start_time))+1 as days_window,
datediff(max(date(start_time)),min(date(start_time)))+1 as sub_day,
count(start_time) as counts
from
(select uid,start_time,
lead(start_time,1)over(partition by uid order by start_time) as next_time
from exam_record
where year(start_time)=2021)a
group by uid
having count(distinct date(start_time))>=2
)b
order by days_window desc,avg_exam_cnt desc;

【例】SQL30 近三个月未完成试卷数为0的用户完成情况

# 每个人的试卷作答（start）记录的月份：窗口函数，序号
# 做筛选：序号前三，没有未完成
# 筛选出用户，得到该用户的试卷完成数（近三个月）
# 按试卷完成数和用户ID降序排名

select uid,count(submit_time) as exam_complete_cnt from
(select uid,date_format(start_time,'%Y%m') as ans_month,start_time,submit_time,
dense_rank()over(partition by uid order by date_format(start_time,'%Y%m') desc) as recent_months
from exam_record)a
where recent_months between 1 and 3  -- <=3
group by uid
having count(start_time)=count(submit_time)
order by exam_complete_cnt desc,uid desc

【例】SQL31 未完成率较高的50%用户近三个月答卷情况

代码注意点：（1）三表嵌套join很复杂，用where in代替反而简化问题；（2）count()在只有一类的情况下可以不和groupby连用，但是只能显示一行结果。count()over()可以在每一行都显示结果；（3）判断前50%（中位数及之后）：rank<=ceiling（总数/2），则是前50%，否则不是
【法一】count(distinct uid) over ()把总人数连接到表上
【法二】只join不on，可以把总人数连接到表上
【法三】（select count(distinct uid) from）表示总人数
思路：先把步骤和对应的方法按照先后顺序写出来，再写代码

# 数所有行数用count(1)或者count任意一个非空变量都可以
with a as (
select uid 
from 
(select *,row_number()over(order by incomplete_rate desc) incomplete_order,count(1)over() as numbers
from
(select uid,(count(1)-count(submit_time))/count(1) as incomplete_rate
from exam_record
where exam_id in (select exam_id from examination_info where tag='SQL')
group by uid)t1)t2 join (select count(distinct uid) as total_user from exam_record join examination_info using(exam_id) where tag='SQL') t_u
# where incomplete_order<=ceiling(numbers*0.5)   -- 法一
# where incomplete_order<=ceiling(total_user*0.5)   -- 法二
where incomplete_order<=ceiling((select count(distinct uid) as total_user from exam_record join examination_info using(exam_id) where tag='SQL')*0.5)  -- 法三
and uid in (select uid from user_info where level in (6,7))
)

select uid,start_month,count(start_time) as total_cnt,count(submit_time) as complete_cnt
from 
(select uid,date_format(start_time,'%Y%m') as start_month,start_time,submit_time,
dense_rank()over(partition by uid order by date_format(start_time,'%Y%m') desc) as recent_months
from exam_record)recent_table
where recent_months<=3
and uid in (select uid from a)
group by uid,start_month  -- 每个人每个月的登录情况
order by uid,start_month

（二）筛选：where、having、in

1、where和having的区别

（1）作用位置：都是筛选功能，where指定分组之前数据行的条件，having子句用来指定分组之后条件
（2）使用限制：where是对聚合前的信息进行筛选，having是对聚合后的信息进行筛选
（3）联系：where-groupby-having的使用顺序，where和having的区别在于筛选对象是分组前还是分组后
【易错点】涉及groupby的时候注意select的要么是聚合函数，要么是groupby的对象

-- 15、查询两门及其以上不及格课程的同学的学号，姓名及其平均成绩 
select a.s_id,a.s_name,avg(b.s_score) as avg_score 
from student a join score b 
on a.s_id=b.s_id
where s_score<60
group by a.s_id,a.s_name
having count(c_id)>=2

SQL78 牛客的课程订单分析(二)

select user_id
from order_info
where status='completed'
and product_name in ('C++','Java','Python')
and date>'2025-10-15'
group by user_id
having count(id)>=2
order by user_id

SQL88 最差是第几名(二)

涉及变量比较一定是在同一行上的数据
where的逻辑在from之后，select之前，所以这里的where筛选可以用到from表中有但是select中没有的变量

-- 中位数：正序和逆序的累积和都大于总和的一半，就是中位数
select grade FROM
(select
grade,(select sum(number) from class_grade) as total,
sum(number) over (order by grade) as up,
sum(number) over (order by grade desc) as down
from class_grade
)a
where up>=total/2  -- 涉及变量比较一定是在同一行上的，where在from之后，select之前
and down>=total/2
order by grade

2、where in与join等价的情况

当一次groupby，需要筛选条件时，where in和join时等价的
当多次groupby，需要筛选条件时，用where in () （注意不是where in ()a，不用标记表名）

等价：【例】SQL22 作答试卷得分大于过80的人的用户等级分布

# where in
select level,count(uid) as level_cnt
from user_info 
where (uid,level) in   # 字段数需要统一
(select ui.uid,level
from exam_record er
left join user_info ui using(uid)
left join examination_info ei using(exam_id)
where tag='SQL'
and score>80)
group by level
order by level_cnt DESC

# join
select level,count(distinct u_i.uid) as level_cnt
from exam_record e_r 
left join examination_info e_i on e_r.exam_id = e_i.exam_id
left join user_info u_i on e_r.uid = u_i.uid
where tag = 'SQL'
and score > 80
group by level
order by level_cnt desc, level desc

不等价【例】月均完成试卷数不小于3的用户爱作答的类别

SELECT tag,count(tag) as tag_cnt
from exam_record join examination_info using(exam_id)
where uid in 
(
select uid from exam_record
where submit_time is not null
group by uid
having count(submit_time)/count(distinct date_format(submit_time,'%Y%m'))>=3
)
group by tag
order by tag_cnt desc

【例】SQL70 牛客每个人最近的登录日期(五)
查询每个日期新用户的次日留存率，结果保留小数点后面3位数(3位之后的四舍五入)，并且查询结果按照日期升序排序

代码注意点：（1）iffull(value,0)表示如果是null就输出0；（2）(a,b) in (select A,B from…)列数一定要对等
明确问题：12号的新用户次留是指在12号是第一次登录，并且在13号也登录了。分母：当前日期新用户的特征是当前日期=该用户所有登录日期的最小值。分子：当前日期作为前一天有该用户的登录记录，并且是第一次登录。（12号作为前一天登陆了并且是第一次登录，13号要登录了）

-- 通过in来筛选
-- 分子：今天在，昨天也在，且昨天是第一天登录
-- 分母：每天的新用户数
-- 易错点：分母为0,ifnull

select date,
ifnull(round(sum(case when 
    (user_id,date) in (select user_id,date_add(date,interval -1 day) from login)
    and
    (user_id,date) in (select user_id,min(date) from login group by user_id)
    then 1 else 0 end)/
sum(case when 
    (user_id,date) in (select user_id,min(date) from login group by user_id)
    then 1 else 0 end),3)
 ,0) as p
 from login 
 group by date
 order by date

（三）聚合信息：groupby与窗口函数

1、要显示所有信息因此不能直接使用group by后的结果

SQL79 牛客的课程订单分析(三)
【法一】内表找出user_id，外表找出该user_id符合的记录

-- 要显示所有信息因此不能直接使用group by后的结果
-- 先找到符合条件的人
with temp1 as (select user_id from order_info
where date>'2025-10-15'
and status='completed'
and product_name in ('C++','Java','Python')
group by user_id 
having count(id)>=2)

-- 再找到符合条件的所有信息
select * from order_info
where user_id in (select * from temp1)  -- 注意不能直接写成temp1
and date>'2025-10-15'
and status='completed'
and product_name in ('C++','Java','Python')
order by id

同理也可以不用临时表来写

select * from order_info
where user_id in (select user_id from order_info
where date>'2025-10-15'
and status='completed'
and product_name in ('C++','Java','Python')
group by user_id 
having count(id)>=2)  -- 注意不能直接写成temp1
and date>'2025-10-15'
and status='completed'
and product_name in ('C++','Java','Python')
order by id

注：

临时表的写法：
with a as (),
b as (),
c as ()

【法二】窗口函数
groupby会把结果聚合成一行，所以如果需要所有信息，就要先内表后外表。但是窗口函数生成结果的行数不变，因此可以直接基于窗口函数做筛选，但是如果where筛选涉及窗口，还是要作为内表的，因为where的逻辑再select之前，但是可以少写很多筛选条件。

select id,user_id,product_name,status,client_id,date 
from
(select *,
count(id) over (partition by user_id) as counts
from order_info
where date>'2025-10-15'
and status ="completed"
and product_name in ("C++","Java","Python")
) a
where counts>=2
order by id

2、窗口函数+groupby

SQL80 牛客的课程订单分析(四)

思路：最后需要最小日期所以肯定做聚合，做聚合就需要全部信息，所以前一步的计数肯定用到窗口函数

-- 首先在有次数的内表上做筛选，然后基于筛选结果做聚合函数求最小日期
-- 窗口函数+groupby 
select user_id,min(date) as first_buy_date,cnt
from
(select *,
count(id) over (partition by user_id) as cnt
from order_info
where date>'2025-10-15'
and status ="completed"
and product_name in ("C++","Java","Python")
) a
where cnt>=2
group by user_id
order by user_id

SQL27 每类试卷得分前3名

select * from 
(select tag,er.uid,
row_number() over (partition by tag order by max(score) desc,min(score) desc,uid desc)as ranking
from examination_info ei join exam_record er using(exam_id)
group by uid,tag
) a
where ranking<=3

3、根据两个变量分组

SQL85 实习广场投递简历分析(二)


select job,date_format(date,'%Y-%m') as mon,sum(num) as cnt
from resume_info
where date like '2025%'  -- 符合最左前缀匹配原则，也走索引
group by job,mon
order by mon desc,cnt desc;

4、聚合函数不一定要和groupby一起用

当只有一类的情况下，聚合函数不一定要和groupby一起用

【例】SQL14 SQL类别高难度试卷得分的截断平均值

-- 法一：嵌套子函数，因为只有一类，所以不需要groupby直接就可以算出min和max
select tag,difficulty,round(avg(score),1) as clip_avg_score
from exam_record er join examination_info ei on er.exam_id=ei.exam_id
where tag='SQL' 
and difficulty='hard'
and score != (select max(score) from exam_record where tag='SQL' and difficulty='hard')
and score != (select min(score) from exam_record where tag='SQL' and difficulty='hard')


-- 法二：窗口函数，正序和倒序两次row_number来找到最大和最小
select tag,difficulty,round(avg(score),1) as clip_avg_score from
(
select tag,difficulty,score,
row_number() over (partition by tag order by score) as rank1,
row_number() over (partition by tag order by score desc) as rank2
from exam_record er join examination_info ei on er.exam_id=ei.exam_id
where tag='SQL' and difficulty='hard' and score is not null
) a
where rank1!=1 and rank2!=1

【例】SQL31 未完成率较高的50%用户近三个月答卷情况

select count(distinct uid) as total_user from exam_record

5、groupby最易错点：select 分组变量/聚合函数

SQL18 月总刷题数和日均刷题数

这里也可以用ifnull，ifnull和coalesce的区别：
ifnull只有两个参数，coalesce有多个参数，返回第一个非空的值

group by with rollup具有汇总加和的功能，但是列名那里自动为null，如果希望有列名，则需要辅助ifnull/coalesce函数
Hive中with rollup和with cude都可以用于group by的汇总，但是当分组依据是三组的情况下，二者呈现出的汇总效果不一样。cube是3222211111，而rollup是321321321。

这里最易错的点在于每月天数的计算
（1）计算每个月的天数可以用函数：day(last_day(time))
（2）也可以自己写：case when month(time) in (1,3,5,7,8,10,12) then 31 else 30 end
（3）最易错的点在于这里用到了groupby month，需要用max(day_of_month)或者min、first、last汇总出唯一结果，这样才不会报错

select coalesce(date_format(submit_time,'%Y%m'),'2021汇总') as submit_month,
count(score) as month_q_cnt,
round(count(score)/max(case when month(submit_time) in (1,3,5,7,8,10,12) then 31 else 30 end),3) as avg_day_q_cnt
FROM practice_record
where year(submit_time)=2021
group by DATE_FORMAT(submit_time, "%Y%m") with rollup

6、根据不同字段group by

问题：
（1）涉及两种不同的groupby：每个人购买每个商品的次数至少两次的人数（筛选）+每种商品的购买人数
（2）因为涉及对其中一个groupby的筛选，因此如果直接在两个groupby的基础上直接再groupby会导致范围不对

-- 法一：在两个字段groupby的表上套一个字段groupby，用if来筛选
SELECT product_id,
    ROUND(SUM(repurchase) / COUNT(1), 3) as repurchase_rate
FROM (
    SELECT uid, product_id, IF(COUNT(1)>1, 1, 0) as repurchase
    FROM tb_order_detail
    JOIN tb_order_overall USING(order_id)
    JOIN tb_product_info USING(product_id)
    WHERE tag="零食" AND event_time >= (
        SELECT DATE_SUB(MAX(event_time), INTERVAL 89 DAY)
        FROM tb_order_overall)
  
    GROUP BY uid, product_id
) as t_uid_product_info
GROUP BY product_id
ORDER BY repurchase_rate DESC, product_id
LIMIT 3;

-- 法二：直接表连接计算字段
select a.product_id,
       ifnull(round(cnt_2/cnt_total,3),0.000) repurchase_rate
from tb_product_info a
left join (select product_id,
                  count(distinct uid) cnt_total     -- 该产品被几个人购买过
           from tb_order_detail t1
           left join tb_order_overall t2
           on t1.order_id = t2.order_id
           GROUP BY product_id) b
on a.product_id = b.product_id    
left join (select product_id,count(distinct uid) as cnt_2  -- count(1)也可以
           from
          (select product_id,uid,count(1) as cnt
           from tb_order_detail join tb_order_overall using(order_id)
           where (DATEDIFF((select max(event_time) from tb_order_overall),date(event_time)) < 90)
           group by product_id,uid
           having cnt>=2)c
           group by product_id)d
on a.product_id = d.product_id           
where tag='零食'
order by repurchase_rate desc,product_id
limit 3

（四）排序

1、第n多

用orderby之后limit个数

limit y --读取 y 条数据
limit x, y --跳过 x 条数据，读取 y 条数据
limit y offset x --跳过 x 条数据，读取 y 条数据

2、前n多：窗口函数

【例】leetcode–185. 部门工资前三高的所有员工

select Department,Employee,Salary
from
    (select 
        b.name as Department,
        a.Name as Employee,
        a.Salary,
        dense_rank() over (partition by a.DepartmentId order by a.Salary desc) as salary_rank
        from Employee a join Department b
            on a.DepartmentId=b.Id) c
where salary_rank<=3

（五）执行顺序

where在join on后
SQL73 考试分数(二)
查询用户分数大于其所在工作(job)分数的平均分的所有grade的属性

代码注意点：表连接中如果是唯一字段可以不加表名

select id,grade.job,score from grade 
left join (select job,avg(score) as avg_score from grade group by job) a 
on grade.job=a.job
where score>avg_score
order by id

（六）统计不同——去重

distinct 会对结果集去重，对全部选择字段进行去重，并不能针对其中部分字段进行去重。使用count distinct进行去重统计会将reducer数量强制限定为1，而影响效率，因此适合改写为子查询。

-- 统计不同的id的个数
select count(distinct id) from  table_1

-- 优化版本的count distinct
select count(*) from
(select distinct id from table_1) tb

SQL15 统计作答次数

数行数用count(*)
count(var) 如果var有空值会自动忽略
count(distinct var)在计数时去重
如果是限制var2不要有空值，数var1有多少种，要用到case when

select
count(*) as total_pv,
count(score) as complete_pv, -- 聚合函数计算时会忽略空值
count(distinct case when score is not null then exam_id else null end) as complete_exam_cnt
from exam_record ;

（七）行列转换

tmp_column

select A,B,C from table
lateral view explode(split(column_C,',')) tmp_table as C
-- A，B，column_C 都是原表的列（字段）,tmp_table：explode形成的新虚拟表，可以不写；

select * from table LATERAL VIEW EXPLODE(SPLIT(ab_version, ',')) vidtb AS vid_explode
where vid_explode in ("1262091")

（八）时间函数

# 提取时间
DATE_FORMAT(NOW(),'%Y')
DATE_FORMAT(NOW(),'%m%d')
year()/month()/day()/hour()/minute()/second()/date()
# 转换类型
convert(log_time,date)
# 时间差day
datediff(string enddate, string startdate)
-- datediff函数只能处理'yyyy-MM-dd'这种格式的日期，如果日期形式是'yyyyMMdd'的话，需要进行格式转换

TIMESTAMPDIFF(interval, time_start, time_end)
-- 可计算time_start-time_end的时间差，单位以指定的interval为准:second,minute,hour,day,month,year
# 时间加
date_add(string startdate, int days)
A.T_DATE = B.T_DATE+ interval 1 hour  
'2021-09-01 22:11:12'+interval 50 minute  
# 时间减
date_sub (string startdate, int days)
A.T_DATE = B.T_DATE+ interval -1 hour

 -- 日期（2020-03-21 17:13:39）和unix时间戳（1584782175）之间相互转换
 ## 日期转化为时间戳 ##
select unix_timestamp('2020-03-21 17:13:39')：得到 1584782019
select unix_timestamp('20200321 13:01:03','yyyyMMdd HH:mm:ss') 得到 1584766863
select unix_timestamp('20200321','yyyyMMdd') 得到 1584720000


## 时间戳转化为日期 ## 
select from_unixtime (1584782175) 得到 2020-03-21 17:16:15
select from_unixtime (1584782175,'yyyyMMdd') 得到 20200321
select from_unixtime (1584782175,'yyyy-MM-dd')得到 2020-03-21


## 日期和日期之间，也可以通过时间戳来进行转换 ##
select from_unixtime(unix_timestamp('20200321','yyyymmdd'),'yyyy-mm-dd') 得到 2020-03-21
select from_unixtime(unix_timestamp('2020-03-21','yyyy-mm-dd'),'yyyymmdd')得到 20200321

（九）字符串函数

1、内置函数

-- 1、拼接
-- （1）concat( A, B...)返回将A和B按顺序连接在一起的字符串
select concat('abc', 'def','gh') 得到abcdefgh
concat(round(num,1),'%')  # 得到百分数

-- （2）concat_ws(string X, stringA, string B) 返回字符串A和B由X拼接的结果
select concat_ws(',', 'abc', 'def', 'gh') 得到 abc,def,gh

-- （3）根据分组情况连接字段
group_concat([DISTINCT] 要连接的字段 [Order BY ASC/DESC 排序字段] [Separator '分隔符'])
# DISTINCT用来给字段去重
# 默认用逗号分隔
# 等同于先用窗口函数排序再用collect_set去重组合

-- rk根据窗口函数得到
-- 把rankd的name组合在一起方便排序：['001张','002李']
SORT_ARRAY(COLLECT_SET(CONCAT(LPAD(CAST(rk AS STRING), 3, '0'),feature_name)))
-- 把['001张','002李']中的数字去掉：先把列表组合成字符串，然后替换，再根据逗号拆分成数组
split(REGEXP_REPLACE(CONCAT_WS(',', rank), '[0-9]\{3\}', ''),',')

-- 2、分割
substring_index(str,delim,count)
str=www.wikidm.cn
substring_index(str,'.',1)  结果是：www（从左向右数）
substring_index(str,'.',2)  结果是：www.wikidm
substring_index(str,'.',-2)  结果为：wikidm.cn（从右向左数）
substring_index(substring_index(str,'.',-2),'.',1)  结果是：wikidm（中间的数）

-- 3、切片
-- substr/substring（str,start,len) 截取字符串从0位开始的长度为len个字符。如果不加len，默认从start到end。
select substr('abcde',3,2) from iteblog;
-- 得到cd

-- 4、其他
select char_length('abcedfg') # 字符长度为7
## 使用trim(string A) 去除字符串两边的空格
select trim(' abc ') 得到 'abc'
## 使用lower(string A)/ lcase(string A)返回字符串的小写形式，常用于不确定原始字段是否统一为大小写
select lower('abSEd') 得到 absed
## 使用upper(string A)/ ucase(string A)返回字符串的大写形式，常用于不确定原始字段是否统一为大小写
select upper('abSEd') 得到 ABSED

【例】用户行为分析
用户行为表tracking_log

统计用户行为序列为A-B-D的用户数,其中:A-B之间可以有任何其他浏览记录(如C,E等),B-D之间除了C记录可以有任何其他浏览记录(如A,E等)

select count(*)
from(
		select user_id,group_concat(opr_id) ubp
		from tracking_log
		group by user_id
		) a
where ubp like '%A%B%D%' and ubp not like '%A%B%C%D%'
# 先提取子表后where筛选

【例】SQL19 未完成试卷数大于1的有效用户

拼接思路：首先字段拼接成新字段，然后是分组后的新字段拼接，要求分组时拼接的字段是不重复的，因此用distinct去重

select uid
        , sum(case when submit_time is null then 1 else 0 end) as incomplete_cnt
        , sum(case when submit_time is not null then 1 else 0 end) as complete_cnt
        , group_concat(distinct CONCAT(DATE_FORMAT(start_time, '%Y-%m-%d'),':',tag) separator ';') as detail
from exam_record er join examination_info ei using(exam_id)
where YEAR(start_time) = 2021 
group by uid
having incomplete_cnt>1
and incomplete_cnt<5
and complete_cnt >= 1
order by incomplete_cnt desc

2、正则表达式

regexp_extract 提取
regexp_replace 替换

##  regexp_extract(string subject, string pattern, int index)
## 将字符串subject按照pattern正则表达式的规则拆分，返回index指定的字符
select regexp_extract('foothebar', 'foo(.*?)(bar)', 1) 得到 the

## regexp_replace(string A, string B, string C)
## 将字符串A中的符合java正则表达式B的部分替换为C
select regexp_replace('foobar', 'oo|ar', '') 得到 fb

get_json_object(string json_string, string path)
第一个参数填写json对象变量，第二个参数使用$表示json变量标识，然后用 . 或 [] 读取对象或数组；
json对象相当于sql中的字典

data =
{
 "store":
        {
         "fruit":[{"weight":8,"type":"apple"}, {"weight":9,"type":"pear"}],  
         "bicycle":{"price":19.95,"color":"red"}
         }, 
 "email":"amy@only_for_json_udf_test.net", 
 "owner":"amy" 
}
hive> select  get_json_object(data, '$.owner') from test;
结果：amy
hive> select  get_json_object(data, '$.store.bicycle.price') from test;
结果：19.95
hive> select  get_json_object(data, '$.store.fruit[0]') from test;
结果：{"weight":8,"type":"apple"}

【例】SQL39 筛选昵称规则和试卷规则的作答记录

select ui.uid,ei.exam_id,round(avg(score)) as avg_score
from exam_record 
join user_info ui using(uid)
join examination_info ei using(exam_id)
where (nick_name rlike '^牛客[0-9]+号$'  -- ^开头，[0-9]任意一个字符，+一个或多个匹配
or nick_name rlike '^[0-9]+$') -- $结尾
and tag rlike '(c|C).*'  -- c或C，.任意字符，*0或多个匹配
and score is not null  -- 这一行要加，因为如果哪一行只有一个结果就是空值，就没办法通过avg的计算把null抵消掉
group by uid,exam_id
order by uid,avg_score

（十）类型转换

CAST (expression AS data_type)

可以转换的数据类型：
二进制，同带binary前缀的效果 : BINARY
字符型，可带参数 : CHAR()
日期 : DATE
时间: TIME
日期时间型 : DATETIME
浮点数 : DECIMAL
整数 : SIGNED
无符号整数 : UNSIGNED

SELECT CAST('9.0' AS decimal)  结果：9
SELECT CAST('9.5' AS decimal(10,2))  结果：9.50
SELECT  CAST(NOW() AS DATE) 结果：2017-11-27
cast(exam_cnt_rank_21 as signed) -- 字符串转化为数字

【例】收入区间分组

select id,
(case when CAST(salary as float)<50000 Then '0-5万'
when CAST(salary as float)>=50000 and CAST(salary as float)<100000 then '5-10万'
when CAST(salary as float) >=100000 and CAST(salary as float)<200000 then '10-20万'
when CAST(salary as float)>200000 then '20万以上'
else NULL end 
from table_1;

（十一）随机抽样

rand(),rand(int seed)

## 从数据表中随机取两条数据，设定了rand(1)之后，每次跑出来的都是一样的两条数据
select * from dm_growth_da.xdl_test_20200328 order by rand(1) limit 2

（十二）空值null

什么时候需要标记is not null？
（1）当设计窗口函数排序row_number，需要where is not null
（2）count，avg，sum会自动排除null
（3）限制其他变量非空，对该变量计数，则需要写成类似count(distinct case when score is not null then exam_id else null end) as complete_exam_cnt 的形式
【例】SQL15 统计作答次数

注意点：count中的casewhen在一般情况下也可以用where来替代，书写上会更加好读，（见下一个例子）但是因为这边要count(*)所以不能用where做统一筛选。

select
count(*) as total_pv,
count(score) as complete_pv, -- 聚合函数计算时会忽略空值
count(distinct case when score is not null then exam_id else null end) as complete_exam_cnt
from exam_record ;

【例】SQL17 平均活跃天数和月活人数

思路：因为平均活跃天数的分子是所有用户的活跃天数之和，需要用count来做，所以是根据用户和天来去重（每个用户一天如果登录多次，就记录一次）

select date_format(submit_time, '%Y%m') as month,
       round((count(distinct uid, date_format(submit_time, '%y%m%d'))) / count(distinct uid), 2) as avg_active_days,  -- 每个人的登录天数count，要对人和天去重distinct
       count(distinct uid) as mau
from exam_record
where submit_time is not null  # 很关键的非空
and year(submit_time) = 2021
group by month

【例】SQL19 未完成试卷数大于1的有效用户

count(非空字段)可以等价于sum(case is not null then end)

# 等价于sum(case when submit_time is null then 1 else null end)
select uid
        , sum(if(submit_time is null,1,null)) as incomplete_cnt
        , sum(if(submit_time is not null,1,null)) as complete_cnt
        , group_concat(distinct CONCAT(DATE_FORMAT(start_time, '%Y-%m-%d'),':',tag) separator ';') as detail
from exam_record er join examination_info ei using(exam_id)
where YEAR(start_time) = 2021 
group by uid
having incomplete_cnt>1
and incomplete_cnt<5
and complete_cnt >= 1
order by incomplete_cnt desc

# count
SELECT uid, 
    (count(*)-count(submit_time)) as incomplete_cnt,
    count(submit_time) as complete_cnt,
    group_concat(distinct concat_ws(':', date(start_time), tag) SEPARATOR ';') as detail
from exam_record left join examination_info using(exam_id)
where year(start_time)=2021
group by uid
having complete_cnt >= 1 and incomplete_cnt BETWEEN 2 and 4
order by incomplete_cnt DESC

null和空值得区别

1、空值不占空间，NULL值占空间。当字段不为NULL时，也可以插入空值。

2、当使用 IS NOT NULL 或者 IS NULL 时，只能查出字段中没有不为NULL的或者为 NULL 的，不能查出空值。

3、判断NULL 用IS NULL 或者 is not null,SQL 语句函数中可以使用IFNULL()函数来进行处理，判断空字符用 =’‘或者<>’'来进行处理。

4、在进行count()统计某列的记录数的时候，如果采用的NULL值，会别系统自动忽略掉，但是空值是会进行统计到其中的。

5、MySql中如果某一列中含有NULL，那么包含该列的索引就无效了。这一句不是很准确。

6、实际到底是使用NULL值还是空值(’’)，根据实际业务来进行区分。个人建议在实际开发中如果没有特殊的业务场景，可以直接使用空值。

题目参考:https://blog.csdn.net/fashion2014/article/details/78826299?ops_request_misc=%257B%2522request%255Fid%2522%253A%2522163132590216780269843900%2522%252C%2522scm%2522%253A%252220140713.130102334…%2522%257D&request_id=163132590216780269843900&biz_id=0&utm_medium=distribute.pc_search_result.none-task-blog-2_alltop_positive~default-1-78826299.pc_search_result_hbase_insert&utm_term=sql&spm=1018.2226.3001.4187

你可能感兴趣的:(SQL,sql,数据库)

Java架构师成长之路 hweiyu00 分享 spring 微服务 spring cloud java
概述本教程主要从6个方面，全面讲解Java技术栈的知识。1.性能调优深入理解MySQL底层原理、索引逻辑，数据结构与算法。使用Explain进行优化分析MVCC原理剖析日志机制解析2.框架源码掌握Spring底层原理带你手写一个Spring解析IOC、AOP源码、以及事务原理3.并发编程剖析Java底层锁机制CAS、JUC工具使用、AQS源码分析以及并发的集合类的讲解4.分布式开发剖析分布式中使用
sqlmap笔记君如尘网络安全-渗透笔记笔记
1.运行环境sqlmap是用Python编写的，因此首先需要确保你的系统上安装了Python。sqlmap支持Python2.6、2.7和Python3.4及以上版本。2.常用命令通用格式：bythonsqlmap.py-r注入点地址--参数-rpost请求-uget请求--level=测试等级--risk=测试风险-v显示详细信息级别-p针对某个注入点注入-threads更改线程数，加速--ba
Java面试高频问题深度解析：JVM、锁机制、SQL优化与并发处理 Debug Your Career 面试 java 面试 jvm
问题列表Java中如何实现一个工作流引擎？Bean的作用域有哪些？JVM中的锁机制是如何工作的？三个方法分别被synchronized锁住，方法a调用方法b，b能获取到a的锁吗？会有什么问题？SQL优化时，EXPLAIN中需要关注哪些关键点？什么是覆盖索引？SELECT*一定不会命中索引吗？SELECT*和SELECT全字段在性能上有区别吗？什么是回表？它与索引有什么关系？100万数据分给10个线
binlog和redolog 重生之我在成电转码 java mysql 日志
好的！这两个是MySQL面试核心知识点，下面详细解释：✅一、概念区分内容binlog（归档日志）redolog（重做日志）属于MySQL层（Server层）InnoDB存储引擎层作用记录所有修改数据库的数据操作（逻辑日志）保障事务的持久性（崩溃后可恢复数据）存储内容SQL语句或事件（INSERT、UPDATE、DELETE）物理页修改（物理日志）写入时机执行完SQL后写入执行SQL时先写入落盘时机
【读点论文】Chain Replication for Supporting High Throughput and Availability 寻雾&启示分布式系统论文阅读
在分布式系统中，强一致性往往和高可用、高吞吐是矛盾的。比如传统的关系型数据库，其保证了强一致性，但往往牺牲了可用性和吞吐量。而像NoSQL数据库，虽然其吞吐量、和扩展性很高，但往往只支持最终一致性，无法保证强一致性。由此ChainReplicationforSupportingHighThroughputandAvailability提出了链式复制协议，旨在保证高吞吐、高可用的同时，支持数据的强一
spark explain如何使用 fzip Spark spark 执行计划
在Spark中，explain是分析SQL或DataFrame执行计划的核心工具，通过不同模式可展示查询优化和执行的详细信息，默认情况下，这个语句只提供关于物理计划的信息。以下是具体使用方法及不同模式的作用：1.explain的基本语法在Spark3.0及以上版本，explain支持多种模式参数，通过mode指定输出格式：#DataFrame调用方式df.explain(mode="simple"
【自建分布式数据库详细指南】（五）使用：常见API及使用问题大板牙花生分布式
延续前几篇文章，下面着重从一些基本的API讲讲从入门到习惯的常用方法，后续更新。USAGE1节点管理设置主节点，又成为协调节点SELECTcitus_set_coordinator_host('coord.example.com',5432);step1.创建节点select*frommaster_add_node('new-node',12345);step2.删除节点step3.新增节点后重新
【商城实战(55)】商城数据库备份：策略与实操指南奔跑吧邓邓子商城实战商城实战数据库备份 MySQL 策略与实操
【商城实战】专栏重磅来袭！这是一份专为开发者与电商从业者打造的超详细指南。从项目基础搭建，运用uniapp、ElementPlus、SpringBoot搭建商城框架，到用户、商品、订单等核心模块开发，再到性能优化、安全加固、多端适配，乃至运营推广策略，102章内容层层递进。无论是想深入钻研技术细节，还是探寻商城运营之道，本专栏都能提供从0到1的系统讲解，助力你打造独具竞争力的电商平台，开启电商实战
Flink sql-clinet 查询报错 lhfmqc sql-clinet 运行问题查询报错 flink
Flinksql-clinet查询报错运行后进行select'helloworld’报以下错误，couldnotexecutesqlstatementjava.net.NoRouteToHostException:Noroutetohost在关闭防火墙之后仍无法解决这个时候你需要进入flinkconf配置中查看flink-conf.yaml文件，查看jobmanager.rpc.address该地
不神话大模型，不做技术乌托邦，用"传统IT+AI积木"实现企业智能转型人工智能
一、开篇：AI革命的务实辩证法在技术狂热与落地鸿沟并存的AI时代，灵燕智能体开发平台提出"三轮驱动法则"：•不颠覆的智慧：MySQL、知识图谱库、MQ等传统中间件构成数字地基•不空想的创新：大模型仅承担"认知苦力"，在人类设计的思考链中定向发力•不取巧的工程：通过D2R映射、低代码工具、元数据治理实现可落地的智能装配二、核心价值：智能开发的工业流水线技术要素原子化拆解将复杂需求分解为可执行的"技术
程序代码篇---Pyqt的密码界面 Ronin-Lotus 程序代码篇上位机知识篇 pyqt 数据库 python ubuntu
文章目录前言一、代码二、代码解释2.1用户数据库定义2.2窗口初始化2.3认证逻辑2.5角色处理2.6错误处理优化2.7功能扩展说明2.7.1用户类型区分管理员普通用户其他用户2.7.2安全增强建议三、运行效果四、运行命令五、界面改进建议5.1密码显示5.2用户头像显示5.3输入框动画效果5.4加载进度显示5.5键盘快捷键前言本文简单介绍了在Ubuntu系统上使用Python的Pyqt创建密码登录
架构师必知必会系列：数据架构与数据管理 AI天才研究院 AI大模型企业级应用开发实战大数据人工智能语言模型 Java Python 架构设计
作者：禅与计算机程序设计艺术1.背景介绍数据架构与数据管理介绍数据架构是指用来定义企业数据的逻辑结构、物理存储结构和数据的流转过程。它由数据中心和IT平台、数据库、文件系统、网络、安全、计算资源等构成。其目的是为了满足业务需求、提升组织效率和降低成本。数据架构包括数据字典、元数据、数据模型、数据流、数据仓库、数据管道、数据服务等。在应用中，将数据按照其自身特性进行划分、分类、归档、清洗和加工，才能
如何进行PHP性能优化？破碎的天堂鸟 PHP学习 php 性能优化开发语言
PHP性能优化是一个复杂且多方面的过程，涉及从代码层面到服务器配置的多个方面。以下是一些关键的优化技巧和最佳实践：选择合适的数据结构（如数组、对象等）可以显著提高程序的运行效率。缓存是提升PHP性能的有效手段之一。可以通过页面缓存、数据缓存、内存缓存等方式来减少重复计算。例如，使用APC、Memcached或Redis进行内存缓存，或者利用文件系统进行数据缓存。使用索引、优化SQL查询语句以及使用
Spring事务失效的常见场景红云梦 spring java 数据库
1事务1.1数据库事务作为单个逻辑工作单元执行的一系列操作，要么完全执行，要么完全不执行1.2事务的四大特性（ACID）原子性(Atomicity)：要么成功，要么失败。一个事务内的所有SQL语句同步执行（依靠undo.log日志保证）一致性(Consistency)：事务前后总量不变，数据库完整性约束没有被破坏隔离性(Isolation)：一个事务执行不被其他事务干扰（锁+MVCC）持久性(Du
Rust + 时序数据库 TDengine：打造高性能时序数据处理利器涛思数据（TDengine）时序数据库 rust tdengine
引言：为什么选择TDengine与Rust？TDengine是一款专为物联网、车联网、工业互联网等时序数据场景优化设计的开源时序数据库，支持高并发写入、高效查询及流式计算，通过“一个数据采集点一张表”与“超级表”的概念显著提升性能。Rust作为一门系统级编程语言，近年来在数据库、嵌入式系统、分布式服务等领域迅速崛起，以其内存安全、高性能著称，与TDengine的高效特性天然契合，适合构建高可靠、高
时序数据库QuestDB在Winform窗体应用 ryan68888 时序数据库
以下是QuestDB在Winform使用的代码：//初始化privatevoidInit(){//创建数据库对象(用法和EFDappper一样通过new保证线程安全)SqlSugarClientDb=newSqlSugarClient(newConnectionConfig(){ConnectionString=“host=10.3.5.227;port=8812;username=admin;p
基于 MySQL 和 Spring Boot 的在线论坛管理系统设计与实现城南|阿洋-计算机从小白到大神 mysql spring boot 数据库
markdownCopy✌全网粉丝20W+,csdn特邀作者、博客专家、CSDN[新星计划]导师、java领域优质创作者,博客之星、掘金/华为云/阿里云/InfoQ等平台优质作者、专注于Java、pyhton、机器学习技术领域和毕业项目实战✌哈喽兄弟们，好久不见哦～最近整理了一下之前写过的一些小项目/毕业设计。发现还是有很多存货的，想一想既然放在电脑里面也吃灰，那么还不如分享出去，没准还可以帮助到
[开题报告]Springboot高校图书管理系统设计与实现lq627计算机毕业设计卓越计算机毕设课程设计
本项目包含程序+源码+数据库+LW+调试部署环境，文末可获取一份本项目的java源码和数据库参考。开题报告研究背景：随着高校图书馆的规模不断扩大和信息化程度的提高，传统的手工管理方式已经无法满足日益增长的图书馆资源管理需求。图书管理系统的设计与实现成为了解决这一问题的关键。通过引入计算机技术和信息管理系统，可以提高图书馆的管理效率和服务质量，为读者提供更便捷、高效的借阅体验。研究意义：图书管理系统
【最低2万搞定！】10万双枪充电桩平台神级配置：服务器成本直降80%+日志/数据库存储全拆解！慧知开源充电桩平台！！！必看攻略文慧的科技江湖更新日志 -(慧哥)慧知充电桩平台服务器数据库开源直流充电桩充电桩 spring cloud 架构
10万台充电桩设备双枪，需要最小的服务器配置？服务器费用控制2-3万，服务器日志产生多少g,数据库订单数据产生多少g!-慧知开源充电桩平台一、服务器配置方案及逻辑（阿里云）1.需求分析设备规模：10万台双枪充电桩，理论最大并发连接数为20万（每个枪独立通信）。请求类型：心跳包（高频）、充电启停、支付、状态上报等，假设平均每秒请求量约5,000QPS。费用目标：总成本控制在2-3万元/月（按包年包月
SQL自学：怎么创建视图 m0_74823471 面试学习路线阿里巴巴 sql 数据库
在SQL中，视图是一种虚拟表，它是基于一个或多个表的查询结果集。视图并不实际存储数据，而是在每次查询时动态生成结果。一、创建视图的语法（以MySQL为例）CREATEVIEWview_nameASSELECTcolumn1,column2,...FROMtable_nameWHEREcondition;view_name：是要创建的视图的名称。column1,column2,...：要在视图中显示
SQL数据更新小王Jacky 数据库学习 sql 数据库
1.插入数据**(1)插入单个元组**--向学生表S插入一条学生记录INSERTINTOS(SNO,SN,SEX,AGE,DEPT)VALUES('S001','张三','男',20,'计算机系');--向选课表SC插入一条选课记录INSERTINTOSC(SNO,CNO,SCORE)VALUES('S001','C001',85);**(2)插入多个元组**--向课程表C插入多条课程记录INSE
pythontype函数使用_Python astype(np.float)函数使用方法解析 weixin_39870238 pythontype函数使用
Pythonastype(np.float)函数使用方法解析我的数据库如图结构我取了其中的nameagenr，做成array，只要所取数据存在str型，那么取出的数据，全部转化为str型，也就是array阵列的元素全是str，不管数据库定义的是不是int型。那么问题来了，取出的数据代入公式进行计算的时候，就会类型不符，这是就用到astype(np.float)代码如下importpymysqlim
如何安全删除MySQL字段？从原理到实战的保姆级指南！小丁学Java 产品资质管理系统安全 mysql 数据库
从MyISAM到InnoDB：解锁MySQL在线删除字段的终极指南真实案例：一次失败的DDL操作引发的思考场景复现：某业务表invite_codes需要删除invitor字段，执行以下命令时触发报错：ALTERTABLEinvite_codesDROPCOLUMNinvitor,ALGORITHM=INPLACE;--报错信息：ALGORITHM=INPLACEisnotsupportedfort
mysql与mariadb版本对应_MySQL与MariaDB及各种版本杂谈 weixin_39616416
MySQL1.MySQLCommunityServer社区版本，开源免费，但不提供官方技术支持。(我们通常使用的MySQL版本)2.MySQLEnterpriseEdition企业版本，需付费，可以试用30天。3.MySQLCluster集群版，开源免费。可将几个MySQLServer封装成一个Server。4.MySQLClusterCGE高级集群版，需付费。5.MySQLWorkbench(G
SQL 错误 [1064] [42000] You have an error in your SQL syntax； check the manual that corresponds to yo web14786210723 sql 数据库
在为用户指定数据的时候，报错了，SQL错误[1064][42000]:YouhaveanerrorinyourSQLsyntax;checkthemanualthatcorrespondstoyoGRANTALLPRIVILEGESONjeecg-boot.*TO'jeecgoot'@'%';ERROR1064(42000):YouhaveanerrorinyourSQLsyntax;checkt
向量数据库技术系列三-Chroma介绍恰恰虎 chromadb 数据库向量
一、前言Chroma是一个开源的AI原生向量数据库，旨在帮助开发者更加便捷地构建大模型应用，将知识、事实和技能等文档整合进大型语言模型（LLM）中。它提供了简单易用的API，支持存储嵌入及其元数据、嵌入文档和查询、搜索嵌入等功能。主要有以下特点:轻量级：Chroma是一个基于向量检索库实现的轻量级向量数据库，不需要复杂的配置和大规模基础设施支持，非常适合小型或中型项目。易用性：提供简单的API，易
新手如何使用 Milvus 巴依老爷coder 数据库 milvus 向量数据库数据库
一文带你入门Milvus：详细指南新手如何使用Milvus：详细指南一、Milvus简介主要特点应用领域二、安装Milvus安装DockerCompose基于DockerCompose安装Milvus服务端安装attu-可视化界面工具三、快速入门安装PythonSDK连接数据库方式1方式2（方式1的封装）数据库操作核心概念集合操作数据操作插入数据精准查询数据-get条件查询数据-query查询数据
MariaDB 和 MySQL 版本关联 java我跟你拼了数据库笔记 mariadb mysql 数据库数据库篇版本关联
MariaDB和MySQL是两个常用的关系型数据库管理系统（RDBMS），它们在很多方面非常相似，因为MariaDB是MySQL的一个分支。MariaDB和MySQL之间的版本关联可以通过以下几个方面来理解：1.历史背景MySQL:MySQL是一个开源的数据库管理系统，由MySQLAB开发，后来被SunMicrosystems收购，再之后被Oracle收购。MariaDB:MariaDB是MySQ
因为mysql 8新的认证插件导致主从复制的IO线程失败库海无涯 mysql
1、错误信息Last_IO_Error:errorconnectingtomaster'[email protected]:3306'-retry-time:60retries:1message:Authenticationplugin'caching_sha2_password'reportederror:Authenticationrequiressecureconnection.2、
MySQL HA的全新篇章：Semisynchronous Replication迁移至InnoDB Cluster的实用指南库海无涯 MySQL mysql
1、概述临时接了一个搭建InnoDBCluster的活儿，客户给我说是有数据的，我当时想这不是非常简单吗？干活儿的时候，才发现并没有这么简单，接手的时候发现是SemisynchronousReplication的环境，然后把从库切换成InnoDBCluster的primary。2、环境复现2.1、从库5.140信息采集mysql>showreplicastatus\G***************
多线程编程之join()方法周凡杨 java JOIN 多线程编程线程
现实生活中，有些工作是需要团队中成员依次完成的，这就涉及到了一个顺序问题。现在有T1、T2、T3三个工人，如何保证T2在T1执行完后执行，T3在T2执行完后执行？问题分析：首先问题中有三个实体，T1、T2、T3，因为是多线程编程，所以都要设计成线程类。关键是怎么保证线程能依次执行完呢？ Java实现过程如下： public class T1 implements Runnabl
java中switch的使用 bingyingao java enum break continue
java中的switch仅支持case条件仅支持int、enum两种类型。用enum的时候，不能直接写下列形式。 switch (timeType) { case ProdtransTimeTypeEnum.DAILY: break; default: br
hive having count 不能去重 daizj hive 去重 having count 计数
hive在使用having count()是，不支持去重计数 hive (default)> select imei from t_test_phonenum where ds=20150701 group by imei having count(distinct phone_num)>1 limit 10; FAILED: SemanticExcep
WebSphere对JSP的缓存周凡杨 WAS JSP 缓存
对于线网上的工程，更新JSP到WebSphere后，有时会出现修改的jsp没有起作用，特别是改变了某jsp的样式后，在页面中没看到效果，这主要就是由于websphere中缓存的缘故，这就要清除WebSphere中jsp缓存。要清除WebSphere中JSP的缓存，就要找到WAS安装后的根目录。现服务
设计模式总结朱辉辉33 java 设计模式
1.工厂模式 1.1 工厂方法模式 (由一个工厂类管理构造方法) 1.1.1普通工厂模式(一个工厂类中只有一个方法) 1.1.2多工厂模式(一个工厂类中有多个方法) 1.1.3静态工厂模式(将工厂类中的方法变成静态方法) &n
实例：供应商管理报表需求调研报告老A不折腾 finereport 报表系统报表软件信息化选型
引言随着企业集团的生产规模扩张，为支撑全球供应链管理，对于供应商的管理和采购过程的监控已经不局限于简单的交付以及价格的管理，目前采购及供应商管理各个环节的操作分别在不同的系统下进行，而各个数据源都独立存在，无法提供统一的数据支持；因此，为了实现对于数据分析以提供采购决策，建立报表体系成为必须。业务目标 1、通过报表为采购决策提供数据分析与支撑 2、对供应商进行综合评估以及管理，合理管理和
mysql 林鹤霄
转载源：http://blog.sina.com.cn/s/blog_4f925fc30100rx5l.html mysql -uroot -p ERROR 1045 (28000): Access denied for user 'root'@'localhost' (using password: YES) [root@centos var]# service mysql
Linux下多线程堆栈查看工具(pstree、ps、pstack) aigo linux
原文：http://blog.csdn.net/yfkiss/article/details/6729364 1. pstree pstree以树结构显示进程$ pstree -p work | grep adsshd(22669)---bash(22670)---ad_preprocess(4551)-+-{ad_preprocess}(4552) &n
html input与textarea 值改变事件 alxw4616 JavaScript
// 文本输入框(input) 文本域(textarea)值改变事件 // onpropertychange(IE) oninput(w3c) $('input,textarea').on('propertychange input', function(event) { console.log($(this).val()) });
String类的基本用法百合不是茶 String
字符串的用法; // 根据字节数组创建字符串 byte[] by = { 'a', 'b', 'c', 'd' }; String newByteString = new String(by); 1,length() 获取字符串的长度 &nbs
JDK1.5 Semaphore实例 bijian1013 java thread java多线程 Semaphore
Semaphore类一个计数信号量。从概念上讲，信号量维护了一个许可集合。如有必要，在许可可用前会阻塞每一个 acquire()，然后再获取该许可。每个 release() 添加一个许可，从而可能释放一个正在阻塞的获取者。但是，不使用实际的许可对象，Semaphore 只对可用许可的号码进行计数，并采取相应的行动。 S
使用GZip来压缩传输量 bijian1013 java GZip
启动GZip压缩要用到一个开源的Filter：PJL Compressing Filter。这个Filter自1.5.0开始该工程开始构建于JDK5.0，因此在JDK1.4环境下只能使用1.4.6。 PJL Compressi
【Java范型三】Java范型详解之范型类型通配符 bit1129 java
定义如下一个简单的范型类， package com.tom.lang.generics; public class Generics<T> { private T value; public Generics(T value) { this.value = value; } }
【Hadoop十二】HDFS常用命令 bit1129 hadoop
1. 修改日志文件查看器 hdfs oev -i edits_0000000000000000081-0000000000000000089 -o edits.xml cat edits.xml 修改日志文件转储为xml格式的edits.xml文件，其中每条RECORD就是一个操作事务日志 2. fsimage查看HDFS中的块信息等 &nb
怎样区别nginx中rewrite时break和last ronin47
在使用nginx配置rewrite中经常会遇到有的地方用last并不能工作，换成break就可以，其中的原理是对于根目录的理解有所区别，按我的测试结果大致是这样的。 location / { proxy_pass http://test;
java-21.中兴面试题输入两个整数 n 和 m ，从数列 1 ， 2 ， 3.......n 中随意取几个数 , 使其和等于 m bylijinnan java
import java.util.ArrayList; import java.util.List; import java.util.Stack; public class CombinationToSum { /* 第21 题 2010 年中兴面试题编程求解：输入两个整数 n 和 m ，从数列 1 ， 2 ， 3.......n 中随意取几个数 , 使其和等
eclipse svn 帐号密码修改问题开窍的石头 eclipse SVN svn帐号密码修改
问题描述： Eclipse的SVN插件Subclipse做得很好，在svn操作方面提供了很强大丰富的功能。但到目前为止，该插件对svn用户的概念极为淡薄，不但不能方便地切换用户，而且一旦用户的帐号、密码保存之后，就无法再变更了。解决思路：删除subclipse记录的帐号、密码信息，重新输入
[电子商务]传统商务活动与互联网的结合 comsci 电子商务
某一个传统名牌产品，过去销售的地点就在某些特定的地区和阶层，现在进入互联网之后，用户的数量群突然扩大了无数倍，但是，这种产品潜在的劣势也被放大了无数倍，这种销售利润与经营风险同步放大的效应，在最近几年将会频繁出现。。。。如何避免销售量和利润率增加的
java 解析 properties-使用 Properties-可以指定配置文件路径 cuityang java properties
#mq xdr.mq.url=tcp://192.168.100.15:61618; import java.io.IOException; import java.util.Properties; public class Test { String conf = "log4j.properties"; private static final
Java核心问题集锦 darrenzhu java 基础核心难点
注意，这里的参考文章基本来自Effective Java和jdk源码 1)ConcurrentModificationException 当你用for each遍历一个list时，如果你在循环主体代码中修改list中的元素，将会得到这个Exception，解决的办法是： 1)用listIterator, 它支持在遍历的过程中修改元素， 2)不用listIterator, new一个
1分钟学会Markdown语法 dcj3sjt126com markdown
markdown 简明语法基本符号 *,-,+ 3个符号效果都一样，这3个符号被称为 Markdown符号空白行表示另起一个段落 `是表示inline代码，tab是用来标记代码段，分别对应html的code，pre标签换行单一段落( <p>) 用一个空白行连续两个空格会变成一个 <br> 连续3个符号，然后是空行
Gson使用二（GsonBuilder） eksliang json gson GsonBuilder
转载请出自出处：http://eksliang.iteye.com/blog/2175473 一.概述 GsonBuilder用来定制java跟json之间的转换格式二.基本使用实体测试类：温馨提示：默认情况下@Expose注解是不起作用的,除非你用GsonBuilder创建Gson的时候调用了GsonBuilder.excludeField
报ClassNotFoundException: Didn't find class "...Activity" on path: DexPathList gundumw100 android
有一个工程，本来运行是正常的，我想把它移植到另一台PC上，结果报： java.lang.RuntimeException: Unable to instantiate activity ComponentInfo{com.mobovip.bgr/com.mobovip.bgr.MainActivity}: java.lang.ClassNotFoundException: Didn't f
JavaWeb之JSP指令 ihuning javaweb
要点 JSP指令简介 page指令 include指令 JSP指令简介 JSP指令（directive）是为JSP引擎而设计的，它们并不直接产生任何可见输出，而只是告诉引擎如何处理JSP页面中的其余部分。 JSP指令的基本语法格式： <%@ 指令属性名="
mac上编译FFmpeg跑ios 啸笑天 ffmpeg
1、下载文件：https://github.com/libav/gas-preprocessor，复制gas-preprocessor.pl到/usr/local/bin/下，修改文件权限：chmod 777 /usr/local/bin/gas-preprocessor.pl 2、安装yasm-1.2.0 curl http://www.tortall.net/projects/yasm
sql mysql oracle中字符串连接 macroli oracle sql mysql SQL Server
有的时候，我们有需要将由不同栏位获得的资料串连在一起。每一种资料库都有提供方法来达到这个目的： MySQL: CONCAT() Oracle: CONCAT(), || SQL Server: + CONCAT() 的语法如下： Mysql 中 CONCAT(字串1, 字串2, 字串3, ...): 将字串1、字串2、字串3，等字串连在一起。请注意，Oracle的CON
Git fatal: unab SSL certificate problem: unable to get local issuer ce rtificate qiaolevip 学习永无止境每天进步一点点 git 纵观千象
// 报错如下： $ git pull origin master fatal: unable to access 'https://git.xxx.com/': SSL certificate problem: unable to get local issuer ce rtificate // 原因：由于git最新版默认使用ssl安全验证，但是我们是使用的git未设
windows命令行设置wifi surfingll windows wifi 笔记本wifi
还没有讨厌无线wifi的无尽广告么，还在耐心等待它慢慢启动么教你命令行设置笔记本电脑wifi： 1、开启wifi命令 netsh wlan set hostednetwork mode=allow ssid=surf8 key=bb123456 netsh wlan start hostednetwork pause 其中pause是等待输入，可以去掉 2、
Linux（Ubuntu）下安装sysv-rc-conf wmlJava linux ubuntu sysv-rc-conf
安装：sudo apt-get install sysv-rc-conf 使用：sudo sysv-rc-conf 操作界面十分简洁，你可以用鼠标点击，也可以用键盘方向键定位，用空格键选择，用Ctrl+N翻下一页，用Ctrl+P翻上一页，用Q退出。背景知识 sysv-rc-conf是一个强大的服务管理程序，群众的意见是sysv-rc-conf比chkconf
svn切换环境，重发布应用多了javaee标签前缀 zengshaotao javaee
更换了开发环境，从杭州，改变到了上海。svn的地址肯定要切换的，切换之前需要将原svn自带的.svn文件信息删除，可手动删除，也可通过废弃原来的svn位置提示删除.svn时删除。然后就是按照最新的svn地址和规范建立相关的目录信息，再将原来的纯代码信息上传到新的环境。然后再重新检出，这样每次修改后就可以看到哪些文件被修改过，这对于增量发布的规范特别有用。检出