我们经常需要汇总数据而不用把它们实际检索出来,为此MySQL提供了专门的函数。使用这些函数,MySQL查询可用于数据检索,以便分析和报表生成。
聚集函数:运行在行组上,计算和返回单个值的函数。
AVG()通过对表中的行数计数并计算特定列值之和,求得该列的平均值。AVG()可用来返回所有列的平均值,也可以返回特定列或行的平均值。
mysql> SELECT AVG(prod_price) AS avg_price FROM products;
+-----------+
| avg_price |
+-----------+
| 16.133571 |
+-----------+
1 row in set (0.00 sec)
COUNT()函数进行计数。可利用COUNT()确定表中行的数目或符合特定条件的行的数目。
COUNT()函数有两种使用方式。
mysql> SELECT COUNT(*) AS num_cust FROM customers;
+----------+
| num_cust |
+----------+
| 5 |
+----------+
1 row in set (0.00 sec)
mysql> SELECT COUNT(cust_email) AS num_cust FROM customers;
+----------+
| num_cust |
+----------+
| 3 |
+----------+
1 row in set (0.00 sec)
MAX()返回指定列中的最大值。MAX()要求指定列名
mysql> SELECT MAX(prod_price) AS max_perice FROM products;
+------------+
| max_perice |
+------------+
| 55.00 |
+------------+
1 row in set (0.00 sec)
MIN()的功能正好与MAX()功能相反,它返回指定列的最小值。
mysql> SELECT MIN(prod_price) AS max_perice FROM products;
+------------+
| max_perice |
+------------+
| 2.50 |
+------------+zong'ji
1 row in set (0.00 sec)
SUM()用来返回指定列值的和(总计)。
mysql> SELECT SUM(quantity) AS items_ordered FROM orderitems WHERE order_num = 20005;
+---------------+
| items_ordered |
+---------------+
| 19 |
+---------------+
1 row in set (0.00 sec)
以上5个聚集函数可以如下使用:
mysql> SELECT AVG(DISTINCT prod_price) as AVG_PRICE FROM products WHERE vend_id = 1003;
+-----------+
| AVG_PRICE |
+-----------+
| 15.998000 |
+-----------+
1 row in set (0.00 sec)
SELECT
可根据需要包含多个聚集函数。
mysql> select COUNT(*) AS num_items, MIN(prod_price) AS price_min,MAX(prod_price) AS price_max,AVG(prod_price) AS price_avg FROM products;
+-----------+-----------+-----------+-----------+
| num_items | price_min | price_max | price_avg |
+-----------+-----------+-----------+-----------+
| 14 | 2.50 | 55.00 | 16.133571 |
+-----------+-----------+-----------+-----------+
1 row in set (0.00 sec)
分组允许把数据分为多个逻辑组,以便能对每个组进行聚集计算。
分组是在SELECT
语句的GROUP BY
子句中建立的。
mysql> SELECT vend_id,COUNT(*) AS num_prods FROM products GROUP BY vend_id;
+---------+-----------+
| vend_id | num_prods |
+---------+-----------+
| 1001 | 3 |
| 1002 | 2 |
| 1003 | 7 |
| 1005 | 2 |
+---------+-----------+
4 rows in set (0.00 sec)
上面的SELECT语句指定了两个列,vend_id包含产品供应商的ID,num_prods为计算字段(用COUNT(*)函数建立)。GROUP BY
子句指示MySQL按vend_id拍寻分组数据。这导致对每个vend_id而不是整个表计算num_prods一次。
GROUP BY
子句可以包含任意数目的列。这使得能对分组进行嵌套,为数据分组提供更细致的控制。GROUP BY
子句中嵌套了分组,数据将在最后规定的分组上进行汇总。换句话说,在建立分组时,指定的所有列都一起计算(所以不能从个别的列取回数据)。GROUP BY
子句中列出的每个列都必须是检索列或有效的表达式(但不能是聚集函数)。如果在SELECT
中使用表达式,则必须在GROUP BY
子句中指定相同的表达式。不能使用别名。SELECT
语句中的每个列都必须在GROUP BY
子句中给出。NULL
值,则NULL
将作为一个分组返回。如果列中有多行NULL
值,它们将分为一组。GROUP BY
子句必须出现在WHERE
子句之后,ORDER BY
子句之前。MySQL允许过滤分组,规定包括哪些分组,排除哪些分组。我们知道了WHERE
子句的作用,但是在这个例子中WHERE
不能完成任务,因为WHERE
过滤指定的是行而不是分组。事实上,WHERE
没有分组的概念。MySQL为此目的提供了另外的子句,那就是HAVING
子句。目前为止所学过的所有类型的WHERE
子句都可以用HAVING
来替代。唯一的差别是WHERE
过滤行,而HAVING
过滤分组。
mysql> SELECT cust_id,COUNT(*) AS orders FROM orders GROUP BY cust_id HAVING COUNT(*) >=2 ;
+---------+--------+
| cust_id | orders |
+---------+--------+
| 10001 | 2 |
+---------+--------+
1 row in set (0.00 sec)
mysql> SELECT vend_id,COUNT(*) AS num_Prods FROM products WHERE prod_price >= 10 GROUP BY vend_id HAVING COUNT(*) >= 2;
+---------+-----------+
| vend_id | num_Prods |
+---------+-----------+
| 1003 | 4 |
| 1005 | 2 |
+---------+-----------+
2 rows in set (0.00 sec)
mysql> SELECT vend_id,COUNT(*) AS num_Prods FROM products GROUP BY vend_id HAVING COUNT(*) >= 2;
+---------+-----------+
| vend_id | num_Prods |
+---------+-----------+
| 1001 | 3 |
| 1002 | 2 |
| 1003 | 7 |
| 1005 | 2 |
+---------+-----------+
4 rows in set (0.01 sec)
虽然GROUP BY
和ORDER BY
经常完成相同的工作,但它们是非常不同的。
我们发现用GROUP BY
分组的数据确实是以分组顺序输出,但情况并不是这样,他并不是SQL规范所要求的。此外,用户也可能会要求以不同于分组的顺序排序,仅因你以某种方式分组数据,并不表示你需要以相同的方式排序输出。应该提供明确的ORDER BY
子句,即使其效果等同于GROUP BY
子句也是如此。
mysql> SELECT order_num, SUM(quantity*item_price) AS ordertotal FROM orderitems GROUP BY order_num HAVING SUM(quantity*item_price) >= 50;
+-----------+------------+
| order_num | ordertotal |
+-----------+------------+
| 20005 | 149.87 |
| 20006 | 55.00 |
| 20007 | 1000.00 |
| 20008 | 125.00 |
+-----------+------------+
4 rows in set (0.00 sec)
mysql> SELECT order_num, SUM(quantity*item_price) AS ordertotal FROM orderitems GROUP BY order_num HAVING SUM(quantity*item_price) >= 50 ORDER BY ordertotal;
+-----------+------------+
| order_num | ordertotal |
+-----------+------------+
| 20006 | 55.00 |
| 20008 | 125.00 |
| 20005 | 149.87 |
| 20007 | 1000.00 |
+-----------+------------+
4 rows in set (0.00 sec)