ORACLE 高级子查询

:: 写多列子查询
:: 在返回空值时描述并解释子查询的行为
:: 写一个在 FROM 子句中的子查询
:: SQL 中使用分级子查询
:: 描述能够用相关子查询解决的问题类型
:: 写相关子查询
:: 用相关子查询更新和删除行
:: 使用 EXISTS 和 NOT EXISTS 操作
:: 使用 WITH 子句

Lesson Aim
   In this lesson, you learn how to write multiple-column subqueries and subqueries in the FROM clause of a SELECT statement. You also learn how to solve problems by using scalar, correlated subqueries and the WITH clause.



			什么是子查询?

 一个子查询是一个嵌入 SELECT 语句中的另一个 SQL 语句的子句


				select ...
		   主查询---->  from   ...
				where  ...
 					(select ...
					 from   ...      <-----子查询
					 where  ...)


What Is a Subquery?

  A subquery is a SELECT statement that is embedded in a clause of another SQL statement, called the parent statement.

内查询把查询结果给外查询
 The subquery (inner query) returns a value that is used by the parent statement. Using a nested subquery is equivalent to performing two sequential queries and using the result of the inner query as the search value in the outer query (main query).

  Subqueries can be used for the following purposes: 
    :: To provide values for conditions in WHERE, HAVING, and START WITH clauses of SELECT statements
    :: To define the set of rows to be inserted into the target table of an INSERT or CREATE TABLE statement	/././././
    :: To define the set of rows to be included in a view or snapshot in a CREATE VIEW or CREATE SNAPSHOT statement	/./././同上
    :: To define one or more values to be assigned to existing rows in an UPDATE statement			////同上
    :: To define a table to be operated on by a containing query. (You do this by placing the subquery in the FROM clause. This can be done in INSERT, UPDATE, and DELETE statements as well.)		/,.,.FROM子查询

  Note: A subquery is evaluated once for the entire parent statement.
先执行内查询,返回值给外查询,再执行主查询.




				子查询

SELECT	select_list
FROM	table
WHERE	expr operator (SELECT select_list
		         FROM	  table);

:: 子查询 (内嵌查询) 在主查询中执行一次
:: 子查询的结果被用于主查询 (外查询)

Subqueries

  You can build powerful statements out of simple ones by using subqueries. Subqueries can be very useful when you need to select rows from a table with a condition that depends on the data in the table itself or some other table. Subqueries are very useful for writing SQL statements that need values based on one or more unknown conditional values.

 In the syntax:			././././.
	operator	includes a comparison operator such as >, =, or IN

  Note: Comparison operators fall into two classes: (比较运算)
	single-row operators (>, =, >=, <, <>, <=)
	multiple-row operators (IN, ANY, ALL). 

 The subquery is often referred to as a nested SELECT, sub-SELECT, or inner SELECT statement. The inner and outer queries can retrieve data from either the same table or different tables.(叫做内嵌select,子select,内部select)


			使用子查询

SELECT last_name
FROM   employees         10500
WHERE  salary >    <-------------|
                (SELECT salary   |
                 FROM   employees
                 WHERE  employee_id = 149) ;



Using a Subquery
  			//对上面的解释
  In the example in the slide, the inner query returns the salary of the employee with employee number 149. The outer query uses the result of the inner query to display the names of all the employees who earn more than this amount.


Example:

 Display the names of all employees who earn less than the average salary in the company.

   SELECT last_name, job_id, salary
   FROM   employees
   WHERE  salary < (SELECT AVG(salary)
                    FROM   employees);


			多列子查询

  主查询的每行与一个多行多列子查询的值比较

Main query				 <----------|
WHERE(manager_id,department_id) IN	<-------|   |
						|   |
					 <-----||   |
					       ||   |
		Subquery		       ||   |
		100	90		_______||   |
		102	60		________|   |
		124	50		____________|


Multiple-Column Subqueries

 So far you have written single-row subqueries and multiple-row subqueries where only one column is returned by the inner SELECT statement and this is used to evaluate the expression in the parent select statement. If you want to compare two or more columns, you must write a compound WHERE clause using logical operators(如果想要比较两行或更多行你要使用逻辑运算符写一个混合的WHERE子句). Using multiple-column subqueries, you can combine duplicate WHERE conditions into a single WHERE clause.(使用混合列子查询可以把多个WHERE条件合到一个WHERE子句.)

  Syntax	
	SELECT column,column, ...
	FROM table
	WHERE (column,column, ...) IN
				   (SELECT column,column, ...
				    FROM table
				    WHERE condition);


   The graphic in the slide illustrates that the values of the MANAGER_ID and DEPARTMENT_ID from the main query are being compared with the MANAGER_ID and DEPARTMENT_ID  values retrieved by the subquery. Since the number of columns that are being compared are more than one, the example qualifies as a multiple-column subquery.



				列比较

在一个多列子查询中的列比较能够被:
:: 成对地比较
:: 非成对的比较 

Pairwise versus Nonpairwise Comparisons成对,非成对比较
   
   Column comparisons in a multiple-column subquery can be pairwise comparisons or nonpairwise comparisons. 
			/././././.在select语句的每个条件行都要有相同的列.
   In the example on the next slide, a pairwise comparison was executed in the WHERE clause. Each candidate row in the SELECT statement must have both the same MANAGER_ID column and the DEPARTMENT_ID as the employee with the EMPLOYEE_ID 178 or 174. 
  
   A multiple-column subquery can also be a nonpairwise comparison. In a nonpairwise comparison, each of the columns from the WHERE clause of the parent SELECT statement are individually compared to multiple values retrieved by the inner select statement. The individual columns can match any of the values retrieved by the inner select statement. But collectively, all the multiple conditions of the main SELECT statement must be satisfied for the row to be displayed. The example on the next page illustrates a nonpairwise comparison.


			成对比较子查询

   显示雇员的细节,这些雇员被同一个经理管理,并且,工作在同一个部门,具有 EMPLOYEE_ID 178 或 174
SELECT	employee_id, manager_id, department_id
FROM	employees
WHERE  (manager_id, department_id) IN
                      (SELECT manager_id, department_id
                       FROM   employees
                       WHERE  employee_id IN (178,174))
AND	employee_id NOT IN (178,174);

SQL> select manager_id,department_id
  2  from employees
  3  where employee_id in (178,174);

MANAGER_ID DEPARTMENT_ID
---------- -------------
       149            80
       149

SQL> SELECT     employee_id, manager_id, department_id
  2  FROM       employees
  3  WHERE  (manager_id, department_id) IN
  4                        (SELECT manager_id, department_id
  5                         FROM   employees
  6                         WHERE  employee_id IN (178,174))
  7  AND        employee_id NOT IN (178,174);

EMPLOYEE_ID MANAGER_ID DEPARTMENT_ID
----------- ---------- -------------
        179        149            80
        177        149            80
        176        149            80
        175        149            80

SQL> SELECT     employee_id, manager_id, department_id
  2  FROM       employees
  3  WHERE  (manager_id, department_id) IN
  4                        (SELECT manager_id, department_id
  5                         FROM   employees
  6                         WHERE  employee_id IN (178,174))
  7  ;

EMPLOYEE_ID MANAGER_ID DEPARTMENT_ID
----------- ---------- -------------
        179        149            80
        177        149            80
        176        149            80
        175        149            80
        174        149            80			/./././.

Pairwise Comparison Subquery
		// 
  The example in the slide is that of a multiple-column subquery because the subquery returns more than one column(子查询返回值多于一行). It compares the values in the MANAGER_ID column and the DEPARTMENT_ID column of each row in the EMPLOYEES table with the values in the MANAGER_ID column and the DEPARTMENT_ID column for the employees with the EMPLOYEE_ID 178 or 174.

  First, the subquery to retrieve the MANAGER_ID and DEPARTMENT_ID values for the employees with the EMPLOYEE_ID 178 or 174 is executed. These values are compared with the MANAGER_ID column and the DEPARTMENT_ID column of each row in the EMPLOYEES table. If the values match, the row is displayed. In the output,  the records of the employees with the EMPLOYEE_ID 178 or 174 will not be displayed. The output of the query in the slide follows.



			非成对比较子查询

    显示被同一个经理管理,具有 EMPLOYEE_ID 174 或 141 的雇员;并且,工作在同一个部门,具有 EMPLOYEE_ID 174 或 141 的雇员的详细信息
SELECT  employee_id, manager_id, department_id 	3/
FROM    employees
WHERE   manager_id IN 
                  (SELECT  manager_id		1/
                   FROM    employees
                   WHERE   employee_id IN (174,141))	
AND     department_id IN 
                  (SELECT  department_id	2/
                   FROM    employees
                   WHERE   employee_id IN (174,141))
AND	employee_id NOT IN(174,141);

返回的department_id值和manager_id值与departments表中的每一行进行比较.
要两个值同时都满足才display.

Nonpairwise Comparison Subquery

   The example shows a nonpairwise comparison of the columns. It displays the EMPLOYEE_ID, MANAGER_ID, and DEPARTMENT_ID of any employee whose manager ID matches any of the manager IDs of employees whose employee IDs are either 174 or 141 and DEPARTMENT_ID match any of the department IDs of employees whose employee IDs are either 174 or 141.

   First, the subquery to retrieve the MANAGER_ID values for the employees with the EMPLOYEE_ID 174 or 141 is executed. Similarly, the second subquery to retrieve the DEPARTMENT_ID values for the employees with the EMPLOYEE_ID 174 or 141 is executed. the retrived values of the MANAGER_ID and DEPARTMENT_ID columns are compared with the MANAGER_ID and DEPARTMENT_ID column for each row in the EMPLOYEES table. If the MANAGER_ID column of the row in the EMPLOYEES table matches with any of the values of the MANAGER_ID retrieved by the inner subquery and if the DEPARTMENT_ID column of the row in the EMPLOYEES table matches with any of the values of the DEPARTMENT_ID retrieved by the second subquery, the record is displayed. The output of the query in the slide follows.

    EMPLOYEE_ID		MANAGER_ID		DEPARTMENT_ID
	    142		       124			 50
	    143		       124			 50
	    144		       124			 50
	    176		       149			 80



			在 FROM 子句中使用子查询

SELECT  a.last_name, a.salary, 		
        a.department_id, b.salavg	//必须是b 表中的'列'
FROM    employees a, (SELECT   department_id, 
                      AVG(salary) salavg
                      FROM     employees
                      GROUP BY department_id) b
WHERE   a.department_id = b.department_id
AND     a.salary > b.salavg;


Using a Subquery in the FROM Clause
 
   You can use a subquery in the FROM clause of a SELECT statement, which is very similar to how views are used. A subquery in the FROM clause of a SELECT statement is also called an inline view(内部视图). A subquery in the FROM clause of a SELECT statement defines a data source for that particular SELECT statement, and only that SELECT statement. The example on the slide displays employee last names, salaries, department numbers, and average salaries for all the employees who earn more than the average salary in their department. The subquery in the FROM clause is named b, and the outer query references the SALAVG column using this alias.
//注意表的别名.

				分级子查询表达式

:: 一个分级子查询表达式是一个从一行中返回确切的一个列值的子查询
:: 在 Oracle8i 中,分级子查询仅在一些有限情况的情况下被支持,例如:
     - SELECT 语句 (FROM 和 WHERE 子句)
     - 在一个 INSERT 语句中的VALUES 表中
:: 在 Oracle9i 中,分级子查询能够被用于:
     - DECODE  and CASE 的条件和表达式部分
     - 除 GROUP BY 以外所有的 SELECT 子句


Scalar Subqueries in SQL

   A subquery that returns exactly one column value from one row is also referred to as a scalar subquery(一个分级子查询表达式是一个从一行中返回确切的一个列值的子查询.) Multiple-column subqueries written to compare two or more columns, using a compound WHERE clause and logical operators, do not qualify as scalar subqueries.

././././././如果子查询返回0行,分级子查询表达式是NULL,如果子查询返回多行,Oracle Server返回ERROR. 
If the subquery returns 0 rows, the value of the scalar subquery expression is NULL. If the subquery returns more than one row, the Oracle Server returns an error.

  The value of the scalar subquery expression is the value of the select list item of the subquery. If the subquery returns 0 rows, the value of the scalar subquery expression is NULL. If the subquery returns more than one row, the Oracle Server returns an error. The Oracle Server has always supported the usage of a scalar subquery in a  SELECT statement. The usage of scalar subqueries has been enhanced in Oracle9i. You can now use scalar subqueries in:
   - Condition and expression part of DECODE and CASE
   - All clauses of SELECT except GROUP BY
   - In the left-hand side of the operator in the SET clause and WHERE clause of UPDATE statement 

  However, scalar subqueries are not valid expressions in the following places:
   - As default values for columns and hash expressions for clusters
   - In the RETURNING clause of DML statements
   - As the basis of a function-based index	基本函数索引
   - In GROUP BY clauses, CHECK constraints,  WHEN conditions///./././
   - HAVING clauses		./././
   - In START WITH and CONNECT BY clauses
   - In statements that are unrelated to queries, such as CREATE PROFILE


		分级子查询: 例子

  在CASE表达式的分级子查询.

SELECT employee_id, last_name,
       (CASE			20 <----|  
        WHEN department_id =		|
                (SELECT deaprtment_id FROM deaprtments
                 WHERE location_id=1800)
        THEN 'Canada' ELSE 'USA' END) location
FROM   employees;

...
...
EMPLOYEE_ID LAST_NAME                 LOCATI
----------- ------------------------- ------
        199 Grant                     USA
        200 Whalen                    USA
        201 Hartstein                 Canada
        202 Fay                       Canada
        203 Mavris                    USA
        204 Baer                      USA
...


   在 ORDER BY 子查询中的分级子查询
SELECT   employee_id, last_name
FROM     employees e			//两个表
ORDER BY	(SELECT department_name	//用deaprtments这个表的department_name排序
 	FROM departments d
 	WHERE e.department_id = d.department_id);


Scalar Subqueries: Examples

  The first example in the slide demonstrates(认证) that scalar subqueries can be used in CASE expressions. The inner query returns the value 20, which is the department ID of the department whose location ID is 1800. The CASE expression in the outer query uses the result of the inner query to display the employee ID, last names, and a value of Canada or USA, depending on whether the department ID of the record retrieved by the outer query is 20 or not.//是USA,or Canada取决于由外查询的department_id记录返回是不是20   /./././
//内连接是20 了,如果外连接是20则是Canada,如果不是20,则返回USA

The result of the preceding example follows:
  EMPLOYEE_ID	LAST_NAME	LOCATI
   	  100	King		USA
	  101	Kochhar		USA
	  102	De Haan		USA
	  103 	Hunold		USA
...
	  201	Hartstein	Canada
 	  202	Fay		Canada
	  206	Higgins		USA
	  206	Gietz 		USA


Scalar Subqueries: Examples (continued)

  The second example in the slide demonstrates that scalar subqueries can be used in the ORDER BY clause. The example orders the output based on the DEPARTMENT_NAME by matching the DEPARTMENT_ID from the EMPLOYEES table with the DEPARTMENT_ID from the DEPARTMENTS table. This comparison in done in a scalar subquery in the ORDER BY clause. The result of the the second example follows:




   The second example uses a correlated subquery. In a correlated subquery, the subquery references a column from a table referred to in the parent statement. Correlated subqueries are explained later in this lesson. 



			相关的子查询

   相关子查询被用于 row-by-row 处理。对外查询的每一行,每个子查询被执行一次./././.

				GET
		|---- >	candidate row from outer query    //从外查询中获得候选行.
		|		 |
		|	      EXECUTE
		|   inner query using candidate row value //从内查询中获得候选行.
		|		 |
 		|		USE
		|----values from inner query to qualify  //
			or disqualify candidate row

Correlated Subqueries

  The Oracle Server performs a correlated subquery when the subquery references a column from a table referred to in the parent statement. A correlated subquery is evaluated once for each row processed by the parent statement. The parent statement can be a SELECT, UPDATE, or DELETE statement. /././././

  Nested Subqueries Versus Correlated Subqueries
     With a normal nested subquery, the inner SELECT query runs first and executes once, returning values to be used by the main query(这是对于一般的查询). A correlated subquery, however, executes once for each candidate row considered by the outer query. In other words, the inner query is driven by the outer query.

  Nested Subquery Execution	/././.
    - The inner query executes first and finds a value.	先内查询
    - The outer query executes once, using the value from the inner query.//再外查询

  Correlated Subquery Execution	.,./././././
    - Get a candidate row (fetched by the outer query).//从外查询中获得行
    - Execute the inner query using the value of the candidate row. 
	用外查询获得的行执行内查询.
    - Use the values resulting from the inner query to qualify or disqualify the candidate.使用从内查询中返回的值限定或不限定行.
    - Repeat until no candidate row remains. //重复做直到没有行余下.


			相关子查询

SELECT 	column1,column2, ...
FROM 	table1 outer
WHERE 	column1 operator
		      ( SELECT	column1,column2
			FROM	table2
			WHERE	expr1=			//要有一个关联
				   outer.expr2);

    子查询参考在父查询中的表的一个列

Correlated Subqueries (continued)

   A correlated subquery is one way of reading every row in a table and comparing values in each row against related data. It is used whenever a subquery must return a different result or set of results for each candidate row considered by the main query. In other words, you use a correlated subquery to answer a multipart question whose answer depends on the value in each row processed by the parent statement. 

   The Oracle Server performs a correlated subquery when the subquery references a column from a table in the parent query. (当一个子查询参考父查询表返回的列.)

   Note: You can use the ANY and ALL operators in a correlated subquery. 


			使用相关子查询

   找出所有的雇员,他们挣的薪水高于该部门的平均薪水

SELECT last_name, salary, department_id
FROM   employees outer
WHERE  salary >		
	|--->	(SELECT AVG(salary)
		 FROM   employees
 		 WHERE  department_id =  
        		    outer.department_id) ;
		

					外查询中的行每被处理一次,内查询就求值一次


Using Correlated Subqueries

   The example in the slide determines which employees earn more than the average salary of their department. In this case, the correlated subquery specifically computes the average salary for each department.

   Because both the outer query and inner query use the EMPLOYEES table in the FROM clause, an alias is given to EMPLOYEES in the outer SELECT statement, for clarity. Not only does the alias make the entire SELECT statement more readable, but without the alias the query would not work properly, because the inner statement would not be able to distinguish the inner table column from the outer table column.

			使用相关子查询

    显示雇员的详细信息,这些雇员至少变换过两次工作

SELECT e.employee_id, last_name,e.job_id
FROM   employees e 
WHERE  2 <= (SELECT COUNT(*)
             FROM   job_history 
             WHERE  employee_id = e.employee_id);


Using Correlated Subqueries

			//对上面的例子进行分析
  The example in the slide displays the details of those employees who have switched jobs at least twice. The Oracle Server evaluates a correlated subquery as follows:
	1. Select a row from the table specified in the outer query. This will be the current candidate row.
	2. Store the value of the column referenced in the subquery from this candidate row. (In the example in the slide, the column referenced in the subquery is E.EMPLOYEE_ID.)	//从候选列中存储子查询中引用列的值,,子查询的引用列值:E.EMPLOYEE_ID

	3. Perform the subquery with its condition referencing the value from the outer query’s candidate row//计算内查询,将满足条件的count(*)找出来. (In the example in the slide, group function COUNT(*) is evaluated based on the value of the E.EMPLOYEE_ID column obtained in step 2.) e.employee_id的值从step 2得来.

	4. Evaluate the WHERE clause of the outer query on the basis of results of the subquery performed in step 3. This is determines if the candidate row is selected for output. (In the example, the  number of times an employee has switched jobs, evaluated by the subquery, is compared with 2 in the WHERE clause of the outer query. If the condition is satisfied, that employee record is displayed.)
//将选出来的count(*)与2对比,如果>=则显示,否则不显示.
	5. Repeat the procedure for the next candidate row of the table, and so on until all the rows in the table have been processed. 

  The correlation is established by using an element from the outer query in the subquery. In this example, the correlation is established by the statement EMPLOYEE_ID = E.EMPLOYEE_ID in which you compare EMPLOYEE_ID from the table in the subquery with the EMPLOYEE_ID from the table in the outer query.



			使用 EXISTS 操作

:: EXISTS 操作对在子查询的结果集中存在的行进行检验
:: 如果一个子查询行值被找到:
	- 在内查询中的搜索不再继续././././.
	- 条件被标记为 TRUE
:: 如果一个子查询行值未找到:
	- 条件被标记为 FALSE
	- 在内查询中的搜索继续


The EXISTS Operator

  With nesting SELECT statements, all logical operators are valid. In addition, you can use the EXISTS operator. This operator is frequently used with correlated subqueries to test whether a value retrieved by the outer query exists in the results set of the values retrieved by the inner query. If the subquery returns at least one row, the operator returns TRUE. If the value does not exist, it returns FALSE. 
如果子查询返回至少一行,则操作返回TRUE,如果没有值返回,则返回FALSE
Accordingly, NOT EXISTS tests whether a value retrieved by the outer query is not a part of  the results set of the values retrieved by the inner query.



			使用 EXISTS 操作

	查找至少有一个雇员的经理

SELECT employee_id, last_name, job_id, department_id
FROM   employees outer
WHERE  EXISTS ( SELECT 'X'	//如果返回X,则TRUE,否则FALSE.最后看是不是TRUE,即返回X
                 FROM   employees	的即为满足条件的.
                 WHERE  manager_id = 
                        outer.employee_id);
EMPLOYEE_ID LAST_NAME                 JOB_ID     DEPARTMENT_ID 
----------- ------------------------- ---------- ------------- 
        100 King                      AD_PRES               90
        101 Kochhar                   AD_VP                 90
        102 De Haan                   AD_VP                 90   
        103 Hunold                    IT_PROG               60  
        108 Greenberg                 FI_MGR               100 
        114 Raphaely                  PU_MAN                30
        120 Weiss                     ST_MAN                50
        121 Fripp                     ST_MAN                50
        122 Kaufling                  ST_MAN                50
        123 Vollman                   ST_MAN                50
        124 Mourgos                   ST_MAN                50
...

//   解析一下::
只要manager_id在employee_id中就显示,即是它的经理.

SQL> select distinct manager_id
  2  FROM   employees
  3  WHERE  manager_id IS NOT NULL;

MANAGER_ID
----------
       100
       101
       102
       103
       108
       114
       120
       121
       122
       123
       124
       145
       146
       147
       148
       149
       201
       205


SQL> select 'X' from dual;

'
-
X

   SELECT employee_id,last_name,job_id,department_id
   FROM   employees 
   WHERE  employee_id IN (SELECT manager_id
                          FROM   employees
                          WHERE  manager_id IS NOT NULL);

Using the EXISTS Operator
  
  使用下面的条件,当至少找到一个经理号和雇员号相匹配的记录时,EXISTS 操作确保在内查询中的搜索不再继续:
   WHERE manager_id = outer.employee_id.

  Note that the inner SELECT query does not need to return a specific value(内查询不必找到确切的值), so a constant(常量也可以选择) can be selected. From a performance standpoint, it is faster to select a constant than a column.

  Note: Having EMPLOYEE_ID in the SELECT clause of the inner query causes a table scan for that column. Replacing it with the literal X, or any constant, improves performance. This is more efficient than using the IN operator. 

  A IN construct can be used as an alternative for a EXISTS operator, as shown in the following example: 

   SELECT employee_id,last_name,job_id,department_id
   FROM   employees 
   WHERE  employee_id IN (SELECT manager_id
                          FROM   employees
                          WHERE  manager_id IS NOT NULL);


			使用 NOT EXISTS 操作

找出所有的没有任何雇员的部门

SELECT department_id, department_name
FROM departments d
WHERE NOT EXISTS (SELECT 'X'
                  FROM   employees
                  WHERE  department_id 	//要有一个联联
                         = d.department_id);

DEPARTMENT_ID DEPARTMENT_NAME
------------- ------------------------------
          120 Treasury
          130 Corporate Tax
          140 Control And Credit
...

可以得出以下是未选定行.
SQL> select * from employees
  2  where department_id=120;

未选定行

SQL> select * from employees
  2  where department_id=130;

未选定行
...

Using the NOT EXISTS Operator 

Alternative Solution 
 
 A NOT IN construct can be used as an alternative for a NOT EXISTS operator, as shown in the following example. 
   SELECT department_id, department_name
   FROM   departments 
   WHERE  department_id NOT IN (SELECT department_id
                                FROM   employees);

   However, NOT IN evaluates to FALSE if any member of the set is a NULL value. 
如果集合的任何成员是NULL值,NOT IN 的值是FALSE.因此,即使departments表中满足WHERE条件的行,你的查询将不会返回任何行.
Therefore, your query will not return any rows even if there are rows in the departments table that satisfy the WHERE condition. 


		相关 UPDATE

UPDATE table1 alias1
SET    column = (SELECT expression
                 FROM   table2 alias2
                 WHERE  alias1.column =    
                        alias2.column);

	用一个相关子查询来更新在一个表中的行,该表基于另一个表中的行

Correlated UPDATE
   In the case of the UPDATE statement, you can use a correlated subquery to update rows in one table based on rows from another table.


			相关UPDATE

:: 用一个附加的列来存储部门名称,反向规格化 EMPLOYEES 表
:: 用相关更新填充表

ALTER TABLE employees
ADD(department_name VARCHAR2(30));

UPDATE employees e
SET    department_name = 
              (SELECT department_name 
	       FROM   departments d
	       WHERE  e.department_id = d.department_id);	//要有一个关联


Correlated UPDATE (continued)

   The example in the slide denormalizes the EMPLOYEES table by adding a column to store the department_name and then populates the table by using a correlated update.

   Here is another example for a correlated update.

Problem Statement
  Use a correlated subquery to update rows in the EMPLOYEES table based on rows from the REWARDS table:

././././././
UPDATE employees
SET    salary = (SELECT employees.salary + rewards.pay_raise
                 FROM   rewards
                 WHERE  employee_id  =  employees.employee_id
                 AND   payraise_date = 
                      (SELECT MAX(payraise_date) 
                       FROM   rewards
                       WHERE  employee_id = employees.employee_id))
WHERE  employees.employee_id IN    
	(SELECT employee_id 
        FROM   rewards);

  This example uses the REWARDS table. The REWARDS table has the columns EMPLOYEE_ID, PAY_RAISE, and PAYRAISE_DATE. Every time an employee gets a pay raise, a record with the details of the employee ID, the amount of the pay raise, and the date of receipt of the pay raise is inserted into the REWARDS table. The REWARDS table can contain more than one record for an employee. The PAYRAISE _DATE column is used to identify the most recent pay raise received by an employee. 

 In the example, the SALARY column in the EMPLOYEES table is updated to reflect the latest pay raise received by the employee. This is done by adding the current salary of the employee with the corresponding pay raise from the REWARDS table. 



			相关 DELETE

DELETE FROM table1 alias1
 WHERE  column operator 
	(SELECT expression
 	 FROM   table2 alias2
 	 WHERE  alias1.column = alias2.column);
  用一个相关子查询删除表中的行,该表基于另一个表中的行

Correlated DELETE

  In the case of a DELETE statement, you can use a correlated subquery to delete only those rows that also exist in another table. If you decide that you will maintain only the last four job history records in the JOB_HISTORY table, then when an employee transfers to a fifth job, you delete the oldest JOB_HISTORY row by looking up the JOB_HISTORY table for the MIN(START_DATE)for the employee. The following code illustrates how the preceding operation can be performed using a correlated DELETE:

   DELETE FROM job_history JH
   WHERE  employee_id =
         (SELECT employee_id 
          FROM   employees E
          WHERE  JH.employee_id = E.employee_id	././././要有一个关联
          AND    start_date =
                (SELECT MIN(start_date)  
                 FROM   job_history JH
                 WHERE  JH.employee_id = E.employee_id)	//关联
                 AND 5 >  (SELECT COUNT(*)  
                           FROM   job_history JH
                           WHERE  JH.employee_id = E.employee_id	//关联
                           GROUP  BY employee_id
                           HAVING COUNT(*) >= 4));



			相关删除 DELETE

   用一个相关子查询删除哪些在 EMPLOYEES 表和 EMP_HISTORY 表中的 employee_id 列值相同的行

DELETE FROM employees E
WHERE employee_id =  
           (SELECT employee_id
            FROM   emp_history 
            WHERE  employee_id = E.employee_id);

Correlated DELETE (continued)

Example
Two tables are used in this example. They are:
	- The EMPLOYEES table, which gives details of all the current employees
	- The EMP_HISTORY table, which gives details of previous employees

  EMP_HISTORY contains data regarding previous employees, so it would be erroneous if the same employee’s record existed in both the EMPLOYEES and EMP_HISTORY tables. You can delete such erroneous records by using the correlated subquery shown in the slide.


				WITH子句

:: 当一个查询块在一个复杂的查询中出现多次时,使用 WITH 子句,能够在 SELECT 语句中多次使用相同查询块
:: WITH 子句取回查询块的结果,并且将它存在用户的临时表空间中
:: WITH 子句可以改善性能

The WITH clause

  Using the WITH clause, you can define a query block before using it in a query. The WITH clause (formally known as subquery_factoring_clause) enables you to reuse the same query block in a SELECT statement when it occurs more than once within a complex query. This is particularly useful when a query has many references to the same query block and there are joins and aggregations. 

  Using the WITH clause, you can reuse the same query when it is high cost to evaluate the query block and it occurs more than once within a complex query. Using the WITH clause, the Oracle Server retrieves the results of a query block and stores it in the user’s temporary tablespace. This can improve performance.

 WITH Clause Benefits
	- Makes the query easy to read
	- Evaluates a clause only once, even if it appears multiple times in the query, 	  	  thereby enhancing performance 
		

			WITH 子句: 例子

   用 WITH 子句,写一个查询来显示部门名称和该部门的合计薪水,哪些人的合计薪水高于各部门的平均薪水

WITH 
dept_costs  AS (
   SELECT  d.department_name, SUM(e.salary) AS dept_total
   FROM    employees e, departments d
   WHERE   e.department_id = d.department_id
   GROUP BY d.department_name),
avg_cost    AS (
   SELECT SUM(dept_total)/COUNT(*) AS dept_avg
   FROM   dept_costs)
SELECT * 
FROM   dept_costs 
WHERE  dept_total >
        (SELECT dept_avg 
         FROM avg_cost)
ORDER BY department_name;


  The  problem in the slide would require the following intermediate calculations:
1.	Calculate the total salary for every department, and store the result using a WITH clause.
2.	Calculate the average salary across departments, and store the result using a WITH clause.
3.	Compare the total salary calculated in the first step with the average salary calculated in the second step. If the total salary for a particular department is greater than the average salary across departments, display the department name and the total salary for that department. 


WITH Clause: Example (continued)

   The SQL code in the slide is an example of a situation in which you can improve performance and write SQL more simply by using the WITH clause. The query creates the query names DEPT_COSTS and AVG_COST and then uses them in the body of the main query. Internally, the WITH clause is resolved either as an in-line view or a temporary table. The optimizer chooses the appropriate resolution depending on the cost or benefit of temporarily storing the results of the WITH clause.


  Note: A subquery in the FROM clause of a SELECT statement is also called an in-line view.(内部视图)

 The output generated by the SQL code on the slide will be as follows:

DEPARTMENT_NAME                DEPT_TOTAL
------------------------------ ----------
Sales                              304500
Shipping                           156400


The WITH Clause Usage Notes
  -- It is used only with SELECT statements
  -- A query name is visible to all WITH element query blocks (including their subquery blocks) defined after it and the main query block itself (including its subquery blocks).
  -- When the query name is the same as an existing table name, the parser searches from the inside out, the query block name takes precedence over the table name.
  -- The WITH clause can hold more than one query. Each query is then separated by a comma.


			SUMMARY

在本课中, 您应该已经学会如何:
:: 返回多于一列的多列子查询
:: 多列比较可以成对或非成对地进行
:: 一个多列子查询也能够被用于一个 SELECT 语句的 FROM 子句
:: 分级子查询在 Oracle9i 中得到了增强
Summary

   You can use multiple-column subqueries to combine multiple WHERE conditions into a single WHERE clause. Column comparisons in a multiple-column subquery can be pairwise comparisons or non-pairwise comparisons. 

 You can use a subquery to define a table to be operated on by a containing query. 

  Oracle 9i enhances the the uses of scalar subqueries. Scalar subqueries can now be used in:
   : Condition and expression part of DECODE  and CASE
   : All clauses of SELECT except GROUP BY  
   : SET clause and WHERE clause of UPDATE statement


			小结
:: 无论何时一个子查询必须对每一个侯选行返回不同的结果,这时,相关子查询是有用的
:: EXISTS 操作是测试值的存在性的布尔操作
:: 相关子查询能够用于 SELECT, UPDATE, and DELETE 语句
:: 在 SELECT 语句中你能够通过 WITH 子句多次使用相同的查询块
Summary (continued)

   The Oracle Server performs a correlated subquery when the subquery references a column from a table referred to in the parent statement. A correlated subquery is evaluated once for each row processed by the parent statement. The parent statement can be a SELECT, UPDATE, or DELETE statement. Using the WITH clause, you can reuse the same query when it is costly to reevaluate the query block and it occurs more than once within a complex query.
			




			************分级取回数据**************

			目标

 完成本课后, 您应当能够执行下列操作:
:: 解释分级查询的概念
:: 创建一个树型结构的报告
:: 格式化分级数据
:: 从树型结构中去除分支 


Lesson Aim
  In this lesson, you learn how to use hierarchical queries to create tree-structured reports.(树型结构报告)


			 EMPLOYEES 表中的例子数据

Sample Data from the EMPLOYEES Table

   Using hierarchical queries, you can retrieve data based on a natural hierarchical relationship between rows in a table. A relational database does not store records in a hierarchical way关系型数据库不能以分等级的方式存储. However, where a hierarchical relationship exists between the rows of a single table, a process called tree walking enables the hierarchy to be constructed. A hierarchical query is a method of reporting, in order, the branches of a tree.

   Imagine a family tree with the eldest members of the family found close to the base or trunk of the tree and the youngest members representing branches of the tree. Branches can have their own branches, and so on. 

   A hierarchical query is possible when a relationship exists between rows in a table.
当在一个表中存在行行之间的关系,分等级查询是有可能的. For example, in the slide, you see that employees with the job IDs of AD_VP, ST_MAN, SA_MAN, and MK_MAN report directly to the president of the company. We know this because the MANAGER_ID column of these records contain the employee ID 100, which belongs to the president (AD_PRES). 

   Note: Hierarchical trees are used in various fields such as human genealogy (family trees), livestock (breeding purposes), corporate management (management hierarchies), manufacturing (product assembly), evolutionary research (species development), and scientific research.


				自然树结构

Natural Tree Structure
  
  The EMPLOYEES table has a tree structure representing the management reporting line. The hierarchy can be created by looking at the relationship between equivalent values in the EMPLOYEE_ID and MANAGER_ID columns. This relationship can be exploited by joining the table to itself(这些表自连接将可以使用.). The MANAGER_ID column contains the employee number of the employee’s manager.

  The parent-child relationship of a tree structure enables you to control:
	- The direction in which the hierarchy is walked
	- The starting point inside the hierarchy

  Note: The slide displays an inverted tree structure of the management hierarchy of the employees in the EMPLOYEES table.



			分等级查询

SELECT [LEVEL], column, expr...
FROM   table
[WHERE condition(s)]
[START WITH condition(s)]
[CONNECT BY PRIOR condition(s)] ;


WHERE条件:
expr comparison_operator(比较运算符) expr


Keywords and Clauses(子句和关键字)

  Hierarchical queries can be identified by the presence of the CONNECT BY and START WITH clauses.

  In the syntax:

	SELECT		Is the standard SELECT clause. 

	LEVEL 		For each row returned by a hierarchical query, the LEVEL 
			为每一行返回分等级查询,										pseudocolumn(列) returns 1 for a root row, 2 for a child of a 					root,and so on.

	/././././
	FROM table	Specifies the table, view, or snapshot containing the columns. 					You can select from only one table.(只能是一个表)

	WHERE		Restricts the rows returned by the query without affecting other 				rows of	the hierarchy. (返回没有影响其它分等级行的查询)

        condition	Is a comparison with expressions.(比较表达式条件)

	START WITH	Specifies the root rows of the hierarchy (where to start). This 				clause is required for a tree hierarchical query.
			指定层次的根行(从哪开始),这个子句对于树型分类查询是必须的.

	CONNECT BY	Specifies the columns in which the relationship between parent 					and child
			指定父子关系的列

	PRIOR		rows exist. This clause is required for a hierarchical query.
			行存在.对于分等级查询是必须的.
 The SELECT statement cannot contain a join or query from a view that contains a join.
SELECT操作不能包括连接,或从一个包含连接的视图中查询



				遍历树

起点

:: 指定必须满足的条件
:: 接受有效的条件

 START WITH column1=value

使用EMPLOYEES表,从名字是Kochhar的雇员开始
...START WITH last_name='Kochhar'


Walking the Tree(遍历树)

  The row or rows to be used as the root of the tree are determined by the START WITH clause(行,或行S将作为树根决定于START WITH子句). The START WITH clause can be used in conjunction with any valid condition.
START WITH子句在任何有效的条件连接使用

 Examples

 Using the EMPLOYEES table, start with King, the president of the company.
	... START WITH manager_id IS NULL

 Using the EMPLOYEES table, start with employee Kochhar. A START WITH condition can contain a subquery.
	... START WITH employee_id = (SELECT employee_id
                          	        FROM    employees
                        	        WHERE   last_name = 'Kochhar')

  If the START WITH clause is omitted(忽略), the tree walk is started with all of the rows in the table as root rows(树的遍历将从表中所有的行,作为根开始遍历). If a WHERE clause is used, the walk is started with all the rows that satisfy the WHERE condition. This no longer reflects a true hierarchy.
如果WHERE子句使用了,遍历将从所有的行开始,并且满足WHERE条件.这将不再返回树的层次.

 Note: The clauses CONNECT BY PRIOR and START WITH are not ANSI SQL standard.

  Instructor Note
    You may wish to add that multiple hierarchical outputs are generated if more than one row satisfies the START WITH condition.



			遍历树(有方向的查询)

CONNECT BY PRIOR column1=column2

从顶向下遍历,用employees表   父列是employee_id,子列是manager_id
...CONNECT BY PRIOR employee_id=manager_id

方向
	从顶向下	---->Column1=Parent Key 
			     Column2=Child Key	

	从底向上	---->Column1=Child Key
			     Column2=Parent Key 


Walking the Tree (continued)

   The direction of the query, whether it is from parent to child or from child to parent, is determined by the CONNECT BY PRIOR column placement. The PRIOR operator refers to the parent row(PRIOR操作涉及到父行). To find the children of a parent row, the Oracle Server evaluates the PRIOR expression for the parent row and the other expressions for each row in the table(为了找到父行中的子行,ORACLE SERVER将PRIOR作用于父行,并且其它的表达式操作将用于表中的其它行,行的条件为真则说明子行在父行中). Rows for which the condition is true are the children of the parent. The Oracle Server always selects children by evaluating the CONNECT BY condition with respect to a current parent row.

Examples

Walk from the top down(自顶向下) using the EMPLOYEES table. Define a hierarchical relationship in which the EMPLOYEE_ID value of the parent row is equal to the MANAGER_ID value of the child row.
	... CONNECT BY PRIOR employee_id = manager_id

 Walk from the bottom up using the EMPLOYEES table.
	 ... CONNECT BY PRIOR manager_id = employee_id

  The PRIOR operator does not necessarily need to be coded immediately following the CONNECT BY(PRIOR操作符并不是非要直接跟随在CONNECT BY后). Thus, the following CONNECT BY PRIOR clause gives the same result(与先前的例子有相同的结果) as the one in the preceding example.
	 ... CONNECT BY employee_id = PRIOR manager_id 

 Note: The CONNECT BY clause cannot contain a subquery. ././././.不能包含子查询



			遍历树:从底向上

SELECT employee_id, last_name, job_id, manager_id
FROM   employees
START  WITH  employee_id = 101
CONNECT BY PRIOR manager_id = employee_id ;	//从底向上

EMPLOYEE_ID LAST_NAME                 JOB_ID     MANAGER_ID
----------- ------------------------- ---------- ----------
        101 Kochhar                   AD_VP             100
        100 King                      AD_PRES			


Walking the Tree: From the Bottom Up 

  The example in the slide displays a list of managers starting with the employee whose employee ID is 101.

Example

  In the following example, EMPLOYEE_ID values are evaluated for the parent row and MANAGER_ID, and SALARY values are evaluated for the child rows. The PRIOR operator applies only to the EMPLOYEE_ID value.

	... CONNECT BY PRIOR employee_id = manager_id
					AND salary > 15000;

SQL>  SELECT employee_id, last_name, job_id, manager_id
  2   FROM   employees
  3   START  WITH  employee_id = 101
  4   CONNECT BY PRIOR manager_id = employee_id
  5  and salary>15000;

EMPLOYEE_ID LAST_NAME                 JOB_ID     MANAGER_ID
----------- ------------------------- ---------- ----------
        101 Kochhar                   AD_VP             100
        100 King                      AD_PRES

  To qualify as a child row, a row must have a MANAGER_ID value equal to the EMPLOYEE_ID value of the parent row and must have a SALARY value greater than $15,000.
为了取得子行,要有列MANAGER_ID的值=父行中employee_id的值../././


Instructor Note
:: In the context of the second paragraph, you may wish to include that additional conditions added to the CONNECT BY PRIOR clause potentially eliminated the whole of the branch(潜在的限制树支), hence the EMPLOYEE_ID AND SALARY are evaluated for the parent row to determine if it is to be part of the output.

SQL> SELECT  last_name||' reports to '|| PRIOR   last_name "Walk Top Down"
  2  FROM    employees
  3  start with employee_id=101
  4  connect by prior manager_id=employee_id;

Walk Top Down
--------------------------------//插一下
Kochhar reports to
King reports to Kochhar

SQL> SELECT  last_name||' reports to '|| last_name "Walk Top Down"
  2   FROM    employees
  3   start with employee_id=101
  4   connect by prior manager_id=employee_id;

Walk Top Down
--------------------------------------------------------------
Kochhar reports to Kochhar
King reports to King



		遍历树:  从顶向下

SELECT  last_name||' reports to '|| PRIOR   last_name "Walk Top Down"
FROM    employees
START   WITH last_name = 'King'
CONNECT BY PRIOR employee_id = manager_id ;


Walking the Tree: From the Top Down

  Walking from the top down, display the names of the employees and their manager. Use employee King as the starting point. Print only one column.

Walk Top Down
--------------------------------
King reports to
King reports to
Kochhar reports to King
Greenberg reports to Kochhar
Faviet reports to Greenberg
Chen reports to Greenberg
Sciarra reports to Greenberg
Urman reports to Greenberg
Popp reports to Greenberg
Whalen reports to Kochhar
Mavris reports to Kochhar

SELECT  last_name||' reports to '|| last_name "Walk Top Down"
FROM    employees
START   WITH last_name = 'King'
CONNECT BY PRIOR employee_id = manager_id ;

Walk Top Down
------------------------------------------
King reports to King
King reports to King
Kochhar reports to Kochhar
Greenberg reports to Greenberg
Faviet reports to Faviet
Chen reports to Chen
Sciarra reports to Sciarra
Urman reports to Urman
Popp reports to Popp
Whalen reports to Whalen
Mavris reports to Mavris
...

			用LEVEL伪列将行分等级

Ranking Rows with the LEVEL Pseudocolumn

  You can explicitly show the rank or level of a row in the hierarchy by using the LEVEL pseudocolumn(伪列). This will make your report more readable(这将使你的报告更容易读).  The forks where one or more branches split away from a larger branch are called nodes, and the very end of a branch is called a leaf, or leaf node. The diagram in the slide shows the nodes of the inverted tree with their LEVEL values. For example, employee Higgens is a parent and a child, while employee Davies is a child and a leaf.

  The LEVEL Pseudocolumn

Value	Level
1	A root node(根)
2 	A child of a root node(根的孩子)
3 	A child of a child, and so on(根的孩子的孩子...)

 
Note: A root node is the highest node within an inverted tree. A child node is any nonroot node. A parent node is any node that has children. A leaf node is any node without children. The number of levels returned by a hierarchical query may be limited by available user memory.

 In the slide, King is the root or parent (LEVEL = 1). Kochhar, De Hann, Mourgos, Zlotkey, Hartstein, Higgens, and Hunold are children and also parents (LEVEL = 2). Whalen, Rajs, Davies, Matos, Vargas, Gietz, Ernst, Lorentz, Abel, Taylor, Grant, and Fay are children and leaves. 
(LEVEL = 3 and LEVEL = 4)


			用 LEVEL 和 LPAD 格式化分级报告

  创建一个报告显示公司的管理层,从最高级别开始,缩进下面跟随的级别

COLUMN org_chart FORMAT A18
SELECT LPAD(last_name, LENGTH(last_name)+(LEVEL*2)-2,'_') 
       AS org_chart
FROM   employees 
START WITH last_name='King' 
CONNECT BY PRIOR employee_id=manager_id;


Formatting Hierarchical Reports Using LEVEL

 The nodes in a tree are assigned level numbers from the root. Use the LPAD function in conjunction with the pseudocolumn LEVEL to display a hierarchical report as an indented tree.(交错树状)

 In the example on the slide:

   : LPAD(char1,n [,char2]) returns char1, left-padded to length n with the sequence of characters in char2. The argument n is the total length of the return value as it is displayed on your terminal screen.
   : LPAD(last_name, LENGTH(last_name)+(LEVEL*2)-2,'_')defines the display format.
   : char1 is the LAST_NAME , n the total length of the return value, is length of the LAST_NAME +(LEVEL*2)-2 ,and  char2  is '_'.

 In other words, this tells SQL to take the LAST_NAME and left-pad it with the '_' character till the length of the resultant string is equal to the value determined by LENGTH(last_name)+(LEVEL*2)-2. 

  For King, LEVEL = 1. Hence, (2 * 1) - 2 = 2 - 2 = 0. So King does not get padded with any '_' character and is displayed in column 1. 

 For Kochhar, LEVEL = 2. Hence, (2 * 2) - 2 = 4 - 2 = 2 . So Kochhar gets padded with 2 '_' characters and is displayed indented.

 The rest of the records in the EMPLOYEES table are displayed similarly. 


Formatting Hierarchical Reports Using LEVEL (continued)			

King
__Kochhar
____Greenberg
______Faviet
______Chen
______Sciarra
______Urman
______Popp
____Whalen
____Mavris

ORG_CHART
--------------------
____Baer
____Higgins
______Gietz
__De Haan
____Hunold
______Ernst
______Austin
______Pataballa
______Lorentz


				修剪分支

	用 WHERE 子句 				用 CONNECT BY 子句
	去除一个结点(node)叶子还要		去除一个分支(node,叶子都不要了)
	
Where last_name !='Higgins'		CONNECT BY PRIOR
					employee_id=manager_id
					AND last_name !='Higgins'
	范围:小				    范围:大


Pruning Branches
  You can use the WHERE and CONNECT BY clauses to prune the tree; that is, to control which nodes or rows are displayed(控制哪些节点或行S不被显示). The predicate you use acts as a Boolean condition.

Examples

Starting at the root, walk from the top down, and eliminate employee Higgins in the result, but process(保留) the child rows.
     SELECT  department_id, employee_id,last_name, job_id, salary
     FROM    employees
     WHERE   last_name  != 'Higgins'
     START   WITH manager_id IS NULL
     CONNECT BY PRIOR employee_id = manager_id;

  Starting at the root, walk from the top down, and eliminate employee Higgins and all child rows.(除去整个分支)
     SELECT  department_id, employee_id,last_name, job_id, salary
     FROM    employees
     START   WITH manager_id IS NULL
     CONNECT BY PRIOR employee_id = manager_id
     AND     last_name != 'Higgins';
:: 写多列子查询
:: 在返回空值时描述并解释子查询的行为
:: 写一个在 FROM 子句中的子查询
:: SQL 中使用分级子查询
:: 描述能够用相关子查询解决的问题类型
:: 写相关子查询
:: 用相关子查询更新和删除行
:: 使用 EXISTS 和 NOT EXISTS 操作
:: 使用 WITH 子句

Lesson Aim
   In this lesson, you learn how to write multiple-column subqueries and subqueries in the FROM clause of a SELECT statement. You also learn how to solve problems by using scalar, correlated subqueries and the WITH clause.



			什么是子查询?

 一个子查询是一个嵌入 SELECT 语句中的另一个 SQL 语句的子句


				select ...
		   主查询---->  from   ...
				where  ...
 					(select ...
					 from   ...      <-----子查询
					 where  ...)


What Is a Subquery?

  A subquery is a SELECT statement that is embedded in a clause of another SQL statement, called the parent statement.

内查询把查询结果给外查询
 The subquery (inner query) returns a value that is used by the parent statement. Using a nested subquery is equivalent to performing two sequential queries and using the result of the inner query as the search value in the outer query (main query).

  Subqueries can be used for the following purposes: 
    :: To provide values for conditions in WHERE, HAVING, and START WITH clauses of SELECT statements
    :: To define the set of rows to be inserted into the target table of an INSERT or CREATE TABLE statement	/././././
    :: To define the set of rows to be included in a view or snapshot in a CREATE VIEW or CREATE SNAPSHOT statement	/./././同上
    :: To define one or more values to be assigned to existing rows in an UPDATE statement			////同上
    :: To define a table to be operated on by a containing query. (You do this by placing the subquery in the FROM clause. This can be done in INSERT, UPDATE, and DELETE statements as well.)		/,.,.FROM子查询

  Note: A subquery is evaluated once for the entire parent statement.
先执行内查询,返回值给外查询,再执行主查询.




				子查询

SELECT	select_list
FROM	table
WHERE	expr operator (SELECT select_list
		         FROM	  table);

:: 子查询 (内嵌查询) 在主查询中执行一次
:: 子查询的结果被用于主查询 (外查询)

Subqueries

  You can build powerful statements out of simple ones by using subqueries. Subqueries can be very useful when you need to select rows from a table with a condition that depends on the data in the table itself or some other table. Subqueries are very useful for writing SQL statements that need values based on one or more unknown conditional values.

 In the syntax:			././././.
	operator	includes a comparison operator such as >, =, or IN

  Note: Comparison operators fall into two classes: (比较运算)
	single-row operators (>, =, >=, <, <>, <=)
	multiple-row operators (IN, ANY, ALL). 

 The subquery is often referred to as a nested SELECT, sub-SELECT, or inner SELECT statement. The inner and outer queries can retrieve data from either the same table or different tables.(叫做内嵌select,子select,内部select)


			使用子查询

SELECT last_name
FROM   employees         10500
WHERE  salary >    <-------------|
                (SELECT salary   |
                 FROM   employees
                 WHERE  employee_id = 149) ;



Using a Subquery
  			//对上面的解释
  In the example in the slide, the inner query returns the salary of the employee with employee number 149. The outer query uses the result of the inner query to display the names of all the employees who earn more than this amount.


Example:

 Display the names of all employees who earn less than the average salary in the company.

   SELECT last_name, job_id, salary
   FROM   employees
   WHERE  salary < (SELECT AVG(salary)
                    FROM   employees);


			多列子查询

  主查询的每行与一个多行多列子查询的值比较

Main query				 <----------|
WHERE(manager_id,department_id) IN	<-------|   |
						|   |
					 <-----||   |
					       ||   |
		Subquery		       ||   |
		100	90		_______||   |
		102	60		________|   |
		124	50		____________|


Multiple-Column Subqueries

 So far you have written single-row subqueries and multiple-row subqueries where only one column is returned by the inner SELECT statement and this is used to evaluate the expression in the parent select statement. If you want to compare two or more columns, you must write a compound WHERE clause using logical operators(如果想要比较两行或更多行你要使用逻辑运算符写一个混合的WHERE子句). Using multiple-column subqueries, you can combine duplicate WHERE conditions into a single WHERE clause.(使用混合列子查询可以把多个WHERE条件合到一个WHERE子句.)

  Syntax	
	SELECT column,column, ...
	FROM table
	WHERE (column,column, ...) IN
				   (SELECT column,column, ...
				    FROM table
				    WHERE condition);


   The graphic in the slide illustrates that the values of the MANAGER_ID and DEPARTMENT_ID from the main query are being compared with the MANAGER_ID and DEPARTMENT_ID  values retrieved by the subquery. Since the number of columns that are being compared are more than one, the example qualifies as a multiple-column subquery.



				列比较

在一个多列子查询中的列比较能够被:
:: 成对地比较
:: 非成对的比较 

Pairwise versus Nonpairwise Comparisons成对,非成对比较
   
   Column comparisons in a multiple-column subquery can be pairwise comparisons or nonpairwise comparisons. 
			/././././.在select语句的每个条件行都要有相同的列.
   In the example on the next slide, a pairwise comparison was executed in the WHERE clause. Each candidate row in the SELECT statement must have both the same MANAGER_ID column and the DEPARTMENT_ID as the employee with the EMPLOYEE_ID 178 or 174. 
  
   A multiple-column subquery can also be a nonpairwise comparison. In a nonpairwise comparison, each of the columns from the WHERE clause of the parent SELECT statement are individually compared to multiple values retrieved by the inner select statement. The individual columns can match any of the values retrieved by the inner select statement. But collectively, all the multiple conditions of the main SELECT statement must be satisfied for the row to be displayed. The example on the next page illustrates a nonpairwise comparison.


			成对比较子查询

   显示雇员的细节,这些雇员被同一个经理管理,并且,工作在同一个部门,具有 EMPLOYEE_ID 178 或 174
SELECT	employee_id, manager_id, department_id
FROM	employees
WHERE  (manager_id, department_id) IN
                      (SELECT manager_id, department_id
                       FROM   employees
                       WHERE  employee_id IN (178,174))
AND	employee_id NOT IN (178,174);

SQL> select manager_id,department_id
  2  from employees
  3  where employee_id in (178,174);

MANAGER_ID DEPARTMENT_ID
---------- -------------
       149            80
       149

SQL> SELECT     employee_id, manager_id, department_id
  2  FROM       employees
  3  WHERE  (manager_id, department_id) IN
  4                        (SELECT manager_id, department_id
  5                         FROM   employees
  6                         WHERE  employee_id IN (178,174))
  7  AND        employee_id NOT IN (178,174);

EMPLOYEE_ID MANAGER_ID DEPARTMENT_ID
----------- ---------- -------------
        179        149            80
        177        149            80
        176        149            80
        175        149            80

SQL> SELECT     employee_id, manager_id, department_id
  2  FROM       employees
  3  WHERE  (manager_id, department_id) IN
  4                        (SELECT manager_id, department_id
  5                         FROM   employees
  6                         WHERE  employee_id IN (178,174))
  7  ;

EMPLOYEE_ID MANAGER_ID DEPARTMENT_ID
----------- ---------- -------------
        179        149            80
        177        149            80
        176        149            80
        175        149            80
        174        149            80			/./././.

Pairwise Comparison Subquery
		// 
  The example in the slide is that of a multiple-column subquery because the subquery returns more than one column(子查询返回值多于一行). It compares the values in the MANAGER_ID column and the DEPARTMENT_ID column of each row in the EMPLOYEES table with the values in the MANAGER_ID column and the DEPARTMENT_ID column for the employees with the EMPLOYEE_ID 178 or 174.

  First, the subquery to retrieve the MANAGER_ID and DEPARTMENT_ID values for the employees with the EMPLOYEE_ID 178 or 174 is executed. These values are compared with the MANAGER_ID column and the DEPARTMENT_ID column of each row in the EMPLOYEES table. If the values match, the row is displayed. In the output,  the records of the employees with the EMPLOYEE_ID 178 or 174 will not be displayed. The output of the query in the slide follows.



			非成对比较子查询

    显示被同一个经理管理,具有 EMPLOYEE_ID 174 或 141 的雇员;并且,工作在同一个部门,具有 EMPLOYEE_ID 174 或 141 的雇员的详细信息
SELECT  employee_id, manager_id, department_id 	3/
FROM    employees
WHERE   manager_id IN 
                  (SELECT  manager_id		1/
                   FROM    employees
                   WHERE   employee_id IN (174,141))	
AND     department_id IN 
                  (SELECT  department_id	2/
                   FROM    employees
                   WHERE   employee_id IN (174,141))
AND	employee_id NOT IN(174,141);

返回的department_id值和manager_id值与departments表中的每一行进行比较.
要两个值同时都满足才display.

Nonpairwise Comparison Subquery

   The example shows a nonpairwise comparison of the columns. It displays the EMPLOYEE_ID, MANAGER_ID, and DEPARTMENT_ID of any employee whose manager ID matches any of the manager IDs of employees whose employee IDs are either 174 or 141 and DEPARTMENT_ID match any of the department IDs of employees whose employee IDs are either 174 or 141.

   First, the subquery to retrieve the MANAGER_ID values for the employees with the EMPLOYEE_ID 174 or 141 is executed. Similarly, the second subquery to retrieve the DEPARTMENT_ID values for the employees with the EMPLOYEE_ID 174 or 141 is executed. the retrived values of the MANAGER_ID and DEPARTMENT_ID columns are compared with the MANAGER_ID and DEPARTMENT_ID column for each row in the EMPLOYEES table. If the MANAGER_ID column of the row in the EMPLOYEES table matches with any of the values of the MANAGER_ID retrieved by the inner subquery and if the DEPARTMENT_ID column of the row in the EMPLOYEES table matches with any of the values of the DEPARTMENT_ID retrieved by the second subquery, the record is displayed. The output of the query in the slide follows.

    EMPLOYEE_ID		MANAGER_ID		DEPARTMENT_ID
	    142		       124			 50
	    143		       124			 50
	    144		       124			 50
	    176		       149			 80



			在 FROM 子句中使用子查询

SELECT  a.last_name, a.salary, 		
        a.department_id, b.salavg	//必须是b 表中的'列'
FROM    employees a, (SELECT   department_id, 
                      AVG(salary) salavg
                      FROM     employees
                      GROUP BY department_id) b
WHERE   a.department_id = b.department_id
AND     a.salary > b.salavg;


Using a Subquery in the FROM Clause
 
   You can use a subquery in the FROM clause of a SELECT statement, which is very similar to how views are used. A subquery in the FROM clause of a SELECT statement is also called an inline view(内部视图). A subquery in the FROM clause of a SELECT statement defines a data source for that particular SELECT statement, and only that SELECT statement. The example on the slide displays employee last names, salaries, department numbers, and average salaries for all the employees who earn more than the average salary in their department. The subquery in the FROM clause is named b, and the outer query references the SALAVG column using this alias.
//注意表的别名.

				分级子查询表达式

:: 一个分级子查询表达式是一个从一行中返回确切的一个列值的子查询
:: 在 Oracle8i 中,分级子查询仅在一些有限情况的情况下被支持,例如:
     - SELECT 语句 (FROM 和 WHERE 子句)
     - 在一个 INSERT 语句中的VALUES 表中
:: 在 Oracle9i 中,分级子查询能够被用于:
     - DECODE  and CASE 的条件和表达式部分
     - 除 GROUP BY 以外所有的 SELECT 子句


Scalar Subqueries in SQL

   A subquery that returns exactly one column value from one row is also referred to as a scalar subquery(一个分级子查询表达式是一个从一行中返回确切的一个列值的子查询.) Multiple-column subqueries written to compare two or more columns, using a compound WHERE clause and logical operators, do not qualify as scalar subqueries.

././././././如果子查询返回0行,分级子查询表达式是NULL,如果子查询返回多行,Oracle Server返回ERROR. 
If the subquery returns 0 rows, the value of the scalar subquery expression is NULL. If the subquery returns more than one row, the Oracle Server returns an error.

  The value of the scalar subquery expression is the value of the select list item of the subquery. If the subquery returns 0 rows, the value of the scalar subquery expression is NULL. If the subquery returns more than one row, the Oracle Server returns an error. The Oracle Server has always supported the usage of a scalar subquery in a  SELECT statement. The usage of scalar subqueries has been enhanced in Oracle9i. You can now use scalar subqueries in:
   - Condition and expression part of DECODE and CASE
   - All clauses of SELECT except GROUP BY
   - In the left-hand side of the operator in the SET clause and WHERE clause of UPDATE statement 

  However, scalar subqueries are not valid expressions in the following places:
   - As default values for columns and hash expressions for clusters
   - In the RETURNING clause of DML statements
   - As the basis of a function-based index	基本函数索引
   - In GROUP BY clauses, CHECK constraints,  WHEN conditions///./././
   - HAVING clauses		./././
   - In START WITH and CONNECT BY clauses
   - In statements that are unrelated to queries, such as CREATE PROFILE


		分级子查询: 例子

  在CASE表达式的分级子查询.

SELECT employee_id, last_name,
       (CASE			20 <----|  
        WHEN department_id =		|
                (SELECT deaprtment_id FROM deaprtments
                 WHERE location_id=1800)
        THEN 'Canada' ELSE 'USA' END) location
FROM   employees;

...
...
EMPLOYEE_ID LAST_NAME                 LOCATI
----------- ------------------------- ------
        199 Grant                     USA
        200 Whalen                    USA
        201 Hartstein                 Canada
        202 Fay                       Canada
        203 Mavris                    USA
        204 Baer                      USA
...


   在 ORDER BY 子查询中的分级子查询
SELECT   employee_id, last_name
FROM     employees e			//两个表
ORDER BY	(SELECT department_name	//用deaprtments这个表的department_name排序
 	FROM departments d
 	WHERE e.department_id = d.department_id);


Scalar Subqueries: Examples

  The first example in the slide demonstrates(认证) that scalar subqueries can be used in CASE expressions. The inner query returns the value 20, which is the department ID of the department whose location ID is 1800. The CASE expression in the outer query uses the result of the inner query to display the employee ID, last names, and a value of Canada or USA, depending on whether the department ID of the record retrieved by the outer query is 20 or not.//是USA,or Canada取决于由外查询的department_id记录返回是不是20   /./././
//内连接是20 了,如果外连接是20则是Canada,如果不是20,则返回USA

The result of the preceding example follows:
  EMPLOYEE_ID	LAST_NAME	LOCATI
   	  100	King		USA
	  101	Kochhar		USA
	  102	De Haan		USA
	  103 	Hunold		USA
...
	  201	Hartstein	Canada
 	  202	Fay		Canada
	  206	Higgins		USA
	  206	Gietz 		USA


Scalar Subqueries: Examples (continued)

  The second example in the slide demonstrates that scalar subqueries can be used in the ORDER BY clause. The example orders the output based on the DEPARTMENT_NAME by matching the DEPARTMENT_ID from the EMPLOYEES table with the DEPARTMENT_ID from the DEPARTMENTS table. This comparison in done in a scalar subquery in the ORDER BY clause. The result of the the second example follows:




   The second example uses a correlated subquery. In a correlated subquery, the subquery references a column from a table referred to in the parent statement. Correlated subqueries are explained later in this lesson. 



			相关的子查询

   相关子查询被用于 row-by-row 处理。对外查询的每一行,每个子查询被执行一次./././.

				GET
		|---- >	candidate row from outer query    //从外查询中获得候选行.
		|		 |
		|	      EXECUTE
		|   inner query using candidate row value //从内查询中获得候选行.
		|		 |
 		|		USE
		|----values from inner query to qualify  //
			or disqualify candidate row

Correlated Subqueries

  The Oracle Server performs a correlated subquery when the subquery references a column from a table referred to in the parent statement. A correlated subquery is evaluated once for each row processed by the parent statement. The parent statement can be a SELECT, UPDATE, or DELETE statement. /././././

  Nested Subqueries Versus Correlated Subqueries
     With a normal nested subquery, the inner SELECT query runs first and executes once, returning values to be used by the main query(这是对于一般的查询). A correlated subquery, however, executes once for each candidate row considered by the outer query. In other words, the inner query is driven by the outer query.

  Nested Subquery Execution	/././.
    - The inner query executes first and finds a value.	先内查询
    - The outer query executes once, using the value from the inner query.//再外查询

  Correlated Subquery Execution	.,./././././
    - Get a candidate row (fetched by the outer query).//从外查询中获得行
    - Execute the inner query using the value of the candidate row. 
	用外查询获得的行执行内查询.
    - Use the values resulting from the inner query to qualify or disqualify the candidate.使用从内查询中返回的值限定或不限定行.
    - Repeat until no candidate row remains. //重复做直到没有行余下.


			相关子查询

SELECT 	column1,column2, ...
FROM 	table1 outer
WHERE 	column1 operator
		      ( SELECT	column1,column2
			FROM	table2
			WHERE	expr1=			//要有一个关联
				   outer.expr2);

    子查询参考在父查询中的表的一个列

Correlated Subqueries (continued)

   A correlated subquery is one way of reading every row in a table and comparing values in each row against related data. It is used whenever a subquery must return a different result or set of results for each candidate row considered by the main query. In other words, you use a correlated subquery to answer a multipart question whose answer depends on the value in each row processed by the parent statement. 

   The Oracle Server performs a correlated subquery when the subquery references a column from a table in the parent query. (当一个子查询参考父查询表返回的列.)

   Note: You can use the ANY and ALL operators in a correlated subquery. 


			使用相关子查询

   找出所有的雇员,他们挣的薪水高于该部门的平均薪水

SELECT last_name, salary, department_id
FROM   employees outer
WHERE  salary >		
	|--->	(SELECT AVG(salary)
		 FROM   employees
 		 WHERE  department_id =  
        		    outer.department_id) ;
		

					外查询中的行每被处理一次,内查询就求值一次


Using Correlated Subqueries

   The example in the slide determines which employees earn more than the average salary of their department. In this case, the correlated subquery specifically computes the average salary for each department.

   Because both the outer query and inner query use the EMPLOYEES table in the FROM clause, an alias is given to EMPLOYEES in the outer SELECT statement, for clarity. Not only does the alias make the entire SELECT statement more readable, but without the alias the query would not work properly, because the inner statement would not be able to distinguish the inner table column from the outer table column.

			使用相关子查询

    显示雇员的详细信息,这些雇员至少变换过两次工作

SELECT e.employee_id, last_name,e.job_id
FROM   employees e 
WHERE  2 <= (SELECT COUNT(*)
             FROM   job_history 
             WHERE  employee_id = e.employee_id);


Using Correlated Subqueries

			//对上面的例子进行分析
  The example in the slide displays the details of those employees who have switched jobs at least twice. The Oracle Server evaluates a correlated subquery as follows:
	1. Select a row from the table specified in the outer query. This will be the current candidate row.
	2. Store the value of the column referenced in the subquery from this candidate row. (In the example in the slide, the column referenced in the subquery is E.EMPLOYEE_ID.)	//从候选列中存储子查询中引用列的值,,子查询的引用列值:E.EMPLOYEE_ID

	3. Perform the subquery with its condition referencing the value from the outer query’s candidate row//计算内查询,将满足条件的count(*)找出来. (In the example in the slide, group function COUNT(*) is evaluated based on the value of the E.EMPLOYEE_ID column obtained in step 2.) e.employee_id的值从step 2得来.

	4. Evaluate the WHERE clause of the outer query on the basis of results of the subquery performed in step 3. This is determines if the candidate row is selected for output. (In the example, the  number of times an employee has switched jobs, evaluated by the subquery, is compared with 2 in the WHERE clause of the outer query. If the condition is satisfied, that employee record is displayed.)
//将选出来的count(*)与2对比,如果>=则显示,否则不显示.
	5. Repeat the procedure for the next candidate row of the table, and so on until all the rows in the table have been processed. 

  The correlation is established by using an element from the outer query in the subquery. In this example, the correlation is established by the statement EMPLOYEE_ID = E.EMPLOYEE_ID in which you compare EMPLOYEE_ID from the table in the subquery with the EMPLOYEE_ID from the table in the outer query.



			使用 EXISTS 操作

:: EXISTS 操作对在子查询的结果集中存在的行进行检验
:: 如果一个子查询行值被找到:
	- 在内查询中的搜索不再继续././././.
	- 条件被标记为 TRUE
:: 如果一个子查询行值未找到:
	- 条件被标记为 FALSE
	- 在内查询中的搜索继续


The EXISTS Operator

  With nesting SELECT statements, all logical operators are valid. In addition, you can use the EXISTS operator. This operator is frequently used with correlated subqueries to test whether a value retrieved by the outer query exists in the results set of the values retrieved by the inner query. If the subquery returns at least one row, the operator returns TRUE. If the value does not exist, it returns FALSE. 
如果子查询返回至少一行,则操作返回TRUE,如果没有值返回,则返回FALSE
Accordingly, NOT EXISTS tests whether a value retrieved by the outer query is not a part of  the results set of the values retrieved by the inner query.



			使用 EXISTS 操作

	查找至少有一个雇员的经理

SELECT employee_id, last_name, job_id, department_id
FROM   employees outer
WHERE  EXISTS ( SELECT 'X'	//如果返回X,则TRUE,否则FALSE.最后看是不是TRUE,即返回X
                 FROM   employees	的即为满足条件的.
                 WHERE  manager_id = 
                        outer.employee_id);
EMPLOYEE_ID LAST_NAME                 JOB_ID     DEPARTMENT_ID 
----------- ------------------------- ---------- ------------- 
        100 King                      AD_PRES               90
        101 Kochhar                   AD_VP                 90
        102 De Haan                   AD_VP                 90   
        103 Hunold                    IT_PROG               60  
        108 Greenberg                 FI_MGR               100 
        114 Raphaely                  PU_MAN                30
        120 Weiss                     ST_MAN                50
        121 Fripp                     ST_MAN                50
        122 Kaufling                  ST_MAN                50
        123 Vollman                   ST_MAN                50
        124 Mourgos                   ST_MAN                50
...

//   解析一下::
只要manager_id在employee_id中就显示,即是它的经理.

SQL> select distinct manager_id
  2  FROM   employees
  3  WHERE  manager_id IS NOT NULL;

MANAGER_ID
----------
       100
       101
       102
       103
       108
       114
       120
       121
       122
       123
       124
       145
       146
       147
       148
       149
       201
       205


SQL> select 'X' from dual;

'
-
X

   SELECT employee_id,last_name,job_id,department_id
   FROM   employees 
   WHERE  employee_id IN (SELECT manager_id
                          FROM   employees
                          WHERE  manager_id IS NOT NULL);

Using the EXISTS Operator
  
  使用下面的条件,当至少找到一个经理号和雇员号相匹配的记录时,EXISTS 操作确保在内查询中的搜索不再继续:
   WHERE manager_id = outer.employee_id.

  Note that the inner SELECT query does not need to return a specific value(内查询不必找到确切的值), so a constant(常量也可以选择) can be selected. From a performance standpoint, it is faster to select a constant than a column.

  Note: Having EMPLOYEE_ID in the SELECT clause of the inner query causes a table scan for that column. Replacing it with the literal X, or any constant, improves performance. This is more efficient than using the IN operator. 

  A IN construct can be used as an alternative for a EXISTS operator, as shown in the following example: 

   SELECT employee_id,last_name,job_id,department_id
   FROM   employees 
   WHERE  employee_id IN (SELECT manager_id
                          FROM   employees
                          WHERE  manager_id IS NOT NULL);


			使用 NOT EXISTS 操作

找出所有的没有任何雇员的部门

SELECT department_id, department_name
FROM departments d
WHERE NOT EXISTS (SELECT 'X'
                  FROM   employees
                  WHERE  department_id 	//要有一个联联
                         = d.department_id);

DEPARTMENT_ID DEPARTMENT_NAME
------------- ------------------------------
          120 Treasury
          130 Corporate Tax
          140 Control And Credit
...

可以得出以下是未选定行.
SQL> select * from employees
  2  where department_id=120;

未选定行

SQL> select * from employees
  2  where department_id=130;

未选定行
...

Using the NOT EXISTS Operator 

Alternative Solution 
 
 A NOT IN construct can be used as an alternative for a NOT EXISTS operator, as shown in the following example. 
   SELECT department_id, department_name
   FROM   departments 
   WHERE  department_id NOT IN (SELECT department_id
                                FROM   employees);

   However, NOT IN evaluates to FALSE if any member of the set is a NULL value. 
如果集合的任何成员是NULL值,NOT IN 的值是FALSE.因此,即使departments表中满足WHERE条件的行,你的查询将不会返回任何行.
Therefore, your query will not return any rows even if there are rows in the departments table that satisfy the WHERE condition. 


		相关 UPDATE

UPDATE table1 alias1
SET    column = (SELECT expression
                 FROM   table2 alias2
                 WHERE  alias1.column =    
                        alias2.column);

	用一个相关子查询来更新在一个表中的行,该表基于另一个表中的行

Correlated UPDATE
   In the case of the UPDATE statement, you can use a correlated subquery to update rows in one table based on rows from another table.


			相关UPDATE

:: 用一个附加的列来存储部门名称,反向规格化 EMPLOYEES 表
:: 用相关更新填充表

ALTER TABLE employees
ADD(department_name VARCHAR2(30));

UPDATE employees e
SET    department_name = 
              (SELECT department_name 
	       FROM   departments d
	       WHERE  e.department_id = d.department_id);	//要有一个关联


Correlated UPDATE (continued)

   The example in the slide denormalizes the EMPLOYEES table by adding a column to store the department_name and then populates the table by using a correlated update.

   Here is another example for a correlated update.

Problem Statement
  Use a correlated subquery to update rows in the EMPLOYEES table based on rows from the REWARDS table:

././././././
UPDATE employees
SET    salary = (SELECT employees.salary + rewards.pay_raise
                 FROM   rewards
                 WHERE  employee_id  =  employees.employee_id
                 AND   payraise_date = 
                      (SELECT MAX(payraise_date) 
                       FROM   rewards
                       WHERE  employee_id = employees.employee_id))
WHERE  employees.employee_id IN    
	(SELECT employee_id 
        FROM   rewards);

  This example uses the REWARDS table. The REWARDS table has the columns EMPLOYEE_ID, PAY_RAISE, and PAYRAISE_DATE. Every time an employee gets a pay raise, a record with the details of the employee ID, the amount of the pay raise, and the date of receipt of the pay raise is inserted into the REWARDS table. The REWARDS table can contain more than one record for an employee. The PAYRAISE _DATE column is used to identify the most recent pay raise received by an employee. 

 In the example, the SALARY column in the EMPLOYEES table is updated to reflect the latest pay raise received by the employee. This is done by adding the current salary of the employee with the corresponding pay raise from the REWARDS table. 



			相关 DELETE

DELETE FROM table1 alias1
 WHERE  column operator 
	(SELECT expression
 	 FROM   table2 alias2
 	 WHERE  alias1.column = alias2.column);
  用一个相关子查询删除表中的行,该表基于另一个表中的行

Correlated DELETE

  In the case of a DELETE statement, you can use a correlated subquery to delete only those rows that also exist in another table. If you decide that you will maintain only the last four job history records in the JOB_HISTORY table, then when an employee transfers to a fifth job, you delete the oldest JOB_HISTORY row by looking up the JOB_HISTORY table for the MIN(START_DATE)for the employee. The following code illustrates how the preceding operation can be performed using a correlated DELETE:

   DELETE FROM job_history JH
   WHERE  employee_id =
         (SELECT employee_id 
          FROM   employees E
          WHERE  JH.employee_id = E.employee_id	././././要有一个关联
          AND    start_date =
                (SELECT MIN(start_date)  
                 FROM   job_history JH
                 WHERE  JH.employee_id = E.employee_id)	//关联
                 AND 5 >  (SELECT COUNT(*)  
                           FROM   job_history JH
                           WHERE  JH.employee_id = E.employee_id	//关联
                           GROUP  BY employee_id
                           HAVING COUNT(*) >= 4));



			相关删除 DELETE

   用一个相关子查询删除哪些在 EMPLOYEES 表和 EMP_HISTORY 表中的 employee_id 列值相同的行

DELETE FROM employees E
WHERE employee_id =  
           (SELECT employee_id
            FROM   emp_history 
            WHERE  employee_id = E.employee_id);

Correlated DELETE (continued)

Example
Two tables are used in this example. They are:
	- The EMPLOYEES table, which gives details of all the current employees
	- The EMP_HISTORY table, which gives details of previous employees

  EMP_HISTORY contains data regarding previous employees, so it would be erroneous if the same employee’s record existed in both the EMPLOYEES and EMP_HISTORY tables. You can delete such erroneous records by using the correlated subquery shown in the slide.


				WITH子句

:: 当一个查询块在一个复杂的查询中出现多次时,使用 WITH 子句,能够在 SELECT 语句中多次使用相同查询块
:: WITH 子句取回查询块的结果,并且将它存在用户的临时表空间中
:: WITH 子句可以改善性能

The WITH clause

  Using the WITH clause, you can define a query block before using it in a query. The WITH clause (formally known as subquery_factoring_clause) enables you to reuse the same query block in a SELECT statement when it occurs more than once within a complex query. This is particularly useful when a query has many references to the same query block and there are joins and aggregations. 

  Using the WITH clause, you can reuse the same query when it is high cost to evaluate the query block and it occurs more than once within a complex query. Using the WITH clause, the Oracle Server retrieves the results of a query block and stores it in the user’s temporary tablespace. This can improve performance.

 WITH Clause Benefits
	- Makes the query easy to read
	- Evaluates a clause only once, even if it appears multiple times in the query, 	  	  thereby enhancing performance 
		

			WITH 子句: 例子

   用 WITH 子句,写一个查询来显示部门名称和该部门的合计薪水,哪些人的合计薪水高于各部门的平均薪水

WITH 
dept_costs  AS (
   SELECT  d.department_name, SUM(e.salary) AS dept_total
   FROM    employees e, departments d
   WHERE   e.department_id = d.department_id
   GROUP BY d.department_name),
avg_cost    AS (
   SELECT SUM(dept_total)/COUNT(*) AS dept_avg
   FROM   dept_costs)
SELECT * 
FROM   dept_costs 
WHERE  dept_total >
        (SELECT dept_avg 
         FROM avg_cost)
ORDER BY department_name;


  The  problem in the slide would require the following intermediate calculations:
1.	Calculate the total salary for every department, and store the result using a WITH clause.
2.	Calculate the average salary across departments, and store the result using a WITH clause.
3.	Compare the total salary calculated in the first step with the average salary calculated in the second step. If the total salary for a particular department is greater than the average salary across departments, display the department name and the total salary for that department. 


WITH Clause: Example (continued)

   The SQL code in the slide is an example of a situation in which you can improve performance and write SQL more simply by using the WITH clause. The query creates the query names DEPT_COSTS and AVG_COST and then uses them in the body of the main query. Internally, the WITH clause is resolved either as an in-line view or a temporary table. The optimizer chooses the appropriate resolution depending on the cost or benefit of temporarily storing the results of the WITH clause.


  Note: A subquery in the FROM clause of a SELECT statement is also called an in-line view.(内部视图)

 The output generated by the SQL code on the slide will be as follows:

DEPARTMENT_NAME                DEPT_TOTAL
------------------------------ ----------
Sales                              304500
Shipping                           156400


The WITH Clause Usage Notes
  -- It is used only with SELECT statements
  -- A query name is visible to all WITH element query blocks (including their subquery blocks) defined after it and the main query block itself (including its subquery blocks).
  -- When the query name is the same as an existing table name, the parser searches from the inside out, the query block name takes precedence over the table name.
  -- The WITH clause can hold more than one query. Each query is then separated by a comma.


			SUMMARY

在本课中, 您应该已经学会如何:
:: 返回多于一列的多列子查询
:: 多列比较可以成对或非成对地进行
:: 一个多列子查询也能够被用于一个 SELECT 语句的 FROM 子句
:: 分级子查询在 Oracle9i 中得到了增强
Summary

   You can use multiple-column subqueries to combine multiple WHERE conditions into a single WHERE clause. Column comparisons in a multiple-column subquery can be pairwise comparisons or non-pairwise comparisons. 

 You can use a subquery to define a table to be operated on by a containing query. 

  Oracle 9i enhances the the uses of scalar subqueries. Scalar subqueries can now be used in:
   : Condition and expression part of DECODE  and CASE
   : All clauses of SELECT except GROUP BY  
   : SET clause and WHERE clause of UPDATE statement


			小结
:: 无论何时一个子查询必须对每一个侯选行返回不同的结果,这时,相关子查询是有用的
:: EXISTS 操作是测试值的存在性的布尔操作
:: 相关子查询能够用于 SELECT, UPDATE, and DELETE 语句
:: 在 SELECT 语句中你能够通过 WITH 子句多次使用相同的查询块
Summary (continued)

   The Oracle Server performs a correlated subquery when the subquery references a column from a table referred to in the parent statement. A correlated subquery is evaluated once for each row processed by the parent statement. The parent statement can be a SELECT, UPDATE, or DELETE statement. Using the WITH clause, you can reuse the same query when it is costly to reevaluate the query block and it occurs more than once within a complex query.
			




			************分级取回数据**************

			目标

 完成本课后, 您应当能够执行下列操作:
:: 解释分级查询的概念
:: 创建一个树型结构的报告
:: 格式化分级数据
:: 从树型结构中去除分支 


Lesson Aim
  In this lesson, you learn how to use hierarchical queries to create tree-structured reports.(树型结构报告)


			 EMPLOYEES 表中的例子数据

Sample Data from the EMPLOYEES Table

   Using hierarchical queries, you can retrieve data based on a natural hierarchical relationship between rows in a table. A relational database does not store records in a hierarchical way关系型数据库不能以分等级的方式存储. However, where a hierarchical relationship exists between the rows of a single table, a process called tree walking enables the hierarchy to be constructed. A hierarchical query is a method of reporting, in order, the branches of a tree.

   Imagine a family tree with the eldest members of the family found close to the base or trunk of the tree and the youngest members representing branches of the tree. Branches can have their own branches, and so on. 

   A hierarchical query is possible when a relationship exists between rows in a table.
当在一个表中存在行行之间的关系,分等级查询是有可能的. For example, in the slide, you see that employees with the job IDs of AD_VP, ST_MAN, SA_MAN, and MK_MAN report directly to the president of the company. We know this because the MANAGER_ID column of these records contain the employee ID 100, which belongs to the president (AD_PRES). 

   Note: Hierarchical trees are used in various fields such as human genealogy (family trees), livestock (breeding purposes), corporate management (management hierarchies), manufacturing (product assembly), evolutionary research (species development), and scientific research.


				自然树结构

Natural Tree Structure
  
  The EMPLOYEES table has a tree structure representing the management reporting line. The hierarchy can be created by looking at the relationship between equivalent values in the EMPLOYEE_ID and MANAGER_ID columns. This relationship can be exploited by joining the table to itself(这些表自连接将可以使用.). The MANAGER_ID column contains the employee number of the employee’s manager.

  The parent-child relationship of a tree structure enables you to control:
	- The direction in which the hierarchy is walked
	- The starting point inside the hierarchy

  Note: The slide displays an inverted tree structure of the management hierarchy of the employees in the EMPLOYEES table.



			分等级查询

SELECT [LEVEL], column, expr...
FROM   table
[WHERE condition(s)]
[START WITH condition(s)]
[CONNECT BY PRIOR condition(s)] ;


WHERE条件:
expr comparison_operator(比较运算符) expr


Keywords and Clauses(子句和关键字)

  Hierarchical queries can be identified by the presence of the CONNECT BY and START WITH clauses.

  In the syntax:

	SELECT		Is the standard SELECT clause. 

	LEVEL 		For each row returned by a hierarchical query, the LEVEL 
			为每一行返回分等级查询,										pseudocolumn(列) returns 1 for a root row, 2 for a child of a 					root,and so on.

	/././././
	FROM table	Specifies the table, view, or snapshot containing the columns. 					You can select from only one table.(只能是一个表)

	WHERE		Restricts the rows returned by the query without affecting other 				rows of	the hierarchy. (返回没有影响其它分等级行的查询)

        condition	Is a comparison with expressions.(比较表达式条件)

	START WITH	Specifies the root rows of the hierarchy (where to start). This 				clause is required for a tree hierarchical query.
			指定层次的根行(从哪开始),这个子句对于树型分类查询是必须的.

	CONNECT BY	Specifies the columns in which the relationship between parent 					and child
			指定父子关系的列

	PRIOR		rows exist. This clause is required for a hierarchical query.
			行存在.对于分等级查询是必须的.
 The SELECT statement cannot contain a join or query from a view that contains a join.
SELECT操作不能包括连接,或从一个包含连接的视图中查询



				遍历树

起点

:: 指定必须满足的条件
:: 接受有效的条件

 START WITH column1=value

使用EMPLOYEES表,从名字是Kochhar的雇员开始
...START WITH last_name='Kochhar'


Walking the Tree(遍历树)

  The row or rows to be used as the root of the tree are determined by the START WITH clause(行,或行S将作为树根决定于START WITH子句). The START WITH clause can be used in conjunction with any valid condition.
START WITH子句在任何有效的条件连接使用

 Examples

 Using the EMPLOYEES table, start with King, the president of the company.
	... START WITH manager_id IS NULL

 Using the EMPLOYEES table, start with employee Kochhar. A START WITH condition can contain a subquery.
	... START WITH employee_id = (SELECT employee_id
                          	        FROM    employees
                        	        WHERE   last_name = 'Kochhar')

  If the START WITH clause is omitted(忽略), the tree walk is started with all of the rows in the table as root rows(树的遍历将从表中所有的行,作为根开始遍历). If a WHERE clause is used, the walk is started with all the rows that satisfy the WHERE condition. This no longer reflects a true hierarchy.
如果WHERE子句使用了,遍历将从所有的行开始,并且满足WHERE条件.这将不再返回树的层次.

 Note: The clauses CONNECT BY PRIOR and START WITH are not ANSI SQL standard.

  Instructor Note
    You may wish to add that multiple hierarchical outputs are generated if more than one row satisfies the START WITH condition.



			遍历树(有方向的查询)

CONNECT BY PRIOR column1=column2

从顶向下遍历,用employees表   父列是employee_id,子列是manager_id
...CONNECT BY PRIOR employee_id=manager_id

方向
	从顶向下	---->Column1=Parent Key 
			     Column2=Child Key	

	从底向上	---->Column1=Child Key
			     Column2=Parent Key 


Walking the Tree (continued)

   The direction of the query, whether it is from parent to child or from child to parent, is determined by the CONNECT BY PRIOR column placement. The PRIOR operator refers to the parent row(PRIOR操作涉及到父行). To find the children of a parent row, the Oracle Server evaluates the PRIOR expression for the parent row and the other expressions for each row in the table(为了找到父行中的子行,ORACLE SERVER将PRIOR作用于父行,并且其它的表达式操作将用于表中的其它行,行的条件为真则说明子行在父行中). Rows for which the condition is true are the children of the parent. The Oracle Server always selects children by evaluating the CONNECT BY condition with respect to a current parent row.

Examples

Walk from the top down(自顶向下) using the EMPLOYEES table. Define a hierarchical relationship in which the EMPLOYEE_ID value of the parent row is equal to the MANAGER_ID value of the child row.
	... CONNECT BY PRIOR employee_id = manager_id

 Walk from the bottom up using the EMPLOYEES table.
	 ... CONNECT BY PRIOR manager_id = employee_id

  The PRIOR operator does not necessarily need to be coded immediately following the CONNECT BY(PRIOR操作符并不是非要直接跟随在CONNECT BY后). Thus, the following CONNECT BY PRIOR clause gives the same result(与先前的例子有相同的结果) as the one in the preceding example.
	 ... CONNECT BY employee_id = PRIOR manager_id 

 Note: The CONNECT BY clause cannot contain a subquery. ././././.不能包含子查询



			遍历树:从底向上

SELECT employee_id, last_name, job_id, manager_id
FROM   employees
START  WITH  employee_id = 101
CONNECT BY PRIOR manager_id = employee_id ;	//从底向上

EMPLOYEE_ID LAST_NAME                 JOB_ID     MANAGER_ID
----------- ------------------------- ---------- ----------
        101 Kochhar                   AD_VP             100
        100 King                      AD_PRES			


Walking the Tree: From the Bottom Up 

  The example in the slide displays a list of managers starting with the employee whose employee ID is 101.

Example

  In the following example, EMPLOYEE_ID values are evaluated for the parent row and MANAGER_ID, and SALARY values are evaluated for the child rows. The PRIOR operator applies only to the EMPLOYEE_ID value.

	... CONNECT BY PRIOR employee_id = manager_id
					AND salary > 15000;

SQL>  SELECT employee_id, last_name, job_id, manager_id
  2   FROM   employees
  3   START  WITH  employee_id = 101
  4   CONNECT BY PRIOR manager_id = employee_id
  5  and salary>15000;

EMPLOYEE_ID LAST_NAME                 JOB_ID     MANAGER_ID
----------- ------------------------- ---------- ----------
        101 Kochhar                   AD_VP             100
        100 King                      AD_PRES

  To qualify as a child row, a row must have a MANAGER_ID value equal to the EMPLOYEE_ID value of the parent row and must have a SALARY value greater than $15,000.
为了取得子行,要有列MANAGER_ID的值=父行中employee_id的值../././


Instructor Note
:: In the context of the second paragraph, you may wish to include that additional conditions added to the CONNECT BY PRIOR clause potentially eliminated the whole of the branch(潜在的限制树支), hence the EMPLOYEE_ID AND SALARY are evaluated for the parent row to determine if it is to be part of the output.

SQL> SELECT  last_name||' reports to '|| PRIOR   last_name "Walk Top Down"
  2  FROM    employees
  3  start with employee_id=101
  4  connect by prior manager_id=employee_id;

Walk Top Down
--------------------------------//插一下
Kochhar reports to
King reports to Kochhar

SQL> SELECT  last_name||' reports to '|| last_name "Walk Top Down"
  2   FROM    employees
  3   start with employee_id=101
  4   connect by prior manager_id=employee_id;

Walk Top Down
--------------------------------------------------------------
Kochhar reports to Kochhar
King reports to King



		遍历树:  从顶向下

SELECT  last_name||' reports to '|| PRIOR   last_name "Walk Top Down"
FROM    employees
START   WITH last_name = 'King'
CONNECT BY PRIOR employee_id = manager_id ;


Walking the Tree: From the Top Down

  Walking from the top down, display the names of the employees and their manager. Use employee King as the starting point. Print only one column.

Walk Top Down
--------------------------------
King reports to
King reports to
Kochhar reports to King
Greenberg reports to Kochhar
Faviet reports to Greenberg
Chen reports to Greenberg
Sciarra reports to Greenberg
Urman reports to Greenberg
Popp reports to Greenberg
Whalen reports to Kochhar
Mavris reports to Kochhar

SELECT  last_name||' reports to '|| last_name "Walk Top Down"
FROM    employees
START   WITH last_name = 'King'
CONNECT BY PRIOR employee_id = manager_id ;

Walk Top Down
------------------------------------------
King reports to King
King reports to King
Kochhar reports to Kochhar
Greenberg reports to Greenberg
Faviet reports to Faviet
Chen reports to Chen
Sciarra reports to Sciarra
Urman reports to Urman
Popp reports to Popp
Whalen reports to Whalen
Mavris reports to Mavris
...

			用LEVEL伪列将行分等级

Ranking Rows with the LEVEL Pseudocolumn

  You can explicitly show the rank or level of a row in the hierarchy by using the LEVEL pseudocolumn(伪列). This will make your report more readable(这将使你的报告更容易读).  The forks where one or more branches split away from a larger branch are called nodes, and the very end of a branch is called a leaf, or leaf node. The diagram in the slide shows the nodes of the inverted tree with their LEVEL values. For example, employee Higgens is a parent and a child, while employee Davies is a child and a leaf.

  The LEVEL Pseudocolumn

Value	Level
1	A root node(根)
2 	A child of a root node(根的孩子)
3 	A child of a child, and so on(根的孩子的孩子...)

 
Note: A root node is the highest node within an inverted tree. A child node is any nonroot node. A parent node is any node that has children. A leaf node is any node without children. The number of levels returned by a hierarchical query may be limited by available user memory.

 In the slide, King is the root or parent (LEVEL = 1). Kochhar, De Hann, Mourgos, Zlotkey, Hartstein, Higgens, and Hunold are children and also parents (LEVEL = 2). Whalen, Rajs, Davies, Matos, Vargas, Gietz, Ernst, Lorentz, Abel, Taylor, Grant, and Fay are children and leaves. 
(LEVEL = 3 and LEVEL = 4)


			用 LEVEL 和 LPAD 格式化分级报告

  创建一个报告显示公司的管理层,从最高级别开始,缩进下面跟随的级别

COLUMN org_chart FORMAT A18
SELECT LPAD(last_name, LENGTH(last_name)+(LEVEL*2)-2,'_') 
       AS org_chart
FROM   employees 
START WITH last_name='King' 
CONNECT BY PRIOR employee_id=manager_id;


Formatting Hierarchical Reports Using LEVEL

 The nodes in a tree are assigned level numbers from the root. Use the LPAD function in conjunction with the pseudocolumn LEVEL to display a hierarchical report as an indented tree.(交错树状)

 In the example on the slide:

   : LPAD(char1,n [,char2]) returns char1, left-padded to length n with the sequence of characters in char2. The argument n is the total length of the return value as it is displayed on your terminal screen.
   : LPAD(last_name, LENGTH(last_name)+(LEVEL*2)-2,'_')defines the display format.
   : char1 is the LAST_NAME , n the total length of the return value, is length of the LAST_NAME +(LEVEL*2)-2 ,and  char2  is '_'.

 In other words, this tells SQL to take the LAST_NAME and left-pad it with the '_' character till the length of the resultant string is equal to the value determined by LENGTH(last_name)+(LEVEL*2)-2. 

  For King, LEVEL = 1. Hence, (2 * 1) - 2 = 2 - 2 = 0. So King does not get padded with any '_' character and is displayed in column 1. 

 For Kochhar, LEVEL = 2. Hence, (2 * 2) - 2 = 4 - 2 = 2 . So Kochhar gets padded with 2 '_' characters and is displayed indented.

 The rest of the records in the EMPLOYEES table are displayed similarly. 


Formatting Hierarchical Reports Using LEVEL (continued)			

King
__Kochhar
____Greenberg
______Faviet
______Chen
______Sciarra
______Urman
______Popp
____Whalen
____Mavris

ORG_CHART
--------------------
____Baer
____Higgins
______Gietz
__De Haan
____Hunold
______Ernst
______Austin
______Pataballa
______Lorentz


				修剪分支

	用 WHERE 子句 				用 CONNECT BY 子句
	去除一个结点(node)叶子还要		去除一个分支(node,叶子都不要了)
	
Where last_name !='Higgins'		CONNECT BY PRIOR
					employee_id=manager_id
					AND last_name !='Higgins'
	范围:小				    范围:大


Pruning Branches
  You can use the WHERE and CONNECT BY clauses to prune the tree; that is, to control which nodes or rows are displayed(控制哪些节点或行S不被显示). The predicate you use acts as a Boolean condition.

Examples

Starting at the root, walk from the top down, and eliminate employee Higgins in the result, but process(保留) the child rows.
     SELECT  department_id, employee_id,last_name, job_id, salary
     FROM    employees
     WHERE   last_name  != 'Higgins'
     START   WITH manager_id IS NULL
     CONNECT BY PRIOR employee_id = manager_id;

  Starting at the root, walk from the top down, and eliminate employee Higgins and all child rows.(除去整个分支)
     SELECT  department_id, employee_id,last_name, job_id, salary
     FROM    employees
     START   WITH manager_id IS NULL
     CONNECT BY PRIOR employee_id = manager_id
     AND     last_name != 'Higgins';

你可能感兴趣的:(oracle)