pass night

【数据库】数据库笔记

Introduction

Main content

the data models
Sql language and user interface
Key principles of DBMS mainly architecture, query optimization, concurrency control, recovery
the security and integrity constrains of databases
Database design
new research and applica
tion fields

What is database

definition

A very large, integrated collection of data

Function

models real-world enterprise

entities e.g. students and courses
Relation e.g. electives

Why use a DBMS

data independence and efficient access
reduced application development time
data integrity and security
uniform data administration
concurrent access, recovery from crashes

File vs. Database

application must stage large datasets between main memory and secondary storage
special code for different queries
must protect data from inconsistence due to multiple concurrent users
crash recovery
security and access control

Concepts

Data are symbols for describing the things of real world. They are existing from of information
Data model is a collection of concepts and definitions for describing data
a schema is a description of a particular collection of data, using a given data model.
the relational model of data is the most widely used model today
1. Relation: basically a table with rows and columns
2. every relation has a schema, which describes the columns or fields

The ANSI-SPARC architecture

在这里插入图片描述

many views: view describe how users see the data
conceptual schema: conceptual schema defines logical structure
Physical schema: physical schema describes the fields and indexes uses

What is DBMS

definition

A software package designed to store and manage databases

History of DBMS

no data model
simple file operation
network data model
hierarchy data model
rational data model

database system

database system = applications + DBMS + Database + DBA database administration
DBMS is the core of database system
1. high level user interface
2. query processing and optimization
3. catalog management
4. concurrency control and recovery
5. integrity constraints checking
6. Access control

Data Models

Hierarchical Data Model

Basic idea

because many things in real world are organized in hierarchy, hierarchical model manages to describe real world in a tree structure

Basic concepts

Record

Field

PCR (parent-child-relationship)

the most basic data relationship in hierarchical model

Hierarchical data schema

a hierarchical data schema consists of PCRs.
Every PCR expresses one 1:N relationship
every record type can only have one parent

Virtual record

use virtual record to represent some other situation that hierarchical model can’t represent

Network data model

Basic idea

the basic data structure is “set”, it represent a 1:N relationship between thing in real world. “1” side is called owner, and “N” side is called member
One record type can be the owner of multi sets, and also can be the owner of multi sets. Many sets form a network structure to express real world
It breaks through the limit of hierarchical structure, so can express non-hierarchical data more easily
record and data items: data items are similar as field in hierarchical model, but it can be a vector
Link record type: used to represent self relationship, end to end relationship

Set

basic unit for network data schema

Link record type

use link record type to represent some relationship that it can’t represent directly

self relationship

end to end relationship

L represent LINK

Relation data model

basic idea

the basic data structure is “table” or relation, The things and the relationships between them in real world are all expressed as tables, so it can be researched in strict mathematic methods. It raises the database technology to a theory height.

features

based on set theory, high abstract level
shield all lower details, simple and clear, easy to understand
Can establish new algebra system relation algebra
Non procedure query language SQL
soft link the essential difference with former data models

Soft link

Some concepts

Attributes and domain

the features of an entity in real world are expressed as attributes in relational model
the value scope of an attribute is called its domain example: age is an positive integer and it can’t be larger than 1000

Relation and tuple

an entity of real world can be expressed as one or more thant one relations
a relation is a N-ary relationship defined on all of its attribute domain $R=(A_1, A_2 \dots A_n)$
This is called the schema of T, and n is the number of attributes, called the degree of R.

Primary key

a set of attributes is a candidate key for a relation if
1. no two distinct tuples can have same values in this set of attributes
2. this is not true for any subset of this set of attributes id is unique, and id+name is also uneque
  1. super key id is candidate key , “id+name” is a super key
  2. if there’s more than 1 key for a relation, one of the keys is chosen to be the primary key, and the others are called alternate key.
  3. if the primary key consists of all attributes of a relation, it’s called all key
the key can decide a tuple uniquely sid is a key for students, and the set {sid , gpa} is a super key

Foreign key

Set of attributes in one relation that is used to “refer” to a tuple in another rational like a rational pointer

ER Data Model

entity(E): Real-world object distinguishable from other objects. An entities is described using a set of attriburte
entity set:a collection of similar entities
1. all entities in an entity set have the same set fo attribute
2. each entity has a key
3. each attribute has a domain
4. permit combined or multi-valued attribute
relationship®: Association among two or more entities
1. relationship can have attributes
relationship set: Collection of similar relationships

Object-Oriented Data Model

Relational algebra

Basic operations

section( $\sigma$ ):Select a subset of row from relation
projection ( $\pi$ ):Deletes unwanted columns from relation
cross-product ( $\times$ ) allows us to combine two relations
set-differences(-) Tuple in reln.1 but not in reln2
Union ( $\cup$ ): Tuples in reln.1 and in reln.2

{ $\sigma,\pi,\cup -\times$ } is a complete operation set

the algebra is “closed”

Other operations

condition join ( $\sigma_C(R\times S)$ )
division $/\equiv \exists(x,y) \in A \forall y\in B \equiv\ \pi_x(A)-\pi_x((\pi_x(A)\times B)-A)$
outer union $\underline{\cup}$ : the values of attributes which don’t exist in original tuples are filled as NULL

Relational Calculus

calculus needs to describe the procedures but algebra doesn’tr

Two flavors:
1. tuple relational calculus: variables range over tuples
2. domain relational calculus: variables range over domain elements

Tuple relational calculus

Example:

Query has the form：

$t < a tt r ib u t e l i s t > ∣ P (t)$

t is called tuple variable

Answer includes all tuples t that make the formula P(t be true)

Example: find all sailors’ name whose rating above 7 and younger than 50

${t[N]|t\in Sailors \land t.T>7 \land t.A<50}$

Domain relational calculus

Example:
1. Query has the from: $\{|P(x_1,x_1\dots x_n\dots x_{n+m})\}$
2. $x_1,x_1\dots x_n$ are called domain variables, $x_1,x_1\dots x_n$ appear in result
3. answer include all tuples $< x_{1}, x_{1} \dots x_{n} >$ that make the formula $P(x_1,x_1\dots x_n\dots x_{n+m})$ be true
4. formula is recursively defined, starting with simple atomic formulas and building bigger and better formulas using the logical connectives

Formula

atomic formula

a formula with atomic operation
$\in Rname$ , or X op Y or X op constant op is one of $\le \ge \ne$

Definition

atomic formula
or $\lnot p, p\land q, p\lor q$ where p and q are formulas
$\exists X(P(X)) or \forall X(P(X))$ where $X$ is free in $P (X)$ if use quntifier to X, then X is bounded. if X is not bounded then X is free

queries that have infinite number of answers are called unsafe

example: find all sailors with a rating above 7

${}|\in Sailors \land T>7$

Differences and Similarities between relational calculus and relational algebra

differences
1. relational algebra needs to specify the order of operations
2. relational calculus only needs to indicate the logic condition the result must be fulfilled
similarities:
1. they are equivalent in terns of expression
2. sql language can express any query that is expressible in relational algebra or relational calculus

User Interfaces and SQL Language

Content

query language
1. formal query language
2. tabular query language
3. graphic query language
4. limited natural language query language
interface and maintaining tolls
APIs
class library

Important terms and concepts

base tabel
view
data type supported
null
unique
default
primary key
foreign key
check integration constrain

Conceptual evaluation strategy

semantics of an SQL query defined ni terms of the folowing conceptual evaluation strategy:

Compute the cross-product of relation-list.
Discard resulting tuples if they fail qualifications.
Delete attributes that are not in target-list.
If DISTINCT is specified, eliminate duplicate rows.

Levels of abstraction: ANSI-SPARC Architecture

views describe how users see the data
conceptual schema defines logical structure
physical schema describes the files and indexes used

Query Language

Basic SQL query

compute the cross-product of relation-list
discard resulting tuples if they fail qualifications
delete attributes that are not in target list
if DISTINCT is specified, eliminate duplicate rows

Union

definition

UNION can be used to compute the union of any two union-compatible set of tuples

example

question: find the sid of sailors who’ve reserved a red or a green boat

solution1 use or condition:

SELECT S.sid FROM Sailors S, Boat B, Reserves R
WHERE S.sid=R.sid AND R.bid=B.bid And (B.color='red' OR B.color='green')

solution2: use UNIUON:

SELECT S.sid FROM Sailors S,Boat B, Reserves R
WHERE S.sid=R.sidAND R.bid = B.bid And (B.color='red')
    UNION 
    (SELECT S.sid FROM Sailors S,Boat B, Reserves R WHERE S.sid=R.sid AND R.bid = B.bid And (B.color='green'))

Intersect

question: find sid’s of sailors who’ve reserved a red and a green boat

solution 1" use AND condition use or condition:

SELECT S.sid FROM Sailors S,Boat B, Reserves R WHERE S.sid=R.sid AND R.bid = B.bid And (B.color='red' AND B.color='green')

solution2: use INTERSECT :

SELECT S.sid FROM Sailors S,Boat B, Reserves R WHERE S.sid=R.sid AND R.bid = B.bid And (B.color='red')
INTERSECT
SELECT S.sid FROM Sailors S,Boat B, Reserves R WHERE S.sid=R.sid AND R.bid = B.bid And (B.color='green')

Nested queries

IN:

 SELECT S.sname FROM Sailor S WHERE S.sid IN (SELECT R.sid FROM Reserves R WHERE R.bid=103)

EXISTS :

SELECT S.sname FROM Sailors S WHERE EXISTS (SELECT * FROM Reserves R WHREE R.bid = 103 And S.sid=R.sid)

Division in SQL

question: find sailors who’ve reserved all boats

solution1: EXCEPT :

SELECT S.sname
FROM Sailors S
WHERE NOT EXISTS(
    (SELECT B.bid FROM Boat B)
    EXCEPT
    (SELECT B.bid FROM Reserves R WHERE R.sid=S.sid)))

solution2:

SELECT S.sname
FROM Sailors S
WHERE NOT EXISTS
	(SELECT B.bid FROM Boats B
	WHERE NOT EXISTS
		(SELECT R.bid
		FROM Reserves R 
		WHERE R.bid=B.bid And R.sid=S.sid))

Aggregate Operator

Aggregation Operators

COUNT(*)
COUNT([DISTINCT] A)
SUM([DISTINCT] A)
SUM([DISTINCT] A)
AVG([DISTINCT] A)
MAX(A)
MIN(A)

Example

find those ratings for which the average age is the minimum over all ratings aggregate operations cannot be nested

-- wrong
SELECT S.rating
FROM Sailors S
WHERE S.age = (SELECT MIN(AVG(S2.age)) FROM Sailors S2)

-- right
SELECT Temp.rating
FROM (
	SELECT S.rating, AVG(S.age) AS avgage
    FROM Sailors S
    GROUP BY S.rating
) AS Temp
WHERE Temp.avgage = (SELECTMIN(Temp.avgage FROM Temp))

Grouping

find age of the youngest sailor with age $\ge$18, for each rating with at least 2 such sailors

SELECT S.rating, MIN(S.age) AS minage 
FROM Sailors S
WHERE S.age .= 18
GROUP BY S.rating
HAVING COUNT(*) > 1

find age of the youngest sailor with age $\ge$18, for each rating with at least 2 such sailors and every sailor under 60 the every keyword

SELECT S.rating, MIN(S.age) AS minage 
FROM Sailors S
WHERE S.age .= 18
GROUP BY S.rating
HAVING COUNT(*) > 1 AND EVERY (S.age <= 60)

for each red boat, find the number of reservations for this boat grouping over a join of two relations

SELECT B.bid, COUNT(*) AS scount
FROM Boats B, Reserves R
WHERE R.bid = B.bid AND B.color='red'
GROUP BY B.bid

-- use having to replace the where
SELECT B.bid, COUNT(*) AS scount
FROM Boats B, Reserves R
WHERE R.bid = B.bid
GROUP BY B.bid, B.color='red'
HAVING B.color='red'

find age of the youngest sailor with age > 18, for each rating with at least 2 sailors subquery in having

SELECT S.rating, MIN(S.age)
FROM Sailors S
WHERE S.age>18
GROUP BY S.rating
HAVING 1<(
	SELECT COUNT(*)
    FROM Sailors S2
    WHERE S2.rating = S.rating
)

Cast Expression

change the expression to the target data type
valid target type
use
1. match function parameters
2. change precision while calculating
3. assign a data type to NULL value

-- Students(name, school)
-- Soldiers(name, service)

CREATE VIEW propects (name school service) AS
	SELECT name, school, CAST(NULL AS Varchar(20))
	FROM Students
UNION
	SELECT name, CAST(NULL AS Varchar(20)), service
	FROM Soldiers

Case Expression

SELECT name, CASE status 
				WHEN 1 THEN 'Active Duty'
				WHEN 2 THEN 'Reserve'
				WHEN 3 THEN 'Special Assignment'
				When 4 THEN 'Retired'
				ELSE 'Unknown'
			 END AS status
		FROM Officers

subquery

scalar sub-query: the result of a sub-query is a single value. It can be used in the place where a value can occur
table expression: the result of a sub-query is a table. It can be used in the place where a table can occur
common table expression: in some complex query, a table expression may need occurring more than on time in the same SQL statements. in this case, we use key word WITH

Scalar Sub-query

definition: the result of a sub-query is a single value. It can be used in the place where a value can occur

find the departments’ names whose average bonus is higher than average salary

SELECT d.deptname
FROM dept AS d
WHERE (SELECT avg(bonus) FROM emp WHERE deptno=d.deptno)
	>
	(SELECT avg(salary) FROM emp WHERE deptno=d.deptno)

list the deptno, deptname, and the max salary of all departments located in New York

SELECT d.deptno, d.deptname, 
(SELECT MAX(salary) FROM emp WHERE deptno=d.deptno AS maxpay)
FROM dept AS d
WHERE d.location='New York'

Table Expression

definition: the result of a sub query is a table. It can be used in the place where a table can occur

fin departments whose total payment is greater than 200000

SELECT deptno, totalpay
FROM
(SELECT deptno, SUM(salary)+SUM(bonus) AS totalpay FROM emp GROUP BY deptno) AS payroll
WHERE totalpay>200000

Common Table Expression

definition: in some complex query, a table expression may need occurring more than one time in the same SQL statements.

find the department who has the highest total payment

WITH payroll (deptno, totalpay) AS
	(SELECT deptno, sum(salary)+sum(bonus) FROM emp GROUP BY deptno)
SELECT deptno
FROM payroll
WHERE totalpay = (SELECT MAX(totalpay) FROM payroll)

Outer Join

the extension of join operation
In join operation only matching tuples fulfilling join conditions are left in results; outer joins will keep unmated tuples, the vacant part is set NULL
left outer join $(*\bowtie)$ : keep all tuples of left relation in the result
right outer join $(\bowtie*)$ : keep all tuples of right relation in the result
full outer join $(*\bowtie*)$ : keep all tuples of left and right relations in the result

Recursion

If a common table expression uses itself in its definition, this is called recursion.

find all employees under the management of Hoover and whose salary is more than 100000

WITH agents (name, salary) AS
	((SELECT name, salary FROM FedEmp WHERE manager='Hoover') 
	UNION ALL
	(SELECT f.name, f.salary FROM agents AS a, FedEmp AS f WHERE f.manager=a.name))
SELECT name
FROM agents
WHERE salary>100000

find how much rivets are used in one wing recursive caculation

WITH wingpart(subpart, qty) AS
	(SELECT subpart, qty FROM components WHERE part = 'wing')
	UNION ALL
	(SELECT c.subpart, w.qty*c.qty FROM wingpart w, components c WHERE w.subpart=c.part)
SELECT sum(qtu) AS qty FROM wingpart WHERE subpart='rivet'

find the lowest total cost route from SFO to JFK recursive search

WITH trips (destination, route, nsegs, totalcost) AS
	(SELECT destination, CAST(destination AS varchar(20)),1,cost FROM flights WHERE origin='SFO')
	UNION ALL
	(SELECT f.destination, CAST(t.route||','||f.destination AS varchar(20)), 
     t.nesgs+1,t.totalcost+f.cost FROM trips t, flights f WHERE t.destination=f.origin 
     AND f.destination !=  'SFO' AND f.origin!='JFK' AND t.nsegs <=3)
SELECT route， totalcost FROM trips WHERE destination='JFK' AND totalcost = (SELECT min(totalcost) FROM trips WHERE destination='JFK')

Data Manipulation Language

Insert: insert a tuple into a table
Delete: delete tuples fulfill qualifications
Update: update the attributes’ value of tuples fulfill qualifications

View in SQL

general view
1. virtual tables derived base tables
2. Logical data independence
3. security of data
4. update problems of view
temporary view and recursive query
1. WIEH
2. RECURSIVE

Embedded SQL

In order to access database in programs, and take further process to the query results, need to combine SQL and programming language

Usage of Embedded SQL in C

begin with EXEC SQL, end with ;
through host variables to transfer information between C and SQL. Host variables should be defined begin with EXEC SQL
in SQL statements, should add ; before host variables to distinguish with SQL’s own variable or attributes’ name
In host language such as C, host variables are used as general variables
Can’t define host variables as Structure
A special host variable SQLCA*(SQL Communication Area)* EXEC SQL INCLUDE SQLCA
Use SQLCA.SQLCode to justify the state of result
use indicator(short int) to teat NULL in host language

Example of host variables defining

EXEC SQL BEGIN DECLARE SECTION;
	char SNO[7];
	char GIVENSNO[7];
	char CNO[6];
	char GIVENCNO[6];
	float GRACDE;
	short GRQADEEI; /*indicator of GRADE*/
EXEC SQL END DECLARE

// CONNECT
EXEC SQL CONNECT :uid IDENTIFIED BY :pwd;

// Execute DDL or DML Statements
EXEC SQL INSERT INTO SC(SNO,CNO,GRADE)
VALUES(:SNO, :CNO, :GRADE);

// Execute Query Statements
EXEC SQL SELECT GRADE
INTO :GRADE :GRADEI
FROM SC
WHERE SNO=:GIVENSNO AND
CNO=:GIVENCNO;

Cursor

// Define a cursor
EXEC SQL DECLARE  CURSOR FOR
SELECT …
FROM …
WHERE …
    
EXEC SQL OPEN 
// Fetch data from cursor
EXEC SQL FETCH 
INTO :hostvar1, :hostvar2, …;
// SQLCA.SQLCODE will return 100 when arriving the end of cursor
CLOSE CURSOR 
    
// an example of query with cursor
EXEC SQL DECLARE C1 CURSOR FOR
SELECT SNO, GRADE
FROM SC
WHERE CNO = :GIVENCNO;
EXEC SQL OPEN C1;
if (SQLCA.SQLCODE<0) exit(1);/* There is error in query*/
while (1) {
EXEC SQL FETCH C1 INTO :SNO, :GRADE:GRADEI
if (SQLCA.SQLCODE==100)break;
/* treat data fetched from cursor, omitted*/
}
EXEC SQL CLOSE C1;

Dynamic SQL

// dynamic SQL executed directly
EXEC SQL BEGIN DECLARE SECTION;
char sqlstring[200];
EXEC SQL END DECLARE SECTION;


char cond[150];
strcpy( sqlstring, ”DELETE FROM STUDENT WHERE ”);
printf(“ Enter search condition :”);
scanf(“%s”, cond);
strcat( sqlstring, cond);

EXEC SQL EXECUTE IMMEDIATE:sqlstring;

// Dynamic SQL with dynamic parameters
EXEC SQL BEGIN DECLARE SECTION;
char sqlstring[200];
int birth_year;
EXEC SQL END DECLARE SECTION;

strcpy( sqlstring, ”DELETE FROM STUDENT WHERE YEAR(BDATE) <= :y; ”);
printf(" Enter birth year for delete :");
scanf("%d", &birth_year);
EXEC SQL PREPARE purge FROM :sqlstring;
EXEC SQL EXECUTE purge USING :birth_year;

// Dynamic SQL for query
EXEC SQL BEGIN DECLARE SECTION;
char sqlstring[200];
char SNO[7];
float GRADE;
short GRADEI;
char GIVENCNO[6];
EXEC SQL END DECLARE SECTION;


char orderby[150];
strcpy( sqlstring, ”SELECT SNO,GRADE FROM SC WHERE CNO= :c”);
printf(“ Enter the ORDER BY clause :”);
scanf(“%s”, orderby);
strcat( sqlstring, orderby);
printf(“ Enter the course number :”);
scanf(“%s”, GIVENCNO);
EXEC SQL PREPARE query FROM :sqlstring;

EXEC SQL DECLARE grade_cursor CURSOR FOR query;
EXEC SQL OPEN grade_cursor USING :GIVENCNO;
if (SQLCA.SQLCODE<0) exit(1);/* There is error in query*/
while (1) {
EXEC SQL FETCH grade_cursorINTO :SNO, :GRADE:GRADEI
if (SQLCA.SQLCODE==100)break;
/* treat data fetched from cursor, omitted*/
∶
}
EXEC SQL CLOSE grade_cursor;

// Stored procedure
EXEC SQL
CREATE PROCEDURE drop_student
(IN student_no CHAR(7),
OUT message CHAR(30))
BEGIN ATOMIC
DELETE FROM STUDENT
WHERE SNO=student_no;
DELETE FROM SC
WHERE SNO=student_no;
SET message=student_no || ’droped’;
END;
EXEC SQL
∶
CALL drop_student(…); /* call this stored procedure later*/

Database management System

DBMS process structure

single process structure: compiled as a single .exe file
multi process structure: one application process corresponding to one DBMS core process
multi threads structure: only one DBMS process, every application process corresponding to a DBMS core thread

Database Access Management

Access types

query all or most records of a file (>15%)
query some sxpecial record
query some records(<15%)
scope query
update

File Organization

heap file: records stored according to their inserted order, and retrieved sequentially. This is the most basic and general form of file organization
direct file: the record address is mapped through hash function according to some attribute’s value
index file: index + heap file/cluster
Grid structure file: suitable for multi attributes queries
raw disk

Index Technique

B+ Tree very common
Clustering index common
inverted file
dynamic hashing
grid structure file and partitioned hash function
bitmap index used in data warehouse
othres

why do we use B+ tree in DBMS:

In the B+ tree, keys are the indexes stored in the internal nodes and records are stored in the leaf nodes. In B tree, keys cannot be repeatedly stored, which means that there is no duplication of keys or records. In B+ tree, the leaf nodes are linked to each other to provide the sequential access. In the B tree, leaf nodes are not linked to each other

The B+ tree is a balanced binary search tree. B+ tree ensures that all leaf nodes remain at the same height. In the B+ tree, the leaf nodes are linked using a link list. Therefore, a B+ tree can support random access as well as sequential access

Query optimization

“rewrite” the query statements submitted by user first, and then deciding the most efficient operation method and steps

algebra optimization
operation optimization

Equivalent Transform

exchange rul of $\bowtie/ \times: E1 \times E2 \equiv E2\times E1 $
combination rule of $\bowtie/\times: E1 \times (E2\times E3) \equiv (E1\times E2) \times E3$
Cluster rule of $\Pi:\Pi_{A_1\dots A_n}(\Pi_{B_1\dots B_m})\equiv \Pi_{A_1\dots A_n} when A_1\dots A_n \sub B_1\dots B_m$
…

Basic principle

push down the unary operations as low as possible
look for and combine the common sub-expression

operation optimization

nested loop: scan inner loop relation for every tuple in outer loop relation one time
merge scan: order the relation R and S on disk ahead
using index or hash to look for mapping tuples
hasing join

Recovery

Introduction

reduce the likelihood of failures (prevent)
recover from failures
1. redundancy
2. should inspect all possible failures

Periodical dumping

backup + log

log: record of all changes on DB since the last backup was mad
1. some transactions maybe half done: should undo
2. some transaction have finished but the result have not been written: write thtm

Transaction

A transaction T is a finite sequence of actions on DB exhibiting the following effects(ACID):

Atomic action: nothing or all
Consistency preservation: consistency state of DB
Isolation: concurrent transactions should run as if they are independent each other
durability: the effects of a successfully completed transaciontare permanently reflected in DB

Commit rule and log ahead rule

some relative structure
1. Active Transaction List(ATL)：记录所有正在执行、尚未提交的TID
2. Transaction Identifier(TID)
3. Committed Transaction List(CTL): 记录所有已提交的事务标识符（TID）
4. Before Image(BI), After Image(AI): 可以看成一个对文件
5. Check Point(CP)
6. Message Manager(MM)
commit rule: ensure that the A.I.(After image) is written into the non-volatile memory before the transaction is committed so that even if a failure occurs after the transaction enters the commit stage, the recorded A.I. can still be used to redo and update, so as to ensure that the transaction meets the ACID principle
log ahead rule: if the A.I. is directly written to the database before the transaction is committed, the corresponding B.I. must be written to the log before the transaction is committed so that undo can be done when a failure occurs before the transaction enters the commit stage, and the execution of the transaction meets the ACIS principle
recover strategies:
1. undo(undo(…)) = undo()
2. redo(redo(…)) = redo()
three type of update strategy
1. first write
  1. AI->BD before commit
  2. TID->active list
  3. BI -> log
  4. AI -> DB
  5. …
  6. TID -> commit list
  7. delete TID from active list 6,7 are commit procedure
2. write after commit
  1. TID -> active list
  2. AI -> log
  3. …
  4. TID -> commit list
  5. AI -> DB
  6. …
  7. delete TID from active list
3. AI -> DB concurrently with commit
  1. TID -> active list
  2. AI, BI -> log
  3. AI -> DB partially done
  4. …
  5. TID -> commit list
  6. AI ->DB (complete)
  7. delete TID from active list

Concurrency Control

Introduction to Concurrency

In multi users DBMS, permit multi transaction access the database concurrently

Why

improving system utilization and response time
different transaction may access to different parts of database

problem arise from concurrency

lost update
dirty read
unrepeatable read

How to avoid problems caused by concurrency

Solution: concurrency control methods such as locking method and time stamp method can be used

Serialization: the criterion for concurrency consistency

definition:

suppose $\{T_1,T_2\dots,T_n\}$ is a set of transactions execution concurrently. If a schedule of $\{T_1,T_2\dots,T_n\}$ produces the same effect on database as some serial execution of this set of transactions, then the schedule is serializable

Locking Protocol

Basic idea

Before a concurrent transaction operates on the same data object, it sends a request to the system to lock the operation object. After the transaction’s lock request is approved, it has certain control over the object. Before the transaction releases its lock, other transactions cannot obtain the lock request of the data object and operate on it, thus avoiding access conflict and ensuring the correct execution of concurrent transactions

After adopting the locking protocol, there may be problems such as live lock and deadlock, among which the problem must be solved is the deadlock problem caused by the circular waiting between transaction

Locking

definition of two phase lock : In a transaction, if all locks precede all unlocks, then the transaction is called two phase transaction. This restriction is called two phase locking protocol
definition of well form: In a transaction, if it first acquires a lock on the object before operating it, it is called well-formed
definition of serializable: if S is any schedule of well formed and two phase transaction, then S is serializable

X-Lock

only one type of lock, for both read and write
two phase lock: all locks precede all unlocks
well formed: acquire lock before operate it 8

S,X lock

S lock: if read access is intended
X lock: if update access is intended

SUX lock

S lock: if read access is intended
X lock: if update access is intended
U lock: for an update access the transaction fitst acquires a U-lock and then promote it to X-lock.

conclusions

well formed + 2PL: serializable
well formed + PL + unlock update at EOT: seralizable and recoverable
well formed and 2PL + holding all locks to EOT: strict two phase locking transaction

Dead Lock

Prevention

timeout: if a transaction waits for some specified time then deadlock is assumed and the transaction should be abort
detect deadlock by wait-for graph,if there is cycle in the graph, there is a dead lock
requesting all locks at initial time of transaction
request locks in a specified order of resource
abort once conflicted
transaction retry
1. wait-die: $T_A$ waits if it is older than $T_B$ , otherwise it “dies”,and then retry with original timestamp
2. wound-wait: $T_A$ waits if it is younger thant $T_B$ other it “wound” $T_B$ , and $T_B$ retry

The Security and Integrity in Database

Introduction

lmain reason
1. system failure
2. inconsistency caused by concurrent access
3. man-caused destruction
4. the data inputted is incorrected, the updating t4ransaction didn;t obey the rule of consistency preservation;

Security of database

protect databases not be accessed illegally
1. view and query rewriting
2. access control
3. identification and authentication of users
4. authorization
5. role
6. data encryption
7. audit trail

Security of statistical Database

In many situation, the statistical data is public while the detailed individual data is secret, but some detailed individual data can be derived from public statistical data

Tracker

individual tracker: background: there is only one man who is a male and whose occupation is a programmer, then select from where sex is male and occupation is programmer the basic idea is use the static information to predict the individual information
general tracker: basic the same as individual tracker

Integrity Constrains

Database Modification

if a is foreign key in $r_2$ which references to K1 in $r_1$ , then
$$
1. \Pi_\alpha \sub \Pi_{K_1}(r_1) \text{when referencing}\
2. t_2[\alpha]\in\Pi_{K_1}(r_1) \text{when insert, also when update }\alpha\
3. \sigma_\alpha=t1K_1 \text{when delete tuples include } \alpha \text{ or update}
  $$

Definition of Integrity Constrain

indicated with procedure: let programs responsible for the checking of integrity constrain
indicated with ASSERTION: defined with assertion specification, and checked by DBMS automatically
indicated with CHECK clause in base table definition, and checked by DBMS automatically

-- check
CREATE TABLE Reserves (
    sname CHAR(10),
	bid INTEGER,
    day DATE
    PRIMARY KEY (bid, day),
    CONSTRAIN noInterlakeRes
    CHECK('Interlake'!=(SELECT B.bname FROM Boats B WHERE B.bid=bid))
)
-- constrains over multiple relations
-- this is wrong, because when Sailors is empty, the number of Boats tuples can be anything; there is no any constrain when inserting into Boats
-- but this is a good example
CREATE TABLE Sailors(
	sid INTEGER,
    sname CHAR(10),
    age REAL,
    PRIMARY KEY (sid),
    CHECK
    ((SELECT COUNT(S.sid) FROM Sailors S) + (SELECT COUNT(B.bid) FROM Boats B)<100)
)
-- the right solution: use assertion
CREATE ASSERTION smallClub
CHECK
((SELECT COUNT(S.sid) FROM Sailors S) + (SELECT COUNT(B.bid) FROM Boats B)<100)

Triggers

definition: procedure that stars automatically if specified changes occur to the DBMS
three parts
1. event: active the trigger
2. condition
3. action
Execution of rules
1. immediate execution
2. deferred execution
3. decoupled or detached mode
4. cascading trigger

-- 语法
CREATE TRIGGER <触发子名>
{BEFORE|AFTER} <触发事件>
ON <表名>
[REFERENCING <引用名>]
FOR EACH {ROW|STATEMENT}
WHEN <条件>
<动作>

<触发事件> = INSERT|DELETE|UPDATE[of<属性表>]
<引用名> = OLD[ROW] [AS] <旧元组名>
<引用名> = NEW[ROW] [AS] <新元组名>
<引用名> = OLD[TABLE] [AS] <旧表名>
<引用名> = NEW[TABLE] [AS] <新表名>

-- a creation example
CREATE TRIGGER insert_grade_check
AFTER INSERT ON enroll
REFERENCING NEW TABLE AS NE
FOR EACH STATEMENT
WHEN (EXISTS(SELECT * FROM NE WHERE grade<3.0))
INSERT
INTO failedcourse
SELECT * FROM NE
WHERE NE.grade<3.0

-- another much simplier example
create trigger insertRollback
before insert on accommodation
referencing new as A
for each statement 
when A.check-out-date = NULL
rollback

Database Design

Data Dependency and Normalization of Relational Schema

some dependent relations exist between attributes
function dependency: the most basic kind of data dependencies. The value of one or a group attributes can decide the value of other attributes
Multi-valued dependency: the value of some attributes can decide a group of values of some other attributes
Join Dependency: the constraint of lossless join decomposition

NF

1NF

attribute of a relation must be atomic

2NF

R$\in$1NF, and no partially function dependency exists between attributes

problem

Insert abnormity: cannot insert the students’ information who have not selected course
Delete abnormity: if a student unselect all courses, his basic information is also lost
Hard to update: because of redundancy, is is hard to keep consistency when update

[外链图片转存失败,源站可能有防盗链机制,建议将图片保存下来直接上传(img-VE2yAoKw-1645077600864)(figures/Database/image-20211212192247177.png)]

3NF

$3NF\in 2NF$ , and not transfer function dependency exists between attribute

problem

insert abnormity: before the employees; salary level are decided, the correspondence between salary level and salary can not input
delete abnormity: delete abnormity: if some salary level has only one man, the correspondence between sale level and salary of his level will be lost when the man is deleted
hard to update: because of redundancy, it is hard to keep consistency when update

Database Design method

ER Model and ER Diagram
Procedure oriented method

basic the same as the procedure of software engineering

你可能感兴趣的:(学习笔记,数据库,database,数据库架构)

一文理清：阿里系数据中台-数据治理工具集(傻傻也能分清楚） Debug_Snail Hadoop Big Data 技术工具人工智能 hadoop 数据仓库
阿里云提供的大数据与数据分析产品种类较多，各产品的定位和核心功能有所不同。以下是对DataWorks、MaxCompute、Dataphin、AnalyticDBforMySQL（ADB）、QuickBI、EMR的详细梳理。一、核心产品定位与功能DataWorks定位：一站式大数据开发治理平台，提供数据集成、开发、调度、治理、服务等全链路能力。核心功能：数据集成：支持异构数据源（如数据库、OSS、
Oracle创建表空间、删除、状态、重命名、修改、增加、移动水煮白菜王 Oracle oracle 数据库
目录Oracle基本学习笔记创建表空间1.表空间创建格式3.表空间状态属性4.重命名表空间5.修改表空间数据文件的大小6.删除表空间的数据文件7.修改表空间中数据文件的状态8.表空间中数据文件的移动Oracle基本学习笔记创建表空间需要使用CREATETABLESPACE语句。其基本语法如下:CREATE[TEMPORARYIUNDO]TABLESPACEtablespacename[DATAFI
2.10 Spring Boot定时任务：@Scheduled与Quartz对比分析 Sendingab spring boot 后端 java
SpringBoot定时任务：@Scheduled与Quartz对比分析一、核心特性对比特性**@Scheduled**Quartz依赖复杂度内置于Spring（零配置）需额外依赖与配置任务持久化不支持（内存存储）支持（数据库持久化）动态任务管理仅静态配置支持运行时增删改查分布式支持需自行实现原生集群支持调度策略固定速率/延迟Cron表达式/日历触发错误处理简单异常捕获完善的重试与错误日志机制性能
学习笔记09——并发编程之线程基础码代码的小仙女高级开发必备技能学习笔记 python
线程基础1.1进程与线程的区别，Java中线程的实现（用户线程与内核线程）进程是操作系统分配资源的基本单位，而线程是CPU调度的基本单位。每个进程有独立的内存空间，而同一进程内的线程共享内存.可以从资源分配、切换开销、通信方式和独立性四个方面来比较两者的区别资源分配进程：操作系统分配资源（如内存、文件句柄等）的基本单位，拥有独立的地址空间。线程：隶属于进程，共享进程的资源（如内存、文件等），是CP
学习笔记10——并发编程2线程安全问题与同步机制码代码的小仙女高级开发必备技能 java知识学习笔记
线程安全问题与同步机制线程安全的本质问题线程安全问题源于多线程环境下对共享资源（数据或状态）的非原子性、非可见性、非有序性访问，导致程序行为不符合预期。主要表现如下：竞态条件（RaceCondition）：多个线程对同一资源进行非原子操作，导致结果依赖线程执行顺序。示例：两个线程同时执行count++（非原子操作，实际包含读-改-写三步）。内存可见性问题：线程修改共享变量后，其他线程无法立即看到最
【高级RAG技巧】使用二阶段检索器平衡检索的效率和精度深度学习机器大语言模型深度学习入门人工智能语言模型
一传统方法之前的文章已经介绍过向量数据库在RAG（RetrievalAugmentedGenerative）中的应用，本文将会讨论另一个重要的工具-Embedding模型。一般来说，构建生产环境下的RAG系统是直接使用Embedding模型对用户输入的Query进行向量化表示，并且从已经构建好的向量数据库中检索出相关的段落用户大模型生成。但是这种方法很明显会受到Embedding模型性能的影响，比
Java学习笔记——并发编程（三） __________习惯 java java
一、wait和notifywait和notify原理Owner线程发现条件不满足，调用wait方法，即可进入WaitSet变为WAITING状态BLOCKED和WAITING的线程都处于阻塞状态，不占用CPU时间片BLOCKED线程会在Owner线程释放锁时唤醒WAITING线程会在Owner线程调用notify或notifyAll时唤醒，但唤醒后并不意味着立刻获得锁，仍需进入EntryList重
mysql 数据库部署 IT 古月方源网络安全运维网络数据库
以下是基于CentOS7系统部署MySQL数据库的详细步骤及常见问题解决方案：一、卸载旧版本MySQL/MariaDB停止服务并检查残留systemctlstopmariadb#停止MariaDB服务rpm-qa|grepmariadb#检查MariaDB安装包rpm-e--nodepsmariadb-libs-*#强制卸载MariaDB及其依赖包rm-rf/etc/my.cnf/var/lib/
学习笔记12——并发编程之线程之间协作方式码代码的小仙女高级开发必备技能 java jvm 开发语言
线程之间协作有哪些方式当多个线程可以一起工作去解决某个问题时，如果某些部分必须在其他部分之前完成，那么就需要对线程进行协调。共享变量和轮询方式实现：定义一个共享变量（如volatile修饰的布尔标志）。线程通过检查共享变量的状态来决定是否继续执行。publicclassTest{ privatestaticvolatilebooleanflag=false; publicstaticvoi
redis持久化 xing.xing redis
目录redis持久化RDB（RedisDatabase）持久化AOF（AppendOnlyFile）持久化redis持久化在Redis中，持久化是确保数据在Redis服务器重启后不丢失的关键功能。Redis提供了两种主要的数据持久化方式：RDB（RedisDatabase）持久化和AOF（AppendOnlyFile）持久化。Redis的默认持久化方式是RDB（快照）。在Redis启动时，它会定期
【护网行动】最新版护网知识总结，零基础入门到精通，收藏这篇就够了网络安全小宇哥 oracle 数据库安全 web安全计算机网络网络安全网络
一、基础知识1.SQL注入：一种攻击手段，通过在数据库查询中注入恶意SQL代码，获取、篡改或删除数据库数据。（1）危害：数据库增删改查、敏感数据窃取、提权/写入shell。（2）类型：按注入点（字符型、数字型、搜索型）、提交方式（get、post、cookie）、执行效果（联合、报错、布尔、时间）分类。（3）注入方式：包括information_schema注入、基于函数报错注入（如updatex
flask实现mvc模式 dev.null Python flask mvc python
Flask默认是一个轻量级框架，并不强制使用MVC模式，但我们可以按照MVC结构来组织代码，使项目更加清晰和可维护。Flask实现MVC模式Flask本身并没有严格的Controller层，但我们可以通过视图函数（ViewFunctions）充当Controller，使其符合MVC模式。目录结构flask_mvc_app/│──app/│├──models.py#Model(数据库模型)│├──v
YashanDB归档管理数据库
本文内容来自YashanDB官网，原文内容请见https://doc.yashandb.com/yashandb/23.3/zh/%E6%95%B0%E6%8D%AE%...YashanDB通过开启归档模式来进行redo日志文件自动归档，用以支持生产环境中的数据热备份以及高可用主备部署场景的主备同步。当故障发生时，可以通过历史全量数据数据备份以及归档的redo日志文件重做完成数据库重建。V$DAT
Redis 主从复制机制深度解析与实践指南月落星还在 redis redis 数据库缓存
Redis的主从复制（Replication）是构建高可用、高性能分布式缓存和数据库系统的核心机制。通过主从复制，数据可以从一个主节点（Master）自动同步到多个从节点（Slave），实现读写分离、负载均衡和故障恢复。本文将深入探讨主从复制的原理、配置方法、常见问题及优化策略。一、主从复制的核心概念1.1什么是主从复制？主从复制是一种数据同步机制，允许从节点实时复制主节点的数据。主节点负责处理写
Linux下安装Mysql环境软件分享工作室 Linux linux mysql 运维
1.mysql说明MySQL是一种开源的关系型数据库管理系统，它具有高性能、可靠性和灵活性的特点。MySQL支持多种操作系统，包括Windows、Linux和MacOS等。它是最流行的数据库管理系统之一，被广泛应用于网站开发、数据存储和数据分析等领域。2.mysql优点1.开源免费：MySQL是开源软件，可以免费使用和修改，没有任何使用限制。2.跨平台：MySQL可以在多种操作系统上运行，包括Wi
达梦数据库操作日期 one 大白(●—●) 数据库达梦日期函数操作日期
排班情况获取当月获取上月和下月的数据select*fromuf_zbglbwherefind_in_set('1',zbbm)>0andrqlike'2021-03%'orrqlike'2021-3%'ordatepart(year,rq)=(selectdatepart(YEAR,ADD_MONTHS(DATE'2021-03-16',1)))anddatepart(month,rq)=(sel
高级java每日一道面试题-2025年2月20日-数据库篇-大表如何优化 ? java我跟你拼了 java每日一道面试题数据库 java 大表优化索引分页
如果有遗漏,评论区告诉我进行补充面试官:大表如何优化?我回答:在Java高级面试中讨论大表优化问题时，理解并能详细阐述各种优化策略和技术实现是至关重要的。以下是结合提供的信息进行综合后的详细解析：大表优化的背景当数据库中的单表记录数变得非常庞大时，数据库操作（CRUD）的性能会显著下降，这不仅影响应用的响应速度，还可能导致系统资源耗尽，影响业务的稳定性。因此，对大表进行有效的优化是提升数据库性能的
第五周作业——第十章动手试一试 hongsqi
10-1Python学习笔记学习笔记：在文本编辑器中新建一个文件，写几句话来总结一下你至此学到的Python知识，其中每一行都以“InPythonyoucan”打头。将这个文件命名为learning_python.txt，并将其存储到为完成本章练习而编写的程序所在的目录中。编写一个程序，它读取这个文件，并将你所写的内容打印三次：第一次打印时读取整个文件；第二次打印时遍历文件对象；第三次打印时将各行
统信UOS下达梦数据库启动图形界面应用工具monitor报JAVA相关错：An error has occurred. See the log file LaoYuanPython 老猿Python 国产信创之光 java 达梦数据库统信UOS操作系统 JDK 图形应用报错
☞░前往老猿Python博客░https://blog.csdn.net/LaoYuanPython一、前言在博文《基于飞腾2000CPU+浪潮电脑+统信UOS安装达梦数据库详解https://blog.csdn.net/LaoYuanPython/article/details/143258863》中介绍了基于飞腾2000CPU+浪潮电脑+统信UOS安装达梦数据库的详细过程，并且安装完毕之后通过
MySQL-关于如何保存“大数据” 赵师的工作日 mysql 大数据数据库
作者：赵师的工作日（赵明中）现役OracleACE、MySQL8.0ocp、TiDBPCTA\PCTP、ElasticsearchCertifiedEngineer微信号：mzzhao23微信公众号：赵师的工作日墨天轮社区：赵师的工作日CSND：赵师的工作日数据库的种类有很多，各类数据库充分发挥各自的优势从而保证业务稳定运行，mysql轻量级、关键数据，redis缓存、快，ES搜索，Mongodb
Apache Doris中都用了哪些开发语言，编译过程中用到了哪些编译器，以及用到了哪些成熟的技术框架 fzip Doris apache 开发语言
ApacheDoris作为一款高性能的实时分析型数据库，其技术栈涉及多语言开发、多种编译器支持以及多个成熟技术框架的集成。以下是综合多个来源的详细分析：一、开发语言Java•应用场景：主要用于开发Frontend（FE），负责元数据管理、查询解析、集群管理等模块。•关键模块：◦FE的元数据持久化通过BDBJE（BerkeleyDBJavaEdition）实现。◦MySQL协议兼容和HTTP服务分别
5、请简述公司的系统服务架构类型（单体架构、分布式架构、微服务架构、分层架构、集群架构、SOA 架构、中台架构）静静在思考面试经验架构分布式微服务
以下是对公司常见的系统服务架构类型的简述及架构图说明：单体架构简述：将所有功能集成在一个项目中，作为一个整体进行开发、部署和运行，所有业务逻辑、数据访问等都在一个进程内。适用于小型项目或业务简单的场景，开发、部署和维护相对简单。架构图用户界面业务逻辑数据访问数据库分布式架构简述：把系统拆分为多个子系统或服务，分布在不同节点上独立运行，通过网络通信协作完成业务功能，可扩展性和可靠性较高，能应对大规模
ClickHouse 作用，优缺点。 mldsh13 clickhouse
ClickHouseClickHouse是一个开源的分布式列式数据库管理系统(DBMS)，专门设计用于实时分析(OLAP)。它最初由俄罗斯的Yandex开发，后来成为了开源项目，被广泛应用于需要高性能数据分析和查询的场景。作用：实时分析：ClickHouse专注于快速查询和分析大量数据，使其特别适用于数据分析、报告和实时仪表板等应用场景。大规模数据处理：能够处理海量数据，支持分布式架构，可以水平扩
MyBatis Plus 在 Java 项目中的高效使用随风九天匠心数据库 java spring java mybatis MyBatis Plus
1.前言1.1MyBatisPlus简介MyBatisPlus是一个MyBatis的增强工具，旨在简化开发人员在数据库操作上的工作量。它提供了丰富的功能，如自动化的CRUD操作、条件构造器、分页查询等，极大地提高了开发效率。1.2为什么选择MyBatisPlus简化代码：自动生成基础的CRUD方法，减少重复代码。提高效率：内置多种插件和工具，提升开发速度。易于维护：代码结构清晰，便于后续维护和扩展
掌握SQL多表连接查询_轻松处理复杂数据关系随风九天匠心数据库 java sql 数据库
1.引言1.1数据库中的多表关系概述在实际应用中，数据库通常由多个表组成，每个表存储不同类型的数据。例如，在一个电子商务系统中，可能会有用户表、订单表、产品表等。这些表之间存在关联关系，通过多表连接查询可以整合这些数据，提供更全面的信息。1.2多表连接查询的重要性多表连接查询是SQL中最常用和重要的操作之一。它允许我们从多个表中提取相关数据，并根据特定条件进行组合。掌握多表连接查询可以帮助我们更高
Apache Doris 实现毫秒级查询响应随风九天匠心数据库服务 java apache Apache Doris
1.引言1.1数据分析的重要性随着大数据时代的到来，企业对实时数据分析的需求日益增长。快速、准确地获取数据洞察成为企业在竞争中脱颖而出的关键。传统的数据库系统在处理大规模数据时往往面临性能瓶颈，难以满足实时分析的需求。例如，一个电商公司需要实时监控销售数据以调整库存和营销策略，而传统的数据库可能需要数分钟甚至数小时才能生成报表，这显然无法满足业务需求。1.2ApacheDoris简介ApacheD
C++开源库大全大王算法 C/C++开发实战365 C++入门及项目实战宝典 c++开源
程序员要站在巨人的肩膀上，C++拥有丰富的开源库，这里包括：标准库、Web应用框架、人工智能、数据库、图片处理、机器学习、日志、代码分析等。标准库C++StandardLibrary：是一系列类和函数的集合，使用核心语言编写，也是C++ISO自身标准的一部分。
基于jsp+servlet+mysql实现增删改查蟹黄味汉堡 mysql servlet jsp
#声明单纯记录学习计算机当中所遇到的问题把解决问题的方法分享给大家希望大佬不要喷我这个小白#链接mysql数据库publicclassBaseDao{publicConnectiongetConnection()throwsClassNotFoundException,SQLException{//url里的demo4为数据库名称Stringurl="jdbc:mysql://localhost:
如何实现集群中的session共享存储？思维导图代码示例（java 架构) 用心去追梦 java 架构开发语言
集群中Session共享存储的实现在分布式系统或集群环境中，确保用户会话（Session）能够在所有节点之间共享是一个关键问题。为了实现这一点，可以采用多种策略和技术。以下是关于如何在Java架构中实现集群中的Session共享存储的主要方面：1.使用集中式存储服务Memcached：轻量级、高性能的内存缓存系统，适用于存储短期的session数据。Redis：功能更强大的键值存储数据库，不仅支持
python mongo异步操作_让python调用mongo读写速度加速10倍的方法 weixin_39867125 python mongo异步操作
1.把mongo读写封装成api2.在api初始化时保持数据库长链接；并且用线程每2分钟遍历一次所有的表并count一次importsysimporttimeimportpymongoimportjsonimportlogimporttracebackimportthreading//库名test，表名test_tableserver_list=['test-mongos.all.serv:636
html页面js获取参数值 0624chenhong html
1.js获取参数值js function GetQueryString(name) { var reg = new RegExp("(^|&)"+ name +"=([^&]*)(&|$)"); var r = windo
MongoDB 在多线程高并发下的问题 BigCat2013 mongodb DB 高并发重复数据
最近项目用到 MongoDB , 主要是一些读取数据及改状态位的操作. 因为是结合了最近流行的 Storm进行大数据的分析处理，并将分析结果插入Vertica数据库，所以在多线程高并发的情境下, 会发现 Vertica 数据库中有部分重复的数据. 这到底是什么原因导致的呢？笔者开始也是一筹莫展，重复去看 MongoDB 的 API , 终于有了新发现： com.mongodb.DB 这个类有
c++ 用类模版实现链表(c++语言程序设计第四版示例代码) CrazyMizzz 数据结构 C++
#include<iostream> #include<cassert> using namespace std; template<class T> class Node { private: Node<T> * next; public: T data;
最近情况麦田的设计者感慨考试生活
在五月黄梅天的岁月里，一年两次的软考又要开始了。到目前为止，我已经考了多达三次的软考，最后的结果就是通过了初级考试（程序员）。人啊，就是不满足，考了初级就希望考中级，于是，这学期我就报考了中级，明天就要考试。感觉机会不大，期待奇迹发生吧。这个学期忙于练车，写项目，反正最后是一团糟。后天还要考试科目二。这个星期真的是很艰难的一周，希望能快点度过。
linux系统中用pkill踢出在线登录用户被触发 linux
由于linux服务器允许多用户登录，公司很多人知道密码，工作造成一定的障碍所以需要有时踢出指定的用户 1/#who 查出当前有那些终端登录（用 w 命令更详细） # who root pts/0 2010-10-28 09:36 (192
仿QQ聊天第二版肆无忌惮_ qq
在第一版之上的改进内容: 第一版链接: http://479001499.iteye.com/admin/blogs/2100893 用map存起来号码对应的聊天窗口对象,解决私聊的时候所有消息发到一个窗口的问题. 增加ViewInfo类,这个是信息预览的窗口,如果是自己的信息,则可以进行编辑. 信息修改后上传至服务器再告诉所有用户,自己的窗口
java读取配置文件知了ing
1，java读取.properties配置文件 InputStream in; try { in = test.class.getClassLoader().getResourceAsStream("config/ipnetOracle.properties");//配置文件的路径 Properties p = new Properties()
__attribute__ 你知多少？矮蛋蛋 C++gcc
原文地址: http://www.cnblogs.com/astwish/p/3460618.html GNU C 的一大特色就是__attribute__ 机制。__attribute__ 可以设置函数属性（Function Attribute ）、变量属性（Variable Attribute ）和类型属性（Type Attribute ）。 __attribute__ 书写特征是：
jsoup使用笔记 alleni123 java 爬虫 JSoup
<dependency> <groupId>org.jsoup</groupId> <artifactId>jsoup</artifactId> <version>1.7.3</version> </dependency> 2014/08/28 今天遇到这种形式，
JAVA中的集合 Collectio 和Map的简单使用及方法百合不是茶 list map set
List ,set ,map的使用方法和区别 java容器类类库的用途是保存对象，并将其分为两个概念： Collection集合：一个独立的序列，这些序列都服从一条或多条规则;List必须按顺序保存元素，set不能重复元素；Queue按照排队规则来确定对象产生的顺序（通常与他们被插入的
杀LINUX的JOB进程 bijian1013 linux unix
今天发现数据库一个JOB一直在执行，都执行了好几个小时还在执行，所以想办法给删除掉系统环境： ORACLE 10G Linux操作系统操作步骤如下：第一步.查询出来那个job在运行，找个对应的SID字段 select * from dba_jobs_running--找到job对应的sid &n
Spring AOP详解 bijian1013 java spring AOP
最近项目中遇到了以下几点需求，仔细思考之后，觉得采用AOP来解决。一方面是为了以更加灵活的方式来解决问题，另一方面是借此机会深入学习Spring AOP相关的内容。例如，以下需求不用AOP肯定也能解决，至于是否牵强附会，仁者见仁智者见智。 1.对部分函数的调用进行日志记录，用于观察特定问题在运行过程中的函数调用
[Gson六]Gson类型适配器(TypeAdapter) bit1129 Adapter
TypeAdapter的使用动机 Gson在序列化和反序列化时，默认情况下，是按照POJO类的字段属性名和JSON串键进行一一映射匹配，然后把JSON串的键对应的值转换成POJO相同字段对应的值，反之亦然，在这个过程中有一个JSON串Key对应的Value和对象之间如何转换(序列化/反序列化)的问题。以Date为例，在序列化和反序列化时，Gson默认使用java.
【spark八十七】给定Driver Program，如何判断哪些代码在Driver运行，哪些代码在Worker上执行 bit1129 driver
Driver Program是用户编写的提交给Spark集群执行的application，它包含两部分作为驱动： Driver与Master、Worker协作完成application进程的启动、DAG划分、计算任务封装、计算任务分发到各个计算节点(Worker)、计算资源的分配等。计算逻辑本身，当计算任务在Worker执行时，执行计算逻辑完成application的计算任务
nginx 经验总结 ronin47 nginx 总结
　　　深感nginx的强大，只学了皮毛，把学下的记录。　　　获取Header 信息，一般是以$http_XX（ＸＸ是小写）获取body,通过接口，再展开，根据Ｋ取Ｖ　　　获取uri,以$arg_XX &n
轩辕互动-1.求三个整数中第二大的数2.整型数组的平衡点 bylijinnan 数组
import java.util.ArrayList; import java.util.Arrays; import java.util.List; public class ExoWeb { public static void main(String[] args) { ExoWeb ew=new ExoWeb(); System.out.pri
Netty源码学习-Java-NIO-Reactor bylijinnan java 多线程 netty
Netty里面采用了NIO-based Reactor Pattern 了解这个模式对学习Netty非常有帮助参考以下两篇文章： http://jeewanthad.blogspot.com/2013/02/reactor-pattern-explained-part-1.html http://gee.cs.oswego.edu/dl/cpjslides/nio.pdf
AOP通俗理解 cngolon spring AOP
1.我所知道的aop 初看aop,上来就是一大堆术语，而且还有个拉风的名字，面向切面编程，都说是OOP的一种有益补充等等。一下子让你不知所措，心想着：怪不得很多人都和我说aop多难多难。当我看进去以后，我才发现：它就是一些java基础上的朴实无华的应用，包括ioc，包括许许多多这样的名词，都是万变不离其宗而已。 2.为什么用aop&nb
cursor variable 实例 ctrain variable
create or replace procedure proc_test01 as type emp_row is record( empno emp.empno%type, ename emp.ename%type, job emp.job%type, mgr emp.mgr%type, hiberdate emp.hiredate%type, sal emp.sal%t
shell报bash: service: command not found解决方法 daizj linux shell service jps
今天在执行一个脚本时，本来是想在脚本中启动hdfs和hive等程序，可以在执行到service hive-server start等启动服务的命令时会报错，最终解决方法记录一下：脚本报错如下： ./olap_quick_intall.sh: line 57: service: command not found ./olap_quick_intall.sh: line 59
40个迹象表明你还是PHP菜鸟 dcj3sjt126com 设计模式 PHP 正则表达式 oop
你是PHP菜鸟，如果你：1. 不会利用如phpDoc 这样的工具来恰当地注释你的代码2. 对优秀的集成开发环境如Zend Studio 或Eclipse PDT 视而不见3. 从未用过任何形式的版本控制系统，如Subclipse4. 不采用某种编码与命名标准，以及通用约定，不能在项目开发周期里贯彻落实5. 不使用统一开发方式6. 不转换（或）也不验证某些输入或SQL查询串（译注：参考PHP相关函
Android逐帧动画的实现 dcj3sjt126com android
一、代码实现： private ImageView iv; private AnimationDrawable ad; @Override protected void onCreate(Bundle savedInstanceState) { super.onCreate(savedInstanceState); setContentView(R.layout
java远程调用linux的命令或者脚本 eksliang linux ganymed-ssh2
转载请出自出处： http://eksliang.iteye.com/blog/2105862 Java通过SSH2协议执行远程Shell脚本(ganymed-ssh2-build210.jar) 使用步骤如下： 1.导包官网下载: http://www.ganymed.ethz.ch/ssh2/ ma
adb端口被占用问题 gqdy365 adb
最近重新安装的电脑，配置了新环境，老是出现： adb server is out of date. killing... ADB server didn't ACK * failed to start daemon * 百度了一下，说是端口被占用，我开个eclipse，然后打开cmd，就提示这个，很烦人。一个比较彻底的解决办法就是修改
ASP.NET使用FileUpload上传文件 hvt .net C#hovertree asp.net webform
前台代码： <asp:FileUpload ID="fuKeleyi" runat="server" /> <asp:Button ID="BtnUp" runat="server" onclick="BtnUp_Click" Text="上传" />
代码之谜（四）- 浮点数（从惊讶到思考） justjavac 浮点数精度代码之谜 IEEE
在『代码之谜』系列的前几篇文章中，很多次出现了浮点数。浮点数在很多编程语言中被称为简单数据类型，其实，浮点数比起那些复杂数据类型（比如字符串）来说，一点都不简单。单单是说明 IEEE浮点数就可以写一本书了，我将用几篇博文来简单的说说我所理解的浮点数，算是抛砖引玉吧。一次面试记得多年前我招聘 Java 程序员时的一次关于浮点数、二分法、编码的面试，多年以后，他已经称为了一名很出色的
数据结构随记_1 lx.asymmetric 数据结构笔记
第一章 1.数据结构包括数据的逻辑结构、数据的物理/存储结构和数据的逻辑关系这三个方面的内容。 2.数据的存储结构可用四种基本的存储方法表示，它们分别是顺序存储、链式存储、索引存储和散列存储。 3.数据运算最常用的有五种，分别是查找/检索、排序、插入、删除、修改。 4.算法主要有以下五个特性：输入、输出、可行性、确定性和有穷性。 5.算法分析的
linux的会话和进程组网络接口 linux
会话：一个或多个进程组。起于用户登录，终止于用户退出。此期间所有进程都属于这个会话期。会话首进程：调用setsid创建会话的进程1.规定组长进程不能调用setsid，因为调用setsid后，调用进程会成为新的进程组的组长进程.如何保证？先调用fork，然后终止父进程，此时由于子进程的进程组ID为父进程的进程组ID，而子进程的ID是重新分配的，所以保证子进程不会是进程组长，从而子进程可以调用se
二维数组元素的连续求解 1140566087 二维数组 ACM
import java.util.HashMap; public class Title { public static void main(String[] args){ f(); } // 二位数组的应用 //12、二维数组中，哪一行或哪一列的连续存放的0的个数最多，是几个0。注意，是“连续”。 public static void f(){
也谈什么时候Java比C++快 windshome java C++
刚打开iteye就看到这个标题“Java什么时候比C++快”，觉得很好笑。你要比，就比同等水平的基础上的相比，笨蛋写得C代码和C++代码，去和高手写的Java代码比效率，有什么意义呢？我是写密码算法的，深刻知道算法C和C++实现和Java实现之间的效率差，甚至也比对过C代码和汇编代码的效率差，计算机是个死的东西，再怎么优化，Java也就是和C

【数据库】数据库笔记

Introduction

Main content

What is database

definition

Function

Why use a DBMS

File vs. Database

Concepts

The ANSI-SPARC architecture

What is DBMS

definition

History of DBMS

database system

Data Models

Hierarchical Data Model

Basic idea

Basic concepts

Record

Field

PCR (parent-child-relationship)

Hierarchical data schema

Virtual record

Network data model

Basic idea

Set

Link record type

self relationship

end to end relationship

Relation data model

basic idea

features

Soft link

Some concepts

Attributes and domain

Relation and tuple

Primary key

Foreign key

ER Data Model

Object-Oriented Data Model

Relational algebra

Basic operations

Other operations

Relational Calculus

Tuple relational calculus

Domain relational calculus

Formula

atomic formula

Definition

Differences and Similarities between relational calculus and relational algebra

User Interfaces and SQL Language

Content

Important terms and concepts

Conceptual evaluation strategy

Levels of abstraction: ANSI-SPARC Architecture

Query Language

Category

Basic SQL query

Union

definition

example

Intersect

Nested queries

Division in SQL

Aggregate Operator

Aggregation Operators

Example

Grouping

Cast Expression

Case Expression

subquery

Scalar Sub-query

Table Expression

Common Table Expression

Outer Join

Recursion

Data Manipulation Language

View in SQL

Embedded SQL

Usage of Embedded SQL in C