XJTLU_CSE304(17/18) Multi-agent System

1 Agents and Objects? similarities and differences

2 Different architectures of agents

(1) practical reasoning

what are the functionalities of each component?

how do they interact with each other?

XJTLU_CSE304(17/18) Multi-agent System_第1张图片

commitments to ends and means

??? while not (empty(pi), or succeed(I, B), or impossible(I, B)) do

while not (empty(pi), or succeed(I, B), or believeimpossible(I, B)) do

(2) planner system

example-1 STRIPS planning problem

(3) BDI architecture

(4) subsumption architecture

(5) horizontal, vertical & hybrid architecture

example-1 Touring Machine

example-2 InterRRap

3 Specific case: Blocks World problem + vacuum machine problem

4 Payoff Matrices [WEEK 9 Multi-agent Interactions]

utilities and preference: u(w) >= u(w') <--> w )= w'

w - state of the world; u - utility function; a - actions, ai*aj --> w

(N, A, U) U - the set of players; A = A1*A2*...*An, Ai is the set of actions available to player i; U - the set of utility functions for each player, it can be enclosed in the payoff matrix.

XJTLU_CSE304(17/18) Multi-agent System_第2张图片

payoff matrix 合并

* if it is the case of prisoner dilemma, the smaller utility the better, otherwise, the larger the better.

(0) Dominant strategies

Si is the dominant strategy for i if no matter what strategy j choose, i will do at least as well as choosing other strategies.

XJTLU_CSE304(17/18) Multi-agent System_第3张图片

Dominant strategy 解法图示

(1) Nash equilibrium

s1 and s2 are in Nash equilibrium if: [1] under the assumption that agent i plays s1, agent j can do no better than playing s2 AND [2] under the assumption that agent j plays s2, agent i can do no better than playing s1.

XJTLU_CSE304(17/18) Multi-agent System_第4张图片

Nash equilibrium 解法图示

(2) Pareto optimal

if there is no other outcome that makes one agent better off without making another agent worse off.

XJTLU_CSE304(17/18) Multi-agent System_第5张图片

Pareto optimal 解法图示

(3) social welfare

XJTLU_CSE304(17/18) Multi-agent System_第6张图片

social welfare 解法图示

5 Coalition games [WEEK 10 Coalition,Voting, Power, and Computational Social Choice]

the core is the set of outcomes for the grand coalition to which no coalition objects.

a coalition C objects to an outcome if there is some outcome for them that makes all of them strictly better off.

if the core is non-empty then the grand coalition is stable, since nobody can benefit from defection.

(1) Shapley value (- best known attempt to define how to divide the benefits of cooperation fairly, taking into account the contributions of each agent)