Multiagent Systems

Multiagent Systems

作者：Yoav Shoham / Kevin Leyton-Brown

副标题：Algorithmic, Game-Theoretic, and Logical Foundations

出版社：Cambridge University Press

出版年：2008-12

ISBN：9780521899437

行业：其它

浏览数：9

内容简介

Multiagent systems combine multiple autonomous entities, each having diverging interests or different information. This overview of the field offers a computer science perspective, but also draws on ideas from game theory, economics, operations research, logic, philosophy and linguistics. It will serve as a reference for researchers in each of these fields, and be used as a text for advanced undergraduate or graduate courses. The authors emphasize foundations to create a broad and rigorous treatment of their subject, with thorough presentations of distributed problem solving, game theory, multiagent communication and learning, social choice, mechanism design, auctions, cooperative game theory, and modal logics of knowledge and belief. For each topic, basic concepts are introduced, examples are given, proofs of key results are offered, and algorithmic considerations are examined. An appendix covers background material in probability theory, classical logic, Markov decision processes and mathematical programming.

作者简介

Yoav Shoham

Stanford University, California

Kevin Leyton-Brown

University of British Columbia, Vancouver

目录

Credits and Acknowledgments

Introduction

1 Distributed Constraint Satisfaction

1.1 Defining distributed constraint satisfaction problems

1.2 Domain-pruning algorithms

1.3 Heuristic search algorithms

1.3.1 The asynchronous backtracking algorithm

1.3.2 A simple example

1.3.3 An extended example: the four-queen problem

1.3.4 Beyond the ABT algorithm

1.4 History and references

2 Distributed Optimization

2.1 Distributed dynamic programming for path planning

2.1.1 Asynchronous dynamic programming

2.1.2 Learning real-time A*

2.2 Action selection in multiagent MDPs

2.3 Negotiation, auctions and optimization

2.3.1 Introduction: from contract nets to auction-like optimization

2.3.2 The assignment problem and linear programming

2.3.3 The scheduling problem and integer programming

2.4 Social laws and conventions

2.5 History and references

3 Introduction to Noncooperative Game Theory: Games in Normal Form

3.1 Self-interested agents

3.1.1 Example: friends and enemies

3.1.2 Preferences and utility

3.2 Games in normal form

3.2.1 Example: the TCP user's game

3.2.2 Definition of games in normal form

3.2.3 More examples of normal-form games

3.2.4 Strategies in normal-form games

3.3 Analyzing games: from optimality to equilibrium

3.3.1 Pareto optimality

3.3.2 Defining best response and Nash equilibrium

3.3.3 Finding Nash equilibria

3.3.4 Nash's theorem: proving the existence of Nash equilibria

3.4 Further solution concepts for normal-form games

3.4.1 Maxmin and minmax strategies

3.4.2 Minimax regret

3.4.3 Removal of dominated strategies

3.4.4 Rationalizability

3.4.5 Correlated equilibrium

3.4.6 Trembling-hand perfect equilibrium

3.4.7 Epsilon-Nash equilibrium

3.5 History and references

4 Computing Solution Concepts of Normal-Form Games

4.1 Computing Nash equilibria of two-player, zero-sum games

4.2 Computing Nash equilibria of two-player, general-sum games

4.2.1 Complexity of computing a sample Nash equilibrium

4.2.2 An LCP formulation and the Lemke--Howson algorithm

4.2.3 Searching the space of supports

4.2.4 Beyond sample equilibrium computation

4.3 Computing Nash equilibria of n-player, general-sum games

4.4 Computing maxmin and minmax strategies for two-player, general-sum games

4.5 Identifying dominated strategies

4.5.1 Domination by a pure strategy

4.5.2 Domination by a mixed strategy

4.5.3 Iterated dominance

4.6 Computing correlated equilibria

4.7 History and references

5 Games with Sequential Actions: Reasoning and Computing with the Extensive Form

5.1 Perfect-information extensive-form games

5.1.1 Definition

5.1.2 Strategies and equilibria

5.1.3 Subgame-perfect equilibrium

5.1.4 Computing equilibria: backward induction

5.2 Imperfect-information extensive-form games

5.2.1 Definition

5.2.2 Strategies and equilibria

5.2.3 Computing equilibria: the sequence form

5.2.4 Sequential equilibrium

5.3 History and references

6 Richer Representations: Beyond the Normal and Extensive Forms

6.1 Repeated games

6.1.1 Finitely repeated games

6.1.2 Infinitely repeated games

6.1.3 "Bounded rationality": repeated games played by automata

6.2 Stochastic games

6.2.1 Definition

6.2.2 Strategies and equilibria

6.2.3 Computing equilibria

6.3 Bayesian games

6.3.1 Definition

6.3.2 Strategies and equilibria

6.3.3 Computing equilibria

6.3.4 Ex post equilibrium

6.4 Congestion games

6.4.1 Definition

6.4.2 Computing equilibria

6.4.3 Potential games

6.4.4 Nonatomic congestion games

6.4.5 Selfish routing and the price of anarchy

6.5 Computationally motivated compact representations

6.5.1 The expected utility problem

6.5.2 Graphical games

6.5.3 Action-graph games

6.5.4 Multiagent influence diagrams

6.5.5 GALA

6.6 History and references

7 Learning and Teaching

7.1 Why the subject of "learning" is complex

7.1.1 The interaction between learning and teaching

7.1.2 What constitutes learning?

7.1.3 If learning is the answer, what is the question?

7.2 Fictitious play

7.3 Rational learning

7.4 Reinforcement learning

7.4.1 Learning in unknown MDPs

7.4.2 Reinforcement learning in zero-sum stochastic games

7.4.3 Beyond zero-sum stochastic games

7.4.4 Belief-based reinforcement learning

7.5 No-regret learning and universal consistency

7.6 Targeted learning

7.7 Evolutionary learning and other large-population models

7.7.1 The replicator dynamic

7.7.2 Evolutionarily stable strategies

7.7.3 Agent-based simulation and emergent conventions

7.8 History and references

8 Communication

8.1 "Doing by talking" I: cheap talk

8.2 "Talking by doing": signaling games

8.3 "Doing by talking" II: speech-act theory

8.3.1 Speech acts

8.3.2 Rules of conversation

8.3.3 A game-theoretic view of speech acts

8.3.4 Applications

8.4 History and references

9 Aggregating Preferences: Social Choice

9.1 Introduction

9.1.1 Example: plurality voting

9.2 A formal model

9.3 Voting

9.3.1 Voting methods

9.3.2 Voting paradoxes

9.4 Existence of social functions

9.4.1 Social welfare functions

9.4.2 Social choice functions

9.5 Ranking systems

9.6 History and references

10 Protocols for Strategic Agents: Mechanism Design

10.1 Introduction

10.1.1 Example: strategic voting

10.1.2 Example: buying a shortest path

10.2 Mechanism design with unrestricted preferences

10.2.1 Implementation

10.2.2 The revelation principle

10.2.3 Impossibility of general, dominant-strategy implementation

10.3 Quasilinear preferences

10.3.1 Risk attitudes

10.3.2 Mechanism design in the quasilinear setting

10.4 Efficient mechanisms

10.4.1 Groves mechanisms

10.4.2 The VCG mechanism

10.4.3 VCG and individual rationality

10.4.4 VCG and weak budget balance

10.4.5 Drawbacks of VCG

10.4.6 Budget balance and efficiency

10.4.7 The AGV mechanism

10.5 Beyond efficiency

10.5.1 What else can be implemented in dominant strategies?

10.5.2 Tractable Groves mechanisms

10.6 Computational applications of mechanism design

10.6.1 Task scheduling

10.6.2 Bandwidth allocation in computer networks

10.6.3 Multicast cost sharing

10.6.4 Two-sided matching

10.7 Constrained mechanism design

10.7.1 Contracts

10.7.2 Bribes

10.7.3 Mediators

10.8 History and references

11 Protocols for Multiagent Resource Allocation: Auctions

11.1 Single-good auctions

11.1.1 Canonical auction families

11.1.2 Auctions as Bayesian mechanisms

11.1.3 Second-price, Japanese, and English auctions

11.1.4 First-price and Dutch auctions

11.1.5 Revenue equivalence

11.1.6 Risk attitudes

11.1.7 Auction variations

11.1.8 "Optimal" (revenue-maximizing) auctions

11.1.9 Collusion

11.1.10 Interdependent values

11.2 Multiunit auctions

11.2.1 Canonical auction families

11.2.2 Single-unit demand

11.2.3 Beyond single-unit demand

11.2.4 Unlimited supply: random sampling auctions

11.2.5 Position auctions

11.3 Combinatorial auctions

11.3.1 Simple combinatorial auction mechanisms

11.3.2 The winner determination problem

11.3.3 Expressing a bid: bidding languages

11.3.4 Iterative mechanisms

11.3.5 A tractable mechanism

11.4 Exchanges

11.4.1 Two-sided auctions

11.4.2 Prediction markets

11.5 History and references

12 Teams of Selfish Agents: An Introduction to Coalitional Game Theory

12.1 Coalitional games with transferable utility

12.1.1 Definition

12.1.2 Examples

12.1.3 Classes of coalitional games

12.2 Analyzing coalitional games

12.2.1 The Shapley value

12.2.2 The core

12.2.3 Refining the core: epsilon-core, least core, and nucleolus

12.3 Compact representations of coalitional games

12.3.1 Weighted majority games and weighted voting games

12.3.2 Weighted graph games

12.3.3 Capturing synergies: a representation for superadditive games

12.3.4 A decomposition approach: multi-issue representation

12.3.5 A logical approach: marginal contribution nets

12.4 Further directions

12.4.1 Alternative coalitional game models

12.4.2 Advanced solution concepts

12.5 History and references

13 Logics of Knowledge and Belief

13.1 The partition model of knowledge

13.1.1 Muddy children and warring generals

13.1.2 Formalizing intuitions about the partition model

13.2 A detour to modal logic

13.2.1 Syntax

13.2.2 Semantics

13.2.3 Axiomatics

13.2.4 Modal logics with multiple modal operators

13.2.5 Remarks about first-order modal logic

13.3 S5: An axiomatic theory of the partition model

13.4 Common knowledge, and an application to distributed systems

13.5 Doing time and an application to robotics

13.5.1 Termination conditions for motion planning

13.5.2 Coordinating robots

13.6 From knowledge to belief

13.7 Combining knowledge and belief (and revisiting knowledge)

13.8 History and references

14 Beyond Belief: Probability, Dynamics and Intention

14.1 Knowledge and probability

14.2 Dynamics of knowledge and belief

14.2.1 Belief revision

14.2.2 Beyond AGM: update, arbitration, fusion, and friends

14.2.3 Theories of belief change: a summary

14.3 Logic, games, and coalition logic

14.4 Towards a logic of "intention"

14.4.1 Some preformal intuitions

14.4.2 The road to hell: elements of a formal theory of intention

14.4.3 Group intentions

14.5 History and references

Appendices

A Probability Theory

A.1 Probabilistic models

A.2 Axioms of probability theory

A.3 Marginal probabilities

A.4 Conditional probabilities

B Linear and Integer Programming

B.1 Linear programs

B.2 Integer programs

C Markov Decision Problems (MDPs)

C.1 The model

C.2 Solving known MDPs via value iteration

D Classical Logic

D.1 Propositional calculus

D.2 First-order logic

Bibliography

Index

读书文摘

猜你喜欢

愧生丛录

幕后猎手·下册

幕后猎手·下册

年年夏日那片海

年年夏日那片海

民国梦与民国2

民国梦与民国2

烈土千瞳：卷二

烈土千瞳：卷二

迷宫饭世界导览冒险者权威指南

迷宫饭世界导览冒险者权威指南

点击查看