Introduction to Artificial Intelligence CSCE , Fall 2017 URL:

Similar documents
Title: Solving Problems by Searching AIMA: Chapter 3 (Sections 3.1, 3.2 and 3.3)

Title: Adverserial Search AIMA: Chapter 5 (Sections 5.1, 5.2 and 5.3)

Title: Local Search Required reading: AIMA, Chapter 4 LWH: Chapters 6, 10, 13 and 14.

GAME THEORY. Analysis of Conflict ROGER B. MYERSON. HARVARD UNIVERSITY PRESS Cambridge, Massachusetts London, England

CSE 520S Real-Time Systems

information it takes to make tampering with an election computationally hard.

Brigadier-General (Retired) David Jurkowski, CMM, CD Former Chief of Staff Joint Operations ( ) Canadian Armed Forces

Handout 6: Utilitarianism

Manipulative Voting Dynamics

The New Geography of Jobs. Enrico Moretti University of California at Berkeley

FAST-TRACK IMMIGRATION TO EUROPE: HUNGARY

Complexity of Manipulating Elections with Few Candidates

An Exploratory study of the Video Bloggers Community

PUBLIC OPINION. Monitor. the. contents. reflecting the mood and attitudes of British people

NP-Hard Manipulations of Voting Schemes

Traveller behaviour: threats & opportunities How global events are driving change in business travel

BIOMETRICS IN A HUMANITARIAN CONTEXT

Tabatha Yelós. Hi!

The economic impact of New Tech: some reflections on the Welfare State

HUMAN TRAFFICKING: Perception, Prevention and Protection

A B. Ø ÓÒ Left Right Suck NoOp

The New Geography of Jobs. Enrico Moretti University of California at Berkeley

Seeking Information About Court Automation a Ten Minute Online Survey

Computational social choice Combinatorial voting. Lirong Xia

NOVEMBER visioning survey results

WHAT WE HEARD : A REPORT ON CONSULTATIONS RELATING TO REPATRIATION IN NUNATSIAVUT

Deliberation in Planning and Acting

EFSA Brussels Liaison Office

Concurrent Programing: Why you should care, deeply. Don Porter Portions courtesy Emmett Witchel

ADOPTED REGULATION OF THE DEPARTMENT OF MOTOR VEHICLES. LCB File No. R Effective March 1, 2012

White Flint Implementation Advisory Committee March 14, 2011 Wall Local Park/MAC 7:30 p.m.

Relentless From Good To Great To Unstoppable

Threats to Life - Policy

The Procedural Requirements

Questions. Hobbes. Hobbes s view of human nature. Question. What justification is there for a state? Does the state have supreme authority?

Hobbes. Questions. What justification is there for a state? Does the state have supreme authority? What limits are there upon the state?

How Techno-Populism Is Undermining Innovation

City of Bellingham Residential Survey 2013

CS 886: Multiagent Systems. Fall 2016 Kate Larson

American Border Patrol 2160 E. Fry Blvd. Sierra Vista, AZ 85635

ISTANBUL SECURITY CONFERENCE 2018

Minnesota Association of Townships Information Library Document Number: TP6000 Revised: January 29, 2002 TOWN ORDINANCES. by Troy Gilchrist, Attorney

(67686) Mathematical Foundations of AI June 18, Lecture 6

EXPERT GROUP ON THE INTERNATIONAL TELECOMMUNICATION REGULATIONS

Should you elect non publication?

Chapter 10. The Manipulability of Voting Systems. For All Practical Purposes: Effective Teaching. Chapter Briefing

Police Firearms Survey

MINUTES OF THE ABILENE METROPOLITAN PLANNING ORGANIZATION TRANSPORTATION POLICY BOARD

A representation theorem for minmax regret policies

MATH4999 Capstone Projects in Mathematics and Economics Topic 3 Voting methods and social choice theory

IGS Tropospheric Products and Services at a Crossroad

Gaining Sex Offender Registrant Compliance

Consequentialist Ethics

Living in a Globalized World

Homework 7 Answers PS 30 November 2013

DIVISION E--INFORMATION TECHNOLOGY MANAGEMENT REFORM

Resource Allocation in Egalitarian Agent Societies

A THEORY OF JUSTICE. Revised Edition JOHN RAWLS

St. Pete Pier Committee Members,

Content. 01. Foreword Key findings Rise in risks to travellers... 4

AP United States Government and Politics Syllabus

Voting Systems. High School Circle I. June 4, 2017

Political Science: An Introduction, 11e

Washington Office 1211 Connecticut Avenue NW Suite 305 Washington, DC T F

Learning Expectations

closer look at Rights & remedies

11th Annual Patent Law Institute

SURVEY OF SUPERIOR COURT JUROR SATISFACTION KITSAP COUNTY, WASHINGTON

The Manipulability of Voting Systems. Check off these skills when you feel that you have mastered them.

Caucasus Barometer. Public Perceptions on Political, Social and Economic issues in South Caucasus Countries

Next- Generation National Security and Public Safety in Europe

QUALITY OF COURT PERFORMANCE: EXTERNAL EVALUATION

Project Update: September 2018 Public Outreach Executive Summary

Regulatory Governance of Network Industries: Experience and Prospects

Digital Billboard Review City Council Economic Development Committee June 16, 2014

Advocacy Involvement by Homeless Service Providers in Chicago: Research Findings. Jennifer E. Mosley, Ph.D University of Chicago

Ottawa River North Shore Parklands Plan PUBLIC CONSULTATION REPORT JULY 6 TO 24, 2017

PCGENESIS PAYROLL SYSTEM OPERATIONS GUIDE

The Role and Importance of Depositions

Strategic Reasoning in Interdependence: Logical and Game-theoretical Investigations Extended Abstract

NewsDiffs: Version Controlling the News

Social Choice & Mechanism Design

Matea Škaberna, Petar Kurečić. University North, Varaždin, Croatia

Annual Engagement Report

Economics Marshall High School Mr. Cline Unit One BC

1 Aggregating Preferences

THE CORPORATION OF THE TOWN OF INNISFIL BY-LAW NO

"Efficient and Durable Decision Rules with Incomplete Information", by Bengt Holmström and Roger B. Myerson

Product Description

ATTACHMENT: 4 REPORT TO GENERAL PLAN 2040 STEERING COMMITTEE

Preference Forms These tables may be useful for scratch work.

A Comparison of Asians, Hispanics, and Whites Restaurant Tipping

Assent Voting: Processes & Considerations for Local Governments in British Columbia. Ministry of Municipal Affairs and Housing

Customer Feedback Summary. Recent Reviews & Published Comments st Pl Bedford, OH (440) for

Agent Modeling of Hispanic Population Acculturation and Behavior

Preserving RMR. Managing Attrition. June 6,2018

The Image of China in Australia: A Conversation with Bruce Dover

DIVISION E INFORMATION TECHNOLOGY MANAGEMENT REFORM

Learning Expectations

Fast Fixes for Slow Markets

Transcription:

B.Y. Choueiry 1 Instructor s notes #4 Title: Intelligent Agents AIMA: Chapter 2 Introduction to Artificial Intelligence CSCE 476-876, Fall 2017 URL: www.cse.unl.edu/~choueiry/f17-476-876 Berthe Y. Choueiry (Shu-we-ri) (402)472-5444

B.Y. Choueiry 2 Instructor s notes #4 Intelligent Agents 1. Agents and environments 2. Rationality 3. PEAS Specifying the task environment: Performance measure, Environment, Actuators, Sensors 4. Types of environments 5. Types of Intelligent Agents

B.Y. Choueiry 3 Instructor s notes #4 Agent Anything that perceives its environment through sensors acts upon its environment through actuators Agents include: Humans, robots, software, etc. Sensors? Actuators? The agent function maps from percept sequences to actions: f : P A The agent program runs on the physical architecture to produce f

Vacuum-cleaner world A B Percepts: locations and contents, e.g., [A,dirty] Actions: Left, Right, Suck, NoOp B.Y. Choueiry 4 Instructor s notes #4

B.Y. Choueiry 5 Instructor s notes #4 A Vacuum-cleaner Agent Percept sequence [A, Clean] [A, Dirty] [B, Clean] [B, Dirty] [A, Clean],[A, Clean]. [A, Clean],[A, Clean],[A, Clean]. Action Right Suck Left Suck Right Right Function Reflex-Vaccuum-Agent ([location,status]]) returns an action if status = Dirty then return Suck else if location = A then return Right else if location = B then return Left

B.Y. Choueiry 6 Instructor s notes #4 Goal of AI Build rational agents. Rational =? What is rational depends on: 1. Performance measures (how, when) 2. The agents prior knowledge of the environment 3. The actions the agent can perform 4. Percept sequence to date (history): everything agent has perceived so far

B.Y. Choueiry 7 Instructor s notes #4 Performance meaure Fixed performance measure evaluates the environment sequence one point per square cleaned up in time t point per clean square per time step, minus one per move? penalize for > k dirty squares?

B.Y. Choueiry 8 Instructor s notes #4 Rationality A rational agent chooses whichever action maximizes the expected value of the performance measure given the percept sequence to date Rational omniscient, clairvoyant Rationality maximizes expected performance Perfection maximizes actual performance Rational = exploration, learning, autonomy After a sufficient experience of its environment, behavior of a rational agents becomes effectively independent of prior knowledge.

B.Y. Choueiry 9 Instructor s notes #4 PEAS To design a rational agent, we must specify the task environment Performance measure? Environment? Actuators? Sensors? Consider, e.g., the task of designing an automated taxi..

B.Y. Choueiry 10 Instructor s notes #4 PEAS: Automated taxi Performance measure: safety, destination, profits, legality, comfort,... Environment: US urban streets, freeways, traffic, pedestrians, stray animals, weather,... Actuators: steering, accelerator, brake, horn, speaker/display,... Sensors: video, accelerometers, gauges, engine sensors, keyboard, GPS,...

B.Y. Choueiry 11 Instructor s notes #4 Environment (1) 1. Fully Observable vs. Partially Observable 2. Deterministic vs. stochastic 3. Episodic vs. sequential 4. Static vs. dynamic 5. Discrete vs. continuous 6. Single agent vs. multiagent

B.Y. Choueiry 12 Instructor s notes #4 Environment (2) Fully/Partially Observable: sensors can detect all aspects of the world Effectively fully observable: relevant aspects Deterministic vs. stochastic: from the agent s view point Next state determined by current state and agents actions Partially observable + deterministic appears stochastic Episodic vs. sequential: Agent s experience divided into atomic episodes; subsequent episodes do not depend on actions in previous episodes

B.Y. Choueiry 13 Instructor s notes #4 Environment (3) Static vs. dynamic: Dynamic: Environment changes while agent is deliberating Semidynamic: environment static, performance scores dynamic Discrete vs. continuous: Finite number of precepts, actions Single agent vs. multiagent: B s behavior maximizes a performance measure whose value depends on A s behavior. Cooperative, competitive, communication. Chess? Taxi driving? hardest case?

B.Y. Choueiry 14 Instructor s notes #4 Environment (4) Hardest case: patially observable, stochastic, sequential, dynamic, continuous, and multiagent Observable Deterministic Episodic Static Discrete Single-agent Solitaire Backgammon Internet shopping Taxi Answers depend on how you define/interpret the case Episodic: chess tournament

B.Y. Choueiry 15 Instructor s notes #4 Environment types Solitaire Backgammon Internet shopping Taxi Observable Yes Yes No No Deterministic Yes No Partly No Episodic No No No No Static Yes Semi Semi No Discrete Yes Yes Yes No Single-agent Yes No Yes No (except auctions) The environment type largely determines the agent design The real world is (of course) partially observable, stochastic, sequential, dynamic, continuous, multi-agent

B.Y. Choueiry 16 Instructor s notes #4 Types of Agents Four, in order of increasing generality: 1. Simple reflex agents 2. Simple reflex agents with state 3. Goal-based agents 4. Utility-based agents 5. Learning agents All these can be turned into learning agents.

B.Y. Choueiry 17 Instructor s notes #4 Simple reflex agents Simple look-up table, mapping percepts to actions, is out of question (too large, too expensive to build) Many situations can be summarized by condition-action rules (humans: learned responses, innate reflexes) Agent Condition-action rules Rectangles: agent s internal state Sensors What the world is like now What action I should do now Actuators Implementation: easy; Applicability: narrow Environment Ovals: background information

B.Y. Choueiry 18 Instructor s notes #4 Simple reflex agents with state Sensory information alone is not sufficient Need to keep track of how the world evolves (evolution: independently of agent, or caused by agent s actions) Agent State How the world evolves What my actions do Condition-action rules Sensors What the world is like now What action I should do now Actuators How the world evolved: model-based agent Environment

B.Y. Choueiry 19 Instructor s notes #4 Goal-based agents State & actions don t tell where to go Need goals to build sequences of actions (planning) Agent State How the world evolves What my actions do Goals Sensors What the world is like now What it will be like if I do action A What action I should do now Actuators Goal-based: uses the same rules for different goals Reflex: will need a complete set of rules for each goal Environment

B.Y. Choueiry 20 Instructor s notes #4 Utility-based agents Several action sequences to achieve some goal (binary process) Need to select among actions & sequences. Preferences. Utility: State real number (express degree of satisfaction, specify trade-offs between conflicting goal) Agent State How the world evolves What my actions do Utility Sensors What the world is like now What it will be like if I do action A How happy I will be in such a state What action I should do now Actuators Environment

B.Y. Choueiry 21 Instructor s notes #4 Learning agents Agent operates in an initially unknown environment, and becomes more competent than its initial knowledge alone might allow Agent Performance standard feedback learning goals Critic Learning element Problem generator changes knowledge Sensors Performance element Actuators Learning: process of modification of each component of the agent to bring the components into closer agreement with the available feedback information, thus improving overall performance of the agent. Environment