Outline. From Pixels to Semantics Research on automatic indexing and retrieval of large collections of images. Research: Main Areas

Similar documents
The Pupitre System: A desk news system for the Parliamentary Meeting rooms

Cluster Analysis. (see also: Segmentation)

Interactive Functional Medical Image Analysis

The Social Web: Social networks, tagging and what you can learn from them. Kristina Lerman USC Information Sciences Institute

File Systems: Fundamentals

Web Mining: Identifying Document Structure for Web Document Clustering

Category-level localization. Cordelia Schmid

Probabilistic Latent Semantic Analysis Hofmann (1999)

Economy Profile 2017 Moldova

Estonian National Electoral Committee. E-Voting System. General Overview

CPSC 467b: Cryptography and Computer Security

SIERRA LEONE 2012 ELECTIONS PROJECT PRE-ANALYSIS PLAN: INDIVIDUAL LEVEL INTERVENTIONS

Abstract: Submitted on:

Case Study. MegaMatcher Accelerator

5222 E. Baseline Road, Suite 101 Gilbert, AZ 85234

Towards Tackling Hate Online Automatically

Fairsail. User. Benefits & Open Enrollment User Guide. Version 3.23 FS-BOE-XXX-UG R003.23

Fairsail Payflow Cookbook for CSV Record Downloads

CS 5523 Operating Systems: Intro to Distributed Systems

UNITED STATES [DISTRICT/BANKRUPTCY] COURT FOR THE DISTRICT OF DIVISION., ) ) Plaintiff, ) ) vs. ) Case No. ), ) Judge ) Defendant.

LIMITE EN/FR COUNCIL OF THE EUROPEAN UNION. Brussels, 15 May /09 ADD 2 LIMITE FRONT 28 COMIX 294 NOTE

Appendix: Supplementary Tables for Legislating Stock Prices

NASCIO Nomination eomis/justicexchange Digital Government: Government to Government

ABC and Integrated Border management

ENTERTAINMENT IDENTIFIER REGISTRY TERMS OF USE

IBM Cognos Open Mic Cognos Analytics 11 Part nd June, IBM Corporation

Accelerated Examination. Presented by Hans Troesch, Principal Fish & Richardson P.C. March 2, 2010

Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012

An Integrated Tag Recommendation Algorithm Towards Weibo User Profiling

A Qualitative and Quantitative Analysis of the Political Discourse on Nepalese Social Media

Systems and methods for conducting jury research and training for estimating punitive damages

7th CIRCUIT ELECTRONIC DISCOVERY COMMITTEE PRINCIPLES RELATING TO THE DISCOVERY OF ELECTRONICALLY STORED INFORMATION. Second Edition, January, 2018

DATA PROTECTION (JERSEY) LAW 2005 CODE OF PRACTICE & GUIDANCE ON THE USE OF CCTV GD6

A Skeleton-Based Model for Promoting Coherence Among Sentences in Narrative Story Generation

A New Computer Science Publishing Model

SECURE REMOTE VOTER REGISTRATION

Kjell-Einar Anderssen. Country Manager Norway - Nutanix

Processes. Criteria for Comparing Scheduling Algorithms

Deep Learning Working Group R-CNN

Experiments on Data Preprocessing of Persian Blog Networks

Saturday, 7 November 15

Congress Lobbying Database: Documentation and Usage

The Issue Of Internet Polling

AMONG the vast and diverse collection of videos in

Judicial Branch IT Update to the North Carolina General Assembly s Justice and Public Safety Oversight Committee

process will save judges, sheriffs, clerks, and attorneys' time and money.

Legal Research for Non-Law Students

CHAPTER 1 RECORDS RETENTION AND DISPOSITION

PRIVACY STATEMENT - TERMS & CONDITIONS. For users of Princh printing, copying and scanning services PRIVACY STATEMENT

Luciano Nicastro

Testing the Waters: Working With CSS Data in Congressional Collections

The Border: Personal Reflections From Ireland, North And South

Lab 3: Logistic regression models

Are policy makers out of step with their constituency when it comes to immigration?

Subreddit Recommendations within Reddit Communities

BYTELINE STUDIO TERMS AND CONDITIONS TEMPLATE

ALBERTA OFFICE OF THE INFORMATION AND PRIVACY COMMISSIONER ORDER F December 8, 2016 UNIVERSITY OF LETHBRIDGE. Case File Number

Hoboken Public Schools. PLTW Introduction to Computer Science Curriculum

YOUR RIGHT TO USE ANY LBRF GRAPHIC MATERIAL IS SUBJECT TO YOUR FULL PAYMENT OF THE LICENSE AND THE RESTRICTIONS SET FORTH IN THIS AGREEMENT.

INFORMATION Note. No 008: CrB checks and when to use them. About IMSPA. First issued: December 2011

Why Biometrics? Why Biometrics? Biometric Technologies: Security and Privacy 2/25/2014. Dr. Rigoberto Chinchilla School of Technology

IC ARTICLE 39. REVISED UNIFORM FIDUCIARY ACCESS TO DIGITAL ASSETS ACT

ISi DATABASES INTERNET LICENSE AGREEMENT

Issues in Information Systems Volume 18, Issue 2, pp , 2017

Dimension Reduction. Why and How

International Biometrics & Identification Association

THE CENTRAL DISTRICT OF CALIFORNIA WESTERN DIVISION ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) )

Identifying Ideological Perspectives of Web Videos using Patterns Emerging from Folksonomies

Introduction-cont Pattern classification

LobbyView: Firm-level Lobbying & Congressional Bills Database

Open Data Kit (ODK) Mobile Data Collection, Aggregation, and Dissemination

FairCom Press Release Archive:

Users reading habits in online news portals

Research Article. ISSN (Print)

CS 5523: Operating Systems

Electronic Voting For Ghana, the Way Forward. (A Case Study in Ghana)

HOW NEW TECHNOLOGY HELPS THE TEXAS LEGISLATIVE COUNCIL DELIVER 20,000 DOCUMENTS* EACH SESSION

Title of Nomination: Justice Xchange Project/System Manager: Paul C. Brown/Kathy J. Gattin Title: Information Systems Administrator/Victim-Offender

Technology enhanced learning for inclusion and integration of migrants

Random Forests. Gradient Boosting. and. Bagging and Boosting

SPECIAL INSPECTOR GENERAL FOR AFGHANISTAN RECONSTRUCTION CHIEF FOIA OFFICER REPORT FISCAL YEAR 2010

Happiness and job satisfaction in urban China: a comparative study of two generations of migrants and urban locals

Creating and Managing Clauses. Selectica, Inc. Selectica Contract Performance Management System

NANOS. Ideas powered by world-class data. Liberals 39 Conservatives 28, NDP 20, Green 6, People s 1 in latest Nanos federal tracking

Research and strategy for the land community.

LEGAL TERMS OF USE. Ownership of Terms of Use

Natural Language Technologies for E-Rulemaking. Claire Cardie Department of Computer Science Cornell University

Freedom of Information Act 2000 (Section 50) Decision Notice

No. 69. An act relating to automated license plate recognition systems. (S.18) It is hereby enacted by the General Assembly of the State of Vermont:

Official Journal of the European Union

LICENCE AGREEMENT FOR THE SUPPLY OF EUROPEAN CENTRE for MEDIUM-RANGE WEATHER FORECASTS (ECMWF) PRODUCTS

WEBSITE USER AGREEMENT

The Senate Nobody Knows By Bernard Asbell

AT&T. End User License Agreement For. AT&T WorkBench Application

CSCU9Q5. Data Protection and Freedom of Information Acts

Structural Folds: Generative Disruption in Overlapping Groups. Balázs Vedres David Stark

CITY OF VANCOUVER DUTY TO ASSIST

SOFTWARE LICENCE. In this agreement the following expressions shall have the following meanings:

BREACHES OF INFORMATION SECURITY: A U.S. COMPANY S OBLIGATIONS

GUIDELINES FOR EXAMINATION OF INDUSTRIAL DESIGNS A REPORT ON CONSENSUS POINTS FOR EXAMINATION OF INDUSTRIAL DESIGN APPLICATIONS

Transcription:

From Pixels to Semantics Research on automatic indexing and retrieval of large collections of images James Z. Wang PNC Technologies Career Development Professorship School of Information Sciences and Technology Penn State University Automatic modeling and learning of concepts http://wang.ist.psu.edu 4/12/2002 J. Z. Wang, Penn State University 1 4/12/2002 J. Z. Wang, Penn State University 2 Research: Main Areas Multimedia information retrieval Image retrieval Image classification Biomedical informatics DNA/Protein sequence analysis Biodiversity informatics Information security Content-based security filtering Brief History of Our Work 1995: Stanford Art Library funded our project on searching of a Chicana Art image database 1996-2000: our work was funded by IBM QBIC group (used by SF Museum of F.A.) NEC AMORA group (used by the Getty Museum) SRI (DARPA-funded, stereo matching for battlefield images) 1999-2000: NSF DLI2 project, Stanford Use wavelets, statistical classification, and integrated regionbased approach for image retrieval; image security filtering 2000-now: Penn State University Automatic modeling and learning of concepts for image indexing 4/12/2002 J. Z. Wang, Penn State University 3 4/12/2002 J. Z. Wang, Penn State University 4 Chicana Art Project, 1995 1000+ high quality paintings Goal: help students and researchers to find similar paintings Used wavelet-based features [Wang+,1997] 4/12/2002 J. Z. Wang, Penn State University 5 Introduction: The Problem Content-based Image Retrieval The retrieval of relevant images from an image database on the basis of automatically -derived image features Applications Biomedicine (X-ray, pathology, CT, MRI, ) Government (radar, aerial, trademark, ) Commercial (fashion catalogue, journalism, ) Cultural (museums, art galleries, ) Education and training Entertainment, WWW, 4/12/2002 J. Z. Wang, Penn State University 6 1

Major Challenges (cont.) Size 1 million images 1000 GB of space 30 GB compressed Understandability & Vision meaning depend on the point-of-view Hard to translate contents and structure into linguistic terms dogs Kyoto Query formulation SIMILARITY: look similar OBJECT: contains a bike OBJECT RELATIONSHIP: contains a dog near a person MOOD: a happy picture TIME/PLACE: Yosemite sunset 4/12/2002 J. Z. Wang, Penn State University 7 4/12/2002 J. Z. Wang, Penn State University 8 Related Work Many image search engines IBM, VIRAGE, NEC, Interpix, scour.net, MIT, Stanford, Berkeley, Columbia, CMU, UCSB, Speed: None is capable of handling the images on the Web Accuracy: None is near the human level of accuracy An active research area Text-based Approach Index images using keywords or descriptions (e.g., google.com) + Easier to design and implement, fast execution + Surrounding text in the Web page + Accepted approach for high value pictures -- Often too expensive -- A picture is worth, and can require 1000 words -- Query word may NOT appear as a keyword -- Surrounding text may NOT describe the image 4/12/2002 J. Z. Wang, Penn State University 9 4/12/2002 J. Z. Wang, Penn State University 10 Feature-based Approach + Handles low-level semantic queries + Many features can be extracted -- Cannot handle higher-level queries (e.g.,objects) Region-based Approach Extract objects from images first + Handles object-based queries e.g., find images with objects that are similar to some given objects + Reduce feature storage adaptively -- Object segmentation is very difficult -- User interface: region marking, feature combination 4/12/2002 J. Z. Wang, Penn State University 11 4/12/2002 J. Z. Wang, Penn State University 12 2

UCSB NeTra [Ma+, 1997] UCB Blobworld [Carson+, 1999] 12/11/2001 J. Z. Wang, Penn State University 10 4/12/2002 J. Z. Wang, Penn State University 13 4/12/2002 J. Z. Wang, Penn State University 14 Motivations Observations: Human object segmentation relies on knowledge Precise computer image segmentation is a very difficult open problem Hypothesis: It is possible to build robust computer matching algorithms without first segmenting the images accurately 4/12/2002 J. Z. Wang, Penn State University 15 4/12/2002 J. Z. Wang, Penn State University 16 Our SIMPLIcity Work [Wang+, D-LIB, 1999][Wang+, PAMI, 2001] Wavelets SIMPLIcity system Semantics-sensitive Integrated Matching for Picture LIbraries Combine low-level statistical semantic classification with image retrieval Wavelet-based feature extraction for fast segmentation Integrated Region Matching (IRM) 4/12/2002 J. Z. Wang, Penn State University 17 4/12/2002 J. Z. Wang, Penn State University 18 3

Fast Image Segmentation IRM: Integrated Region Matching IRM defines an image-to-image distance as a weighted sum of region-to-region distances Weighting matrix is determined based on significance constrains and a MSHP greedy algorithm Partition an image into 4 4 blocks Extract wavelet-based features from each block Use k-means algorithm to cluster feature vectors into regions Compute the shape feature by normalized inertia 4/12/2002 J. Z. Wang, Penn State University 19 4/12/2002 J. Z. Wang, Penn State University 20 IRM: Major Advantages 1. Reduces the influence of inaccurate segmentation 2. Helps to clarify the semantics of a particular region given its neighbors 3. Provides the user with a simple interface Recent Extensions Scalable IRM: Indexing region-based feature space using statistical clustering [Wang+Du, JCDL, 2001] Fuzzy matching: Fuzzy region matching to further reduce sensitivity to the average number of regions segmented [Chen+Wang, PAMI, 2002] 4/12/2002 J. Z. Wang, Penn State University 21 4/12/2002 J. Z. Wang, Penn State University 22 Experiments and Results Speed 800 MHz Pentium PC with LINUX OS Database: 200,000 COREL image DB (60,000 photographs + 140,000 hand-drawn arts) Image indexing time: one second per image Image retrieval time: Without the scalable IRM, 1.5 seconds/query CPU time With the scalable IRM, 0.15 second/query CPU time External query: one extra second CPU time 4/12/2002 J. Z. Wang, Penn State University 23 4/12/2002 J. Z. Wang, Penn State University 24 4

RANDOM SELECTION Natural out-door scene: 23 related, out of 31 Query Results Current SIMPLIcity System 4/12/2002 J. Z. Wang, Penn State University 25 4/12/2002 J. Z. Wang, Penn State University 26 External Query 4/12/2002 J. Z. Wang, Penn State University 27 4/12/2002 J. Z. Wang, Penn State University 28 Searching Terracotta Warriors (with Simmons College) Objective Accuracy Test on Image Categorization Compare with EMD [Rubner+, 1999] Two setups: avg of 13.1 filled bins, 42.6 filled bins Subset of the COREL database 10 categories, each containing 100 pictures Africa, breach, buildings, buses, dinosaurs, elephants, flowers, horses, mountains, food 1000 queries were tested Average precision p within the best 100 matches is computed 4/12/2002 J. Z. Wang, Penn State University 29 4/12/2002 J. Z. Wang, Penn State University 30 5

Robustness to Image Alterations 10% brighten on average 8% darken Blurring with a 15x15 Gaussian filter 70% sharpen 20% more saturation 10% less saturation Shape distortions Cropping, shifting, rotation 4/12/2002 J. Z. Wang, Penn State University 31 4/12/2002 J. Z. Wang, Penn State University 32 4/12/2002 J. Z. Wang, Penn State University 33 Automatic Modeling and Learning of Concepts for Image Indexing Key observations: Human beings are able to build models about objects or concepts from images The learned models are stored in the brain and used in the recognition process Hypothesis: Computers can learn a large collection of concepts by 2D or 3D image-based training Trained Concepts: Basic building blocks in determining the semantic meanings of images Basic Object: flower, beach Object composition: building+grass+sky+tree Location:Asia, Venice Time: night sky, winter frost Abstract: sports, sadness 4/12/2002 J. Z. Wang, Penn State University 34 System Design Train statistical models of a dictionary of concepts using sets of training images 2D images are currently used 3D-image training can be much better Compare images based on model comparison Select the most statistical significant concept(s) to index images linguistically Initial experiment: 600 concepts, each trained with 40 images 15 minutes Pentium CPU time per concept, train only once highly parallelizable algorithm Initial Model: 2-D Wavelet MHMM Model: Inter-scale and intrascale dependence States: hierarchical Markov mesh, unobservable Features in SIMPLIcity: multivariate Gaussian distributed given states 4/12/2002 J. Z. Wang, Penn State University 35 4/12/2002 J. Z. Wang, Penn State University 36 6

2-D MHMM Preliminary Results Computer Prediction: people, Europe, man-made, water Building, sky, lake, landscape, Europe, tree People, Europe, female Capture cross-resolution and intra-resolution context information Statistical dependence across resolution is assumed to be Markovian 4/12/2002 J. Z. Wang, Penn State University 37 Food, indoor, cuisine, dessert Snow, animal, wildlife, sky, cloth, ice, people 4/12/2002 J. Z. Wang, Penn State University 38 Conclusions A robust integrated region-based image retrieval algorithm Implemented in our SIMPLIcity system Tested on 200,000 images Improved accuracy and robustness, compared with some systems Fast execution On-going: Automatic modeling and learning of semantic concepts 600 concepts can be learned automatically 4/12/2002 J. Z. Wang, Penn State University 39 4/12/2002 J. Z. Wang, Penn State University 40 Future Work Explore new methods for better accuracy refine statistical modeling of images learning from 3D refine matching schemes Apply these methods to special image databases (e.g., art, biomedicine) very large databases Acknowledgments NSF DLI2 The PNC Foundation SUN Microsystems Lockheed Martin Corp US Army NMTB (pending) Earlier funding: IBM QBIC, NEC AMORA, SRI AI, Stanford Lib/Math/Biomedical Informatics/CS 4/12/2002 J. Z. Wang, Penn State University 41 4/12/2002 J. Z. Wang, Penn State University 42 7

More Information Papers in PDF, 60,000-image DB download, demo, etc. http://wang.ist.psu.edu 4/12/2002 J. Z. Wang, Penn State University 43 8