Recognition of Natural Scenes from Global Properties: Seeing the Forest Without Representing the Trees
Michelle R. Greene & Aude Oliva. Submitted manuscript
Abstract: Human observers are able to rapidly and accurately categorize natural scenes, but the representation that mediates this is still unknown. Here we propose a framework of rapid scene categorization that does not need to segment a scene into objects and instead uses a vocabulary of global, ecological properties that describe spatial and functional aspects of scene space (such as navigability or mean depth).  In Experiment 1, we obtained ground-truth rankings on global properties for use in Experiments 2-4. To what extent do human observers use global property information when rapidly categorizing natural scenes? In Experiment 2, we found that global property resemblance was a strong predictor of both false alarm rates and reaction times in a rapid scene categorization experiment. To what extent is global property information alone a sufficient predictor of rapid natural scene categorization? In Experiment 3, we found that the performance of a classifier representing only these properties is indistinguishable from human performance in a rapid scene categorization task in terms of both accuracy and false alarms. To what extent is this high predictability unique to a global property representation? In Experiment 4, we compared two models that represented scene object information to human categorization performance and found that these models had far lower fidelity at representing the patterns of performance than the global property model. These results provide support for the hypothesis that rapid categorization of natural scenes may not be mediated primarily though objects and parts, but also through global properties of structure and affordance.