|PAGE 1 OF 3||  [ref]||DOWNLOAD PDF|
Reprinted from Proc. Royal Society B 171 179-296 with the kind permission of the Editor
[This paper develops the notion of perceptual hypotheses and also raises questions concerning the distinction to be made between 'analogue' and 'digital'. This is made in the context of the brain, and seems important if we are to be clear in how we use an engineering distinction. The point is: Should the distinction be regarded as one of engineering convenience (continuous or discontinuous mechanisms, or circuits) or should it be regarded as a deeper logical distinction? I incline to the latter view.]
An adequate theory of visual perception must explain how the fleeting patterns of light upon the retinas give knowledge of surrounding objects. The problem of how the brain 'reads' reality from images is acute, because images represent directly but few, and biologically unimportant, characteristics of objects. What matters biologically are such things as whether an object is poisonous or food, hard or soft, heavy or light, sharp or blunt, friend or foe. These are not properties of images. The owner of the eye cannot eat or be eaten by its images, and yet his life depends upon interpreting them in terms of quite different characteristics of objects. It follows that eyes are of little biological value unless there is an adequate brain to interpret their images; which raises the evolutionary problem: how did the eye-brain combination arise? (cf. Gregory, 1968c [REF 9]). To read reality from images is to solve a problem: a running set of very difficult problems throughout active life. Errors are illusions. Certain situations present special difficulty, giving rise to systematic errors: can these serve as clues to how the brain generally solves the problem of what objects are represented by which images?
Illusions can occur in any of the sense modalities, and they can cross the senses. A powerful illusion crosses from the seen size of an object to its apparent weight, as judged by lifting. Small objects feel up to fifty per cent heavier than larger objects of the same scale weight. Thus weight is evidently not judged simply by the input from the arm but also by prior expectation set by the previous handling of weights. When the density is unexpectedly great or small we suffer a corresponding illusion of weight.
There are, however, illusions of quite different types. There are purely optical illusions, where light from the object to the eye is bent by reflection (mirrors) or by refraction (the bent-stick-in-water effect, and mirages). There are also what we may call sensory illusions. The sense organs, the eyes, ears, touch and heat-sensitive nerve endings can all be upset, when they will transmit misleading information to the brain. They are upset by prolonged stimulation and by over-stimulation. For example, the 'waterfall effect' observed by Aristotle, is dramatically demonstrated by watching a rotating spiral (Fig. 1). If this is rotated on a turntable for ten or twenty seconds while the eyes are held at its centre, it will seem to contract or expand, depending on the direction of rotation. When stopped, there is a marked illusory movement in the opposite direction to the original movement. In the illusion, movement is seen but with no change of position.
Evidently a velocity detecting system has been adapted by prolonged stimulation to movement. As is well known, after-images occur after intense or prolonged stimulation of the retina by light: we see first a positive 'picture' which soon changes to a negative 'picture' which may persist for many minutes. This is due to local retinal adaptation, the brain receiving retinal signals essentially the same as for the normal image of an object but persisting beyond the physical stimulation of the retina. It is also possible for any sense organ to signal a quite wrong kind of stimulus. Pressure on the eye will send signals to the brain which are experienced as light; and electric current through the eye will produce the sensation of light, and through the ear sound. Fortunately, these sensory illusions seldom occur in normal conditions, though they are easily evoked in the laboratory.
The types of illusions which concern us here are, however, entirely different from either optical or sensory illusions. We may call these perceptual illusions. They arise from misinterpretation by the brain of sensory information.
Several perceptual illusions were known to the ancient Greeks, but they have only been studied experimentally for just over a century. The first scientific description in modern times was a letter by a Swiss naturalist, L. A. Necker, to Sir David Brewster (Necker, 1832) describing how a rhomboid reverses in depth, sometimes one face appearing the nearer, sometimes another. Necker correctly noted that changes of eye fixation could induce this change in perception, but that it would occur quite spontaneously. This famous effect is generally illustrated with an isometric skeleton cube (Fig. 2 a) rather than Necker's original figure (Fig. 2b). Perceptual reversals, or alternations (there can be several alternative perceptions) are not limited to vision. Repeated words, presented on an endless tape loop, give analogous auditory reversals (Warren and Gregory 1958 [REF 3]); A similar, even more striking effect, was noted by W. J. Sinsteden: that the rotating vanes of a windmill spontaneously reverse direction when it is not clear whether one is seeing the front or the back of the windmill (for references, see Boring, 1942). This effect is well shown by casting the shadow of a slowly rotating vane upon a screen, thus removing all information of which is the back and which the front. The shadow will also at times appear to expand and contract upon the plane of the screen. It is important to note that these effects are not perceptual distortions of the retinal image: they are alternative interpretations of the image, in terms of possible objects, and only one interpretation is correct.
The most puzzling visual illusions are systematic distortions of size or shape. These distortions occur in many quite simple figures; the distortions occurring in the same directions and to much the same extent in virtually all human observers and probably also in many animals. Their explanation presents a challenge which should be accepted, for a viable theory of normal perception must account for them and they could be important clues to basic perceptual processes.
The simplest distortion illusion was the first to be described: by the father of experimental psychology, Wilhelm Wundt (1832 - 1920), who was Hermann von Helmholtz's assistant at Heidelburg. Wundt described the 'horizontal - vertical' illusion - that a vertical line looks longer than the horizontal line of equal length. He attributed this distortion to asymmetry of the eye movement system. Although this has been invoked many times since to explain distortion illusions it must be ruled out, for the distortions occur in after-images, or in normal retinal images optically stabilized so as to remain stationary on the eye though it moves. In addition, distortions can occur in several directions at the same time, which could hardly be due to eye movements; and in any case it is difficult to see how curvature distortions could be related to eye movements. All the evidence points to the origin of the distortions being not in the eyes but in the brain.
Interest in the illusions became general upon the publication of several figures showing distortions which could produce errors in using optical instruments. This concerned physicists and astronomers a hundred years ago, when photographic and other ways of avoiding visual errors were not available. The first of the special distortion figures was the Poggendorff figure of 1860 (Fig. 3). This was followed by the Hering illusion (1861); its converse being devised by Wundt much later, in 1896 (Thiéry 1896) (FIG. 4). The most famous illusion of all is the Müller-Lyer arrow figure (FIG. 5). This was devised by F. C. Müller-Lyer and was first presented in fifteen variants (Müller-Lyer, 1889). This figure is so simple, and the distortion so compelling that it was immediately accepted as the primary target for theory and experiment. All sorts of theories were advanced: Wundt's eye movement theory (in spite of its inadequacy); that the 'wings' of the arrow heads drew attention away from the ends of the central line, or 'arrow shaft', to make it expand or contract; that the heads induced a state of empathy in the observer (though the distortion seems far too constant for such an explanation), that the distortion is a special case of a supposed general principle that acute angles tend to be overestimated and obtuse angles underestimated.
This was, however, left unexplained, and it is not clear why it should produce size changes without apparent changes of angle. All these theories had a common feature: they were attempts to explain the distortion in terms of the stimulus pattern without reference to its significance in terms of the perception of objects. There was, however, one quite different suggestion made by A. Thiéry (1896) that the distortions are related to perspective depth features. Thiéry regarded the Müller-Lyer arrows as drawings of such objects as a saw-horse, seen in three dimensions, with the legs going away from the observer in the acute-angled figure and towards him in the obtuse-angled figure. This suggestion has seldom been considered until recently, though the 'perspective theory' was described by R. H. Woodworth in 1938: 'In the Müller-Lyer figure the obliques readily suggest perspective and if this is followed one of the vertical lines appears farther away and therefore objectively longer than the other'. This quotation brings out the immediate difficulties of developing an adequate theory along these lines, for the distortion occurs even when the perspective suggestion is not followed up, for the figure generally appears flat and yet distorted; and there is no hint given of a modus operandi, or brain mechanism responsible for the size changes. An adequate theory following Thiéry's suggestion that perspective is somehow important must show how distortion occurs though the figures appear flat. It should also indicate the kind of brain mechanisms responsible.
|FIG. 6 Ponzo figure. The upper of the parallel lines is expanded with respect to the lower.||FIG. 7 The Ponzo illusion is seen in this photograph of receding (railway) lines. Here the perspective significance of converging lines is obvious.|
The idea that geometrical perspective - the converging of parallel lines with distance - has a bearing is at least borne out by the occurrence of these distortions in actual scenes. A simple example is the Ponzo illusion (FIG. 6). This is a skeleton drawing of typical perspective convergence of parallel lines with distance, as in the railway lines of Figure 7, (cf. Tausch, 1954). The upper superimposed rectangle, which would be more distant, is expanded. Similarly, the inside corner is expanded (FIG. 8a) and the outside corner shrunk (Fig. 8b) just as in the Müller-Lyer figures, which are like skeleton corners. In both cases, regions indicated by perspective as distant are expanded, while near regions are shrunk. The distortions are opposite to the normal shrinkings of the retinal image of objects with increased distance. Is this merely fortuitous, or is it a clue to the origin of the illusions? Before we come to grips with the problem of trying to develop an adequate perspective theory, it will be helpful to consider some curious features of ordinary pictures.
|FIG 8 (a) The outward-going fins of the Müller-Lyer figure are here seen in flat projection of a photograph of the inside corner of a room.||FIG 8 (b) The inward-going arrows are seen in the outside corner of the building. As in the Ponzo example, distance indicated by perspective is associated with illusory expansion.|