the geometry of color perception
This page describes how these different aspects of color perception fit together. For example, how is the physical stimulus (luminance) related to a measure of sensation (brightness)? How can we show that red is more similar to blue than to green? Does the brightness of a color change as its chroma increases?
Mapping the quantities of a physical stimulus onto color sensations is called psychophysics, and it was the earliest form of color specification. This geometrical approach was innovated by the 17th century naturalist Isaac Newton, who summarized his experiments in light and pigment mixing as a hue circle, the first geometrical model of color perception. This evolved into the painters' many color wheels and is incorporated into all modern color models.
Another important psychological model was proposed by a physiologist, Ewald Hering, who tried to deduce the physiology of color vision from subjective color experiences. For several decades his opponent process theory was seen as an unsatisfactory alternative to the trichromatic theory, but today both theories are included in models of color vision.
Hering's model, like Newton's, seems to assume a mind of geometrical regularity or symmetry. Unfortunately, the functional relationships between color stimuli, color receptors and color perceptions are not that simple. Tracking the changes from cones to colors shows the many complex adjustments that are necessary to create visual experience. The geometry of colormaking attributes reveals that no single geometry can adequately unify all the different ways to measure color.
Perhaps the ultimate color description occurs in the translation of color sensations into color words. Hering's four unique hues have been frequently used in linguistic research in the hope that the usage of common color words might uncover the fundamental reference points of human color experience. Color Similarity. If we think in terms of the visible spectrum, it seems obvious that yellow and green are more similar than red and green: "yellow" light is closer to "green" light in the spectrum band. Spectral closeness seems a reasonable way to judge the similarity among colors.
But ask yourself, which hue is more similar to red: green, or purple? Although purple is at the opposite end of the spectrum from red, most people would answer that purple and red are more similar.
which hue is more similar to red: green, or purple?
reflectance curves for quinacridone rose (PV19), quinacridone magenta (PR122), and manganese violet (PV16) see this page for an explanation of reflectance curves and how to interpret them Because the L cones are stimulated by red or purple surfaces, we judge these to be similar colors in the same way that yellow green seems similar to blue green, because the M cones are stimulated by both. |
||||||||||||||||||
| However, magenta and red violet, the bridging hues between red and blue violet, are not spectral lights. They cannot be found anywhere in the spectrum created by a prism or rainbow, even though "red" and "blue" light, when combined, produce those colors.
Newton's Opticks. This light mixing discovery was made by the English physicist and mathematician Isaac Newton (1642-1726), who first described the many optical and color experiments he performed in the late 1660's to the English Royal Society in 1672, and after an interval of more study and contentious public discussion published them in his Opticks of 1704. (That's the English edition; a scholarly Latin translation appeared in 1706.) Newton had experience grinding his own lenses and was attempting to understand the problem of chromatic aberration that introduces fringe colors into optical images. He knew of the "celebrated Phenomena of Colours" described in experiments with pentahedral (triangular in cross section) prisms performed decades earlier by Guido Scarmiglioni, Thomas Harriot and Marcus Marci. But he pursued these studies much further in experimental thoroughness, observational precision and logical clarity. The revolutionary aspect of Newton's work was his emphasis on keeping separate the physical and sensory (mathematical and psychological) aspects of color a scientific strategy advanced in the treatises of Galileo Galilei. Newton demonstrated that each spectral hue has a unique and measurable refrangibility or angle of refraction when passed through a lens or prism. In his Experimentum crucis ("decisive experiment") and variations of it, he showed that this characteristic of light remains constant even if the light is blended by a lens, sent through multiple prisms, or passed through a colored filter. He could not explain this refrangibility, which arises from the wave properties of light, but he concluded that the sensation of color was both separate from it yet intimately related to it. He first defined refrangibility and color in parallel terms: The Light whose Rays are all alike Refrangible, I call Simple, Homogeneal, and Similar; and ... The Colours of Homogeneal Lights, I call Primary, Homogeneal and Simple. and then, in a famous passage, he declared that color is a sensation in the viewer's mind, not a property of the light itself or of the materials illuminated by the light: If at any time I speak of Light and Rays as coloured or endued with Colours, I would be understood to speak not philosophically [scientifically] and properly, but grossly, and according to such Conceptions as vulgar [uneducated] People in seeing all these Experiments would be apt to frame. For the Rays to speak properly are not coloured. In them there is nothing else than a certain Power and Disposition to stir up a Sensation of this or that Colour. ... So Colours in the Object are nothing but a Disposition to reflect this or that sort of Rays more copiously than the rest; in the Rays they are nothing but their Dispositions to propagate this or that motion into the Sensorium; and in the Sensorium they are Sensations of those Motions under the Forms of Colours. [Book One, Part II]
Newton frequently pointed to the similar sensory properties of light and sound and, in imitation of the diatonic musical scale (produced by the white keys of a piano), he (or rather a sharp eyed assistant) identified seven "primary or simple" colors of light in the spectrum red, orange, yellow, green, blue, indigo and violet. |
||||||||||||||||||
| In Newton's view, spectral "orange" or "indigo" light was just as primary, homogeneal and simple as "red" or "green" light, because none of these hues by itself could be altered or separated by refraction into any other colors.
But he found they could be mixed. In particular, he discovered a region of new colors that were known in isolated natural exemplars such as gems and flowers but that did not appear in a prismatic spectrum. Instead, these extraspectral hues "purple" and "red violet" (magenta) appeared by overlapping the "red" and "violet" ends of two separate spectrums (right). And he found that recombining three or four colors of the spectrum through a lens reconstituted the original "white" color of sunlight.
The Hue Circle. Newton summarized these striking observations as an ingenious new color model: he joined the "red" and "violet" ends of the spectrum to create a hue circle. This circle still shows the spectrum as a continuous gradation of color from red to violet, but now red is joined to violet, via the "artificial" or mixed colors carmine, magenta and purple.
newton's hue circle the original "color circle" (1704) superimposed on the spectral and extraspectral hues (there is no "magenta" or "purple" light in the spectrum); capital letters refer to notes of the diatonic scale In this diagram, the center of the circle (O) is "white" (colorless) light, and the circumference represents the "fiery" (saturated) colors of every spectral hue. The distance from the center to the edge indicates the range of unsaturated colors the drab or whitish colors of the everyday world between white and homogeneal hues. |
![]() extraspectral magenta |
|||||||||||||||||
| The hue circle is not just an artifice to make a home for extraspectral purple. Newton explained that it provides a geometrical method to calculate the chromaticity (hue and saturation) of any light mixture. He declared that two or more spectral "primaries" combined in specific quantities or "weights" would produce a mixture color that was located at the "center of gravity" (weighted average) among them all. The method is like finding the balance point of a circular cake pan with different sized weights placed around the rim (right).
In Newton's diagram (above), the small circles underneath each color name indicate the varying quantities or "weights" of each spectral color that might contribute to a color mixture: large amounts of red, orange and yellow, small amounts of "blew", indigo and violet. The color located at "Z", the average of them all, is the unsaturated red orange color that results. And, to close the circle, Newton observed that: If the point Z fall in or near the line OD, the main ingredients being red and violet, the Colour compounded shall not be any of the prismatick Colours, but a purple inclining to red or violet, accordingly as the point Z lieth on the side of the line DO towards E or towards C, and in general the compounded violet is more bright and more fiery than the uncompounded. Newton emphasized that the hue circle only applies to light mixtures. Pigment mixtures would not depend on the proportional weights or quantities of the pigments in a mixture, but on "the quantities of the Lights reflected from them." He verified that colors arise in pigments and in natural surfaces because they reflect some spectral colors and absorb others. The Analysis of White. Newton explored pigment mixtures at length, using the artists' primaries red, yellow, green and blue (as the pigments carmine lake, orpiment, verdigris and bremen blue) to explore the origin of white or gray mixtures: All colour'd Powders do suppress and stop in them a very considerable Part of the Light by which they are illuminated. For they become colour'd by reflecting the Light of their own Colours more copiously, and that of all other colours more sparingly, and yet they do not reflect the Light of their own Colours so copiously as white Bodies do. ... And therefore by mixing such Powders, we are not to expect a strong and full White, such as is that of Paper, but some dusky obscure one, such as might arise from a Mixture of Light and Darkness, or from white and black, that is, a grey, or dun, or russet brown, such as are the Colours of a Man's Nail, or of a Mouse, or Ashes, of ordinary Stones, of Mortar, of Dust and Dirt in High-ways, and the like. And such a dark white I have often produced by mixing colour'd Powders. ... Now, considering that these gray and dun Colours may also be produced by mixing Whites and Blacks, and by consequence differ from perfect Whites, not in Species of Colours, but only in degree of Luminousness, it is manifest that there is nothing more requisite to make them perfectly white than to increase their Light sufficiently. ... and this Newton accomplished by illuminating the gray mixtures with a single beam of sunlight in a darkened room, or by comparing the sunlit mixtures to a shaded piece of white paper. These mixing experiments with pigments and lights reveal Newton's fundamental preoccupation with the analysis of "white" into the compounding of different primary colors. Newton's hue circle implied that different combinations and proportions of primary colors would equivalently mix to white; white (or gray, in pigments) could be created by many different combinations of spectral light. Complementary Mixtures. Newton's play with "white" mixtures led to one of his most intriguing insights: two hues on opposite sides of the hue circle could create a near neutral color if mixed in the right proportions: If only two of the primary Colours which in the circle are opposite to one another be mixed in an equal proportion, the point Z shall fall upon the center O, and yet the color compounded of those two shall not be perfectly white, but some faint anonymous [diluted and unnameable] Colour. This is the origin of the idea of complementary colors and the "mix to white" criterion for identifying them. But the passage reveals a few points of confusion. For starters, Newton admitted he could not create a pure "white" by mixing two or three "primary" colors, but he apparently did not grasp that this was because he was mixing broad sections of the spectrum as a single color, rather than mixing single wavelengths. (That is, his hue circle works only if the calculations are made on the physical attribute of "refrangibility" and not on the psychological categories of "homogeneal color", those wide bands of seven spectral hues.) Newton also did not explain that some spectral hues are more luminous or have a higher tinting strength than others, which means they have inherently more weight in color mixtures and should be either located farther from the "white" center of the hue circle (as they are in some chromaticity diagrams) or overweighted in the "center of gravity" calculations. Finally, Newton's parallel experiments with pigments and lights misled many readers into thinking that pigments and lights mix in the same way, which only confused 18th century "color theorists". Newton's Legacy. It is difficult from a modern perspective to grasp the extraordinary originality of Newton's color theory. Perhaps the most important breakthrough in the Opticks is the use of geometry the circle to explain color mixtures, not as a medieval symbol of completeness or unity, but as the framework for a mathematical analysis of hues in a color mixture. This linkage between physical quantities and sensory qualities is the essential principle of psychophysics. According to Newton, at least seven spectral hues or "simple" colors of light were necessary to describe all color mixtures. The arbitrary distinction between "real" and "illusory" color is swept away: all color arises in mixtures of light, no matter whether the light comes from a prism or the reflectance qualities of colored powders. These ideas went far beyond the 17th century dyer's lore that three primary colored paints or dyes defined color mixing.
A key aspect of Newton's work, as his contemporaries saw it, was that he refuted the color theory inherited from Aristotle, in which "light" and "dark" were the two antagonistic primitives that mysteriously combined, like an oil slick on water, to create the iridescence of color. Newton showed just the reverse was true: white is not the acme of all color phenomena but a murky muddled mixture, no purer than dust or dirt; many different mixtures of three or more spectral colors would produce the same "faint, anonymous" or near white light. Black was not a "color" at all but merely the absence of light.
|
![]() newton's geometrical weighting the location of each color is measured on perpendicular x,y dimensions the x,y values for each color are multiplied by the "weight" (luminance or brightness) of the color the weighted x and y values for all colors are added together the total x and y values are divided the resulting average x,y location is the centroid (C) or mixture point |
|||||||||||||||||
| Newton insisted that color was a sensation in conscious experience or the "Sensorium". But this clearly implies that the hue circle is a mental structure, not a physical property of light. The psychologist Roger Shepard demonstrated this by asking college students to choose a numerical rating that described the apparent similarity between every pair of colors in a set of 14 Munsell color chips whose dominant wavelengths were matched to spectral hues. (He excluded extraspectral purple and magenta to avoid inducing a circle where none existed.) He then used a multidimensional scaling program to translate these similarity judgments into a map, where dissimilarity is represented as distance (diagram, right). This procedure not only recreated the ordering in the hue circle but also the perceptual spacing and complementary opponency between hues, as these are defined in modern color models.
Newton's book was intended for an audience of Enlightenment naturalists, and it was widely read. The English mathematician and perspective theorist Brook Taylor (1685-1731), who was also a talented amateur painter, pointed out in his New Principles of Linear Perspective (1719) that "the knowledge of this theory may be of great use in painting", and then explained in general terms how to apply it to mixing paints though Newton had cautioned that his circle only applied to mixtures of light. From this misconception all artists' color wheels have branched and bloomed.
Despite the fact that some Continental scientists found Newton's color mixing experiments difficult to replicate, or refused to accept his refutation of ancient color theory, Newton's book had a decisive (and hotly debated) impact on 18th century color concepts. After centuries of futile and confused speculation about the origins of color, his book separated the mental from the physical attributes of color and laid the foundations for its scientific study. |
![]() a hue circle formed from from Shepard (1962) |
|||||||||||||||||
| The German polymath Hermann Grassmann (1809-1877) contributed to this work by revisiting Newton's hue circle. Commenting on recent experimental work by Helmholtz, and adopting principles implicit in Newton's "center of gravity" method of predicting color mixtures, Grassmann proposed that:
1. Every color sensation may be analyzed into three mathematically determinable elements: the hue, the brightness, and the brightness of the intermixed white. 2. If one of two mixed lights is continuously altered while the other light remains constant, the appearance of the mixed light is also continuously changed. 3. Two colors, both of which have the same hue and the same proportion of intermixed white, also give identical mixed colors, no matter of what monochromatic lights they may be composed. 4. The total luminance of any light mixture is the sum of the luminances of the lights mixed. From these principles, Grassmann was able to prove an important corollary: For every hue of monochromatic light, there is another monochromatic color which, when mixed with it, gives colorless light. Grassmann clarified the principle that mixtures of light are additive in hue, brightness and chroma (mixture with "white" light), and therefore that color sensations are related to light mixtures according to quantitative principles. He also showed that every monochromatic [homogeneous] hue has a monochromatic complementary color or (a point Grassmann missed) a mixture of extraspectral "red" and "violet" hues which produce "white" when mixed together (diagram, right). Today these principles, in a more algebraic version, are known as Grassmann's Laws. They formed the theoretical framework for color experiments by Helmholtz and James Clerk Maxwell in the 1850's. These established the trichromatic model of color mixture, and the modern study of color perception. By the 1850's this anecdotal and speculative approach to color was largely displaced by psychophysics or the quantitative study of stimulus and sensation. This discipline emerged in Germany, through the work of Ernst Weber (1795-1878), Gustav Fechner (1801-1887), Wilhelm Wundt (1832-1920), Hermann von Helmholtz and others. Psychophysicists developed the experimental methods and mathematical equations necessary to link the intensity of a basic sensation to quantitative units of the physical stimulus weight in relation to mass, brightness in relation to light intensity, pitch in relation to frequency of vibration, and so on. At the same time, biologists developed the laboratory and dissection methods necessary to understand the physiology of sense organs and the nervous system. As a result the sensory and biological structure of perception began to be viewed as an integrated mechanism obeying basic laws that quantify the connection between mind and matter. This is the historical moment in which Bohemian physiologist Ewald Hering (1834-1918) launched a new defense of subjective color experience and color antagonism, published as the monograph Zur Lehre vom Lichtsinne (On a Theory of the Light Sense) in 1874 and as a book in 1878. Hering's Urfarben. Hering's strategy was to start with color experience and from that attempt to deduce color physiology. He pointed out that the trichromatic theory advocated by Helmholtz, Arthur König and others, which postulated three types of nerve excitations produced by three kinds of receptor cells, could not explain several well established observations in color experience. Hering noted that: yellow seems to be psychologically just as basic as the trichromatic red, green or blue, yet does not seem to contain any of those colors dichromats, who lack a "red" or "green" cone, cannot see either a red or green color yet still can see yellow deuteranopia (green colorblindness, caused by a lack of the M cones) does not cause the point of maximum spectral luminance to shift from "green" wavelengths toward "red" most colors of the spectrum seem to shift in hue as they brighten or darken (the Bezold-Brücke effect), but these shifts do not appear in a pure blue, green or yellow. Based on clues of this kind, Hering asserted the perceptual primacy of four Urfarben or "primordial colors", known in the USA as the four unique hues: red, yellow, green and blue. He conjectured that they were produced by visual substances or processes located somewhere in the visual system outside the retina. He was vague about the physiology but precise about the "pure" form of the unique hues, which he equated with the color of monochromatic lights at 470, 500, 570 and 700 nm. Hering's Opponent Processes. Hering then turned to color mixtures. He observed that light or surface colors can produce a sensation of red mixed with yellow (orange) or red mixed with blue (purple), but never create the sensation of red mixed with green ("reddish green" or "greenish red") or yellow mixed with blue ("yellowish blue" or "bluish yellow"). This proved to Hering that the visual substances were organized as antagonistic or opponent processes. In one process assimilation of visual substance produced the red sensation and dissimilation produced the green sensation; the other process produced the opponent sensations of yellow and blue. When the visual substances were neutralized or in balance, both hues associated with that opponent process disappeared. Hering also observed that we commonly lose color vision at night, yet still see light and dark, which convinced him that luminosity is separate from hue as a color vision process. So he proposed two additional "colors", white and black, that create the perception of brightness or lightness and also affect color chromatic purity by mixing with any of the unique hues to create less intense (desaturated) colors across the complete range of shades, tones and tints which Hering called veiled colors. However, he conceded that yellow and red had an "inherent brightness", and green and blue an "inherent darkness", which mingled perceptions of hue and luminosity, and that white and black did not disappear at balance but produced the positive sensation of gray, which meant that black and white are not linked as opponent processes. In summary, Hering proposed there are six fundamental color processes that are arranged as three visual contrasts, including two opponent processes. They are:
The achromatic sensation of middle gray results when all the substances and opponent processes are in balance.
Hering took pains to fit his theory to Newton's hue circle. Roughly half the hues on the circle can be described as containing some yellow, and the other half some blue; these semicircles overlap with two opposing spans of red and green, oriented perpendicular to the yellow and blue like four hue compass points. Binary mixtures of any two neighbor unique hues (one hue each from the two opponent processes) would explain all spectral hues between them, including the extraspectral purples between red and blue.
hue circle explained by two opponent processes
Although Hering did not describe a complete color system, his color writings suggested the basic geometry for the Swedish NCS color model.
As explained in the next section, opponent contrasts have proven to be fundamental to color vision. So it is ironic that most of Hering's specific explanations of colorblindness and color perception have turned out to be incorrect, or the data he relied on flawed or incomplete. His concept of "visual substances" directly causing conscious sensations and working in chemical opponency, like acid and base, is wrong in several respects. The opponent contrasts that do underlie color vision are not anchored on the unique hues. Opponent contrasts can be identified in neural pathways only between the retina and visual cortex, not in brain areas associated with color recognition or color naming. Hering was an innovator who uncovered important and valid general principles despite a flawed interpretation of the facts.
The Opponent Functions. For several decades after the publication of Hering's ideas, Hering and Helmholtz, and then their descendant partisans, disputed the theoretical mechanisms and research evidence for the trichromatic and opponent theories. The trichromatic theory and the opponent process theory were usually seen as incompatible explanations of color vision.
By the middle of the 20th century, researchers had concluded that both theories were necessary to explain the physiological processes of color perception. These hybrid models were first suggested by Johannes von Kries and G.E. Müller in the 1890's and were definitively reformulated by Müller in 1930 as a zone theory of color vision. Zone theories derive the opponent dimensions from the trichromatic retinal responses by a step by step transformation or recoding across three or more stages or zones. In 1949 D.B. Judd specified Müller's physiological theory as testable equations based on cone fundamentals of the CIE standard observer. Today these opponent dimensions are the foundation for most modern color appearance models.
recoding opponent dimensions from trichromatic outputs
In this procedure, either the viewers or the experimenters chose four wavelengths of light to match Hering's unique hues. (A mixture of "red" and "violet" wavelengths must be used to produce a unique red.) The intensity of each light was then adjusted so that equal parts of red and green matched the brightness of unique yellow, and equal parts of yellow and blue matched the brightness of unique green.
Next, a fifth wavelength was chosen as the test color. Viewers used a light mixing apparatus to blend the test light with the two unique hues that were nearest complementary colors (opposite on the hue circle) to the test light, until the mixture produced a pure "white" light. This was repeated for test lights at regular intervals across the spectrum.
|
![]() Grassmann's diagram of dotted line shows a complementary mixture of "yellow" and "indigo" light; letters denote Fraunhofer spectral absorption lines from Grassmann (1853) |
|||||||||||||||||
| Finally, the proportions (relative luminances) of the two unique hues necessary to cancel a third monochromatic light at each wavelength were combined as two chromatic response functions. Representative curves produced by two individual subjects are shown at right; theoretical curves, based on the CIE 10° standard observer, are shown below.
opponent functions in spectral hues
The y/b opponent function is represented by the yellow/blue curve. The area where the yellow curve is above the horizontal line, with a maximum in "yellow green" wavelengths, indicates colors that appear to contain some yellow. The area where the blue curve is below the line, with a maximum in "blue violet", indicates hues that contain some blue. This y/b function contrasts the long and short wavelength ends of the spectrum.
The r/g opponent function (red/green curve) codes for hues that appear to contain some red (the areas where the red curve is above the horizontal line, with peaks in "blue violet" and "scarlet" wavelengths) as opposed to hues that contain some green (the area where the green curve is below the line, with a maximum in "middle green"). This r/g function contrasts the middle wavelengths from both ends of the spectrum.
The unique hues appear at any point where one opponent function is at the zero line where it cannot bias or alter the hue created by the other opponent function. Thus, unique yellow appears at the long wavelength balance point in the r/g opponent function, at about 573 nm, and unique blue at the short wavelength balance point around 472 nm. Similarly, unique green appears at the single balance point in the y/b function, at about 492 nm. (The exact location of these unique hues varies across individuals.)
What about unique red? There is no point where the y/b contrast crosses the neutral line a second time in the visible spectrum both contrasts taper toward neutral as the spectrum fades to invisibility, and near infrared "red" light (beyond 700 nm) actually appears more yellow (scarlet) as the wavelength increases to 900 nm. To produce unique red from monochromatic lights, a small amount of "violet" must be added to spectral "red" light, to neutralize the yellow hue. (This is one reason for the S cone input to the r/g opponent coding.)
Finally, Hurvich and Jameson adopted the photopic sensitivity function (gray curve) to represent the whiteness/blackness or w/k opponent function. Procedurally, they equated the w/k curve with the quantity of "white" light produced by the neutralizing mixture of spectral and complementary opponent hues. But this curve glosses over several problems. It does not account for the additivity failures in spectral mixtures, and blackness is not a color sensation created by isolated color mixtures it arises from the luminance contrast between a color stimulus and the color area around it. Luminance perception is actually structured as a brightness/blackness (b/k) opponent dimension: white is its neutral value.
|
![]() opponent function curves for after Jameson & Hurvich, 1955 |
|||||||||||||||||
| What Are the Opponent Functions? On a previous page I mentioned that a few studies have analyzed the reflectance curves of a large number of color samples from the Munsell color system. When the correlations among all these reflectance profiles are statistically simplified, they result in three weighting functions (right) that reconstruct the original reflectance profiles with 95% to 99% accuracy, depending on the study. The pattern of peaks and crossings of these basis functions is extremely efficient in the sense that every surface color can be defined by a unique location in just three dimensions.
These curves recognizably correspond to the general form of the three chromatic response functions: curve 1 is the w/k function, curve 2 is the y/b function, and curve 3 is the r/g function. There are discrepancies in the exact shape and crossover points of the curves (especially in function 1), but these discrepancies can be attributed to four factors: (1) reflectance curves are a physical specification of the color stimulus, not a perceptual specification via the cone fundamentals; (2) reflectance curves define the color independent of any light source, whereas the human opponent functions are optimized for chromatic adaptation along the dimensions of chromatic variation in the daylight phases; (3) the reflectance variations in the Munsell color samples are constrained to some degree by the limited number of pigments used to manufacture them; and (4) by far the largest proportion of natural surface colors are weakly chromatic and rather dark, which means the Munsell system contains a disproportionately large number of saturated and light valued colors.
These issues aside, if we plot the size of the weights assigned to saturated Munsell hues on the second and third weighting functions, the color samples once again arrange themselves in a hue circle (right) with an accurate pairing of complementary hues yellow green across from deep blue, and red across from blue green. This is persuasive evidence that, as we would expect, the perceptual structure of color vision is strongly tuned to the reflectance structure of natural surfaces.
|
![]() weighting functions that predict the spectral profiles of 1270 Munsell color samples hue circle created by the spectral weighting functions from Lenz, Osterberg et al. (1996) |
|||||||||||||||||
| Transforming the trichromatic signals into the opponent functions would not be difficult for a neural network that could flexibly sum, difference and reweight the cone signals. For example, the diagram (right) shows the Hurvich and Jameson opponent functions (color) overlaid with curves derived as simple weighted sums of the linear, population weighted cone fundamentals (gray); numbers indicate the weight and sign applied to each cone output.
Note the heavy weighting of the S cone outputs necessary to match the y/b opponent function estimated from hue cancellation experiments. This demonstrates the perceptual overweighting of S cones in comparison both to their impact on the opponent dimensions and to their sparse numbers among the much more numerous rod, L and M photoreceptor cells. All this implies that the wavelength locations of the unique hues are a byproduct of the opponent dimensions and not the other way around. Thus, the fact that very different sets of opponent functions, derived from statistical reflectance analyses or hue cancellation experiments, can produce the same hue circle, implies that the orientation of the oppponent dimensions 3 the location of the unique hues is of secondary importance. And in fact there are large individual differences in the choice of unique hues, which strongly suggests the hues are not structurally basic to color vision in the way the cone fundamentals are.
An exception may be the yellow balance between L and M cones. If the M cone outputs are approximately doubled (to compensate for the 2 to 1 majority of L cones in the retina) then unique yellow is located at the null point (L2M = 0). The S cones cannot upset this balance because their sensitivity in "green" to "red" wavelengths (above 525 nm) is nearly zero, and because the S cone peak response (at 445 nm) is the complementary "violet" hue to unique yellow (between 565 nm to 575 nm, depending on the color temperature of the light used to define "white"). Yellow around 570-575 nm is (not coincidentally) the direction of color shifts in daylight chromaticity and of the yellow tint that accumulates in the lens with age. By anchoring the y/b contrast on this yellow, both the internal constraints of lens yellowing and S cone influence, and the external constraints of natural light variations, can be compensated for by a single mechanism. The r/g dimension would adapt to provide an approximately independent contrast to the anchoring y/b dimension.
|
![]() weighted sums of the cone fundamentals that match the r/g and y/b opponent functions |
|||||||||||||||||
| Finally, the spectral hues where the r/g or y/b opponent functions are in balance, as shown in the CIELUV chromaticity diagram (right), can be contrasted with the white point or balance point between two photoreceptor outputs that emerge in various types of colorblindness. In trichromatic vision two areas of reduced hue purity appear at around 490 nm ("cyan") and 570 nm ("yellow"), and the color white appears as a single point inside the chromaticity diagram. In tritanopia (missing S cone), the white line extends from 570 nm to 460 nm in normal vision a very near visual complement to unique yellow. In protanopia (missing L cone), the white point changes into a confusion line across the chromaticity diagram from 494 nm ("cyan") to c494 nm: all colors on this line appear neutral or achromatic. In deuteranopia (missing M cone), the white line extends from 499 nm ("cyan") to c499 nm. Both confusion lines are close to the trichromatic unique red at c495 nm. This implies that unique red is a somewhat more reliable anchor for the r/g dimension, as seems verified by the much larger individual differences in the perception of unique green in comparison to unique yellow or red.
This problem can be explored with graphical data analysis, which has been used by multivariate statisticians for over a century. It lets us examine complex mathematics by means of pictures rather than numbers, and allows us to show complex transformations in simple geometrical terms. By making graphical adjustments to the color space changing the length of color dimensions, rotating our view of the dimensions, adjusting the angle between dimensions we can see the transformations from cones to colors as explicit sculptural steps. A Geometrical Color Standard. To start we require a graphical standard for the geometrical pattern we expect to find in the color appearance of color samples that have been correctly transformed from the light excitation produced in the cones. Since the 1920's, the most commonly used standard in color research has been the Munsell Color System. The Munsell system can be used to define colors that are perceived to be evenly spaced in hue and chroma across equal value (lightness or gray scale) differences. In addition, the Munsell hues 5R, 5Y, 5G and 5B approximately locate the Hering unique hues red, yellow, green and blue.
The Munsell system arranges colors along equally spaced, radial hue angles and circular chroma levels from the achromatic point at each lightness level. This ideal geometry is most recognizable when viewed in the orientation shown in the diagram below, which displays all hues within the chroma range typical of artist's paints and at the two Munsell values 4 and 8.
distribution of munsell color samples This pattern summarizes the expected relationships between color appearance and measurements of hue, chroma or lightness: chroma differences are equal across all chroma levels and for all hues; lines of constant hue are radial, straight and equally spaced around the hue circle; the "compass point" colors are Munsell hues 5Y, 10R, 5PB and 10BG; and these relationships are the same across all lightnesses. Another relationship, not visible in the diagram, is that each hue/chroma plane is perpendicular to the achromatic axis (like a wheel on an axle). To make these graphical comparisons, the color diagrams must represent height and width in equal scale units, turn the color space so that our view is directly down the achromatic axis (the gray scale), and place light yellow (5Y) at the top of the hue circle and red (5R) on the right. They also depend on the (certainly false) assumption that the Munsell color system distributes colors with perfect accuracy. Cone Excitation Space. The first stage in all zone theories corresponds to the transduction of light by the cone fundamentals, which define the relative amount of photoreceptor excitation produced by light of different wavelengths. The total stimulation is calculated by multiplying at each wavelength the L, M and S cone sensitivity by the color's spectral emittance curve, then adding the products across all wavelengths in the visible spectrum. This produces a triplet of cone excitations Le, Me and Se that produce the geometrically irregular and curved cone excitation space that was introduced earlier to describe the stimulation of the cones by monochromatic lights. It is not possible to measure cone excitations directly. Instead, a transformation function is applied to the XYZ tristimulus values to estimate the L, M and S values for each color sample.
This space has very different characteristics when displayed in terms of surface colors, as shown below for Munsell color samples within the range typical of artists' paints (Munsell values 1 to 9 across chroma levels /2 to /12).
munsell color system in the cone excitation space munsell value steps 1 to 9 In this view the cone excitation space is a cube, with the L, M and S cone excitations represented as the depth, horizontal and vertical distances from the back left corner (0); the cube is tilted forward slightly to make differences in the L direction easier to see. The color stimuli inhabit a small region of the total cubical space, forming a narrow, elliptical cone with apex at the origin (zero excitation in all three cones). This is the fundamental topology of human color vision. Hues are oriented with yellow downward (as yellows produce nearly zero stimulation in the S cones). A plot of the original XYZ values produces a nearly indistinguishable color distribution.
Each value plane is almost perfectly flat, hue angles are all approximately straight and radial from the achromatic point, and complementary hues are radially opposite each other. The curvature of the color space produced by monochromatic color stimuli is not visible in surface colors. |
![]() lines of balanced (zero) values on after Burns et al. (1984) |
|||||||||||||||||
| However, the distribution of colors displays some quirks, which are apparent when viewed from the side or above (diagram, right). The achromatic axis (white line, above), which intersects the pure gray value at each lightness, forms a rising diagonal across the space, while the luminosity function (green line, above) lies in the L,M plane. That is, the S cone contributes to chromaticity but not to luminance; whiteness is not the same as brightness.
Each Munsell value plane is perpendicular to the L,M plane, so that the projections of all colors onto the plane form a series of lines across the luminosity function (gray lines, above), but the hue planes cross both the luminosity function and the achromatic axis at oblique angles.
Next we orient the colors at values 4 and 8 to match the Munsell ideal standard (above). To do this, the space must be turned or rotated so that the achromatic axis is aligned with the direction of view, then turned on its head to place yellow at the top, as shown below.
munsell color samples in cone excitation space The amount of rotation depends on how the cone fundamentals are weighted (equivalently, how the white point is defined), but approximately a 45° turn around the S dimension and then a 30° tilt downward around the new LM dimension will bring all the gray points (wp) together over the origin. In this orientation, the fundamental two part geometry of the cone excitation space is visible: the L+M excitations define the horizontal dimension, and the S axis the vertical dimension. Now we can itemize the asymmetries in the cone excitation space that must be remedied in order to produce the geometry of equally spaced color samples: (1) At lighter values, larger increments in cone excitation are necessary to shift the lightness to the next Munsell value, although there is a constant ratio between the cone excitations at adjacent value levels. (See diagram, above right.) (2) At lighter values, larger increments in cone excitation are necessary to produce equal chroma changes, as shown by the increased diameter of the hue plane and of the innermost (chroma = 2) ring at value 8 compared to value 4. (3) The hue/chroma planes are not circular but elliptical: the excitation distances between chroma values are much smaller in the L and M direction than in the S direction. (4) Hue angles are not equally spaced; instead hue differences are much smaller among some hues (yellows, oranges and blues) than others (magentas and greens). (5) Each hue/chroma plane of colors having equal lightness or Munsell value is not perpendicular to the luminosity dimension or to the achromatic axis. Note that the distribution of surface colors in the cone excitation space is far more regular than the distribution of monochromatic lights. Color vision is adapted to perceive surface colors accurately, not spectral hues or rainbows. Cone Contrast Space. The next step in color vision actually embraces several color transformations. In modern color appearance models and chromaticity diagrams, cone excitations must be adapted through response compression, the definition of preliminary opponent dimensions, and a complete separation of the luminosity and chromaticity dimensions. Physiological and psychophysical studies over the past few decades have clarified additional early transformations, in particular the recoding and normalization of cone outputs as a cone contrast space anchored on the background or adaptation values. Response Compression. The photoreceptor outputs actually signal changes in light in relation to the baseline cell excitation. Photoreceptors do not signal an absolute level of light energy but a relative or proportional difference or contrast to the amount of light that has come before. Thus, the cone responses already include some part of luminance adaptation to the average luminance, and chromatic adaptation to the average chromaticity, of the visual field. Further adjustments are also made at later stages of the visual process. This is specified as a second triplet of cone values La, Ma and Sa which quantify the baseline response of the cones. (In the early CIE color models, which are based directly on XYZ tristimulus values, the adaptation triplet is the illuminant specification XwYwZw, where Yw = 100 always.) The contrast outputs LΔ, MΔ and SΔ are then the contrast ratio of color excitation over adaptation excitation:
These contrast values can be either positive or negative; the null point or origin of the space represents the La, Ma and Sa values. The most important feature of these contrast or change ratios is that they constitute a comparison between two different stimuli (the color and the adaptation background), which defines a Weber fraction. Fechner's Law states that a stimulus contrast will produce an equal perceived color contrast whenever the color stimulus is a constant proportional difference (ΔL) from the baseline stimulus (La). This contrast space disguises the absolute color and brightness of the visual field (because all adaptation states are set equal to zero), but standard practice is to use the white point to represent adaptation to an achromatic background perceived as "white" or "gray" (for surface colors) at a specific luminance level. A thornier problem is that the dimensions of the space represent relative increases or decreases in cone stimulation, which are different for different observers, adaptation backgrounds, spatial or temporal variations in the stimulus, or amount of color contrast. A common procedure is to make each dimension proportional to the maximum possible stimulation possible in each situation.
In modern color appearance models, which deal directly with stimulus quantities (in the form of colorimetric tristimulus values) rather than cone excitations, the response compression is defined as a power transformation or exponent applied to the tristimulus values before any other transformations are made. |
![]() side and top view of munsell colors |
|||||||||||||||||
| Different exponents have been used in different color models and as applied to chromatic or luminance responses. The diagram (right) illustrates the problem in terms of the luminance factor (CIE Y) of a surface in relation to its perceived lightness (Munsell V). At dark values, a relatively small increase in the luminance factor produces a very large change in the perceived lightness, but at light values a large change in the luminance factor produces an increasingly smaller change in perceived lightness. As shown, applying an exponent to the curved stimulus quantities Y causes them approximately to match the linear perceived quantities V. (CIECAM uses the exponent 0.43; CIELAB uses the exponent 0.33 and then, for the lightness L scale, adjusts the intercept and slope of the resulting line.)
effect of response compression on chroma spacing As shown above, the compression exponent (0.33) has a more complex effect on the hue/chroma planes. It gives the chroma rings approximately equal diameter at each lightness level, and somewhat reduces the elliptical shape of the chroma contours, primarily by changing the relative proportions along the vertical (S cone) dimension. This changes the geometrical distribution of colors from an elliptical cone to an elliptical cylinder; but it increases the rightward tilt of the chroma boundaries. Note that the chroma spacing increases substantially in yellow hues at all lightness levels, primarily due to the exponent effect on the extremely small S cone values. This implies that all cones paradoxically have their largest relative impact on color discrimination (chroma and hue differences) when their signals are close to zero. The same effect is visible in the nearly vertical lightness slope of the Y function at near zero luminance factor (diagram, right). Postreceptor Opponent Contrasts. The cone contrast space defined by the LΔ, MΔ and SΔ provides the framework for the definition of a preliminary opponent contrasts of the trichromatic outputs. This opponent recoding was the second stage in early zone theories. It pools the contrast information early in the visual system in order to preserve its signal accuracy and to create separate visual channels for luminance (brightness) and chromaticity (hue/chroma) information. These postreceptor opponent contrasts have been measured in early visual pathways and have very different response and adaptation behavior to visual contrasts across space or time. They are usually defined as:
|
![]() exponential compression of |
|||||||||||||||||
| The version proposed in 1984 by Derrington, Krauskopf & Lennie (known as the DKL opponent space) has been especially influential in the neurophysiological stream of color vision research. It is based on an isoluminant chromaticity diagram devised by Donald MacLeod and Robert Boynton (diagram, right).
This chromaticity space standardizes all values on the luminance Lum (L+M) response and gives all three cone responses equal weight. Some color spaces double the M response to the red/green contrast (L2M). The Lum and LM dimensions are created by a single orthogonal rotation of about 43° around the S axis of the cone excitation space, which aligns the luminosity axis with the direction of view toward the color samples. These new dimensions are simply weighted sums of the original cone excitation dimensions, as follows:
This procedure also forces a new distinction between the luminosity and achromatic axes. If the Lum (L+M) dimension includes any S output, then a second rotation of about 26° around the LM dimension has been applied to align the achromatic points with the direction of view. This defines a whiteness/blackness or lightness induction dimension (W). The innovations here are that the cone excitations have been combined according to specific proportions that establish contrast, priority and independence among the cone signals; and that four distinct quantities Lum, W, LM and LumS are in play.
Separating Chromaticity from Lightness. The final adjustment addresses the oblique angles between each hue/chroma plane and the achromatic and luminosity axes (see diagram above, right). These angles remain even after the previous transformations have been applied, as shown below in the opponent dimensions of response compressed, rotated cone excitation space. (In the original XYZ tristimulus values, the XZ hue/chroma planes are already perpendicular to the luminance Y and achromatic axes, so adjustment of L+M+S is not required.)
separating chromaticity from lightness The tilt or slope (orange line) along the LM dimension arises because the L cones are weighted more heavily than the M cones in luminance than in chromatic perception; and on the L+MS dimension because the S cones are weighted more heavily in chromatic than luminance perception. The remedy is simple: the separate contributions of the L, M and S cones to the L+M+S (whiteness) dimension are adjusted by small slope factors (0.32 and 0.47) so that lightness values remain constant across changes in the values of the LM and L+MS dimensions. The solution for this "perpendicular white" (Wp) looks like this:
This effectively weights the L cone output to about 2 times the M output and over 50 times the S output in the definition of the whiteness L+M+S dimension. Called an oblique rotation, it orients the hue/chroma planes perpendicular to the achromatic axis, and aligns the luminosity and achromatic dimensions along the LM dimension: saturated red and green are roughly equal in lightness. In either space, a residual correlation between lightness and chromaticity remains along the L+MS (YZ) dimension, visible above as the leaning shape of the color distribution. These are not removed, because it corresponds to the perceptual fact that dark purples are much more saturated than dark blue greens, and dark yellows and light blue violets are not strongly saturated.
Perceptual Opponent Space. The third step in zone theories stipulates that the separate cone contrasts are combined into perceptual opponent dimensions. These dimensions are presumed to describe the conscious structure of color perception in other words, the distribution of colors in the Munsell color system. In modern color models this stage constitutes a final adjustment of the postreceptor opponent dimensions (the "preliminary" ab dimensions in CIECAM). |
![]() macleod boynton isoluminant chromaticity diagram |
|||||||||||||||||
| Where have the postreceptor opponent dimensions taken us? After the cone excitations are response compressed, rotated, made perpendicular to and normalized on the maximum value on the achromatic (Wp) dimension, the spectrum locus appears as shown (diagram, right). The opponent dimensions place "cyan" (495 nm) on the L+MS null line and "unique yellow" (570 nm) on the LM null line; these are the commonly ascribed balance points for the yellow/blue and red/green opponent contrasts.
The dimensions puts the maximum value of LM at a the warmest "red orange" hue (608 nm) and the minimum at "unique green" (522 nm); and puts the maximum value of L+MS at middle "green" (545 nm) and the minimum at "blue violet" (445 nm). These dimensions are related to the equal area cone fundamentals as:
The large weights given to the LM contrast are necessary to overcome the narrow elliptical shape of the color space. The diagram (right) shows the spectral profile of the chromatic postreceptor opponent functions. The recognition features are the absence of positive "violet" hue coefficients on the LM curve, and the very large negative "violet" hue coefficients on the L+MS curve.
The diagram below shows the hue/chroma distribution of surface colors. The L+MS null line is placed at a distinctive yellow green (leaf green) and a blue violet (approximately Munsell 5GY and 7.5P). The LM null line is placed at blue green and magenta, at Munsell 7.5BG and 10R. These "balance hues" do not correspond to those at the spectrum locus, especially in the placement of unique yellow (Munsell 5Y): we expect this because human vision is adapted to perceive surfaces, not spectral lights.
munsell color samples in perceptual opponent space Four distinct problems are still unresolved in the postreceptor opponent contrasts: Most important, a pronounced elliptical distortion remains in the chroma rings: the blue greens are too compressed in comparison to the yellows, and the hue angles are not equally spaced around the hue circle (compressed in yellow and blue, expanded in green and magenta). The elliptical distortion is itself distorted in the direction of yellow green and red, and this distortion increases at higher chroma levels, so that the near neutral chroma rings are roughly elliptical but the saturated chroma rings are clearly wedge shaped. Yellows with a distinctly green content are on the "red" side, and blues with a pronounced red content are on the "green" side, of the LM (red/green) null line. (That is, the vertical "compass points" in the geometrical standard are rotated clockwise), However, this cannot be corrected by rotation, as the horizontal "compass points" (10R and 7.5BG) are approximately correct. The chroma intervals are not equivalent at different lightness levels; comparing Munsell values 4 and 8, the disparity is especially large in yellows and blues. Necessary Additional Transformations. These four problems remain the most significant obstacles to accurate color scaling of color samples viewed in isolation (that is, colors that are not significantly affected by color context). They typically require additional transformations in quantitative color models that do not, as yet, have a clear connection to the neural machinery of color vision we can't explain why they are necessary. The current level of model sophistication is exemplified by the dimensions of brightness (Q), lightness (J) and the hue/chroma plane (aC and bC) in CIECAM. These are approximately defined as:
A computational trick was used to derive these weights: the starting or preadaptation R, G and B values in CIECAM are set to 1.0, with the values on the remaining two cones set to zero, and these values are passed through the CIECAM mechanism; the weights above are the final values of these "colors". These weights will change according to the luminance of the color and the definition of the viewing context, and the CIECAM cone fundamentals are different from those used in the previous examples; but the weights do illustrate the modifications necessary to "improve" the postreceptor opponent space. These are: a decrease in the L cone weight, as a proportion of the M cone weight, in all dimensions a substantial decrease in the negative S cone contribution to the bC (L+MS) dimension the addition of a substantial negative S cone contribution to the aC (LM) dimension separation of the achromatic dimension (or lightness, J) from the luminosity dimension (or brightness, Q) produced by a decrease in the relative S cone contribution.
In other words, the CIECAM color space depends on the extensive redistribution of the S cone outputs (equivalently, the effect of short wavelength light) in the definition of the opponent dimensions. A similar redistribution is silently effected in CIELAB by the fact that the "red" (X) and "green" (Y) tristimulus values include a substantial "violet" content. |
![]() the postreceptor opponent space (top) plot of spectrum locus; (bottom) opponent hue coefficients |
|||||||||||||||||
| The diagram (right) shows the spectrum locus and hue coefficients in CIECAM. The key differences with the postreceptor opponent functions are primarily in the "yellow", "red" and "violet" (and extraspectral) hues, and in the sharply curtailed "violet" end of the curve. The recognition features are the positive hue coeffients (and second null point) on the aC (LM) dimension in "violet" wavelengths which is small in CIECAM but much larger in, for example, the Hurvich & Jameson perceptual opponent dimensions or in CIELAB; and the relatively small negative values for "violet" hue coefficients on the bC (L+MS) dimension.
The distribution of surface colors (below) shows a much more circular outline in the chroma rings, and better placement of the yellow and blue violet colors in relation to the vertical null line. However, there is still significant irregularity in the chroma contours, visible as the "wedge" exaggeration of chroma distances in the yellow green and red directions, and in the crowded hue spacing in yellow greens, reds and blue greens and the exaggerated hue spacing in greens.
munsell color samples in CIECAM ab space At this point, the divergence of the color distribution from the ideal, perfectly circular distribution runs up against a variety of methodological issues. One is that the accuracy of Munsell scaling has been questioned; for example, it is generally agreed that the Munsell hue spacing across green colors is too large, and the chroma scaling seems to be inconsistent across hues and values.
Nevertheless, the problem with the chroma scaling likely represents a genuine aspect of color vision. We get a better picture of this problem if look first at the raw data: the original X,Z tristimulus values in a single hue/chroma plane, with the X dimension centered on Y and expanded to produce a near circular ring at chroma 6.
distribution of munsell samples on x,z tristimulus values
The geometrical problem appears more clearly in the deviation of each hue sample from the circular placement in its chroma level. This produces the diagram below; value 6 is used because it is the only hue/chroma plane that completely covers all hues out to chroma 14.
deviations from circular chroma |
![]() the CIECAM perceptual opponent space (top) plot of spectrum locus; (bottom) opponent hue coefficients |
|||||||||||||||||
| The hues where the maximum deviation occurs in each quadrant shift slightly as the chroma increases, but the basic pattern is relatively stable across different lightness levels, different compression exponents (when applied uniformly to all colors), and different locations of the achromatic center. The pattern is primarily determined by the cone fundamentals or color matching functions used to define the cone excitation space, and by the numerical definition of the opponent dimensions. The axes of the deviations are also not exactly perpendicular, as shown in the diagram (right).
The correction for these deviations is probably a more nuanced specification of the compression exponent. There are at least three different problems to solve. The first is the increased, eventually linear compression of chroma spacing along the zero S (=Z) line for hues between 7.5G to 7.5YR at chroma above 8. This is clearly visible as the flat chroma boundary across the yellow part of the color distribution in the x,z and opponent spaces (diagrams above and right). The second problem is the spike of inflated b+ chroma values, which is due to the peak values of the Lum (L+M) sum where they cannot be moderated by subtracting S values near zero. The effect disappears across the yellow orange and orange hues because these colors are substantially less luminous. This implies a smaller exponent (a larger chroma deduction) across the yellow green hues. The third variation, the gm+ and g+m deviations, suggests a third exponent operating in relation to the sign of both opponent dimensions: the exponent produces a relatively smaller compression for hues where the sign product of the ab dimensions is negative (greens or purples), and a higher compression when the sign product is positive (oranges and blues). Perhaps the most significant themes to emerge in this discussion of undefined additional corrections is the regulatory importance of the S cones in the specification of an opponent color space, and the complex interplay of different compression, rotation and normalizing steps necessary to reach an approximately circular scaling of perceived chroma. As mentioned earlier, many of the problems described above are minimized by direct use of XYZ tristimulus values, because these already redistribute the S ("violet") content to the other dimensions so that the "red" (X) dimension is actually a magenta, the hue/chroma planes are already perpendicular to the luminosity dimension, and the exponent compressed dimensions form nearly circular chroma rings. But this means that many of the necessary transformation steps are "precooked" into the XYZ values; alternately, that the XYZ values must be "uncooked" (transformed or rotated) to get back to the (theoretically) primitive LMS values. Starting from the LMS values displays the essential information processing steps more clearly, and shows that color vision is a tightly integrated, precisely balanced process. It also confirms that current color models do not yet describe it accurately. Measuring Perceptual Discrimination. The perceptual geometries of vision are identified through judgments of color difference by comparing two colors that are different along one or more of the colormaking attributes, while holding the other attributes constant. The kinds of questions addressed in this way are: What is the smallest stimulus difference we can perceive on each of the three colormaking attributes? Does discrimination ability change as the physical properties or intensity of the stimulus changes? How does change on one colormaking attribute affect color appearance on another colormaking attribute? All these questions concern related colors one color seen in contrast to beside another. As a bridge between the realm of color discrimination and the colorimetric definitions of color matching, I open each discrimination section with the color geometry predicted by standard chromaticity diagrams, which describe unrelated colors or colors perceived in isolation.
The "Just Noticeable Difference". For starters we need a method of measuring discrimination that can apply to any color attribute we can manipulate by changing a physical stimulus. The metric discovered in 19th century psychophysics and used frequently since then is the just noticeable difference or jnd.
|
![]() dimensions of eccentricity in the |
|||||||||||||||||
| As shown at right, a stimulus is presented at a starting quantity Q of a stimulus attribute (luminance, chromaticity, or hue for colors). Once the viewer is adapted to this stimulus, the attribute is increased by varying amounts and each time the subject is asked, yes or no, if there is a difference in the stimulus. After repeated trials the increment T that produces a perceptible difference is identified, or the increment that produces a correct response 75% of the time is used to compensate for the effects of guessing.
This defines a quantity of stimulus change T, known as the difference threshold or 1 jnd. When the starting quantity is the stimulus zero value (complete absence of the stimulus), then the first jnd value T is called the absolute threshold Q0. This is the minimum stimulus quantity necessary to produce any sensation. The process can be repeated by using the previous value Q as the new starting value, increasing the stimulus intensity again until a new difference appears, and taking this new value of T as a difference threshold at that stimlus intensity. By repeating this procedure across incremental increases in the stimulus quantity, the subjective sensation produced by any physical attribute can be measured off using the "yardstick" of the just noticeable difference. The Weber Fraction (ΔQ/Q). At each step the jnd is the smallest detectable increase T in the stimulus, given the starting value of the stimulus. It turns out that this increase is often a constant proportional increase (k) in the stimulus intensity, which is defined by the Weber fraction (pronounced Veybur): k = T/(Q + Q0) or equivalently ΔQ/(Q + Q0) Weber's law asserts that the ratio k is constant across a wide range of stimulus quantities, although the value of k is different for different sensory domains or types of stimulus. If this is assumed to be true, then Fechner's Law allows the estimation of the sensory intensity (S) from the stimulus quantity, as: S = a + b*log[Q + Q0] where a and b are applied to remove negative log values and match the stimulus units to the perceptual quantities. It is now, especially in color vision research, typical to express this relationship as a power function, known as Steven's Law, where the exponent 1/e is typically less than 1 (e=2 or 1/2 is the square root) S = b*[Q + Q0]1/e where b is used to fit the stimulus units to a practical perceptual scale. Similar power functions occur in all sensory domains vision, taste, hearing, touch, smell, pain and muscular contraction (sensations of weight or effort) although the exponent in each domain is different. In fact, log or power functions similar to Fechner's Law or Steven's law are used in many scales related to energetic quantities stellar luminances in magnitude, sound in decibels, earthquakes on the Richter scale, ion ratios on the pH scale, and so on. I should also mention a generic formula that is used to model the depletion of photopigment by light (derived from an enzyme kinetics curve):
where R is the response as a proportion of the maximum possible response Rmax, L is the stimulus luminance (or retinal illuminance), σ50 is the luminance value that produces a 50% response (the half saturation value), and n is the response compression exponent (usually around 0.70). (Zero is the assumed minimum response.) The implications of this function are discussed in the section on luminance adaptation mechanisms.
The Power Function in Lightness. What is the effect of the Weber fraction on perception? The relationship between the perceived lightness of a surface color and its reflectance or luminance factor (luminance as a proportion of the luminance of an ideal "white" surface) shows the perceptual geometry very clearly.
lightness as a function of reflectance
Underlying these sensory power functions is the common sensory property of response compression. Fundamentally, response compression represents a compromise between the physical limitations of an organism and the enormous range of a real world stimulus. In effect, the organism experiences an increase in the stimulus as a sensory change proportional to the remaining sensory response which becomes smaller as the sensation approaches its physiological upper limit. |
![]() just noticeable difference in a stimulus of quantity Q |
|||||||||||||||||
| The graphic (right) illustrates the perceptual effect of response compression in lightness. Equal increments of the stimulus intensity become less and less perceptually potent as the intensity of the stimulus increases, just as a single lightbulb adds considerably to the illumination of a candlelit room but in broad daylight makes no impact at all. To raise lightness or brightness by equal perceptual amounts, the luminance or reflectance must be raised by a greater absolute amount at each step. Thus an equal interval luminance or reflectance scale (equal Lum) produces a brightness/lightness scale (B/L) skewed toward lighter values, while an equal interval brightness/lightness scale (equal B/L) arises from a reflectance or luminance distribution (Lum) skewed toward lower values. The effect is to divide the lightness distribution into two parts light grays and dark grays with the proportionately best lightness discrimination in the light grays (medium to high luminances).
"Less" and "more" luminance is always relative to our light adaptation around an average luminance (which is usually a 19% luminance factor for the "middle gray" used in graphic arts but is 10% to 13% for the "gray" luminance used to meter photographic exposures). For illumination levels above a few lux the subjective effects of response compression are remarkably consistent. The sensory response compression displayed in psychophysical power functions is the single most important asymmetry in color vision. Its effect throughout the visual system is to create curvatures or nonlinearities in stimulus intensities across steps of perceptual difference that appear to us as evenly spaced "just noticeable differences".
Luminance Discrimination. Response compression shows that increasing quantities of light produce decreasing sensory changes as the light intensity becomes very high. However, this tradeoff is not consistent across all luminance levels. Instead, the Weber fraction or "constant proportion" between the starting luminance and a just visible increase in luminance varies with the average light intensity.
|
![]() two sides of response compression as defined from |
|||||||||||||||||
| In fact, changes in the Weber fraction are apparent just within the range of luminances that characterize surface reflectances (diagram, right). The first jnd below a pure "white" surface represents about a 2.6% decrease in luminance; but at low luminances (low reflectances, or very dark surfaces) the incremental change in luminance rises to 10% and above. This imposes a hard "floor" or minimum sensitivity on lightness gradations, which we do not experience because nearly all surfaces have reflectances greater than 5%.
If we consider difference thresholds in the brightness of two lights viewed side by side, in the Weber fraction (ΔL/L) similarly changes across luminance levels from .0001 to 10,000 cd/m2 (equivalent to light intensity or illuminance on a white surface of about .0003 to 30,000 lux), as shown below.
weber fraction across luminance levels
The kink in the curve at about 0.005 cd/m2 is due to the different response compressions in cone and rod photoreceptors. It indicates that the cone mediated luminance response persists to lower light levels than the cone mediated chromatic response, which is usually said to disappear at around 0.1 cd/m2 or 0.02 lux. On either side of this kink the Weber fraction shifts as two different exponential functions, one for scotopic vision, and the other for mesopic vision where both rods and cones are active.
At very low light intensities, it is estimated that the eye when completely dark adapted can perceive one or a few photons at a time, so the visual threshold is very close to the physical zero value of the stimulus. Very high light intensities can completely saturate the photopigments in the eye and make any changes in intensity impossible to see. This saturation occurs at luminance levels somewhere above 100,000 cd/m2 the exact value depends on the duration of the exposure but the chart shows visual sensitivity already starts to decline above 1000 cd/m2.
At peak sensitivity the eye still responds to a limited luminance range, roughly 1:100 around any specific luminance adaptation value. Photometric or lightness scales are usually divided into 100 units because, at photopic light levels, the eye is able to discriminate at most many steps in surface reflectance when the eye is adapted to the "white" luminance.
50 perceptually equal luminance steps
For example, the illustration shows 50 luminance steps, from black to white, that should appear perceptually equal across the entire range. (The contrasts you are able to see depend on the brightness and contrast settings on your computer monitor.) Note that the differences between adjacent steps are barely visible with edge enhancement, and become indistinguishable when separated by a black line.
Now let's step outside the visual range and look at the scope of vision across all light levels. the graph shows the same compression function across familiar illumination levels.
equal perceptual differences across illuminance levels
Finally, the sensitivity of the eye to luminance or lightness changes depends on the apparent size of the color area. Contrast sensitivity peaks for surface areas subtending a visual angle of about 0.2° (the apparent width of a #2 pencil seen from 7 feet, or less than half the width of the full moon). Sensitivity declines slightly but remains good for areas of much smaller size (such as the lines forming the letters of this page), and more rapidly for visual areas subtending 1° or more (a USA dime at 3 feet).
Hue Discrimination. Consider next the ability of normal trichromats to see the difference between two similar hues, a test of hue discrimination.
As a template of the hue discrimination we expect to find, the diagram below shows the hue angle distance between adjacent wavelengths at the spectrum locus, as measured at the white point, using three different chromaticity diagrams. The log hue angle difference is used because the hue angle is made smaller by increased chroma (distance from the white point), and chroma shows response compression as chroma increases.
Again, the chromaticity diagrams represent unrelated colors as produced entirely by the L, M and S cone outputs. So they represent the hypothesis that the cone outputs define our hue discrimination abilities. If actual hue discrimination does not match these curves, then hue discrimination must be affected by additional visual processes.
hypothetical spectral hue discrimination
These curves are hard to visualize as a color circle, so the diagram below presents the CIELUV predicted spectral hue angles projected to an equal radius (as shown below for the spectral location of "green" at 540 nm). This shows the predicted perceived spacing of spectral hues as the radial distance between hue markers.
spacing of monochromatic hues in CIELUV
Now here is observed hue discrimination performance, measured as the minimum wavelength difference necessary to make two monochromatic lights, presented one above another at equal luminance, appear just visibly different. The three curves represent hue discrimination ability at three illuminance levels (in trolands) for three stimulus sizes (angular widths).
spectral hue discrimination
The areas of best and worst hue discrimination differ from the chromaticity predictions. Best discrimination is around 490 nm ("cyan") and 595 nm ("orange"), which are almost exact visual complementary hues; the "green" peak of weaker discrimination is closer to 540 nm. There is also a second hump of poor discrimination at around 450 nm, bounded by a third area of good hue discrimination at around 420 nm; these are completely absent from the predicted curves.
|
![]() the weber fraction is not constant across luminance changes in lightness perception |
|||||||||||||||||
| However, a generally very good match is obtained in a calculation from the absolute change between two spectral wavelengths in scores on the y/b, r/g and w/k opponent functions combined (right). The log of this absolute change represents the "orange" and "cyan" basins approximately correctly, gets the two peaks of poor "blue" and "green" discrimination, and also catches the third area of good hue discrimination around 420 nm.
There are a few details where the opponent dimensions diverge from the experimentally observed hue discrimination curves. But the overall fit is close enough to suggest that the opponent dimensions, not the cone fundamentals, define the geometry of hue discrimination.
In the previous section I explained that the unique hues do not define the opponent dimensions. A logical corollary is that the unique hues do not explain hue discrimination, which is easy enough to test. |
![]() log hue discrimination computed from opponent functions |
|||||||||||||||||
| First the opponent dimensions must be transformed to show each hue as a proportional mixture of two unique hues, as Hering required. (This also minimizes the effect of changing brightness or saturation in the monochromatic lights.) After this adjustment, the opponent functions appear as shown at right, and their values are called hue coefficients or hue proportions. This shows that equal mixtures of unique yellow and unique green, or unique yellow and unique red, are quite close to unique yellow, rather than halfway toward the other hues.
The unequal spacing occurs because the results are shown along a nanometer scale, which corresponds to the visual spacing of hues in a diffraction grid spectrum. This is remedied by respacing the spectrum to correspond to equal changes on the hue coefficients (the proportional mixtures of two unique hues). As a result, some parts of the spectrum are perceptually expanded while others are greatly compressed (diagram below). This new spacing is hue discrimination acuity predicted from the unique hues.
spectral hues spaced along a hue coefficient scale It's intriguing that the predicted hue discrimination curves from hue coefficients and CIELUV are almost indistinguishable. Again the best acuity is located around 485 nm and 573 nm, with worst acuity at around 520 nm and at the spectrum extremes. The proportional change in hue discrimination from best to worst is too large (except for the spectrum ends), and the third area of acuity at around 420 nm is missed completely. To summarize: Neither the cone fundamentals (chromaticity diagrams) nor the unique hues predict observed hue discrimination ability as well as a log change score on the three opponent functions combined. The flaw is that the log opponent curve includes the w/k function or brightness change, which is excluded from chromaticity diagrams and was partially excluded from the experimental results. In fact, predicting hue discrimination is a complex problem that requires several added assumptions to work satisfactorily. (The discussion in chapter 8, "Chromatic Discrimination" of Color Vision, 2nd ed. by Peter Kaiser and Robert Boynton provides an excellent overview.) However, hue flexes and shifts in response to viewing or contextual factors, including color luminance, chroma, color contrast, chromatic adaptation and cognitive interpretation of the scene. It is reasonable to expect that hue discrimination cannot be explained by a single zone or mechanism in color vision though the opponent dimensions, provided they include luminance (the w/k contrast), do an excellent job. Hue Purity Discrimination. As explained in the introduction to chromaticity diagrams, hue purity (chroma or saturation) can be defined as the distance on a chromaticity diagram between a color and the white point, when all colors are standardized to have the same brightness. The diagram below shows chroma measured in chromaticity diagrams from an equal energy white point to the spectrum locus at each spectral wavelength. As before, the curves are based on three different chromaticity diagrams, expressed in log values because chroma shows response compression as chroma increases. The underlying measurement units were different across the three chromaticity diagrams, so all have been normalized to the same maximum and minimum values (which occur in the "violet" and "yellow" wavelengths).
These curves represent unrelated colors as produced entirely by the L, M and S cone outputs. So they represent the hypothesis that cone outputs alone define our chroma discrimination. If actual chroma discrimination does not match these curves, then it must be affected by additional visual processes.
relative maximum saturation of spectral hues The first point to note is the poor agreement among the three chromaticity diagrams. All (especially the foveal model) suggest a single minimum value at around 570 nm ("yellow"), where the spectrum locus comes closest to the white point. Both the CIELUV and 10° curves indicate a second chroma minimum or inflection in the curve at around 483 nm ("cyan"), and reduced "green" saturation in between. All models show elevated chroma at both spectrum ends, but the wide field (10°) and CIELUV curves indicate "violet" chroma is higher. What do the data say? Several testing methods have been used to assess saturation intensity or saturation discrimination in lights, but many are related to Helmholtz's method for measuring Sättigung (excitation purity) by the mixture of "white" and monochromatic lights of equal luminance. In these tasks, viewers: judge the amount of "white" content in a monochromatic light ("yellow" contains much white, "blue violet" none at all); count how many "just noticeable" steps of added white light it takes to make the spectral color disappear completely (about 5 for "yellow"; 20 or more for "blue violet"); or measure the minimum quantity of monochromatic light necessary to produce a just noticeable tint in white light (a lot of "yellow", very little "blue violet").
These tests (especially the last) are analogous to the painter's tinting test and clearly illustrate the overweighting of B cones in color vision. The geometry below shows the quantity of pure hue necessary to tint a pure "white" light, or the tinting strength of monochromatic light at constant luminance.
a tinting test using light mixtures These curves show reasonable agreement with the foveal (2°) chromaticity diagram, implying that the overlap in cone sensitivity curves primarily determines our chromatic sensitivity (or that Priest and Brickwedde used a foveal presentation of the color stimuli). The data allow some quantitative conclusions: a 15:1 mixture of "white" and monochromatic "yellow" light is necessary to produce a just noticeable tint in white, but for "red" light the mixture ratio is around 170:1, and for "violet" light it may go as high as 1000:1! By this measure, monochromatic "violet" light has over 60 times the tinting strength of "yellow" light.
These curves only show the first chromaticity step from the white point, not the overall geometry of chromatic intensity across the entire range of saturation values. They also do not indicate the response in surface colors, which are determined by more complex perceptual processes.
|
![]() plot of spectral hue coefficients from Hurvich (1997) |
|||||||||||||||||
| In surface colors, any simple relationship to cone outputs is lost. As a simple illustration, the diagram (right) plots total chromaticity signal (computed as the ratio between the L, M and S chromatic and achromatic responses at each lightness, summed across all three cones) against the CIECAM measure of saturation for five cardinal Munsell hues (5Y, 5R, 5P, 5B and 5G), across the physically possible chroma range at Munsell values 4, 6 and 8, into equal area cone fundamentals. The cone outputs were derived as the transformed tristimulus values for all colors. The basic feature is a relatively linear response between saturation and cone excitation across the red, yellow and green hues, but substantial response compression in the blue and violet hues.
The cone responses also show a very large initial increment in saturation in relation to the chromatic signal, a jump that is too large to be traced in detail by the Munsell chroma intervals. This indicates that the visual system is especially sensitive to small chroma differences from the | ||||||||||||||||||