A few days ago, Don/51studebaker brought up some concerns about the topic of color identification:
https://goscf.com/t/44769&whichpage=2#380164I would like to explain what I have learned concerning the development of a color identification method utilizing consumer scanners.
Computers are perfect at one thing, consistency. The output of a program will always be the same if the input is always the same. The major problem in using them for color identification is is that achieving consistent input has many variables. The next problem to be solved is to write a program to quantitatively analyze the input and then output a color name that is meaningfully accurate. However, accurately comparing stamp colors via scans can be done with only minimal expense in the absence of a special program.
So far in my testing, 16 bit per color channel consumer grade scanners are sensitive enough to provide useable input. However, no two scanners are exactly the same. There will be at least minor variation even between two pulled off an assembly line at the same time. Then there are differences between different makes and models in how they physically acquire the scan.
The most significant differences can be reduced to an acceptable amount by utilizing a color calibration target (~$20). Maintaining a consistent work flow that correctly utilizes color calibration is tedious when done manually. However, it could be made to be quite painless by having the entire workflow in a single program. Incorporating the open source program LPROF (
http://lprof.sourceforge.net/) with correct presets would require of the user to only click on 4 landmark points of a color calibration target image.
Some of the variation in scans will still remain due to things such as different light sources. The most significant one is due to color metamerism, i.e. colors not appearing the same under different light sources:
https://en.wikipedia.org/wiki/Metam..._%28color%29 . I haven't done enough testing to determine if this is a significant issue or not. If it is, then metamerism can be eliminated by only using scanners which use a specific light source. It seems that the usage of LEDs is superior and is being adopted as standard. Image sensor type is another variable. You can read more about some of the issues if you like here:
http://www.imageaccess.de/?page=Whi...persCCDOrCIS . Even if only one subset of scanner type is suitable, many makes and models are available for purchase at affordable prices (below $300). As time goes on, the quality of scanners will increase and their prices will likely decrease.
Following the above, we now have acceptable input data. More specifically, by color calibrating, we now have a scan that is in the
Absolute Color Space. The concept of the ACS is why any further variation introduced by a video card, monitor, etc. does not matter. Any image that is in the ACS will display exactly the same as any other image in the ACS on every single monitor on the planet (see note 1 below for an uncommon caveat to this). Even without a specialized program, their similarities or differences will be accurate if both are viewed in the same program (Firefox, Internet Explorer, Photoshop, etc.). If the monitor you are using displays all images as slightly bluish, then identical colors in the ACS will appear as identical on that monitor. Similarly, all non-identical colors in the ACS will appear as non-identical on that same monitor.
At this point, no special program is needed. A color calibrated image is all that is needed to accurately compare one stamp image to another color calibrated stamp image using your eyes. It is here that different people's ability to perceive color differences and the quality of the monitor can be an issue. However, none of that matters if you use an image manipulation program's (Photoshop, paint.net, GIMP, etc.) color picker tool to compare the numerical RGB or HSV values of each stamp with each other.
With a color calibrated image, an individual's color perception doesn't matter to a program that goes over each pixel in an image and does meaningfully accurate analysis. This will require the program to do some statistical analysis, compare the results to numerical thresholds, then output a color name. These thresholds will need to be derived from analysis of many color calibrated scans of stamps that have already been identified as a particular shade. This is the most difficult part of computer driven color identification development.
I've been working with Jack Daley (3cent1861.com) who has had good success in shade identification of the 1861 3c series by analyzing 9600 dpi scans (close to 1 pixel per ink particle). He subjectively identifies (expert eyeballs) shades based on the the ratios of the handful of pigment particle types. These particles vary by size and also color categories such as reddish, orangish, bluish, and greenish. I'm working on integrating computational quantization to further utilize his findings. In essence, I am trying to computerize his color perception and knowledge. Unfortunately, our progress is at a snail's pace due to our limited availability.
From what I can tell, Jack Daley can't place every single stamp into a particular shade category. Stamps can be faded or stained. During production there are day to day ink formula variations, containers of ink not thoroughly mixed each time they are accessed, one side of the plate receiving more ink than the other, etc. that all lead to how every stamp has a slightly different appearance. Some stamps simply have a color that is between two shades. However, just like Mike McClung, he has found identifiable patterns correlating to the production dates.
In conclusion, 30 years ago the concept of today's smartphone was inconceivable. Today's consumer grade scanners are now accurate and affordable enough to help bring definitive color identification to philately's masses. The use of color calibrated scanners (~95% accuracy) will prevent accurate identification of a small percentage of stamps, but it would allow for orders of magnitude better accessibility than now. We have everything we need except for a sufficient amount of data and a program to simplify the process for the average user.
There are many more details than discussed here and many of the concepts might need more simplification to understand. Ask away, but it may take me a long while to reply.
Note 1.However it is only true if every computer recognizes and applies an image's color profile correctly. A color profile defines how a computer correctly displays an image. To explain this further, look a few weeks back to this thread:
https://goscf.com/t/43719 . The reason why the image appeared differently on Don's two computers is because nl1947's image did not have a set color profile. One of his computer's programs defaulted to one color space and the other computer defaulted to a different color space. This doesn't occur if an image has a color profile applied and the viewing program (Internet Browser, image viewer, Photoshop, etc.) utilizes the profile correctly. This tedious part of color management can be eliminated by having one program perform the entire chain from scan acquisition to analysis.
Further reading:http://postalmuseum.si.edu/research...orimetry.pdf Philatelic Shade Discrimination Based on Measured Color
Some scanners utilize infrared light to detect dust. This feature could aid the shade identification process:
http://www.heritagesciencejournal.c...7445-2-8.pdf Infrared False Color for Art Examination
http://chsopensource.org/2013/02/13...up-workflow/ Identification of pigments by multispectral imaging; a flowchart method