Hi Don,
Excellent job!
I had started doing something similar with the Tolman railroad cancels a few years back, not expanding the listings, but putting them into an online searchable database. I had intended, once done, to start on the insurance cancels. As with many of my projects, time became an issue. The novelty and development wore off and all that remained was the grunt work (image manipulation and data entry), which at the end of a long work day or work week isn't what I want to spend my time doing.
You can see the initial attempt here:
http://www.revenue-collector.com/railroad.shtmlNot as polished or capable as your DB.
A couple of comments and suggestions:
1. I agree that having a separate number for every single variant, e.g., inverted day, month, and/or year slugs, seems excessively granular. Those can be treated as a "v" or "var" suffix to the base type, and if you really wanted to, use v1, v2, v3, etc. At least then you preserve the link to the base cancel type.
2. When Eric Jackson and I met at Chicagopex last year, he asked me if I was going to continue working on mine, because he actually used it periodically for the cancel text search. Mine differs from yours in that (1) I include everything but the date in the data to be searched, including the city and state, and (2) I do not do any whole word matching; I allow fragments and don't parse whitespace (I include whitespace in the data entry for readability, but when searching for a match, I strip all whitespace and punctuation from both search query and target. Why? Because frequently these cancels are found as partial strikes rather than complete cancels, especially on smaller denomination stamps, so you don't know what part of the cancel the collector has in front of them. Being able to search for any small portion is a boon to the collector.
Your database currently comes close, but it does use whitespace as a limiter, which on many cancels can be open to interpretation. Also, you don't include the city and state text in the searchable cancel verbage, which I think is an omission, as that text does appear as part of the cancel and if you only have a fragment, that could be important.
Using the Abington Mutual Fire Insurance Co. (A3) cancel as an example, in my opinion the search queries "NGTON MU" and "NGTONMU" should return positive results. The former does, the latter does not.
Also, doing searches for "NGTON MASS" or "NGTONMASS" should also return positive results as that is part of the cancel text as well, and could be a fragment that shows up on a stamp.
Minor quibbles that presumably can be tweaked by the search processing.
I can happily not feel guilty about pursuing my incarnation. :) Well done!
P.S. There are similar line drawings of ship cancels that Tolman did in a TAR article (which is downloadable from my website).
I'd love to see this approach extended to mining, proprietary, banking, etc.... a lot of work to be done on those, however, since there are no Tolmanesque volumes to work from that have the line drawings already done. Quite a few could be gleaned from images on my website though, by processing them through retroreveal. Lot of grunt work...
P.P.S. Bill purchased quite a large number of insurance cancels from me last year. There were still quite a few that I wasn't willing to part with just because of aesthetics or the stamps in question. When you get to the point of adding unlisted cancels, please let me know if any of the ones in the various collections I have imaged en masse but not yet processed, would be useful and I can pull them and make hi-res images.
http://www.revenue-collector.com/insurancecoll/http://www.revenue-collector.com/railroadmisc/http://www.revenue-collector.com/chicagopex/http://www.revenue-collector.com/2016cancels/