![]() |
||
|
The previous article in this series discussed gathering all of the purchase data from your past lotteries and merging it together to create a buyer history database. This installment introduces two techniques for analyzing this information - RFM analysis and experience rating. Why rate your buyers? Why should you rate your buyers? There are many reasons. One important reason is simple economics. If you are running a mature lottery, and have not been keeping track of lapsed buyers, you may be spending a huge amount of money for postage and materials on people who are not responding. You may want to know, year by year, how certain groups of people are responding. For example, how many new buyers are coming back the next year. You may want to send targeted letters to various classes of buyers at the start of your campaign. Everybody knows, at least instinctively, that the most responsive lottery players are those that have bought tickets in the past. You know, because you see their names every year. But have you created a historical database of all the purchases? Can you look someone up in the database, and instantly see their purchase history? Can you send thank-you letters to your best customers? Lottery players who spend $100 on a ticket are people who make a conscious, informed decision to buy. Its not like the 6/49 or a scratch-and-win ticket. A dollar is easy to spend without thinking about it. A hundred dollars is not. This is a good thing. It means that the people on your mailing list decided, at some point in time, that the cost of the ticket was money well spent. And this means that they are, as a group, somewhat predictable. Using RFM Analysis RFM Analysis (Recency, Frequency, Monetary value) has been described in many places, including the pages of Canadian Fundraiser. For a quick recap: the theory goes that your most recent buyers will be the most likely to purchase again. You take your database, sort it by recency, split it into 5 equal parts ("quintiles"), and assign the numbers 5 to 1 to the quintiles; 5 being most recent, and 1 being least recent. The same goes for frequency. Within each recency quintile, the most frequent buyers are most likely to buy again. Take each of the 5 recency quintiles, split them again into 5 equal parts, and assign the numbers 5 to 1 in order of frequency. Do the same again for the monetary value. What you wind up with is 125 cells, coded as 555,554,... down to 111. Each cell has the same number of records, or close to the same. People coded as 555 are usually your best customers, whereas people coded as 111 are generally the least likely to buy again. When applied to lotteries, Recency equates to which lottery they purchased in; Frequency to the number of previous lotteries the person has bought in; and Monetary represents the total number of tickets bought. Because most people buy a single ticket per lottery, and each ticket costs the same, Frequency and Monetary values have a very strong correlation. RFM coding is easy to implement if you have a reasonably proficient programmer or database administrator available. However, it is not useful by itself until your raffle has had at least 7 campaigns. Most write-ups about RFM analysis say that once your database is coded, you should do a test mailing to determine the response rates prior to your rollout. With a ticket raffle, however, you simply do not have time for that. Once you get your lottery license, the clock is ticking rapidly. You cannot afford to spend time waiting for responses to trickle in before selecting who you will mail to. That's where experience rating comes into play. Experience rating Experience rating will tell you, historically, what the response rates were for various groups of people, based on their buying patterns. It will let you answer all the following questions and more: 1. How many first-time buyers from Lottery 2 responded in Lottery 3? What percentage? How does this compare to the first-time buyers from Lottery 1? 2. How many consistent, long-term buyers do you have? What percentage bought tickets in the last lottery? 3. How much does the response rate drop if people do not buy a ticket in one campaign? Two campaigns? Having software experience-rate your existing database, or contracting the job to an outside company to do it for you will help you estimate your direct mail response rates before you start your campaign. The exact results will vary, of course, depending on many factors, not the least of which are the quality of your prize package, marketing efforts, and competition. Sign here, please Experience-rating can be done in all sorts of ways; our preferred method is to generate a signature for each buyer, using their purchase history. What is a signature? Its simply a set of characters that represent whether or not a purchase was made. Each position represents one lottery campaign. Start with a blank field. For the first lottery a person buys in, record a Y in the field. From then on, append either a Y or an N for each subsequent lottery they bought or did not buy in. You will end up with a series of Y's and N's, which together are the buyer's signature. Take a hypothetical example. Assume you have had three past lotteries - Lottery 1, Lottery 2, and Lottery 3. If 'Joe' bought in every one of the events, then his signature would be YYY. If he missed Lottery 2, then his signature would be YNY. If he only bought in Lottery 2, then his signature would be YN. Every person on the master file has a signature. By analyzing these signatures appropriately, experience rating computes the historical response rates for each signature class. The historical rates may also be used to estimate what the next campaign's response rates will be, with certain limitations. The calculations and assumptions required are beyond the scope of this article. In our work, we generally calculate the response rates for groups based on whether they are single or multiple ticket buyers, and an overall total. However, given sufficient data, groups can be broken down further by other factors such as gender or age. There are solid indications of gender and age differentiation in response rates. However, we have not performed a complete analysis on this. Nonetheless, we think it's important to track your buyers' age and gender. The following table represents our hypothetical example. It shows the response rates for buyers from Lottery 1 over the course of three campaigns. Please note these figures are for illustration purposes only. Comparison of response rates for our example
Looking at the table, you can see that the most recent and frequent buyers have the highest response rates. Additionally, compared to the overall total, multiple-ticket buyers have significantly higher response rates. However, remember that only perhaps 10% of buyers will purchase more than one ticket (at least with the standard $100 tickets). Looking at these numbers from another angle, you can see that of new buyers (signature 'Y'), 78% don't purchase in the next lottery. That's not to say they will never purchase; it may be a year, or 3 years down the road, but the vast majority of those that do not purchase again do not repeat ever. The last column of the table represents the percentage of buyers from Lottery 1 and where they ended up. For instance, only 9.5% of the original buyers continued to buy in every lottery. Mining for gold To illustrate these concepts in more detail, we projected the hypothetical information an extra few years to create a model of a 'mature' lottery. We wanted to compare RFM analysis with experience rating, and used our model to calculate the average response rates for each RFM cell. The following graph shows the experience-rated response rates (bars), and the percentage of the total predicted response (line) compared to the percentage of the total mailout. Each bar represents one RFM cell. That is; each bar contains 1/125 of the mailing list.
To show this chart in full detail, click here. Note that the RFM codes in the graph above were chosen only by sorting the data according to recency, frequency, and monetary value, with no regard to the experience rated response rates. As expected, the most recent & frequent buyers have high response rates. That they show such a strong correlation illustrates the potential of simply coding your database for RFM. The following table shows the relationship between mailing to the top 'n' percentage of the mailing list, versus the percentage of the expected response. In our example, the top 10% of the mailing list generates 40% of the direct mail orders.
What to do with the data Probably the first thing you should do once your database has been experience-rated is to identify all the lapsed buyers. That is, all the buyers that are no longer profitable to mail to. You can eliminate these people from the standard campaign mailings and save perhaps thousands of dollars in postage and materials. You can also use the information to start targeting your correspondence. You will know which buyers are beginning to lapse in their support of the lottery, so you could try to bring them back with a special mailing. Or you could send thank you letters to your core supporters. The possibilities are endless. Mailing list misconceptions You might think that as time goes on and your mailing list grows, that there will come a point where the responses from your mailing list alone will be enough to sell out your lottery. We believe this to be virtually impossible. In our most optimistic study models the mailing list alone produces at most 70%-80% of the total ticket sales. Under real-life conditions, we believe that a normal mailing list would produce at most 50% of the total sales. The only way that we could imagine a mailing list alone producing a higher percentage of sales would be to either buy a list much larger than the number of tickets in your lottery; or to engage in a year-round campaign to add interested people to your mailing list. We don't know whether or not this would be allowed under your lottery regulations, nor are we qualified to determine this. Choosing a method We recommend using both experience rating and RFM when analyzing your data. RFM codes alone do not tell you much, unless you track results year over year and determine your response rates. The standard 125-cell breakdown is not useful until your lottery has been running long enough to create significant differentiation in your buyer profiles. Experience rating can be used by itself if you can set up your data appropriately to that you can select buyers based on their projected response rates. Again, you need enough historical data to work with. Costs Unless you have expertise in-house, the process of building a buyer history, experience rating the data, and coding the buyers is not cheap. The costs must be compared to both the immediate savings in postage and materials, and the longer-term improvements you can expect from being able to create better relationships with your buyers. Savings Depending on your mailing list, you could save up to $50,000 or more in postage and materials. It depends entirely on how many lapsed buyers are identified and removed from the mailout. Final Words The data, response rates, and graphs used in this article are generated from models that we use for estimating response rates and should not be used to infer any specific information about your own mailing lists. Each lottery has its own unique characteristics and the data we presented here is developed using a general, non-specific model that attempts to illustrate basic industry patterns.
Richard Vandenberg is President of Vandenberg Systems Inc., a company that develops administration and ticket processing software for lotteries across Canada. His company also performs a variety of data analysis and processing, including creating buyer histories, RFM analysis and experience rating. He can be reached at (888) 228-1187, by e-mail to Richard@vansys.com, or via the Internet at www.vansys.com.
|