{"id":722,"date":"2019-11-26T18:25:37","date_gmt":"2019-11-27T02:25:37","guid":{"rendered":"https:\/\/loopinput.com\/?p=722"},"modified":"2020-01-14T23:38:10","modified_gmt":"2020-01-14T23:38:10","slug":"why-i-hate-nps-with-a-passion","status":"publish","type":"post","link":"https:\/\/archive.blogs.harvard.edu\/loopinput\/2019\/11\/26\/why-i-hate-nps-with-a-passion\/","title":{"rendered":"Why I Hate NPS With a Passion"},"content":{"rendered":"<h1>Why I Hate NPS With a Passion<\/h1>\n<p><a href=\"https:\/\/loopinput.com\/why-i-hate-nps-with-a-passion\/\">Blog With Images<\/a><\/p>\n<h2>The Tale of Two Products<\/h2>\n<p>After purchasing a product on an online e-commerce store, customers are asked the typical NPS survey question \u201cOn a scale of 0-10, how likely is it that you would recommend [product name] to your friends, family or business associates?\u201d for two different products.<\/p>\n<p>Product A Customer Ratings: [6,6,6,6,6,6,6,6,10,10] (Average Customer Rating 6.8)<\/p>\n<p>Product B Customer Ratings: [0,0,0,0,9] (Average Customer Rating 1.8)<\/p>\n<p>Their ratings are listed above. Look at the two data sets. Consider carefully. Which product do you think has a higher NPS (Net Promoter Score).<\/p>\n<p>The answer is their NPS is exactly the same.<\/p>\n<p>Wait that\u2019s unbelievable\u2026 the first product has an average customer rating of 6.8\/10 while the second has an average customer rating of 1.8\/10. How is that possible?<\/p>\n<figure><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/loopinput.com\/wp-content\/uploads\/2019\/11\/image2-1.png\" alt=\"Spray Cans\" width=\"250\" height=\"327\" \/><figcaption>Pictured: Product A and Product B<\/figcaption><\/figure>\n<h2>What is NPS?<\/h2>\n<p>Welcome to the wonderful world of NPS, one of the most misguided scoring calculations ever invented.<\/p>\n<p>Before we start, what is NPS (Net Promoter Score) and how is it calculated?<\/p>\n<figure><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/loopinput.com\/wp-content\/uploads\/2019\/11\/image1-1024x553.png\" alt=\"Net Promoter Score\" width=\"800\" height=\"432\" \/><figcaption><\/figcaption><\/figure>\n<p>Take your data set of ratings between 0-10. Divide these ratings into three groups. Anyone with a rating of 7 or 8 is a \u201cPassive.\u201d Throw these scores out. Anyone with a score of 9 or 10 is a \u201cpromoter\u201d and anyone with a score of 0-6 is a \u201cdetractor.\u201d Take your % of detractors out of the total ratings and subtract it from your % of promoters out of the total ratings. Whatever % comes out of that, negative or positive, turns into your NPS score.<\/p>\n<p>That\u2019s it, you\u2019re done.<\/p>\n<p>So, if 50% of respondents were promoters and 10% were detractors, your NPS is 40. Well sounds straightforward enough\u2026 what\u2019s wrong with that?<\/p>\n<h2>NPS Example<\/h2>\n<p>In the above example, product A has two 10s (promoters) and eight 6s (detractors). So you subtract the % of detractors (80%) from the % of promoters (20%) to get -60% as a total which turns into an NPS of -60.<\/p>\n<p>Product B has one 9 (promoter) and four 0s (detractors). So you subtract the % of detractors (80%) from the % of promoters (20%) to get -60.0% as a total which turns into an NPS of -60.<\/p>\n<p>So product A has a final NPS of -60 and product B has a final NPS of -60. The products have the exact same NPS.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/loopinput.com\/wp-content\/uploads\/2019\/11\/image1-1.jpg\" alt=\"Girl Staring\" width=\"800\" height=\"445\" \/><\/p>\n<h2>Why NPS Sucks<\/h2>\n<h2>Arbitrary First-In Data Grouping<\/h2>\n<p>When working with ratings you will almost always have to lose some accuracy to achieve an average \u201crating.\u201d Often this involves rounding a final number.<\/p>\n<p>Rounding is a form of data grouping. When you are saying 49.469 will be rounded to 49.47 you are grouping data based on\u00a0 a set of parameters or rules. Usually this is done at the very last possible stage to avoid, as much as possible, abstracting out the detail in data too early to prevent inaccuracies.<\/p>\n<p>The sooner in the process you group the data, the more divergent comparable data sets will become in the final calculation because you stack small inconsistencies into bigger and bigger ones, creating a cascading chain of building inaccuracies. That is why most scoring systems round at the last possible step before producing a score.<\/p>\n<p>Additionally, the wider the range of this \u201cgrouping\u201d the more inaccurate, and less reflective of the original data set, your final rating becomes.<\/p>\n<p>NPS has one of the widest groupings imaginable. Every rating between 0-6 is considered exactly the same. That\u2019s right, 60% of all possible ratings are grouped and considered \u201cequivalent.\u201d A further, less egregious, grouping happens when 7 is considered equivalent to 8 and 9 is considered equivalent to 10.<\/p>\n<h2>No Consideration for Predictive Accuracy or Volume of Data Set<\/h2>\n<p>NPS does not employ any mechanism of considering the volume or number of data points when creating a calculation. So a data set with one 0 and one 10 is considered exactly equivalent to a data set with one million 0\u2019s and one million 10\u2019s. This is also a weakness shared with traditional averaging. The average of the above two data sets (one with two data points and the other with two million data points) will also come out to be exactly the same.<\/p>\n<figure><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/loopinput.com\/wp-content\/uploads\/2019\/11\/image4.png\" alt=\"\" width=\"511\" height=\"327\" \/><figcaption>With traditional averaging, the first product would be considered superior to the second<\/figcaption><\/figure>\n<p>What is the solution to this? The solution is to produce two scores, one an average that does not take into consideration the number of data points and the other a Wilson score which does weight the number of data points.<\/p>\n<p>If you want a more thorough explanation of Wilson scores here is a great blog on the topic: <a href=\"http:\/\/www.evanmiller.org\/how-not-to-sort-by-average-rating.html\">http:\/\/www.evanmiller.org\/how-not-to-sort-by-average-rating.html<\/a> but the jist of it is that a Wilson score takes into consideration the number of data points and predictive accuracy into consideration.<\/p>\n<p>But if using Wilson scores is too complicated for you, the solution is to just use averages.<\/p>\n<h2>In Defense of NPS<\/h2>\n<figure><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/loopinput.com\/wp-content\/uploads\/2019\/11\/bigknight-233x300.jpg\" alt=\"big knight\" width=\"233\" height=\"300\" \/><figcaption>An NPS defender<\/figcaption><\/figure>\n<h2>NPS is about net detractors and net promoters! It\u2019s about the intangibles!<\/h2>\n<p>Argument: The entire point of NPS is that it is about net promoters and net detractors. Net promoters will spread the good word about your product and net detractors will spread the bad word about your product. The net neutrals will say nothing so they have no influence!<\/p>\n<p>Response: This is predicated on the assumption that someone that rates your business or product a 6 will be just as much of a vocal detractor as someone that rates your business or product a 0. So instead of assuming that the amount that someone promotes or detracts your business is correlated linearly with their rating, you are putting people into an arbitrary bracket and assuming that those brackets map to peoples\u2019 behavior non-linearly with no evidence to back that fact.<\/p>\n<h2>NPS is easy<\/h2>\n<p>Argument: Everyone uses NPS because it is easy to calculate! None of this complicated Wilson score mumbo jumbo!<\/p>\n<p>Response: So is an average rating. When in doubt go with an average rating. NPS provides a poor return on complexity by both simultaneously adding complexity and butchering data sets. Calculating an average is both easier than NPS (you just add every rating together and divide by the number of ratings) and does not butcher the data set.<\/p>\n<h2>NPS is powerful because of the survey question. \u201cOn a scale of 0-10, how likely is it that you would recommend [product name] to your friends, family or business associates?\u201d<\/h2>\n<p>Argument: the true power of NPS is that you are calculating your viral coefficient! The survey question is the critical piece!<\/p>\n<p>Response: There is no reason that you can\u2019t ask this question and calculate your scores from the results in a different way. Just because you ask this specific survey question, you do not have to use NPS to calculate the results.<\/p>\n<p>Additionally, NPS survey question does not add anything that other loyalty-related questions cannot provide. According to a study by Hayes (<a href=\"https:\/\/businessoverbroadway.com\/wp-content\/uploads\/2011\/01\/QP_June_2008_True_Test_Of_Loyalty.pdf\">https:\/\/businessoverbroadway.com\/wp-content\/uploads\/2011\/01\/QP_June_2008_True_Test_Of_Loyalty.pdf<\/a>), there is no evidence that the &#8220;likelihood to recommend&#8221; question is a better predictor of business growth than other customer-loyalty questions (e.g., overall satisfaction, likelihood to purchase again) and the &#8220;likelihood to recommend&#8221; question does not measure anything different from other conventional loyalty-related questions.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Why I Hate NPS With a Passion Blog With Images The Tale of Two Products After purchasing a product on an online e-commerce store, customers are asked the typical NPS survey question \u201cOn a scale of 0-10, how likely is it that you would recommend [product name] to your friends, family or business associates?\u201d for [&hellip;]<\/p>\n","protected":false},"author":9783,"featured_media":1742,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[923],"tags":[2316],"class_list":["post-722","post","type-post","status-publish","format-standard","hentry","category-blog","tag-marketing"],"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/posts\/722","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/users\/9783"}],"replies":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/comments?post=722"}],"version-history":[{"count":2,"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/posts\/722\/revisions"}],"predecessor-version":[{"id":1798,"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/posts\/722\/revisions\/1798"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/"}],"wp:attachment":[{"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/media?parent=722"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/categories?post=722"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/archive.blogs.harvard.edu\/loopinput\/wp-json\/wp\/v2\/tags?post=722"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}