Reinventing the Newsroom

Greg Harmon on Web Traffic Numbers

Posted in Branding, Communities, Paid Content by reinventingthenewsroom on October 15, 2009

My post earlier this week led to an interesting phone conversation with Greg Harmon of Belden Interactive. We discussed the apparent disconnect between publishers and audiences shown in the ITZ Belden presentation given at last month’s API confab, but mostly we talked about Web traffic numbers.

I’m math-challenged, so I was following Harmon into deeper waters than I’d like, but I think I followed along with him without getting too lost. His contention is that newspapers’ counts of unique visitors are drastically inflated by measurement errors. That’s bad news for publishers who like big numbers when discussing traffic and reach — but Harmon sees it as good news for advertisers and folks trying to figure out how to turn online newspapers into a business. First off, they need reliable data. Second, there’s a  silver lining to the measurement errors: Harmon thinks they substantially undercount visits per user, meaning those smaller audiences are much more engaged and valuable than papers have been led to believe.

Harmon says his efforts to get better traffic numbers date back five years or so, to work he did for Lee Enterprises. Lee was getting numbers indicating there were 400,000 monthly unique visitors to the Billings (Mont.) Gazette, which seemed odd given that the print Gazette’s circulation was around 50,000 and its total readership was probably 80,000. Was non-local readership really that high?

Harmon — with help from Alan Hubbard, a professor of biostatistics at Berkeley — took apart that data. The data about page views and visits seemed sound, so they analyzed the numbers based on what they knew about how many page views an average visit generates — a number that’s fairly standard across newspaper sites, though it changes on weekdays and weekends. Running those numbers yielded an estimate of average visitors per day that they thought was more accurate, and from which they could extrapolate. To discover where site visitors were coming from, they surveyed them through Web-site interstitials, which let them categorize readers by how often they visited and whether or not they were local. Harmon’s conclusions: The Gazette’s average monthly traffic was really around 40,000, and half of that came from Billings.

Harmon said the news brought cheers. Why? Because while 400,000 monthly unique visitors was impressive, it was useless for selling ads: It made the print salespeople’s 80,000 readers look puny, and stuck the Web salespeople with a number that advertisers doubted and that they couldn’t explain. The new numbers, he says, let the Gazette present a unified marketing message, and gave potential Web advertisers a realistic number to consider when thinking about their potential return on investment.

But why were Belden’s numbers so different? Harmon looked through data from some 60 sites Belden Interactive had worked with, and used surveys to focus on potential factors. Site visitors were asked how many computers they’d used to access a site in a given month. The average answer was two, but that wasn’t enough to account for the discrepancy.

Harmon’s suspicions fell on cookies. Were site visitors clearing them? He suspected so, even though newspaper executives and techies disagreed, arguing that people didn’t habitually clear cookies and most had never heard of them. (I was skeptical myself. And the big audience-measurement firms would say that cookies don’t affect their metrics.) Harmon says when questions about cookies were added to Belden’s surveys, about 50% of respondents had indeed never heard of them. But 25% of respondents said they cleared cookies weekly, and 10% said they did so daily.

Such small percentages can drive big errors, he explains. Say you have 1,000 readers who visit a paper’s Web site once per day. According to Harmon’s survey results, 500 of them have never heard of cookies, and so will be recorded as 500 monthly visitors. Another 150 have heard of cookies but don’t mess with them. They count as 150 monthly visitors. But 250 visitors clear their cookies weekly — and so they show up as 1,000 monthly visitors. Then there are 100 visitors who clear their cookies daily. They show up as 3,000 monthly visitors. Add them together, and 1,000 monthly visitors have become 4,650, even though two-thirds of the audience never messes with cookies.

Harmon says he sees evidence of this in the relationship between page views and unique visitors reported by papers of all sizes: Figures from paper after paper show that each unique visitor generates around 2.5 visits —  a measure that’s stayed the same for years, and that’s bedeviled papers who think readers no longer engage with them. But Harmon thinks that figure is a product of faulty measurements. He says papers’ loyal readers report they visit around 40 times a month, a figure that should make papers feel better about their connection with the audience. But if those readers visit from two computers, their monthly visits will be recorded as 20, not 40. And if they clear cookies once a week, the number of apparent visits drops to five. The effect of all this? Papers think they have many more unique visitors than they do — and engaged, repeat visitors look like multiple drive-by readers. Harmon thinks that’s a fundamental misunderstanding that has, in turn, has distorted the advertising market — big, amorphous audiences are of interest to Madison Avenue, not local businesses. And when page views get commoditized, CPMs go down.

Harmon says that publishers haven’t wanted to believe him, but now that the industry is looking aggressively at paid models, his argument is getting a new hearing.

“It’s kind of a beautiful moment for us,” he says.

In the wake of the API panel, Alan Mutter discussed why publishers need real numbers as they consider for-pay plans — a plan that assumes 15% of visitors will pay for news is going to end rather badly if a large proportion of those visitors don’t actually exist. As Harmon puts it, “using incorrect information is actively dangerous to your enterprise.” (The comments on Mutter’s post also introduced me to the term scalejacking — marketers’ obsession with big audience numbers and exposure at the expense of small audience numbers and engagement — more from Seth Godin here.)

Harmon’s inclined to agree. His research tells him the real audience is much smaller. But those readers also visit more frequently, are far more engaged and are more local than they appear when diffused amid a sea of phantoms.

“We have all been enamoured of the big number, and we’ve always suspected it was wrong,” Harmon says, adding that “as long as all we were doing was selling commodity page views, it really didn’t matter. It was a shiny gold badge. But if we really want to go paid, the shiny gold badge doesn’t work anymore.”

8 Responses

Subscribe to comments with RSS.

  1. […] Greg Harmon on Web Traffic Numbers « Reinventing the Newsroom reinventingthenewsroom.wordpress.com/2009/10/15/harmon_on_traffic_numbers – view page – cached Posted in Branding, Communities, Paid Content by reinventingthenewsroom on October 15, 2009 — From the page […]

  2. Perry Gaskill said, on October 18, 2009 at 11:36 am

    Thanks for the followup. Also a size-large attaboy to Greg Harmon for expanding on the original ITZ-Belden presentation. For example, the 40,000 UV count with 20,000 local from Billings, Montana (~90,000 population) makes a lot more sense and is roughly in line with NAA-Nielsen numbers which seem to indicate UV’s of 20-25 percent in the aggregate among metro dailies.

    For the ink-stained wretches among us who come from news side, crunching these kinds of numbers tends to prompt a thousand yard stare, but the local UV’s are important to the business side of things if you consider that there seems to be two divergent directions the business of journalism is taking as it transits online. One direction is that the media “play” should be targeted at a global audience where the scale dictates that what counts most are sheer page hits. The New York Times, for example, in its effort to maintain its position as a national newspaper of record falls into this category which, for want of a better term, can be called the top-down model.

    At the other end of the spectrum are local efforts such as the West Seattle Blog, Voice of San Diego, or Howard Owens’ Batavian which are coming at things from a bottoms-up model. And one of the bigger problems faced by smaller news orgs now spooling up is that without accurate anticipated visitor numbers it’s difficult to measure the bottom threshold of where a community’s size can support some type of initially adequate yet growable and sustainable effort. Is there a potential business model for small local operations? Maybe, if you consider a couple of somewhat rhetorical questions.

    Here’s the first question: Why would a Billings, Montana hardware store owner expect to sell a hammer to a kid in Bulgaria?

    And here’s another one: Is it better to sell ads against 200,000 page hits at $.50 per CPM, or is it better to sell against 20,000 localized page hits at $20 per CPM?

    It also seems to me that one of the larger points of confusion about local traffic is because current analysis tools tend to be geared toward the top-down business model. It shouldn’t be that difficult, for example, to use a combination of server logs with IP addresses, cookies, and required-for-story-comments registered logins to get some fairly accurate numbers on locals. And those numbers don’t have to be perfect; they just need to be in the ballpark. They also need to be simple enough for a local merchant to understand without a 30-minute eye-glazing technical explanation. Having such a localized analytical tool also raises a couple of other potentials:

    The first is that it gives local merchants a means to budget for an ad spend on a regular basis. Local content is also probably going to have fewer traffic spikes with the exception of the rare man-bites-dog story, for example.

    Yet another potential is that although there are proponents of the idea that citizen journalism is the future of news content, the reality is that it has never really gotten much traction. And one reason is because writers need to be paid.

    In a story which ran in Forbes, late last month, tech entrepreneur Neil Senturia said he is launching U.S. Local News Network which would attempt to bring yet another national-based model into the local sphere.

    “Our business model is Starbucks, not Pizza Hut,” Senturia is quoted as saying.

    What was also interesting was that Senturia said he would be eliminating a previous “goofball” writer payment model in which “reporters were paid with stipends plus an 80 percent cut of the ad revenue generated by the pages on which their stories appear.”

    Forbes never asked Senturia why it was a goofball idea, and you’re left to wonder if there was something fundamentally flawed about the payment model or if something simpler such as the rate being too high was the problem.

  3. reinventingthenewsroom said, on October 19, 2009 at 4:17 pm

    That’s all well said, Perry, thank you for it.

    And glad you appreciated the followup. Took a while to come together because I was in deep water mathematically, but I hope it added something to the larger discussion.

  4. […] loyal users who want the kind of journalism in which Slate specializes. That took me back to my recent discussion with Greg Harmon about traffic numbers and how newspaper audiences may be a lot smaller than […]

  5. […] Jason Fry does a great job breaking down the numbers with Greg Harmon in this piece on Reinventing the Newsroom. Harmon’s analysis examines just how unique those visitors truly are and how a relatively small portion of users who delete their browser cookies can wreak havoc on the accuracy of the number. […]

  6. […] few over the empty many. (See also Slate’s David Plotz on core readers vs. drive-bys, and my own conversation about traffic stats with Greg […]

  7. […] Jason Fry does a great job breaking down the numbers with Greg Harmon in this piece at Reinventing the Newsroom. Harmon’s analysis examines just how unique those visitors truly are and how a relatively small portion of users who delete their browser cookies can wreak havoc on the accuracy of the number. […]

  8. […] goes right back to my conversation in the fall with Belden Interactive’s Greg Harmon. The first place Harmon tried to make sense […]


Comments are closed.