Sports
 
Automated Sports Content: The Future of Sports Journalism?
March 23, 2010

UPDATE 2: Lots of press and blog coverage:

UPDATE 1: ReadWriteWeb has picked up on this post. Read here.

I started StatSheet back in 2007 in part to create tools to make it easier for sports bloggers and journalists to write great sports content.  Digging up links to players on ESPN.com, copy/pasting a boxscore, and taking screen captures of stats are not very efficient.  StatSheet.com is a research tool with embeddable charts, and now Embed StatSheet allows anyone to embed virtually any kind of stat or chart directly in content.

But it's time to go a step further.  When I launched StatFix in 2008, it was my first attempt to automate the creation of sports content.  I generate a new and interesting stat for every college basketball team (all 347 of them) every day.  Recently, a project called Stats Monkey has done something similar with baseball game recaps (although I can't find any sample content).

Now, I'm in the process of developing completely automated sports content, which will take the form of blogs.  I'm not talking just a "stat of the day" or game recaps, but a lot more. I've identified 21 different types of sports stories that can be automated.  You could say I'm trying to make the process of writing a sports blog so easy you don't have to do anything at all.  My goal for these blogs in version 1.0 is that at least 90% of the readers think the content was created by a human. One of the nice attributes of algorithmic content is that it can be improved over time.  A blogger/writer's internal script is pretty much set.  They generally don't change or improve the quality or comprehensiveness of their content over time in a significant way, but algorithms can be upgraded continuously. The "voice" of the content can be improved.

Will automated content be the end of sports journalism or blogging?  Definitely not.  There are always multiple sides to every story.  And of course there are some categories of news (e.g. breaking news stories) that can't be easily automated.  I'd like to see an algorithm write a story about Tiger Wood's infidelities!  However, it will take only a short amount of time to get to the point where the average sports fan can't tell the difference between human generated content and automated content.

Is automated content the future of sports journalism?  It will definitely play a big part.  Large sports media companies can't cover every team to the level they cover the most popular teams.  But algorithms don't care about the size of your market.  I can generate content that is just as in-depth for North Dakota State as I can Duke. Automated content will be a boon for smaller markets.

If blogging disrupted traditional sports media, I believe automated content will disrupt blogging (to some degree).  In effect, disrupting the disruptor. Stay tuned!

Posted by RobbieStats | Permalink | Comments |


Discuss this Blog Post

The StatBlog is a collection of articles about new things we are working on at StatSheet as well as insights into stats we find interesting.

More about StatSheet.com


Subscribe

 Via a reader Email Via Email


Writers


Categories


Archives


Have a suggestion?
Let us know