(This post provides the content for a presentation I recently gave as part of O’Reilly Media’s “Tools of Change in Publishing” conference. It builds on a talk I initially gave last October at the Internet Archive’s “Books in Browsers” conference. A screencast that includes the presentation visuals has been posted on Vimeo. It runs about 23 minutes).
For the last couple of years I’ve been writing about a set of publishing topics – piracy, disruptive innovation, print on demand, workflow and content strategy, among others – that I started to think were connected by a common theme.
I first called that theme “a unified field theory of publishing”, more than a mouthful, but I think “context first” is a better and more helpful description. In that spirit, my talk today addresses the damage done by what I call the “container model of publishing”.
My idea in a nutshell is this: book, magazine and newspaper publishing is unduly governed by the physical containers we have used for centuries to transmit information. Those containers define content in two dimensions, necessarily ignoring that which cannot or does not fit.
Worse, the process of filling the container strips out context – the critical admixture of tagged content, research, footnoted links, sources, audio and video background, even good old title-level metadata – that is a luxury in the physical world, but a critical asset in digital ones. In our evolving, networked world – the world of “books in browsers” – we are no longer selling content, or at least not content alone. We compete on context.
I propose today that the current workflow hierarchy – container first, limiting content and context – is already outdated. To compete digitally, we must start with context and preserve its connection to content.
We need to think about containers as an option, not the starting point. Further, we must start to open up access, making it possible for readers to discover and consume our content within and across digital realms.
Without a shift in mindset, we are vulnerable to a range of current and future disruptive entrants. Containers limit how we think about our audiences. In stripping context, they also limit how audiences find our content.
Here, scale is not our friend. It may well be the enemy. As Clay Christensen first outlined in 1997, disruptive technologies don’t look or feel like what we typically value. Often enough, they are cheaper, simpler, smaller and more convenient than their traditional analogues.
Already, smaller, more nimble digital upstarts have reversed the paradigm. They start with context, vital to digital discoverability and trial, and use it to strengthen content. Many startups forego containers, or they create them only as a rendering of personal (consumer) preference.
Think Craiglist. Think Monster. Think Cookstr, a born-digital food site that started with and continues to evolve its taxonomy. Context first.
As barriers to entry have fallen, I’ve started to think more about how traditional book, magazine and newspaper publishers can survive in a digital era. There are both new and non-traditional established entrants across most publishing segments. Their successes have pushed traditional publishers to look at ways to change business models and organize around customers.
It is time to see our publishing brethren – newspapers and magazines – as part of a disrupted continuum that affects us all. Digital makes convergence not only possible; digital has made convergence inevitable. Marketers have become publishers; publishers are marketing arms; new entrants are a bit of both. Customers have become alternately competitors, partners and suppliers.
As I prepared this talk, I was reminded of a passage from Salman Rushdie’s 1990 book, Haroun and the Sea of Stories.
In the book, Haroun sets off to find stories for his father, who has lost his ability to tell tales. Along the way, Haroun comes across Iff, the Water Genie, who at first does not treat Haroun kindly. But at a low point, the Water Genie relents and starts to tell Haroun …
“… about the Ocean of the Streams of Story, and even though he was full of a sense of hopelessness and failure, the magic of the Ocean began to have an effect on Haroun. He looked into the water and saw that it was made up of a thousand thousand thousand and one different currents, each one a different color, weaving in and out of one another like a liquid tapestry of breathtaking complexity …”
I’ll stop there. We’ll return to this story in a bit, but for the moment I’d like to use it as a jumping-off point, a call for us to:
Imagine a world in which content authoring and editing tools are cheap, or even free.
Imagine a world in which storage is plentiful, even virtual.
And imagine a world in which content can be disseminated in a range of formats, at the figurative or literal push of a button.
That world exists today, with literally dozens of credible, widely accessible tools and resources. These authoring, repository and distribution tools and resources make it possible for anyone to create, manage and disseminate digital as well as physical content.
The thing is, while that world is already here, it is far from evenly distributed.
The typical winners are in the upper right – genres, like cooking, that have many components, or “chunks”, and a higher probably of being recombined or reused.
Our problem is, we’re not the only ones looking at these markets.
While publishers think of agile workflows as an opportunity to drive down the cost of making content for containers, a newer breed of “born-digital” competitors have started with context. These new entrants are developing taxonomies and tools so that they can invade the same niches we thought we were making more efficient.
The challenge is not just being digital; it’s being demonstrably relevant to the audiences who now turn first to digital to find content.
New entrants – our real competition – start with the customer. They develop contextual frameworks that help them differentiate both readers and themselves. The new guys like the new tools because they are cheap, scalable and open-source. In fact, they are already exploiting tools that many traditional publishers lament are “just too hard to learn”.
How did we get here? There’s a reason.
In their physical forms, newspapers, magazines and books establish the boundaries of both content and context. Historically focused on containers, we have become stuck using them as the primary source for digital content.
Only after we fill the physical container do we turn our attention to rebuilding the digital roots of content: the context, including tags, links, research and unpublished material, that can get lost on the cutting-room floor.
Most of that context never makes it back. We have taken to using things like title-level metadata, some search engine optimization and occasionally effective use of syndication as proxies for something contextually rich.
Competing as we are against the “born-digital”, that’s not nearly enough.
Further, we treat readers as if their needs can be defined by containers. But in a digital world, search takes place before physical sampling, much more often than the reverse. Readers may at times look for a specific product, but more often they search for an answer, a solution, a spark that turns into an interest and perhaps a purchase.
Publishers are in the business of linking content to markets, but we’re hamstrung at search because we’ve made context the last thing we think about.
When content scarcity was the norm, we could live with a minimum of context. In a limited market, our editors became skilled in making decisions about what would be published. Now, in an era of abundance, editors have inherited a new and fundamentally different role: figuring out how “what is published” will be discovered.
To serve that new role, we must reverse our publishing paradigm. We need to start with context and develop and maintain rich, linked, digital content.
We also need to use the tools we have (as well as ones we have yet to develop) to make containers an output of digital workflows, not the source of content in those workflows. This is a fundamental change in our approach, but it is the only way that I see to compete in a digital-first, content-abundant universe.
And I don’t think that this change in mindset (or workflow) will come easily.
Over time, we have adopted a series of mental models that constrain our ability to change. The long history of using physical containers to distribute content, for example, has led us to conflate “format” with “brand”.
Perhaps there was a time when the physical nature of content products – their look and feel – dominated. But in a digital era, I think that its time has passed.
In a similar way, we often speak of digital content as a derived or secondary use. The recent debate about e-book rights underscores how deeply this bias runs. Who “owns” e-book rights is a different topic, but the Open Road and Wylie dust-ups were telling for the question that was not asked: who owns the context that drives discoverability, use and value in a digital realm?
In a digital era, context supports discoverability, use and re-use. Investing in context is now a requirement.
Unfortunately, our product focus and an obsession with scale lead us to worry more about finding ways to reduce costs. We think of making the physical object incrementally better, optimizing the creation, production and delivery of content in a single package.
Along the way, we miss opportunities to create agile, discoverable and accessible content.
I call this situation “container myopia”, paying homage to Ted Levitt’s 1960 article, “Marketing myopia”. In the article, Levitt called on marketers to shift from a product-centered to a customer-centered paradigm. He famously showed how railroad companies failed to see that they were in the transportation business, much as publishers have struggled to see that they are in the content solutions business.
In a digital realm, true content solutions are increasingly built with open APIs, something containers are pretty bad at. APIs – application programming interfaces – provide users with a roadmap that lets them customize their content consumption.
The physical forms of books, magazines and newspapers have analog forms of APIs. We’ve all figured out how to access the information contained in these physical products. But, the physical form itself does not always make for a good API, something that Craigslist, the Huffington Post, Cookstr and others have capitalized on.
Open up your API, I contend, or someone else will.
Many current audiences (and all future ones) live in an open and accessible environment. They expect to be able to look under the hood, mix and match chunks of content and create, seamlessly, something of their own. Failure to meet those needs will result in obscurity, at best.
To illustrate that point, I want to bring you to perhaps the most hierarchical, inaccessible, closed environment I know of: an American public high school. In particular, I’d like to take you to Columbia High School in Maplewood, New Jersey, where our youngest son, Charlie, is now a junior. The school opened in 1927, and it has not changed much since then.
Last summer, Charlie learned (happily) that he had earned a 5 on the AP Art History exam. This made him eligible to serve as a sort of teaching assistant for this year’s Art History class. All he needed to do was align his free period with the scheduled slot for Art History.
I don’t know how many of you have tried to parse a high-school scheduling API. It seems to rely on green-screen devices, stacks of forms and a queuing process that means you won’t have your new schedule in hand until two weeks after the start of the school year.
On a Friday in July, Charlie came home to find his junior-year schedule in the mail. His free period did NOT align. Charlie has seen his brother and sister fight the powers that be at Columbia High School, at times unsuccessfully, and he decided to pursue a different course.
Lacking access to the master schedule, he went to a free resource – Facebook – posted his schedule there and asked anyone who attended Columbia High School to do the same.
By Sunday morning, he had gathered enough data to compile his own master schedule. With this information in hand, he rearranged his classes, filled out a home-made “change form” and sent it to the high school on Monday morning. “Please give me this schedule”, it said. Problem solved.
Stories like this one, as well as everything Kirk Biglione says about DRM, have led me to see piracy as the consequence of a bad API. 16 years olds expect access, or they invent it. The future of content involves giving readers access to the rules, tools and opportunities of contextually rich content, so that they can engage with it on their own terms.
And whether they say it just like this or not, readers WANT good APIs.
Content is no longer just a product. It’s part of a value chain that solves readers’ problems.
Readers expect publishers to point them to the outcomes or answers they want, where and when they want them. We’re interested in content solutions that don’t waste our time, a precious commodity for all of us.
Perhaps most daunting: readers expect that their content solutions will improve over time. They don’t care that much (or at all) about how it happens.
Companies that are good at aggregating solutions will reduce the time and hassle involved in finding and buying something. Those firms have a leg up on their competitors.
Drawn from the prescient “lean consumption” model that James Womack and Daniel Jones debuted half a decade ago, these ideas are evident in aggregators like Amazon. They’re embodied in services like Kobo and Kindle. They’re not just products; they’re solutions.
So, if containers are now an option, and content must be made accessible, what is the role of context?
First, let’s establish a context of our own: Freed from physical constraints, we no longer have to write to length. We can link; we can expand; we can annotate.
As low- or no-cost authoring, repository and distribution tools and resources become freely available, it is axiomatic that ours has become and will remain an era of content abundance.
Simply: content abundance is the precursor to the development (and maintenance) of context.
When there was only the Gutenberg Bible, we didn’t need Dewey. When booksellers were smaller and largely independent, we didn’t have much need for BISAC codes. And before online sales made almost every book in print evident and available, ONIX was an unattended luxury.
Digital abundance is pushing us to create much more than title-level metadata. To manage abundance, we can (and do) use blunt instruments, like verticals, or somewhat more elegant tools, like search engines.
But when it comes to discovery, access and utility, nothing substitutes for authorial and editorial judgment, as evidenced in the structural and contextual tags applied to our content.
Context can’t be just a preference or an afterthought any more. Early and deep tagging is a search reality. In structural terms, our content fits search conventions, or it will not be referenced.
And in contextual terms, our content needs to be deeply and consistently tagged, or it will face an increasingly tough time being found.
We can’t afford to build context into content after the fact. Doing so irrevocably truncates the deep relationships that authors and editors create and often maintain until the day, hour or minute that containers render them impotent. Building back those lost links is redundant, expensive and ultimately incomplete.
This isn’t a problem of standards. At Indiana University, Jenn Riley and Devin Becker have vividly illustrated our abundance of contextual frameworks. The problem we face, the one we avoid at our peril, is implementing these standards.
Ultimately, that’s a function of workflow.
If strategy is a head, I liken workflow to a circulatory system. We all know how hard it can be to change organizational direction, but in practice, it’s a matter of coordination. Decide you want to go somewhere else, and your head tells your arms and legs to swing one way or another.
If you want to change workflow, though, you are looking at the publishing equivalent of a heart transplant. And starting with context requires publishers to make a fundamental change in their content workflows.
At a time when we struggle to create something as simple as a clean ONIX feed, planning for and preserving connections to content is a challenge of significant proportion. And we don’t have much time to get this new challenge right.
Although the precise changes in workflow will vary by publisher, certain principles apply. I think moving from a mindset of “product” to “service” or “solutions” means at least four things for publishers:
- Our content must become open, accessible and interoperable. Adherence to standards will not be an option;
- Because we compete on context, we’ll need to focus more clearly on using it to promote discovery;
- Because we’re competing with businesses that already use low- and no-cost tools, trying to beat them on the cost of content is a losing proposition. We need to develop opportunities that encourage broader use of our content; and
- We will distinguish ourselves if we can provide readers with tools that draw upon context to help them manage abundance.
Clearly, we’ll need new skill sets to compete in an era of abundance. We’ll probably have to add a lot more training than we have ever done internally. But those aren’t the toughest challenges. Changing workflow is.
I want to leave you on a stronger, happier note than that, though. Change can be hard, and we all need reasons to try something different or new. A short while ago, I asked you to leave Haroun and join me in a leap of imagination.
I’d like to travel back to the Sea of Stories, where the Water Genie is explaining to Haroun that …
“… these were the Streams of Story, and that each colored strand represented and contained a single tale. Different parts of the Ocean contained different sorts of stories, and as all the stories that had ever been told and many that were still being invented could be found here, the Ocean of the Streams of Story was in fact the biggest library in the universe. And because the stories were held here in fluid form, they retained the ability to change, to become new versions of themselves, to join up with other stories and so become yet other stories; so that unlike a library of books, the Ocean of the Streams of Story was much more than a storeroom of yarns. It was not dead, but alive.”
Like Haroun, we in publishing can sometimes become filled with a sense of hopelessness and failure.
And like Haroun, we’re perched atop a tapestry of breathtaking complexity. It is a time of remarkable opportunity in publishing, one in which we are able to find and build upon those strands of stories, in context.
Yes, we face a significant challenge preparing for a very different world, but it is a challenge I think we have the insight and experience to meet. What we choose to do now will begin to determine which stories get told, as well as who writes – and publishes – them.
With that, I’ll close my tag. While this story is ending, I hope that it spurs some stories of your own. As it does, I ask that you all think more about context, and that you continue to imagine.
I appreciate the opportunities that the Internet Archive and O’Reilly Media provided in its development. I also appreciate the feedback and direction offered by a number of colleagues and friends, as well as the presentation design created for this talk by our son, Frank O’Leary.
(With special thanks to Peter Brantley, Kirk Biglione, Laura Dawson, Kassia Krozser, Don Linn and Hugh McGuire for their feedback on various drafts of this presentation, as well as Frank O’Leary for his excellent work preparing a visual story to accompany these remarks.)