Book Publishing & Technology: The ONIX 3 Debacle

November 5th, 2025

Beware: This post talks about metadata for books, and it talks about ONIX, the data format used to convey metadata for the retailing of books.

It would be impossible to recap the whole story of metadata for books here, except in the clumsiest of fashions, and so if this is a topic that mystifies, or that you disdain, move on. (Or, alternately, have a read of John Warren’s “Zen and the Art of Metadata Maintenance,” probably the best intro out there.)

A publisher literally cannot sell their books at retail, particularly online retail, without ONIX metadata. I mean literally. Actually, totally. Publishers cannot upload a book for sale to Amazon without ONIX metadata. Amazon will not accept the upload. And so ONIX metadata is not a nice-to-have, it’s a must-have. Most publishing personnel don’t want to know about metadata at all — they leave it to the techies, or the marketing people. But it’s essential. Nonfiction books don’t have to have an index any more, and often don’t. But they do need ONIX metadata.

picture of the Onix character in Pokemon

“Onix: It is a very aggressive Pokémon that will constantly attack humans”

What is this beast called ONIX? Think of it as just a form that publishers fill out for new books. At its most basic level ONIX metadata is just the basics about a book: title, subtitle, author, publishers, pub date, retail price, ISBN, BISAC codes, and so on.

(Parenthetical: The ONIX standard is maintained by EDItEUR, a UK-based membership organization, with participation from virtually the entire publishing industry worldwide.)

The first version of ONIX appeared in 2000. Version 2 was published in 2001, and revised in 2003. The latest version, ONIX 3, was published in April 2009, sixteen years ago.

Here’s the debacle:

ONIX 2.1 was ‘sunsetted’ (no longer actively supported) by EDItEUR at the end of 2014, nearly 12 years ago — it has not been revised or improved since. ONIX 3, meanwhile, is so much more robust than 2.1 ever was. There is all kinds of information about a book that you can communicate in ONIX 3 that could never be expressed in 2.1. Using the data contained in an ONIX 3 file, publishers can sell more books than they can using 2.1. If you already know how to use 2.1, ONIX 3 is not technically difficult to implement.

BISG used to publish a sort of shame list showing who was supporting each version of ONIX. The last copy of the list I’ve got is a decade old. At the time only 40% of respondents were using ONIX 3. I know it’s a much higher adoption now. But still…

Chart showing different ONIX timeline events

The ONIX Timeline

Last week the Book Industry Study Group (BISG) published a post, “Time to Act: The ONIX 3 Transition is Actually Here.” It’s written by Claire Holloway, the hard-working chairperson of BISG’s Metadata Committee. Claire is, as always, reasonable in her outlook and fully supportive of the publishing community. But the story she’s telling is, to me, chilling, in the way it reminds of ONIX’s slow journey across two-dozen years.

Apparently Amazon is finally going to force the publishing industry to transition to ONIX 3 — it has set a deadline of March 2026 for a full conversion.

But Claire points out also that the transition to ONIX 3 “isn’t just a technical upgrade—it’s becoming a legal necessity. The European Accessibility Act (EAA) requires metadata about an ebook’s accessibility features to be provided and communicated throughout the supply chain for any ebook being sold in the EU.”

So, in 2026, will the book publishing industry finally fully embrace a sixteen-year-old standard that can make their businesses more profitable? We’ll see.

——————————————————

But this post is really about how technology is adopted into the book publishing industry. I’m now looking at a different technology, far more powerful and profound than metadata enhancement. It can help publishers become more efficient and also to sell more books. AI’s potential impact is a hundred times more profound than moving from ONIX 2.1 to ONIX 3.

Yes, I’m very conscious of the reasons for disliking, no, even hating AI technology. I talk about these frequently, in my book, my presentations, and in my blog posts. But the universe isn’t waiting for the book publishing industry to forgive AI’s original sin, or to come to terms with the many threats posed by the technology.

Illustration of a bookish man screaming "I hate AI" while the Titanic sinks behind him.

Typically understated illustration, created by ChatGPT.

Last week Adobe released the top-line results of a survey over 16,000 “content creators” from around the world. Of course Adobe’s idea of a content creator is different from ours — Adobe discusses content “such as images, video, audio, and design.” But 76% of respondents report that AI has accelerated the growth of their business or follower base. 81% say AI helps them create content they otherwise couldn’t have made. 85% believe AI has positively impacted the creator economy.

Several weeks ago I shared an early look at an income survey commissioned by ALLi of the self-published author community. They added a few questions on AI use, and found that over 25% of reporting authors were using AI tools, and nearly 75% of those report that the technology is having either some, or a strong positive impact on their income or productivity.

A survey from Gotham Ghostwriters and Josh Bernoff, released yesterday, featured surprising top-line numbers, such as “a surprising 61% use AI tools.” But if you focus just on the book writer responses, the numbers were in the 25% range (23% of fiction writers reported daily use of AI tools), consistent with the ALLi survey numbers.

Meanwhile, book publishing, steeped in tradition and trade practices, remains as technology-averse as ever. A BISG survey from last summer revealed lots of dabbling with AI tools (45%), but less than a qurter said that they are incorporating AI into existing workflows.

I went through the jobs classifieds on Publishers Marketplace and PW this morning and found nary a mention of AI skills. The marketing manager of one of the largest publishing imprints in the U.S. requires experience “in creating decks and presentations and presenting in front of large groups” but no proficiency with AI tools. The most demanding technical qualifications for an advertising associate at one of the Big 5 publishers is “Microsoft Excel, Word, PowerPoint, and Outlook.”

Job posting for a publicist at Celadon Books (Macmillan)

First time!

A senior publicist role at a Big 5 imprint requires… news flash: “Understanding of AI models to be used for research and development of campaigns as well as a curiosity around new uses of AI for workflow optimization is a plus.” That’s the first time I’ve seen AI mentioned in a publishing industry job post!

A couple of recent stats feature bleak vistas of the future of book publishing. After noting a 4.1% sales increase in 2024, the Association of American Publishers (AAP) reported sales in the publishing industry down 4.4% for the month of August, and down 2.8% year-to-date.

More troubling is the new study from the University of Florida and University College London, which found that daily reading for pleasure in the U.S. is “in free fall,” having declined by more than 40% over the last 20 years. Without readers, where is publishing?

——————————————————

As I was completing this draft I found a link to a just-published article by Ted Underwood about AI adoption in education, “AI is the Future. Higher Ed Should Shape It.” (Underwood shared a link to the article.) The article might be summed up as an invitation for educators to work with AI, not against it. AI is “a factor in education whether it finds a place in the curriculum or not.”

I’d invite you to read the article, swapping out ‘colleges and universities’ with “book publishers” and ‘education’ with “publishing.” You’ll end up with sentences that like this:

But adoption of AI in publishing has mostly not been driven by explicit partnerships between book publishers and AI companies. Rather, individual publishing staff have quietly started to use it in their work. To reject AI, publishers would have to prohibit staff from using it…. That sort of outright prohibition is so implausible that I’m not sure it’s worth debating….

If publishers want to stay at the forefront of knowledge production, we have no choice but to take responsibility for shaping AI and fitting it to our needs. If we don’t play that role, we will effectively cede intellectual leadership to the organizations that do.

Will the management of publishing companies figure out ways to harness the power of AI, while mitigating its drawbacks? There are some positive early indicators. But publishing’s historic discomfort with technology seems likely to block the industry from robustly integrating AI into current workflows. The implications are concerning.