You’re reading Read Max, a twice-weekly newsletter that tries to explain the future to normal people. Read Max is supported entirely by paying subscribers. If you like it, find it useful, and want to support its mission, please upgrade to a paid subscription! Greetings from Read Max HQ! In today’s issue, responding to some good recent human-generated writing on A.I.-generated writing. A reminder: Read Max is a subscription newsletter whose continued existence depends on the support and generosity of paying readers. If you find the commentary at all enlightening, entertaining, or otherwise important to your weekly life, please consider upgrading to a paid subscription, an act which will allow you to enjoy Read Max’s weekly recommendations for overlooked books, movies, and music, and give you a sense of pride and accomplishment and supporting independent journalism, such as it is. Will A.I. writing ever be good?Sam Kriss has an excellent new piece in The New York Times Magazine examining the actual style of “A.I. voice.” It’s a great rundown of the many formal quirks of A.I.-generated text, and I’m glad that Kriss was able to appreciate the strangeness of even the ever-more-coherent writing produced by frontier large language models, which is “marked by a whole complex of frankly bizarre rhetorical features”:
Even as L.L.M.s get better at producing fluid and plausibly human text, these persistent stylistic tics remain interestingly abrasive--in a single short answer, presented to you in a vacuum, A.I. text is as smooth as can be, but when you’re confronted with an overwhelming amount of it, the strangeness that’s been fine-tuned out really begins to re-assert itself. Kriss argues (in part) that one reason A.I. writing remains so (in aggregate) weird and waffly is that L.L.M.s “can’t ever actually experience the world”:
But I wonder if it’s true that the lack of a “world model” is what pushes L.L.M. text toward metaphorical drivel: It seems just as likely that chatbots over-rely on this kind of sensory-immaterial conjunction because, as Kriss says, it’s a “cheap literary effect” that impresses people passing superficially over a text--exactly the kind of fake-deep crowd-pleaser for which L.L.M. output is being fine-tuned. These satisfyingly plausible folk-technical explanations come up often when people are trying to describe the limitations of A.I.-generated writing. One well-rehearsed account blames A.I.’s stylistically uninteresting output on next-token prediction: Large language models, this argument goes, intrinsically cannot generate truly great writing, or truly creative writing, because they’re always following paths of less resistance, and regurgitating the most familiar and most probable formulations. This is a satisfying argument, not least because it’s easily comprehensible, and for all we know it’s even a true one. But we don’t actually know that it’s right, because we’ve never really tried to make an L.L.M. that’s great at writing. I appreciated Nathan Lambert’s recent piece at Interconnects “Why AI writing is mid,” which argues that the main roadblocks to higher-quality writing are as much economic as technical: There simply isn’t enough demand for formally ambitious (or even particularly memorable) writing to be worth the expense or resources necessary to train a model to produce it.
As Lambert points out, much of what we dislike about A.I.-generated text from a formal perspective--it’s generally cautious, inoffensive, anodyne, predictable, neutral, unmemorable and goes down smooth--is a product not of some inherent L.L.M. “voice” but of the training and fine-tuning processes imposed by A.I. companies, which are incentivized to make their chatbots sound as annoying and bland as possible. No one is out there actually trying to create Joycebot (or whatever), and for good reason: The saga of Microsoft’s Bing and its “alter-ego,” Sydney, is in a broad sense the best fictional story yet produced by an L.L.M. chatbot, but it was also an unmitigated disaster for the company. To the extent that their output is pushed into “mid-ness” by economic circumstance, L.L.M.s are not unprecedented. In a real sense, “why A.I. is writing mid” and “why most professional writing is mid” have the same explanation: “Good writing,” whether authored wholly by humans or generated by an L.L.M., requires capacious resources (whether in time and education and editing or in compute and training and fine-tuning) to create an idiosyncratic (and likely polarizing) voice for which there usually isn’t economically sufficient demand.¹ I sometimes think that it’s more helpful to think about large language models as equivalent not to individual writers in the specific but to whole systems or institutions of which writing is an end-product. A given L.L.M. is less akin to, say, a replacement-level magazine writer than it is to “the entire magazine industry at its peak,” if you imagine the magazine industry as a giant, complex, unpredictable machine for producing a wide variety of texts. Just as that industry, as a whole, once was able to generate text to a certain degree of predictability and at a relatively high floor of quality, to varying client specifications and structured by its own internal systems and incentives, so too do Claude or ChatGPT.² I bring up magazines in particular as a point of comparison because I’ve been struck for a while at the similarity between the voice deployed in the latest generation of chatbots and what a friend calls “F.O.B. voice,” or the smooth, light, savvy, vaguely humorous tone that once reigned in magazine front-of-book sections:
“Mid,” as the magazine industry knew, and as L.L.M.s “know,” is a rewarding zone to be in: It’s what people find easiest to consume, and what advertisers feel most comfortable appearing adjacent. Of course, the magazine industry generated more than just reams and reams of smooth placeholder text; it also produced New Journalism, the modern short story, “Eichmann in Jerusalem,” the Hillary planet, etc. But these were positive externalities, not inevitabilities, driven more by cultural prerogatives than by financial necessity. To get something similar from an L.L.M. would likely require a lot of not-necessarily-profitable groundwork. 1 Another way of thinking about it might be: A parallel timeline where an A.I. was pumping out great novels would be an improvement to our own, because it’d suggest that there was enough demand for genuinely great novels to make it worth training an L.L.M. to do so. 2 Not to get too whatever about it, but it’s good to note that that magazine articles (or, even moreso, Hollywood movies) are the products of many humans operating within larger systems and frameworks. Do we think of those articles as “magazine industry-generated,” or major-studio movies as being “Hollywood-generated”? I’m not saying we should, necessarily, but I suspect that if and whenever A.I. is able to create great (or even non-slop) writing, we will come to think of it less as “A.I.-generated” and more as authored by the prompter, or the prompter in concert with the model creators at various levels. Invite your friends and earn rewardsIf you enjoy Read Max, share it with your friends and earn rewards when they subscribe. |

