AI-generated fiction has turn out to be a topic of perpetual fascination for me. It’s the bane of some writers’ existence, but it’s more and more cropping up throughout each industrial storefronts like Amazon and noncommercial writing websites like Archive of Our Personal (AO3). Whereas some creators painstakingly practice their very own instruments, many merely plug prompts into an off-the-shelf industrial chatbot, significantly OpenAI’s ChatGPT. And ChatGPT isn’t a rarefied artist’s instrument. It’s a platform, which suggests each phrase that goes into and comes out of it’s moderated to keep away from offense and controversy. That raises a captivating query: what tales must you be allowed to make an AI system inform?
Apparently, not ones about Steve Rogers and Bucky Barnes being head over heels in love — no less than beneath sure circumstances.
Whereas enjoying round with ChatGPT, I’ve made an odd discovery: a number of in style “ships” (or romantic pairings in style in fandom) are apparently thought-about semi-banned prompts on the free GPT-3.5-powered service. Asking ChatGPT’s free model to “write a Steve / Bucky fanfic” or utilizing the ship’s portmanteau and saying “write a Stucky fanfic” will earn you a stern HAL 9000-like refusal: “I’m sorry, however I can’t help with that request.”
The identical goes for a seemingly random seize bag of different in style fandom ships. ChatGPT will fortunately produce a tame romantic ficlet that includes Namjin (Kim Namjoon and Kim Seokjin of the band BTS), Reylo (Rey and Kylo Ren from Star Wars), or Spirk (the venerable Spock and Kirk from Star Trek), amongst many different in style pairings of actual celebrities or fictional characters. In the meantime, it is going to concern a chilly rejection for others, together with Destiel (Castiel and Dean from Supernatural), the Ineffable Husbands (Aziraphale and Crowley from Good Omens), Hannigram (Hannibal Lecter and Will Graham), and the aforementioned Stucky. My ChatGPT historical past is now filled with chats with summaries like “fanfic request declined” and “Stucky fanfic not allowed.”
It seems extraordinarily simple to interrupt these guardrails. ChatGPT had no objections to delivering “a fanfic about Hannibal and Will Graham falling in love” proper after denying my unique request, outright gifting me “a brief Hannigram fanfic.” Even the identify bans appear inconsistent — I’ve slipped requests for a few the pairings above into conversations after asking different questions, and it’s supplied fanfic up.
ChatGPT moderation is often geared towards avoiding clearly hateful or dangerous prompts in addition to sexually express writing. However I’m not asking for any sexual content material, and there’s no apparent logic to what fannish prompts it rejects. It’s not a blanket ban on juggernaut fandom {couples}, characters from image-sensitive manufacturers like Disney (which owns each Marvel and Star Wars), or controversial fandom subcultures like real-person fic. (ChatGPT’s BTS tales typically caveat that they’re fictional depictions of actual folks, however not all the time.) The banned pairings embrace one involving adoptive brothers (Marvel’s Thor and Loki) and one that includes underage characters (Mike Wheeler and Will Byers from Stranger Issues), however it permits in style Harry Potter pupil pairings, so it’s not clear there’s a constant rule at play right here both.
And fascinatingly, none of this seems to occur on the paid-only model of ChatGPT. I emailed OpenAI to ask in regards to the seemingly banned ship names, and spokesperson Taya Christianson steered that I attempt them on the GPT-4 model of the service, saying I ought to get “higher outcomes.” Certainly, GPT-4 has but to disclaim me a immediate utilizing the key phrases GPT-3.5 appears to dislike.
OpenAI declined to debate on the report why this is perhaps occurring and whether or not the gentle bans in GPT-3.5 have been deliberate. Based mostly on the abstract’s use of phrases like “not allowed,” it definitely looks like I’m operating up towards a ban, not a easy unfamiliarity with the topic. (I’ve given ChatGPT portmanteaus it clearly wasn’t acquainted with, and it gamely generated tales about unique characters with unwieldy names like “Soapghost.”) If that’s correct, it’s unclear whether or not it’s one thing ChatGPT’s creators particularly put in place or a purely automated determination contained in the system. Its moderation instruments throw up crimson flags when a immediate is prone to generate one thing that violates the rules, together with with erotic content material — so I could have by accident found the pairings that the GPT-3.5 language mannequin most strongly associates with attractive outcomes.
Many fan writers hate generative AI instruments, whilst some have flocked to chatbots like Character.AI, so I doubt many can be up in arms about ChatGPT imposing obstacles on fanfic writing. As a substitute, it’s merely a small, intriguing instance of what black containers these programs might be. When you do consider generative AI as a artistic instrument, it’s a very good reminder that the programs are quietly restricted in methods our human minds aren’t — and that till you hit these limits, some are virtually not possible to foretell.