Reddit Inc. is weighing feedback from early meetings with potential investors in its initial public offering that it should consider a valuation of at least $5 billion, according to people familiar with the matter, even as it is estimated below that figure in the volatile market for shares of private companies.
Most LLMs have tonnes of NSFW data in their training.
Typically, if this wants to be blocked, a secondary RAG or LORA is run overtop to act as a filtering mechanism to catch, block, and regenerate explicit responses.
Furthermore, output allowed lexicon is a whole thing.
Unfiltered LLMs without these layers added on are actually quite explicit and very much capable of generating extremely NSFW output by default.