Content Moderation Archives

November 8, 2024

Mistral’s New AI Moderation API

During this week's MMO CMO AI Trasnformation Summit, one key topic was the pressing need for ways to control and moderate LLM output, so Mistral's latest announcement is timely. The company has launched a new moderation API, powered by its fine-tuned Ministral 8B model, designed to classify content across nine categories, including violence, self-harm, and personally identifiable information. Continue Reading →

November 30, 2022

Twitter Content Moderators Reassigned

According to transparency.twitter.com, "Effective November 23, 2022, Twitter is no longer enforcing the COVID-19 misleading information policy." I'm not sure when this was posted, but I learned about it this morning. Continue Reading →

November 16, 2022

The Ultimate Content Moderation Challenge

CNN obtained an internal memo to Meta's (formerly Facebook's) content moderation team that noted that “political speech is ineligible for fact-checking. This includes the words a politician says as well as photo, video, or other content that is clearly labeled as created by the politician or their campaign.” The memo was in direct response to requests for guidance from Meta's internal and external fact checkers. Continue Reading →

Get Briefed Every Day!