Policy Management

The Policies feature allows you to define, manage, and apply moderation policies directly within the Moderation Dashboard. This feature is designed to enhance transparency, compliance, and efficiency in your content moderation process. By integrating policies into both automated rules and manual actions, you can ensure a consistent and accountable approach to content moderation.


Key Features

  • Define and Manage Policies: Create and organize your platform's moderation policies and sub-policies, each with a unique name, api_value, and description, for tracking.
  • Integration with Rules and Actions: Apply policies to automated rules and manual actions, ensuring each moderation decision is justified and documented in the moderation log.
  • DSA Compliance: Explore and add standardized Digital Services Act (DSA) policies to your application, crucial for tracking and reporting actions in line with regulatory requirements. DSA policies follow the structure required for reporting to the DSA Transparency Database

How to use Policies

Adding Policies

  1. Navigate to the Policies Page:
  2. Create Policies and Sub-Policies: Define your policies, including a unique api_value, display name, and description.
    1. API Value: Theapi_valueis a unique identifier for each policy and sub-policy, returned in the action request's policy object whenever a policy is applied. This value helps you track and integrate with policies within your system. Refer to the API Response below to view how Policies are returned
    2. Description: Describe your Policy and Sub-policy in more detail. This description will also be returned in the API Response and can be used to directly communicate policy infractions to your end user.
  3. Explore DSA Policies:
    1. Add standardized DSA policies to your application to ensure compliance with the Digital Services Act, which regulates online platforms to prevent illegal activities and protect user safety.
    2. These policies follow the same API structure required when reporting to the DSA Transparency Database
      Learn more about DSA Transparency Database here.

Applying Policies

  1. In Automated Rules: When configuring rules, select policies that are being moderated by the rule and these policies will automatically be applied and tracked when the rule is fired on a post or user.
  2. In Manual Actions: Moderators can select applicable policies and sub-policies during the review process, ensuring clear documentation of decisions and transparency for the content publisher.

Example Action Request (with Policies applied)

{
  "user": "user_id_123",
  "post": "post_id_123",
  "policies": [
    {
      "parent_policy": {
        "api_value": "VIOLENCE",
        "display_name": "Violence (Sample Policy)",
        "description": "Our platform maintains a zero-tolerance policy towards violent content to ensure the safety and well-being of our users. We strictly forbid violent speech deemed highly severe and likely to cause harm. Such content will be removed, and repeated offenses may lead to account restrictions or suspension. This includes explicit statements that:\n\nThreaten Violence: Direct threats to inflict physical harm, including threats of killing, torture, sexual assault, or other bodily injury. This extends to threats against civilian homes, shelters, or essential infrastructure.\nExpress Desires for Harm: Wishing, hoping, or expressing a desire for others to suffer harm, such as death, illness, tragedy, or physical injury.\n\nIncite Violence: Encouraging or promoting acts of violence or harm, including self-harm or participation in atrocities like crimes against humanity, war crimes, or genocide.\n\nGlorify Violence: Praising or celebrating violent acts where harm occurred, including expressing gratitude for someone's suffering or lauding animal abuse or cruelty.\n\nViolent speech is also prohibited in highly visible areas on this platform, such as live video, profile information, header images, or community cover photos.We tailor our response to the severity and likelihood of harm. In some cases, we may limit the visibility of violent content on this platform if we assess that:"
      },
      "sub_policies": [
        {
          "api_value": "GUN_VIOLENCE",
          "display_name": "Gun Violence",
          "description": "Gun Violence"
        },
        {
          "api_value": "BOMBS",
          "display_name": "Bombs",
          "description": "Bombs"
        }
      ]
    },
    {
      "parent_policy": {
        "api_value": "hate_speech",
        "display_name": "Hate Speech",
        "description": "This is the description for Hate Speech"
      },
      "sub_policies": [
        {
          "api_value": "bullying",
          "display_name": "Bullying",
          "description": "This is Bullying"
        }
      ]
    }
  ]
}