trust-and-safety

Here are 52 public repositories matching this topic...

roostorg / osprey

Automate the obvious and investigate the ambiguous. High-performance safety rules engine for real-time event processing at scale.

trust-and-safety roost-tools

Updated Apr 4, 2026
Python

roostorg / awesome-safety-tools

Star

Directory of open source tools for online safety

safety trust-and-safety online-safety

Updated Apr 3, 2026

roostorg / model-community

Star

Making open safety AI models accessible and beneficial to the safety community

trust-and-safety

Updated Apr 3, 2026
Jupyter Notebook

tattle-made / Uli

Sponsor

Star

Software and Resources for Mitigating Online Gender Based Violence in India

nlp machine-learning ml browser-extension india social-impact sdg indic-languages indic indian-languages trust-and-safety gender-based-violence extension-chrome content-moderation ogbv sdg-10 sdg-5

Updated Apr 7, 2026
Elixir

swicg / activitypub-trust-and-safety

Star

ActivityPub Trust and Safety Taskforce

activitypub fediverse trust-and-safety

Updated Apr 1, 2026
HTML

roostorg / coop

Star

Review and moderation, your way. Online safety dashboard, queues, routing and automatic enforcement rules, and integrations.

trust-and-safety child-safety content-safety

Updated Apr 8, 2026
TypeScript

haileyok / phoebe

Star

A trust and safety agent that interacts with Osprey for investigation, real-time analysis, and prevention implementations

agent trust-and-safety atproto

Updated Feb 7, 2026
Python

disciplinedware / swiftward

Star

Self-hosted Trust & Safety policy engine with A/B testing, replay, and full audit trails

golang yaml self-hosted deterministic ai-safety fraud-detection policy-engine audit-trail trust-and-safety content-moderation

Updated Feb 9, 2026

roostorg / community

Star

Documentation and policies for the ROOST organization and open source community. File non-technical or ROOST-wide issues here.

trust-and-safety

Updated Apr 6, 2026
CSS

haileyok / gopdq

Star

A Go implementation of Facebook's PDQ

trust-and-safety pdq

Updated Jan 15, 2026
Go

gian-gg / sabot

Star

Your third-party safety layer for verified, transparent, and scam-free online transactions.

security typescript ai nextjs smart-contracts blockchain p2p transactions web3 fraud-prevention escrow trust-and-safety

Updated Jan 29, 2026
TypeScript

crispthinking / PdqHash

Star

A .NET implementation of the PDQ hashing algorithm to make integrating Trust and Safety tools for digital service providers easier.

hashing security dotnet trust-and-safety pdq

Updated Apr 8, 2026
C#

prysaic-labs / OpenSiteTrust

Star

OpenSiteTrust is an open, explainable, and reusable website scoring ecosystem

open-data browser-extension crowdsourcing open-api trust-and-safety explainable-ai privacy-by-design security-headers phishing-detection reputation-system risk-scoring trust-score url-analysis brand-impersonation community-moderation scam-detection misinformation-detection domain-intelligence website-trust

Updated Aug 20, 2025
Python

collingeorge / Reflection-on-Truth

Star

A Reflection on Truth, Power, and the Misdiagnosis of Awareness

surveillance cybersecurity misinformation disinformation trust-and-safety algorithmic-bias digital-health ethics-in-ai hybrid-warfare cyberpsychology whistleblower-protection clinical-blind-spots digital-psychiatry geopolitics-of-ai

Updated Sep 9, 2025

jordanstarrk / mcp-preflight

Star

ls -la for MCP servers. See tools, resources, and risky capabilities before you connect or trust a server.

mcp developer-tools inspection ai-agents trust-and-safety model-context-protocol mcp-server agent-infrastructure

Updated Feb 15, 2026
Python

w-henderson / ProjectPositiveVibes

Star

🤝 Using large language models to seamlessly help content moderators make better decisions, faster.

trust-and-safety content-moderation gpt-3

Updated Mar 29, 2023
TypeScript

Combat fake news with cryptographic image verification. Origin Lens analyzes C2PA Content Credentials and EXIF metadata to detect AI-generated content, verify digital signatures, and reveal complete edit history. Privacy-first open source iOS app with on-device verification. (arXiv:2602.03423)

Updated Mar 7, 2026
Dart

PRADUMAN-KR / Multimodal-Lip-Sync-Deepfake-Detection-System

Star

Production-ready Multimodal Lip Sync Detection & Deepfake Detection System. Detects audio-video synchronization mismatches using deep learning (PyTorch) with a scalable FastAPI-based inference pipeline. Optimized for real-time processing,low false positives, and robust performance on noisy speech segments. Built for video forensics,synthetic media

computer-vision neural-network detection pytorch resnet multimodal-learning trust-and-safety ai-security deepfakes content-moderation temporal-modeling cross-modal-learning forensic-tools media-forensics real-time-inference lip-sync-detection audio-video-sync

Updated Mar 25, 2026
Python

crispthinking / athena-python-client

Star

A Python Client for the Athena CSAM detection service

trust-and-safety csam-detection

Updated Apr 8, 2026
Python

sp2023lab / ChatShieldNLP

Star

Desktop app for detecting inappropriate or unsafe messages using PyQt6, OCR, and NLP (rule-based + DistilBERT).

desktop-app python nlp ocr trust-and-safety pyqt6

Updated Aug 21, 2025
Python

Improve this page

Add a description, image, and links to the trust-and-safety topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the trust-and-safety topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trust-and-safety

Here are 52 public repositories matching this topic...

roostorg / osprey

roostorg / awesome-safety-tools

roostorg / model-community

tattle-made / Uli

swicg / activitypub-trust-and-safety

roostorg / coop

haileyok / phoebe

disciplinedware / swiftward

roostorg / community

haileyok / gopdq

gian-gg / sabot

crispthinking / PdqHash

prysaic-labs / OpenSiteTrust

collingeorge / Reflection-on-Truth

jordanstarrk / mcp-preflight

w-henderson / ProjectPositiveVibes

aloth / origin-lens

PRADUMAN-KR / Multimodal-Lip-Sync-Deepfake-Detection-System

crispthinking / athena-python-client

sp2023lab / ChatShieldNLP

Improve this page

Add this topic to your repo