catalog / Cybersecurity / Presidio — PII Detection & Anonymization
GuardrailCybersecurityFree

Presidio — PII Detection & Anonymization

Microsoft's open-source PII detection and anonymization engine supporting 50+ entity types across text, images, and structured data.

Instalaciones198k
Valoración★ 4.8
Reseñas66
Repositorio fuente

Presidio — PII Detection & Anonymization

Microsoft Presidio provides fast, contextual analysis and anonymization of personally identifiable information (PII) in text and images. It powers data-privacy compliance in LLM pipelines, ETL workflows, and document processing systems.

Key Features

  • 50+ built-in recognisers: names, emails, phone numbers, SSN, credit cards, IBANs, IP addresses, medical identifiers, and more
  • Custom recogniser support (regex, spaCy NER, stanza, transformer models)
  • Anonymization operators: redact, replace, hash, encrypt, mask, synthetic data substitution
  • Image redaction module (DICOM, PDF, raster)
  • REST API (presidio-analyzer + presidio-anonymizer as microservices)

Quick Start

from presidio_analyzer import AnalyzerEngine
from presidio_anonymizer import AnonymizerEngine

analyzer = AnalyzerEngine()
results = analyzer.analyze(text="My phone is 212-555-1234", language="en")
anonymizer = AnonymizerEngine()
print(anonymizer.anonymize(text="My phone is 212-555-1234", analyzer_results=results))
# Output: My phone is <PHONE_NUMBER>
npx ai-supply add presidio-pii-anonymizer

Curated mirror of the open-source Presidio (MIT). Get it from the source.

More from @ai-supply

View profile →
Model
llama.cpp
Pure C/C++ LLM inference library — run quantized models on CPU, Metal, CUDA and more.
900k4.9
Connector
vLLM
High-throughput, memory-efficient LLM inference engine with PagedAttention and continuous batching.
820k4.9
Agent
MetaGPT
Multi-agent framework that assigns GPT roles (PM, engineer, QA) to solve complex software tasks end-to-end.
820k4.8
Skill
NLTK
The Natural Language Toolkit — Python's foundational NLP library for tokenization, POS tagging, parsing, and corpora.
760k4.7