Microsoft Presidio Misses 1 in 2 Sensitive Entities. Limina Misses 1 in 12.
On real privacy data, Presidio recall is 0.386. Limina's is 0.919. Free to deploy, expensive to rely on—Presidio was built as a starting point, not a production-grade privacy layer. Limina was. And like Presidio, it runs entirely on-premises. Unlike Presidio, it actually catches what's there.

.png)
.png)
.png)
.png)
.png)

.png)












.png)
.png)
.png)
.png)
.png)

.png)











91.94%
+52.59%
50+
0
Where Limina Wins
A direct comparison across the dimensions that matter most to engineering and compliance teams.
Feature by Feature
Limina vs. AWS Comprehend across the full capability set.
Built for Teams with Real Exposure
The organizations that move from Presidio to Limina are the ones where a 52% recall gap has consequences.
Healthcare & Life Sciences
Presidio has no PHI entity types—no dosage, no medical condition, no injury classification. Limina covers all 18 HIPAA identifiers and the full PHI set, on-premises, with expert determination-ready output. For teams processing clinical notes, radiology reports, and patient recordings, the coverage gap isn't marginal.
Financial Services
Presidio provides partial PCI coverage. Disfluent card numbers—spoken aloud, misformatted, or split across a transcript—pass through undetected. Limina catches them. For contact center and transaction data at scale, the difference between partial and full coverage is the difference between compliant and exposed.
Global Enterprises
A single-language tool isn't a global privacy strategy. Presidio supports one language natively; additional languages require training investment. Limina supports 50+ languages with the same accuracy benchmark, so EMEA, APAC, and LATAM teams operate at the same standard as English-language deployments.
AI & LLM Initiatives
Presidio was built as an engineering baseline. As data drift occurs, regulations change, and new entity types emerge, maintenance effort compounds. Limina handles updates, language coverage, and regulation changes—so engineering teams build on a production-grade foundation rather than maintaining one.
Regulated & High-security Environments
Both Limina and Presidio run on-premises. The difference is what happens at scale—Presidio's 1M character limit and 38.62% recall are production constraints, not theoretical ones. Limina runs without input limits and catches what Presidio misses, with deterministic output your auditors can verify.
The Honest Comparison
A missed entity isn't a classification error. It's data exposure. Here's how the two products actually compare.
"Presidio is free."
True. The tradeoff is accuracy and maintenance. Presidio recall on real privacy data is 0.386. In other words, more than half of sensitive entities go undetected. As data drift occurs, regulations change, and new entity types emerge, the engineering effort required to keep Presidio production-grade compounds quickly. The licence cost is zero. The total cost is not.
"Presidio runs on-premises."
Also true, but so does Limina. On-premises deployment isn't a differentiator between these two products. Recall, language coverage, entity coverage, and input limits are.
"Presidio supports custom entity recognition."
It does, with training data and engineering investment. Limina ships 50+ entity types out of the box, including all 18 HIPAA identifiers, full PCI coverage, and GDPR Article 9 sensitive categories that Presidio doesn't cover. Presidio supports 33 conceptual entity types, with partial HIPAA and partial PCI coverage.
"Presidio's output is deterministic."
Correct. So is Limina's. Deterministic output is a baseline requirement, not a differentiator.
"Presidio is benchmarked on standard datasets."
Limina's benchmarks run on the ai4privacy 500k dataset—publicly available, multi-domain, spanning finance, healthcare, and legal text. Limina has not trained on any split of it. Labels from both solutions were mapped to a common schema for a direct comparison. The evaluation code and datasets are available on request. We'd welcome a head-to-head on your data.
See Limina on Your Data
Most teams know within a single proof of concept whether Limina fits. We'll run it against your formats, your languages, your edge cases — so the comparison is real, not theoretical.