You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Presidio contains predefined recognizers for PII entities.
This page describes the different entities Presidio can detect and the method Presidio employs to detect those.
A Crypto wallet number. Currently only Bitcoin address is supported
Pattern match, context and checksum
DATE_TIME
Absolute or relative dates or periods or times smaller than a day.
Pattern match and context
EMAIL_ADDRESS
An email address identifies an email box to which email messages are delivered
Pattern match, context and RFC-822 validation
IBAN_CODE
The International Bank Account Number (IBAN) is an internationally agreed system of identifying bank accounts across national borders to facilitate the communication and processing of cross border transactions with a reduced risk of transcription errors.
Pattern match, context and checksum
IP_ADDRESS
An Internet Protocol (IP) address (either IPv4 or IPv6).
Pattern match, context and checksum
NRP
A person’s Nationality, religious or political group.
Custom logic and context
LOCATION
Name of politically or geographically defined location (cities, provinces, countries, international regions, bodies of water, mountains
Custom logic and context
PERSON
A full person name, which can include first names, middle names or initials, and last names.
Custom logic and context
PHONE_NUMBER
A telephone number
Custom logic, pattern match and context
MEDICAL_LICENSE
Common medical license numbers.
Pattern match, context and checksum
URL
A URL (Uniform Resource Locator), unique identifier used to locate a resource on the Internet
Pattern match, context and top level url validation
USA
Entity Type
Description
Detection Method
US_BANK_NUMBER
A US bank account number is between 8 to 17 digits.
A Unique Entity Number (UEN) is a standard identification number for entities registered in Singapore.
Pattern match, context, and checksum
Australia
FieldType
Description
Detection Method
AU_ABN
The Australian Business Number (ABN) is a unique 11 digit identifier issued to all entities registered in the Australian Business Register (ABR).
Pattern match, context, and checksum
AU_ACN
An Australian Company Number is a unique nine-digit number issued by the Australian Securities and Investments Commission to every company registered under the Commonwealth Corporations Act 2001 as an identifier.
Pattern match, context, and checksum
AU_TFN
The tax file number (TFN) is a unique identifier issued by the Australian Taxation Office to each taxpaying entity
Pattern match, context, and checksum
AU_MEDICARE
Medicare number is a unique identifier issued by Australian Government that enables the cardholder to receive a rebates of medical expenses under Australia's Medicare system
Pattern match, context, and checksum
India
FieldType
Description
Detection Method
IN_PAN
The Indian Permanent Account Number (PAN) is a unique 12 character alphanumeric identifier issued to all business and individual entities registered as Tax Payers.
Pattern match, context
IN_AADHAAR
Indian government issued unique 12 digit individual identity number
Pattern match, context, and checksum
IN_VEHICLE_REGISTRATION
Indian government issued transport (govt, personal, diplomatic, defence) vehicle registration number
Pattern match, context, and checksum
Finland
FieldType
Description
Detection Method
FI_PERSONAL_IDENTITY_CODE
The Finnish Personal Identity Code (Henkilötunnus) is a unique 11 character individual identity number.
Pattern match, context and custom logic.
Adding a custom PII entity
See this documentation for instructions on how to add a new Recognizer for a new type of PII entity.
Complementing Presidio with Azure AI Language PII
Azure AI Language PII
is a cloud-based service that provides Natural Language Processing (NLP) features for detecting PII in text.
A list of supported entities by Azure AI Language PII can be found here.