Combined Entities
In addition to the entities described in the Eduction Grammar Reference, the IDOL PII Package includes grammar files that contain "combined" entities. These files are named combined_*.ecr
(or combined_*_cjkvt.ecr
for Japan) and the entities match addresses, dates, driving license numbers, and so on, from multiple countries.
- The entities that end in
/all
match data for any supported non-CJKVT country or language. - The entities that end in
/all_cjkvt
match data for any supported CJKVT country. - The entities that end in
/gdpr
match data for any supported country or language subject to GDPR.
For example:
- Using
pii/address/all
fromcombined_address.ecr
matches a postal address from any non-CJKVT country. This is similar to using theaddress.ecr
grammar file and extractingpii/address/??
. - Using
pii/address/gdpr
fromcombined_address.ecr
matches a postal address from any country subject to GDPR. This is similar to using theaddress.ecr
grammar file and extractinggdpr/address/??
. - Using
pii/date/dob/context/all
fromcombined_date.ecr
matches a date of birth written numerically or using words in any language. This is similar to using thedate.ecr
grammar file and extractingpii/date/dob/context/???
.
The combined (/all
, /all_cjkvt
and /gdpr
) entities provide a significant improvement in processing speed when you extract matches for all countries or languages.
The combined grammar files might produce fewer matches, because (by default) only a single match is returned in cases where the same characters in the input text would match multiple countries or languages.
TIP: If you need all matches, you can turn on the AllowMultipleResults
configuration option. This option slows down the matching process because it does not stop after a single match, but is generally still faster than using the individual grammars.
File | Entity |
---|---|
combined_address.ecr | pii/address/all |
pii/address/gdpr | |
pii/address/streetlocation/context/all | |
pii/address/streetlocation/context/gdpr | |
pii/address/city/context/all | |
pii/address/city/context/gdpr | |
pii/address/country/context/all | |
pii/address/country/context/gdpr | |
pii/address/postcode/context/all | |
pii/address/postcode/context/gdpr | |
combined_address_cjkvt.ecr | pii/address/all_cjkvt |
pii/address/address1/context/all_cjkvt | |
pii/address/region/context/all_cjkvt | |
pii/address/country/context/all_cjkvt | |
pii/address/postcode/context/all_cjkvt | |
combined_banking.ecr |
pii/banking/account_number/context/all |
pii/banking/account_number/context/gdpr | |
pii/banking/account_number/landmark/all | |
pii/banking/account_number/landmark/gdpr | |
pii/banking/account_number/nocontext/all | |
pii/banking/account_number/nocontext/gdpr | |
pii/banking/context/all | |
pii/banking/context/gdpr | |
pii/banking/iban/context/all | |
pii/banking/iban/context/gdpr | |
pii/banking/iban/landmark/all | |
pii/banking/iban/landmark/gdpr | |
pii/banking/iban/nocontext/all | |
pii/banking/iban/nocontext/gdpr | |
pii/banking/landmark/all | |
pii/banking/landmark/gdpr | |
pii/banking/nocontext/all | |
pii/banking/nocontext/gdpr | |
pii/banking/roll_number/context/all | |
pii/banking/roll_number/context/gdpr | |
pii/banking/roll_number/landmark/all | |
pii/banking/roll_number/landmark/gdpr | |
pii/banking/roll_number/nocontext/all | |
pii/banking/roll_number/nocontext/gdpr | |
pii/banking/routing_number/context/all | |
pii/banking/routing_number/context/gdpr | |
pii/banking/routing_number/landmark/all | |
pii/banking/routing_number/landmark/gdpr | |
pii/banking/routing_number/nocontext/all | |
pii/banking/routing_number/nocontext/gdpr | |
pii/banking/swiftcode/context/all | |
pii/banking/swiftcode/context/gdpr | |
pii/banking/swiftcode/landmark/all | |
pii/banking/swiftcode/landmark/gdpr | |
pii/banking/swiftcode/nocontext/all | |
pii/banking/swiftcode/nocontext/gdpr | |
combined_banking_cjkvt.ecr | pii/banking/account_number/context/all_cjkvt |
pii/banking/account_number/landmark/all_cjkvt | |
pii/banking/account_number/nocontext/all_cjkvt | |
pii/banking/context/all_cjkvt | |
pii/banking/landmark/all_cjkvt | |
pii/banking/nocontext/all_cjkvt | |
pii/banking/routing_number/context/all_cjkvt | |
pii/banking/routing_number/landmark/all_cjkvt | |
pii/banking/routing_number/nocontext/all_cjkvt | |
pii/banking/swiftcode/context/all_cjkvt | |
pii/banking/swiftcode/landmark/all_cjkvt | |
pii/banking/swiftcode/nocontext/all_cjkvt | |
combined_date.ecr | pii/date/dob/context/all |
pii/date/dob/landmark/all | |
pii/date/dob/context/gdpr | |
pii/date/dob/landmark/gdpr | |
pii/date/nocontext/all | |
pii/date/nocontext/gdpr | |
combined_date_cjkvt.ecr | pii/date/dob/context/all_cjkvt |
pii/date/dob/landmark/all_cjkvt | |
pii/date/nocontext/all_cjkvt | |
combined_driving.ecr | pii/driving/context/all |
pii/driving/nocontext/all | |
pii/driving/landmark/all | |
pii/driving/context/gdpr | |
pii/driving/nocontext/gdpr | |
pii/driving/landmark/gdpr | |
combined_driving_cjkvt.ecr | pii/driving/context/all_cjkvt |
pii/driving/nocontext/all_cjkvt | |
pii/driving/landmark/all_cjkvt | |
combined_health.ecr | pii/health/ehic/context/gdpr |
pii/health/ehic/nocontext/gdpr | |
pii/health/ehic/landmark/gdpr | |
pii/health/id/context/all | |
pii/health/id/nocontext/all | |
pii/health/id/landmark/all | |
pii/health/id/context/gdpr | |
pii/health/id/nocontext/gdpr | |
pii/health/id/landmark/gdpr | |
combined_health_cjkvt.ecr | pii/health/id/context/all_cjkvt |
pii/health/id/nocontext/all_cjkvt | |
pii/health/id/landmark/all_cjkvt | |
combined_medical_terms.ecr | pii/medical_terms/all |
pii/medical_terms/gdpr | |
pii/medical_terms/blood_test/all | |
pii/medical_terms/blood_test/gdpr | |
pii/medical_terms/disability/social_security/all | |
pii/medical_terms/disability/social_security/gdpr | |
pii/medical_terms/disease_condition/all | |
pii/medical_terms/disease_condition/gdpr | |
pii/medical_terms/drug_brand/all | |
pii/medical_terms/drug_brand/gdpr | |
pii/medical_terms/drug_generic/all | |
pii/medical_terms/drug_generic/gdpr | |
pii/medical_terms/lab_test/all | |
pii/medical_terms/lab_test/gdpr | |
pii/medical_terms/lifestyle/all | |
pii/medical_terms/lifestyle/gdpr | |
pii/medical_terms/medication/all | |
pii/medical_terms/medication/gdpr | |
pii/medical_terms/specialty/all | |
pii/medical_terms/specialty/gdpr | |
pii/medical_terms/surgical_procedure/all | |
pii/medical_terms/surgical_procedure/gdpr | |
combined_medical_terms_cjkvt.ecr | pii/medical_terms/all_cjkvt |
combined_name.ecr | pii/name/all |
pii/name/gdpr | |
combined_name_cjkvt.ecr | pii/name/all_cjkvt |
pii/name/latin/all_cjkvt | |
pii/name/cjkvt/all_cjkvt | |
combined_national_id.ecr | pii/id/context/all |
pii/id/nocontext/all | |
pii/id/landmark/all | |
pii/id/context/gdpr | |
pii/id/nocontext/gdpr | |
pii/id/landmark/gdpr | |
combined_national_id_cjkvt.ecr | pii/id/context/all_cjkvt |
pii/id/nocontext/all_cjkvt | |
pii/id/landmark/all_cjkvt | |
combined_passport.ecr | pii/passport/context/all |
pii/passport/nocontext/all | |
pii/passport/landmark/all | |
pii/passport/context/gdpr | |
pii/passport/nocontext/gdpr | |
pii/passport/landmark/gdpr | |
combined_passport_cjkvt.ecr | pii/passport/context/all_cjkvt |
pii/passport/nocontext/all_cjkvt | |
pii/passport/landmark/all_cjkvt | |
combined_postcode.ecr | pii/postcode/context/all |
pii/postcode/nocontext/all | |
pii/postcode/landmark/all | |
pii/postcode/context/gdpr | |
pii/postcode/nocontext/gdpr | |
pii/postcode/landmark/gdpr | |
combined_postcode_cjkvt.ecr | pii/postcode/context/all_cjkvt |
pii/postcode/nocontext/all_cjkvt | |
pii/postcode/landmark/all_cjkvt | |
combined_telephone.ecr | pii/telephone/context/all |
pii/telephone/nocontext/all | |
pii/telephone/landmark/all | |
pii/telephone/context/gdpr | |
pii/telephone/nocontext/gdpr | |
pii/telephone/landmark/gdpr | |
combined_telephone_cjkvt.ecr | pii/telephone/context/all_cjkvt |
pii/telephone/nocontext/all_cjkvt | |
pii/telephone/landmark/all_cjkvt | |
combined_tin.ecr | pii/tin/context/all |
pii/tin/nocontext/all | |
pii/tin/landmark/all | |
pii/tin/context/gdpr | |
pii/tin/nocontext/gdpr | |
pii/tin/landmark/gdpr | |
combined_tin_cjkvt.ecr | pii/tin/context/all_cjkvt |
pii/tin/nocontext/all_cjkvt | |
pii/tin/landmark/all_cjkvt | |
combined_voter_id.ecr | pii/voter_id/context/all |
pii/voter_id/nocontext/all | |
pii/voter_id/landmark/all |