Combined Entities

In addition to the entities described in the Eduction Grammar Reference, the IDOL PII Package includes grammar files that contain "combined" entities. These files are named combined_*.ecr (or combined_*_cjkvt.ecr for Japan) and the entities match addresses, dates, driving license numbers, and so on, from multiple countries.

  • The entities that end in /all match data for any supported non-CJKVT country or language.
  • The entities that end in /all_cjkvt match data for any supported CJKVT country.
  • The entities that end in /gdpr match data for any supported country or language subject to GDPR.

For example:

  • Using pii/address/all from combined_address.ecr matches a postal address from any non-CJKVT country. This is similar to using the address.ecr grammar file and extracting pii/address/??.
  • Using pii/address/gdpr from combined_address.ecr matches a postal address from any country subject to GDPR. This is similar to using the address.ecr grammar file and extracting gdpr/address/??.
  • Using pii/date/dob/context/all from combined_date.ecr matches a date of birth written numerically or using words in any language. This is similar to using the date.ecr grammar file and extracting pii/date/dob/context/???.

The combined (/all, /all_cjkvt and /gdpr) entities provide a significant improvement in processing speed when you extract matches for all countries or languages.

The combined grammar files might produce fewer matches, because (by default) only a single match is returned in cases where the same characters in the input text would match multiple countries or languages.

TIP: If you need all matches, you can turn on the AllowMultipleResults configuration option. This option slows down the matching process because it does not stop after a single match, but is generally still faster than using the individual grammars.

File Entity
combined_address.ecr pii/address/all
pii/address/gdpr
pii/address/streetlocation/context/all
pii/address/streetlocation/context/gdpr
pii/address/city/context/all
pii/address/city/context/gdpr
pii/address/country/context/all
pii/address/country/context/gdpr
pii/address/postcode/context/all
pii/address/postcode/context/gdpr
combined_address_cjkvt.ecr pii/address/all_cjkvt
pii/address/address1/context/all_cjkvt
pii/address/region/context/all_cjkvt
pii/address/country/context/all_cjkvt
pii/address/postcode/context/all_cjkvt
combined_banking.ecr

pii/banking/account_number/context/all

pii/banking/account_number/context/gdpr
pii/banking/account_number/landmark/all
pii/banking/account_number/landmark/gdpr
pii/banking/account_number/nocontext/all
pii/banking/account_number/nocontext/gdpr
pii/banking/context/all
pii/banking/context/gdpr
pii/banking/iban/context/all
pii/banking/iban/context/gdpr
pii/banking/iban/landmark/all
pii/banking/iban/landmark/gdpr
pii/banking/iban/nocontext/all
pii/banking/iban/nocontext/gdpr
pii/banking/landmark/all
pii/banking/landmark/gdpr
pii/banking/nocontext/all
pii/banking/nocontext/gdpr
pii/banking/roll_number/context/all
pii/banking/roll_number/context/gdpr
pii/banking/roll_number/landmark/all
pii/banking/roll_number/landmark/gdpr
pii/banking/roll_number/nocontext/all
pii/banking/roll_number/nocontext/gdpr
pii/banking/routing_number/context/all
pii/banking/routing_number/context/gdpr
pii/banking/routing_number/landmark/all
pii/banking/routing_number/landmark/gdpr
pii/banking/routing_number/nocontext/all
pii/banking/routing_number/nocontext/gdpr
pii/banking/swiftcode/context/all
pii/banking/swiftcode/context/gdpr
pii/banking/swiftcode/landmark/all
pii/banking/swiftcode/landmark/gdpr
pii/banking/swiftcode/nocontext/all
pii/banking/swiftcode/nocontext/gdpr
combined_banking_cjkvt.ecr pii/banking/account_number/context/all_cjkvt
pii/banking/account_number/landmark/all_cjkvt
pii/banking/account_number/nocontext/all_cjkvt
pii/banking/context/all_cjkvt
pii/banking/landmark/all_cjkvt
pii/banking/nocontext/all_cjkvt
pii/banking/routing_number/context/all_cjkvt
pii/banking/routing_number/landmark/all_cjkvt
pii/banking/routing_number/nocontext/all_cjkvt
pii/banking/swiftcode/context/all_cjkvt
pii/banking/swiftcode/landmark/all_cjkvt
pii/banking/swiftcode/nocontext/all_cjkvt
combined_date.ecr pii/date/dob/context/all
pii/date/dob/landmark/all
pii/date/dob/context/gdpr
pii/date/dob/landmark/gdpr
pii/date/nocontext/all
pii/date/nocontext/gdpr
combined_date_cjkvt.ecr pii/date/dob/context/all_cjkvt
pii/date/dob/landmark/all_cjkvt
pii/date/nocontext/all_cjkvt
combined_driving.ecr pii/driving/context/all
pii/driving/nocontext/all
pii/driving/landmark/all
pii/driving/context/gdpr
pii/driving/nocontext/gdpr
pii/driving/landmark/gdpr
combined_driving_cjkvt.ecr pii/driving/context/all_cjkvt
pii/driving/nocontext/all_cjkvt
pii/driving/landmark/all_cjkvt
combined_health.ecr pii/health/ehic/context/gdpr
pii/health/ehic/nocontext/gdpr
pii/health/ehic/landmark/gdpr
pii/health/id/context/all
pii/health/id/nocontext/all
pii/health/id/landmark/all
pii/health/id/context/gdpr
pii/health/id/nocontext/gdpr
pii/health/id/landmark/gdpr
combined_health_cjkvt.ecr pii/health/id/context/all_cjkvt
pii/health/id/nocontext/all_cjkvt
pii/health/id/landmark/all_cjkvt
combined_medical_terms.ecr pii/medical_terms/all
pii/medical_terms/gdpr
pii/medical_terms/blood_test/all
pii/medical_terms/blood_test/gdpr
pii/medical_terms/disability/social_security/all
pii/medical_terms/disability/social_security/gdpr
pii/medical_terms/disease_condition/all
pii/medical_terms/disease_condition/gdpr
pii/medical_terms/drug_brand/all
pii/medical_terms/drug_brand/gdpr
pii/medical_terms/drug_generic/all
pii/medical_terms/drug_generic/gdpr
pii/medical_terms/lab_test/all
pii/medical_terms/lab_test/gdpr
pii/medical_terms/lifestyle/all
pii/medical_terms/lifestyle/gdpr
pii/medical_terms/medication/all
pii/medical_terms/medication/gdpr
pii/medical_terms/specialty/all
pii/medical_terms/specialty/gdpr
pii/medical_terms/surgical_procedure/all
pii/medical_terms/surgical_procedure/gdpr
combined_medical_terms_cjkvt.ecr pii/medical_terms/all_cjkvt
combined_name.ecr pii/name/all
pii/name/gdpr
combined_name_cjkvt.ecr pii/name/all_cjkvt
pii/name/latin/all_cjkvt
pii/name/cjkvt/all_cjkvt
combined_national_id.ecr pii/id/context/all
pii/id/nocontext/all
pii/id/landmark/all
pii/id/context/gdpr
pii/id/nocontext/gdpr
pii/id/landmark/gdpr
combined_national_id_cjkvt.ecr pii/id/context/all_cjkvt
pii/id/nocontext/all_cjkvt
pii/id/landmark/all_cjkvt
combined_passport.ecr pii/passport/context/all
pii/passport/nocontext/all
pii/passport/landmark/all
pii/passport/context/gdpr
pii/passport/nocontext/gdpr
pii/passport/landmark/gdpr
combined_passport_cjkvt.ecr pii/passport/context/all_cjkvt
pii/passport/nocontext/all_cjkvt
pii/passport/landmark/all_cjkvt
combined_postcode.ecr pii/postcode/context/all
pii/postcode/nocontext/all
pii/postcode/landmark/all
pii/postcode/context/gdpr
pii/postcode/nocontext/gdpr
pii/postcode/landmark/gdpr
combined_postcode_cjkvt.ecr pii/postcode/context/all_cjkvt
pii/postcode/nocontext/all_cjkvt
pii/postcode/landmark/all_cjkvt
combined_telephone.ecr pii/telephone/context/all
pii/telephone/nocontext/all
pii/telephone/landmark/all
pii/telephone/context/gdpr
pii/telephone/nocontext/gdpr
pii/telephone/landmark/gdpr
combined_telephone_cjkvt.ecr pii/telephone/context/all_cjkvt
pii/telephone/nocontext/all_cjkvt
pii/telephone/landmark/all_cjkvt
combined_tin.ecr pii/tin/context/all
pii/tin/nocontext/all
pii/tin/landmark/all
pii/tin/context/gdpr
pii/tin/nocontext/gdpr
pii/tin/landmark/gdpr
combined_tin_cjkvt.ecr pii/tin/context/all_cjkvt
pii/tin/nocontext/all_cjkvt
pii/tin/landmark/all_cjkvt
combined_travel.ecr pii/travel/context/all
pii/travel/nocontext/all
pii/travel/landmark/all
combined_voter_id.ecr pii/voter_id/context/all
pii/voter_id/nocontext/all
pii/voter_id/landmark/all