Entities supported for the country jp
The following entities are supported for the country jp (Japan)
| entity | description | example |
|---|---|---|
| address | A postal address. In general, a score of one is given to an address that includes a numbered, common format street address (for example "23 North Road"), a known city (for example "London"), and a postal code in a viable format for the country (for example "SW1A 2AA"). Deviations from this form lead to score penalties. The ordering of these elements varies by country. OpenText recommends that you use prefiltering to improve the performance for this grammar. See Configure Prefiltering. This entity returns the addresses in a normalized format by default. The normalized form standardizes apartment and house numbers, expands shortened forms of region names, removes additional punctuation, and converts the text to uppercase. For example "ABIDEI HURRIYET CD TANER PALAS APT 9, KAT:7, D:9, 34437 ISTANBUL". The exact order depends on the country. You can turn off normalization by setting normalize_addresses=false in the address_stoplist.lua script. This option can improve performance when you do not need normalization. |
1-2 Echizen, Nanao-shi, Ishikawa-ken 926-8611 1-2-34, Yaesu 4-Chome, Nanae, Atsuta, Hekinan, Kagoshima, 123-4567, Japan 2-3-4 Kotoni-1-jo, Nishi-ku, Sapporo-shi, Hokkaido 4-2 Kita-1-jo Nishi, Chuo-ku, Sapporo-shi 6-22, Aza Uwadaira Oaza Nagahama, Kazamaura, Ashigarashimo, Toyama 8-1 Hongo-dori Higashi, Chuo-ku, Sapporo-shi P.O. Box 595, Ginza post Office, Tokyo 100-8692, Japan 新潟県新潟市江南区楚川甲180-1 日本、〒123-4567神奈川県津島市城南区月形町八重洲四丁目1番2-34号 |
| address/country/context | A country, with context. | Country: Japan 国: 日本 |
| address/country/landmark | A country landmark. | Country 国 |
| address/country/nocontext | A country, without context. | Japan 日本 |
| address/country/nocontext/cjkvt | A country in CJKVT native script, without context. | 日本 |
| address/country/nocontext/latin | A country in romanized text, without context. | Japan |
| address/landmark | A postal address landmark. | address 住所 |
| address/postcode/context | A postal code, with context. | 〒: 123-4567 |
| address/postcode/landmark | A postal code landmark. | 〒 |
| address/postcode/nocontext | A postal code, without context. | 123-4567 |
| address/region/context | A region, with context. For Taiwan, this is a county (縣 xiàn) or municipality (市 shì). | Prefecture: Kagoshima 都道府県: 神奈川県 |
| address/region/landmark | A region landmark. | Prefecture 都道府県 |
| address/region/nocontext | A region in CJKVT native script or romanized text, without context. | Kagoshima 神奈川県 |
| address/region/nocontext/cjkvt | A region in CJKVT native script, without context. | 神奈川県 |
| address/region/nocontext/latin | A region in romanized text, without context. | Kagoshima |
| address/settlement/context | A settlement, with context. For example, in Japan a town or city, or in Taiwan a district (區 qū) or township (鎮 zhèn/鄉 xiāng). | City/Ward/Town/Village: Abuta gun City/Ward/Town/Village: Nanae Village, Atsuta, Hekinan City/Ward/Town/Village: Nanae, Atsuta, Hekinan 市区町村: 津島市城南区月形町 |
| address/settlement/landmark | A settlement landmark. | City/Ward/Town/Village 市区町村 |
| address/settlement/nocontext | A settlement in CJKVT native script or romanized text, without context. | Abuta gun Nanae Village, Atsuta, Hekinan Nanae, Atsuta, Hekinan 津島市城南区月形町 |
| address/settlement/nocontext/cjkvt | A settlement in CJKVT native script, without context. | 三島郡 津島市城南区上北山村 津島市城南区月形町 |
| address/settlement/nocontext/latin | A settlement in romanized text, without context. | Abuta gun Nanae Village, Atsuta, Hekinan Nanae, Atsuta, Hekinan |
| address/streetlocation/context | A street location (house number and street name), with context. | Address: 1-2 Echizen, Nanao-shi Address: 1-2-34, Yaesu 4-Chome Address: 2-3-4 Kotoni-1-jo Address: 4-2 Kita-1-jo Nishi Address: 6-22, Aza Uwadaira Oaza Nagahama, Kazamaura, Ashigarashimo Address: 8-1 Hongo-dori Higashi Address: P.O. Box 595, Ginza Post Office 住所: 八重洲四丁目1番2-34号 住所: 甲180-1 住所: 神奈川県津島市城南区月形町八重洲四丁目1番2-34号 |
| address/streetlocation/landmark | A street location landmark. | Address 住所 |
| address/streetlocation/nocontext | A street location (house number and street name), without context. | 1-2 Echizen, Nanao-shi 1-2-34, Yaesu 4-Chome 2-3-4 Kotoni-1-jo 4-2 Kita-1-jo Nishi 6-22, Aza Uwadaira Oaza Nagahama, Kazamaura, Ashigarashimo 8-1 Hongo-dori Higashi P.O. Box 595, Ginza Post Office 八重洲四丁目1番2-34号 甲180-1 神奈川県津島市城南区月形町八重洲四丁目1番2-34号 |
| address/streetlocation/nocontext/cjkvt | An address first line in CJKVT native script, without context. | 七尾市 北藤橋町 ロ部 天 私書箱123 八重洲四丁目1番2-34号 南二十二条西十六丁目2番地 厚別中央二条十丁目3番6号 大通西二十丁目2番18号 天草郡蔵札内村4の5の10 田原市[大字奈川字津島小字月南]998 甲180-1 神奈川県津島市城南区月形町八重洲四丁目1番2-34号 |
| address/streetlocation/nocontext/latin | An address first line in romanized text, without context. | 1-2 Echizen, Nanao-shi 1-2-34, Yaesu 4-Chome 2-3-4 Kotoni-1-jo 4-2 Kita-1-jo Nishi 5-3, Chikusa-ku, Nishio-shi 6-22, Aza Uwadaira Oaza Nagahama, Kazamaura, Ashigarashimo 8-1 Hongo-dori Higashi P.O. Box 595, Ginza Post Office |
| banking/account_number/context | An individual bank account number, with context. | バンク: 12345678 |
| banking/account_number/landmark | An individual bank account number landmark. | バンク |
| banking/account_number/nocontext | An individual bank account number, without context. | 12345678 |
| banking/context | Bank account numbers sufficient to identify an individual account, with context. | 銀行口座番号: 2556345 1234567 |
| banking/landmark | A bank account landmark. | 銀行口座番号 |
| banking/nocontext | Bank account numbers sufficient to identify an individual account, without context. | 2556345 1234567 |
| banking/routing_number/context | A routing number, usually used to identify a bank branch, with context. | 支店番号: 8926-789 |
| banking/routing_number/landmark | A routing number landmark. | 支店番号 |
| banking/routing_number/nocontext | A routing number, usually used to identify a bank branch, without context. | 8926-789 |
| banking/swiftcode/context | A SWIFT/BIC code, with context. | SWIFT Number: XXXXJP00BBB |
| banking/swiftcode/nocontext | A SWIFT/BIC code, without context. | XXXXJP00BBB |
| driving/context | A driving license number with context. This entity matches both the driving license number, and the personal number or driver number, if present. On the standard European driving license, these are fields 5 and 4d.1 | 運転免許: 901103864930 |
| driving/landmark | A driving license landmark. | 運転免許 |
| driving/nocontext | A driving license number, without context.2 | 901103864930 |
| health/id/context | A health number with context. This entity gives a lower score to matches with more ambiguous landmarks (such as 番号 and "Number"). However, if two matches occur together (for example "記号 21700023 番号 21"), the entity can match both with a higher score. | Business Reference Number: 21700 介護保険: 1234567890 保険者番号: 21700023 労災保険: 12-3-45-123456-123 被保険者証番号: 21 |
| health/id/landmark | A health number landmark. | Business Reference Number 介護保険 保険者番号 労災保険 被保険者証番号 |
| health/id/nocontext | A health number, such as a British NHS number, French Carte Vitale number or Japanese Health Insurance Card number, without context. | 12-3-45-123456-123 1234567890 21 21700 21700023 |
| id/context | A national identity number with context. For information about the supported ID numbers, see Supported National ID Numbers. | Jukikādo bangō: 12345678901 マイナンバー: 654327654322 在留番号: AB12345678YZ |
| id/employment_insurance/context | A Japanese unemployment insurance number, with context. | 雇用保険番号: 1234-123456-1 |
| id/employment_insurance/landmark | A Japanese unemployment insurance number landmark. | 雇用保険番号 |
| id/employment_insurance/nocontext | A Japanese unemployment insurance number, without context. | 1234-123456-1 |
| id/landmark | A national identity number landmark.3 | Jukikādo bangō マイナンバー 在留番号 |
| id/nocontext | A national identity number without context. For information about the supported ID numbers, see Supported National ID Numbers.4 | 12345678901 AB12345678YZ 654327654322 |
| id/pension/context | A Japanese pension number, with context. | Nenkin bangō: 123456123456 |
| id/pension/landmark | A Japanese pension number landmark. | Nenkin bangō |
| id/pension/nocontext | A Japanese pension number, without context. | 123456123456 |
| internet/email/context | An email address, with context. | 電子メール: jsmith@mailserver.com |
| internet/email/landmark | An email address landmark. | 電子メール |
| internet/email/nocontext | An email address, without context. | jsmith@mailserver.com |
| name | A full personal name. Romanized names can be in title case or upper case, and can be in the order given name surname or surname given name. CJKVT native script names must be surname given name. For Japanese, either form can include honorifics.5 | Hiroshi Tanaka-san Minamoto no Yasuhiro Mrs K. Inoue 山田直樹 |
| name/cjkvt | A full personal name in CJKVT native script. | 山田直樹様 |
| name/foreign | A full foreign person name in Katakana. | キーラ・セジウィック マイケル・J・フォックス君 |
| name/given_name/context | A given name with context. For CJKVT the context landmark should be in CJKVT native script. | 名前: Keiko 名前: 恵 |
| name/given_name/context/cjkvt | A given name in CJKVT native script, with context. | 名前: 恵 |
| name/given_name/context/cjkvt/foreign | A non-Japanese given name in Katakana, with context. | 下の名前ウィリアム |
| name/given_name/context/latin | A romanized given name, with a context landmark in CJKVT native script. | 名前: Keiko |
| name/given_name/landmark | A given name landmark. For CJKVT it will be in native script. | 名前 |
| name/given_name/nocontext | A given name, without context. | Keiko 恵 |
| name/given_name/nocontext/cjkvt | A given name in CJKVT native script, without context. | 恵 |
| name/given_name/nocontext/cjkvt/foreign | A non-Japanese given name in Katakana, without context. | ロバート |
| name/given_name/nocontext/cjkvt_spaced | A given name in CJKVT native script, separated by spaces, and without context. This entity is primarily to allow you to create patterns that match alternative name formats. | 和 彦 |
| name/given_name/nocontext/cjkvt_spaced/foreign | A non-Japanese given name in spaced Katakana, without context. | ジ ョ ー ジ |
| name/given_name/nocontext/latin | A romanized given name, without context. | Keiko |
| name/landmark | A full name landmark. | 名前 |
| name/latin | A romanized full personal name. | Hiroshi Tanaka-san Minamoto no Yasuhiro Mrs K. Inoue Takumi Muraoka QC |
| name/post_title/nocontext | A title that follows a name in romanized text, or CJKVT native script. | QC さん |
| name/post_title/nocontext/cjkvt | A title that follows a name in CJKVT native script. | さん |
| name/post_title/nocontext/latin | A title that follows a name in romanized text. | QC |
| name/pre_title/nocontext | A title that precedes a name in romanized text. | Mrs |
| name/surname/context | A surname with context. For CJKVT the landmark will be in native script | 名字: Inoue 名字: 山田 |
| name/surname/context/cjkvt | A surname in CJKVT native script, with context. | 名字: 山田 |
| name/surname/context/cjkvt/foreign | A non-Japanese surname in Katakana, with context. | 名字: ヤング |
| name/surname/context/latin | A romanized surname, with a context landmark in CJKVT native script. | 名字: Inoue |
| name/surname/landmark | A surname landmark. | 名字 |
| name/surname/nocontext | A surname without context. | Inoue 山田 |
| name/surname/nocontext/cjkvt | A surname in CJKVT native script, without context. | 山田 |
| name/surname/nocontext/cjkvt/foreign | A non-Japanese surname in Katakana, without context. | スミス |
| name/surname/nocontext/cjkvt_spaced | A given name in CJKVT native script, separated by spaces, and without context. This entity is primarily to allow you to create patterns that match alternative name formats. | 佐 藤 |
| name/surname/nocontext/cjkvt_spaced/foreign | A non-Japanese surname in spaced Katakana. | ジ ュ ニ ア |
| name/surname/nocontext/latin | A romanized surname, without context. | Inoue |
| name/title_surname | A title and surname. | Dr Inoue 佐藤様 |
| name/title_surname/cjkvt | A title and surname in CJKVT native script. | 佐藤様 |
| name/title_surname/cjkvt/foreign | A non-Japanese title and surname in Katakana. | ラ ド ウ ィ ッ グ 様 |
| name/title_surname/latin | A title and surname in romanized text. | Dr Inoue |
| nationality/adj/context | A nationality adjective with context. | 国籍: 日本語 |
| nationality/adj/landmark | A nationality adjective landmark. | 国籍 |
| nationality/adj/nocontext | A nationality adjective without context. | 日本語 |
| nationality/any/context | Any combination of nationality adjective and noun landmark and value. | 国籍: 日本語 |
| nationality/any/landmark | Any nationality adjective or landmark. | 国籍 |
| nationality/any/nocontext | Any nationality adjective or noun. | 日本語 |
| nationality/noun/context | A nationality noun with context. | 国: 日本 |
| nationality/noun/landmark | A nationality noun landmark. | 国 |
| nationality/noun/nocontext | A nationality noun without context. | 日本 |
| passport/context | A passport number with context. | パスポート: XS1234567 |
| passport/landmark | A passport landmark. For information about cases where the landmark and passport number do not match or have an ambiguous match, see Ambiguous Entities. | パスポート |
| passport/nocontext | A passport number without context. | XS1234567 |
| postcode/context | A postal code with context. | 〒: 123-4567 |
| postcode/landmark | A postal code landmark. | 〒 |
| postcode/nocontext | A postal code without context. | 123-4567 |
| telephone/context | A telephone number with context. For the telephone_cjkvt grammar, numbers can be ASCII or full-width numbers. This entity returns the telephone number in the normalized format +NNNNN, starting with the country code. For example +12012224344.6 | 電話機: +81 (3) 3224 5000 |
| telephone/landmark | A telephone number landmark. For the telephone_cjkvt grammar, landmarks are available only in CJKVT native script. | 電話機 |
| telephone/nocontext | A telephone number without context. For the telephone_cjkvt grammar, numbers can be ASCII or full-width numbers. This entity returns the telephone number in the normalized format +NNNNN, starting with the country code. For example +12042439955.7 | +81 (3) 3224 5000 |
| tin/context | A tax identification number with context. For more examples, see Example Tax Identification Numbers.8 | マイナンバー: 654327654322 法人番号: 8 7001 1000 5901 |
| tin/landmark | A tax identification number landmark. | マイナンバー 法人番号 |
| tin/nocontext | A tax identification number without context. For more examples, see Example Tax Identification Numbers.9 | 654327654322 8 7001 1000 5901 |
In the tables, ✔️ indicates that the item is always used, and ❌ that the item is never used. The ✓ symbol indicates that the item is only used for some entities (that is, for some but not all countries or languages). For details, refer to the relevant entity page.