Word breaker for a new language
hello all,
i understood people go custom word breaker during situations
1.when there no word breaker specific language.
2.when existing word breaker of specific language not addressing project needs(special treatment of special characters).
unforunately in situation messages stored in our tables not follow rules of natural languages , data stored in multiple languages in same column.
logically call "industrial language" similar english,german,spanish , japanese languages.
below questions.
1.it's possible develop custom word breaker only for specific language has lcid .is so?
2. assume if want develop custom word breaker/stemmer above mentioned "industrial language" n't have lcid. possible?
3.if yes, how lcid new languages created/registered?
i hope i conveyed problem properly.
thanks & regards
samba
hi samba,
known, each language has lcid. there languages word breakers registered sql server default, include english, german, spanish , japanese. please pay attention result query below:
select * sys.fulltext_languages
in addition, can manually load licensed third-party word breakers additional languages, such danish, polish, , turkish.
it seems cannot create custom language mixture many languages. column, can specify 1 type of language. notice thread xml data type column, think suggestion on thread appropriate in scenario: xml word breaker , regular text word breaker.
stephanie lv
technet community support
SQL Server > SQL Server Search
Comments
Post a Comment