Word breaker for a new language


hello all,

i understood people go custom word breaker during situations

1.when there no word breaker specific language.

2.when existing word breaker of specific language not addressing project needs(special treatment of special characters).

unforunately in situation messages stored in our tables not follow rules of natural languages , data stored in multiple languages in same column.

logically call "industrial language" similar english,german,spanish , japanese languages.

below questions.

1.it's possible develop custom word breaker only for specific language has lcid .is so?

2. assume if want develop custom word breaker/stemmer above mentioned "industrial language"  n't have lcid. possible?

3.if yes, how lcid new languages created/registered?

i hope i  conveyed problem properly.

thanks & regards

samba

hi samba,

known, each language has lcid. there languages word breakers registered sql server default, include english, german, spanish , japanese. please pay attention result query below:

select * sys.fulltext_languages

in addition, can manually load licensed third-party word breakers additional languages, such danish, polish, , turkish.

it seems cannot create custom language mixture many languages. column, can specify 1 type of language. notice thread xml data type column, think suggestion on thread appropriate in scenario: xml word breaker , regular text word breaker.


stephanie lv

technet community support



SQL Server  >  SQL Server Search



Comments

Popular posts from this blog

Conditional formatting a graph vertical axis in SSRS 2012 charts

Register with Power BI failed

SQL server replication error Cannot find the dbo or user defined function........