Sunday, February 26, 2012

Full-text search with German umlaut

Hello everyone!
I’m using the full text search with the German language and it works fine
in
mapping the German umlaut "ü" into "ue", but unfortunately the other
direction doesn’t work (the search for words including "ue" does not match
words including "ü").
I have tried to solve it by editing the thesaurus file but I didn’t find a
possibility to specify only word sequences. It works only for words.
This works for "Mueller"
<expansion>
<sub>Mueller</sub>
<sub>Müller</sub>
</expansion>
But this does’nt work:
<expansion>
<sub>ue</sub>
<sub>ü</sub>
</expansion>
How can I get the mapping of "ue" to "ü" within the full text search?
Thanks
RobIt works in SQL Server 2005 if you select the accent insensitive catalog
option.
Hilary Cotter
Director of Text Mining and Database Strategy
RelevantNOISE.Com - Dedicated to mining blogs for business intelligence.
This posting is my own and doesn't necessarily represent RelevantNoise's
positions, strategies or opinions.
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html
Looking for a FAQ on Indexing Services/SQL FTS
http://www.indexserverfaq.com
"RobRoma" <RobRoma@.discussions.microsoft.com> wrote in message
news:41D47CF7-3BCB-45D7-A68F-1ED1DA4C629C@.microsoft.com...
> Hello everyone!
> I'm using the full text search with the German language and it works fine
> in
> mapping the German umlaut "" into "ue", but unfortunately the other
> direction doesn't work (the search for words including "ue" does not match
> words including "").
> I have tried to solve it by editing the thesaurus file but I didn't find a
> possibility to specify only word sequences. It works only for words.
> This works for "Mueller"
> <expansion>
> <sub>Mueller</sub>
> <sub>Mller</sub>
> </expansion>
> But this does'nt work:
> <expansion>
> <sub>ue</sub>
> <sub></sub>
> </expansion>
> How can I get the mapping of "ue" to "" within the full text search?
> Thanks
> Rob
>|||Hi Hilary!
Thanks for your answer but unfortunately there is no change of the search
behaviour: Searching for "mueller" does not match "müller".
I tried those configurations (all possible combinations) using SQL Server
Express 9.00.2047:
I used an empty thesaurus file.
Database collations:
German_PhoneBook_CS_AI
German_PhoneBook_CS_AS
Latin1_General_CS_AI
Latin1_General_CS_AS
Fulltext Catalog ACCENT_SENSITIVITY:
ALTER FULLTEXT CATALOG TestFTCatalog REBUILD WITH ACCENT_SENSITIVITY = ON /
OFF
Did I misundertand your advice?
Rob
"Hilary Cotter" wrote:

> It works in SQL Server 2005 if you select the accent insensitive catalog
> option.
> --
> Hilary Cotter
> Director of Text Mining and Database Strategy
> RelevantNOISE.Com - Dedicated to mining blogs for business intelligence.
> This posting is my own and doesn't necessarily represent RelevantNoise's
> positions, strategies or opinions.
> Looking for a SQL Server replication book?
> http://www.nwsu.com/0974973602.html
> Looking for a FAQ on Indexing Services/SQL FTS
> http://www.indexserverfaq.com

No comments:

Post a Comment