Finally I want to present two fixes I did this last week just to show what kind of work needs to be done in order to fix problems in Nepomuk:
1. Bug 281136 – Nepomuk queries containing unicode characters fail
The problem presented itself as follows: whenever the user would execute a query containing extended characters such as german umlauts, french accents, or for example any russian character the query would not return any results.
After some testing I realized that the queries simply failed when being delivered to Virtuoso because of Nepomuk’s automatic search excerpt extraction. It turned out that Virtuoso’s bif:search_excerpt method cannot handle wide characters which is exactly what it got. So I turned to the Virtuoso team for help and got a workaround which essentially means that we convert the wide characters to UTF8. However, this results in stripped search excerpts so the story does not end yet – I am waiting for a better solution from the Virtuoso guys.
http://trueg.wordpress.com/2011/10/05/the-hunt-for-nepomuk-bugs-continues/
tl;dr Поиск всяких умлаутов и русских букв то того. Как вы все с этим там жили?