tag:support.flying-sphinx.com,2011-01-05:/discussions/problems/40736-cyrillic-symbols-in-searchFlying Sphinx: Discussion 2018-10-17T21:35:24Ztag:support.flying-sphinx.com,2011-01-05:Comment/462795752018-10-17T21:35:23Z2018-10-17T21:35:23Zcyrillic symbols in search<div><p>Indexed characters are managed through the <code>charset_table</code> and (prior to Sphinx 2.2) <code>charset_type</code>. Here is Sphinx's documentation for v2.1.8 (which I believe you are using):<br>
<a href="http://sphinxsearch.com/docs/archives/2.1.8/conf-charset-type.html">http://sphinxsearch.com/docs/archives/2.1.8/conf-charset-type.html</a><br>
<a href="http://sphinxsearch.com/docs/archives/2.1.8/conf-charset-table.html">http://sphinxsearch.com/docs/archives/2.1.8/conf-charset-table.html</a></p>
<p>In Thinking Sphinx, these settings are managed in a per-environment manner within <code>config/thinking_sphinx.yml</code> - similar to <code>config/database.yml</code>. The default settings should actually work fine with Cyrillic characters, but I think the issue here is that Thinking Sphinx now expects Sphinx v2.2.x by default (which uses UTF-8 automatically and has deprecated <code>charset-type</code>). To explicitly set that, add the following to your <code>config/thinking_sphinx.yml</code> file:</p>
<pre>
<code>production:
charset_type: "utf-8"</code>
</pre>
<p>I would recommend using Sphinx 2.2.11 instead, though - and this is also managed in <code>config/thinking_sphinx.yml</code>:</p>
<pre>
<code>production:
version: 2.2.11</code>
</pre>
<p>Once you've made either of these changes, you'll want to run <code>rake ts:rebuild</code> to ensure the data is re-indexed accordingly.</p></div>Pat Allan