lucene - Luke Lucene QueryParser 区分大小写

Question

在 Luke 中，如果我输入搜索表达式docfile:Tomatoes.jpg*，则解析后的查询是docfile:Tomatoes.jpg*. 当搜索表达式为docfile:Tomatoes.jpg,（无星号 *）时，解析后的查询docfile:tomatoes.jpg带有小写的 't'。

为什么？
我怎样才能改变这个？

顺便说一句，使用 org.apache.lucene.analysis.standard.StandardAnalyzer。

score 4 · Accepted Answer

StandardAnalyzer使用LowerCaseFilter这意味着它将您的查询和数据小写。这在 Javadocs http://lucene.apache.org/java/3_0_1/api/core/org/apache/lucene/analysis/standard/StandardAnalyzer.html中有描述。

如果我没记错的话WhitespaceAnalyzer不小写，但验证它是否适合您的需求http://lucene.apache.org/java/3_0_1/api/core/org/apache/lucene/analysis/WhitespaceAnalyzer.html。

score 1 · Accepted Answer

对于 Lucene 5.3.0，使用 SimpleAnalyzer 解决了这个问题。

例子：

Analyzer analyzer = new org.apache.lucene.analysis.core.SimpleAnalyzer();

最后，使用相同的分析器来构建索引和搜索。

lucene - Luke Lucene QueryParser 区分大小写

2 回答 2

Related

Reference