0

所以我有一个定制的分析器,它从本体中添加了额外的术语。此外,我想在它们被编入索引之前对其进行词干提取。以下是index metadata弹性搜索头插件中获取的。

{
    "state": "open",
    "settings": {
        "index": {
            "refresh_interval": "1000s",
            "number_of_shards": "5",
            "creation_date": "1471931611750",
            "analysis": {
                "filter": {
                    "owlfilter": {
                        "type": "owl",
                        "indexName": "ontoowl",
                        "expansionType": "RDFSLABEL",
                        "owlFile": "/home/tannys/elasticsearch-2.3.0/ontologyWorkTrial/myownowl.owl"
                    }
                },
                "analyzer": {
                    "owlanalyzer": {
                        "filter": ["owlfilter","porter_stem"],
                        "type": "custom",
                        "tokenizer": "standard"
                    }
                }
            },
            "number_of_replicas": "1",
            "uuid": "d8Ub8A0eSm65geMK_bpdvw",
            "version": {"created": "2030099"}
        }
    },
    "mappings": {
        "mytype": {
            "properties": {
                "nameortitle": {
                    "search_analyzer": "standard",
                    "analyzer": "owlanalyzer",
                    "store": true,
                    "type": "string"
                },
                "description": {
                    "search_analyzer": "standard",
                    "analyzer": "owlanalyzer",
                    "store": true,
                    "type": "string"
                }
            },
            "aliases": [ ]
        }
    }
}

具有讽刺意味的是,在我使用porter_stem过滤器之前,结果更好。所以我不太确定,出了什么问题。我想查看被索引的术语。我怎样才能看到分析器的执行情况,比如说卢克对 Lucene 做了什么?任何指导。

4

1 回答 1

1

您可以在此处使用术语向量 API 。这将为您提供文档中某个字段的术语,或者您也可以以相同的方式使用多术语 API来查看多个文档中的术语。

于 2016-08-24T09:11:12.290 回答