0

我有六个列表,其中三个对应于给定的句子,另外三个列表对应于前面列表的标签。我想根据句子的顺序将这些列表压缩到 OrderedDict 中。

from collections import OrderedDict

sentence = 'Im fucking excited to learn your damn business'

words = ['excited', 'learn', 'business']
stopwords = ['Im', 'to', 'about', 'your']
swearwords = ['fucking', 'damn']

# Labels
words_labels = ['friendly', 'interested', 'interested']
stopwords_labels = ['stopword'] * len(stopwords)
swearwords_labels = ['sweaword'] * len(swearwords)

d = OrderedDict(zip(keys, values))

# Expected output
{
    'Im': 'stopword',
    'fucking': 'swearword',
    'excited': 'friendly',
    'to': 'stopword',
    'learn': 'interested',
    'your': 'stopword'
    'damn': 'swearword',
    'business': 'interested'
}
4

1 回答 1

1

首先列出重复内容,语法是

stopwords_labels = ['stopword'] * len(stopwords)

然后建立所有内容和所有标签的列表,然后你就很好了

keys = [*words, *stopwords, *swearwords]
values = [*words_labels, *stopwords_labels, *swearwords_labels]
mappings = dict(zip(keys, values))
d = OrderedDict((k, mappings[k]) for k in sentence.split())

您也可以使用该语法,将

keys = words + stopwords + swearwords
values = words_labels + stopwords_labels + swearwords_labels
于 2021-01-24T18:18:51.757 回答