python - 递归检索线程化评论列表

Question

我正在尝试编写一个递归函数，可以从 Reddit 提交中检索嵌套评论。我正在使用 Python + PRAW

def _get_comments(comments, ret = []):
    for comment in comments:
        if len(comment._replies) > 0:
            return _get_comments(tail(comments), ret + [{
                #"body": comment.body,
                "id": comment.id,
                "author": str(comment.author),
                "replies": map(lambda replies: _get_comments(replies, []), [comment._replies])
                }])
        else:
            return ret + [{
                    #"body": comment.body,
                    "id": comment.id,
                    "author": str(comment.author)
                }]
    return ret

def tail(list):
    return list[1:len(list)]

我得到以下输出，它不完整并且有嵌套数组：

pprint(_get_comments(s.comments))
[{'author': 'wheremydirigiblesat',
  'id': u'ctuzo4x',
  'replies': [[{'author': 'rhascal',
                'id': u'ctvd6jw',
                'replies': [[{'author': 'xeltius', 'id': u'ctvx1vq'}]]}]]},
 {'author': 'DemiDualism',
  'id': u'ctv54qs',
  'replies': [[{'author': 'rhascal',
                'id': u'ctv5pm1',
                'replies': [[{'author': 'blakeb43', 'id': u'ctvdb9c'}]]}]]},
 {'author': 'Final7C', 'id': u'ctvao9j'}]

该Submission对象有一个comments属性，它是一个Comment对象列表。每个Comment对象都有一个_replies属性，该属性是更多Comments 的列表。

我错过了什么？我尽力而为——递归很难。

score 2 · Accepted Answer

你得到它几乎是正确的。问题是，当递归很简单时，您正试图使递归变得复杂。您不需要tail()函数以及map()内部的函数，因为您已经在迭代注释。

我在示例中重命名了您的函数，因为它实际上将注释转换为字典。

让我们从简单的案例开始，想一想：“好吧，我想要一个能够将评论列表转换为字典列表的函数”。只是简单的功能：

def comments_to_dicts(comments):
    results = []  # create list for results
    for comment in comments:  # iterate over comments
        item = {
            "id": comment.id,
            "author": comment.author,
        }  # create dict from comment

        results.append(item)  # add converted item to results 
    return results  # return all converted comments

现在您希望 dict 还包括转换为 dicts 的回复列表。而且你已经有了能够进行这种转换的函数，所以让我们使用它并将结果放入item['replies']：

def comments_to_dicts(comments):
    results = []  # create list for results
    for comment in comments:  # iterate over comments
        item = {
            "id": comment.id,
            "author": comment.author,
        }  # create dict from comment

        if len(comment._replies) > 0:
            item["replies"] = comments_to_dicts(comment._replies)  # convert replies using the same function

        results.append(item)  # add converted item to results 
    return results  # return all converted comments

由于您修改了调用的相同函数，因此它将转换所有回复，无论它们有多深。希望更清楚递归是如何工作的。

python - 递归检索线程化评论列表

1 回答 1

Related

Reference