1

我想在 pandas 中使用 URL 读取 json 文件,但它给我带来了一些错误,我认为这是我给出的相关路径,请查看代码和 url。我在堆栈溢出时没有找到它,所以问了它。这个可能是重复的,但请帮助我。

data_df = pd.read_json('https://github.com/jackiekazil/data-wrangling/blob/master/data/chp3/data-text.json')

json_url 如下:

https://github.com/jackiekazil/data-wrangling/blob/master/data/chp3/data-text.json

错误信息:

ValueError: Expected object or value
4

2 回答 2

1

你想要这个网址:https://raw.githubusercontent.com/jackiekazil/data-wrangling/master/data/chp3/data-text.json

raw.githubusercontent.com提供在 Github 存储库中找到的未处理文件。您在问题中发布的链接不会为您提供您感兴趣的原始 JSON 文件;相反,它会加载网页本身。它也在this answer.

使用上面的 URL,您的代码对我来说效果很好。

于 2020-07-02T19:37:36.297 回答
1
  • 您将无法使用 pandas 直接读取文件
  • 你需要下载它
  • 顺便说一句,下面的功能将适用于从正确的 URL 下载任何文件。
    • 该函数需要 URL 和要保存到的目录。
import request
from pathlib import Path
import pandas as pd

def create_dir_save_file(dir_path: Path, url: str):
    """
    Check if the path exists and create it if it does not.
    Check if the file exists and download it if it does not.
    """
    if not dir_path.parents[0].exists():  # if directory doesn't exist
        dir_path.parents[0].mkdir(parents=True)  # create directory
        print(f'Directory Created: {dir_path.parents[0]}')
    else:
        print('Directory Exists')
        
    if not dir_path.exists():  # if file doesn't exist
        r = requests.get(url, allow_redirects=True)  # get file
        open(dir_path, 'wb').write(r.content)  # write file
        print(f'File Created: {dir_path.name}')
    else:
        print('File Exists')
        

data_dir = Path.cwd()  # current working dir or use Path('e:/some_path') to specify a location
url = 'https://raw.githubusercontent.com/jackiekazil/data-wrangling/master/data/chp3/data-text.json'
file_name = url.split('/')[-1]
data_path = data_dir / file_name  # local path to data once downloaded
create_dir_save_file(data_path, url)  # call function to download file

df = pd.json_normalize(data_path)  create dataframe

# display(df.head())

                           Indicator PUBLISH STATES  Year             WHO region World Bank income group               Country         Sex  Display Value  Numeric Low High Comments
0   Life expectancy at birth (years)      Published  1990                 Europe             High-income               Andorra  Both sexes             77     77.0                  
1   Life expectancy at birth (years)      Published  2000                 Europe             High-income               Andorra  Both sexes             80     80.0                  
2  Life expectancy at age 60 (years)      Published  2012                 Europe             High-income               Andorra      Female             28     28.0                  
3  Life expectancy at age 60 (years)      Published  2000                 Europe             High-income               Andorra  Both sexes             23     23.0                  
4   Life expectancy at birth (years)      Published  2012  Eastern Mediterranean             High-income  United Arab Emirates      Female             78     78.0                  
于 2020-07-03T02:09:07.200 回答