r - 如何在读取 XLSB 文件时跳过空白行以节省读取时间

翻译自：https://stackoverflow.com/questions/69490585 2021-10-08T04:41:09.370

34 次

目标跳过读取文件并将读取的数据插入数据框中，当年份列空白时。

目的我的输入文件有 100 万行列，companyName但一旦Year列变为空白，其他数据就为空白。如果我只能在函数在Year列中命中空白之前准备好 xlsb，则文件读取速度将大大提高。

我的尝试


read_excel <- read_xlsb ("C:/Tempdata/temp.xlsb", sheet = "tmp", trim_ws = TRUE, na = "", range = "tmp!A1:D10000")

我正在使用范围来限制读取，但它不是最好的方法，因为带有数据的行数可能会在未来的文件中发生变化。

有人可以建议一个好的方法吗？

我有一个 xlsb 表（excel 二进制格式）：

 companyName   Year   dataname     values

 company1      1980   sheetname1     5
 company1      1981   sheetname1     6
 company1      1982   sheetname1     7
 company1      1980    sheetname1     1
 company1      2016   sheetname1     8
 company2      1980   sheetname1     10
 company2      1981   sheetname1     20
 company2      1982   sheetname1     30
 company2      1980    sheetname1     14
 company2      2016   sheetname1     40
 company1      2000    sheetname2     xxx
 company1      2001    Name           yyy
 company1      
 company1      
 company1

我正在使用read_xlsb() 来自 library(readxlsb)

r - 如何在读取 XLSB 文件时跳过空白行以节省读取时间

0 回答 0

Related

Reference