0

我有 160 个文本文件,它们都具有相同的数据列,但没有标题。没有分隔符,所以我必须使用 read.fwf,我可以使用以下代码一次读取一个:

myfile= "//PATH/AllFiles/10914_1Mile_TextFile.txt"

read_fwf(myfile, fwf_cols(NUM=5,YNUM=20,STREETNUM=8,STREETPRED=2,STREETNAME=20,STREETTYPE=8,STREETPOSTD=2,STREETADD2=40,CITY=29,STATE=2,ZIP=5,ZIP4=4,EFFDATE=8,TYPE=1,
                                          IUM=6,FILL=1,VAMOUNT=9,FILLS=1,OVAMOUNT=9))

但是这样做 160 次并不理想,所以我试图一次将它们全部放入以绑定它们。到目前为止,这是我的代码:

CompleteDataCollection <- do.call( "rbind", lapply(myfile, function(fn) 
  data.frame(Filename=fn, read.fwf(fn,
                                   widths = c(5,20,8,2,20,8,2,40,29,2,5,4,8,1,6,1, 9,1,9),

                                   header = FALSE,
                                   col.names = c("NUM","YNUM","STREETNUM","STREETPRED","STREETNAME","STREETTYPE","STREETPOSTD","STREETADD2","CITY",
                                                  "STATE","ZIP","ZIP4","EFFDATE","TYPE", "PREMIUM","FILL", "VAMOUNT","FILLS","OVAMOUNT"))
  )))

我不断收到错误,并且 fn 没有设置任何内容,应该是吗?任何建议表示赞赏并欢迎提前谢谢您。

4

1 回答 1

0

fn 应该是一个包含目录中所有文件名的列表。当您的工作目录设置为文本文件所在的目录时,您可以保存所有文件名的列表:

fn <- dir()

然后在 fn 上执行 lapply。

CompleteDataCollection <- lapply(fn, function(x) read.fwf(fn,
                               widths = c(5,20,8,2,20,8,2,40,29,2,5,4,8,1,6,1, 9,1,9),

                               header = FALSE,
                               col.names = c("NUM","YNUM","STREETNUM","STREETPRED","STREETNAME","STREETTYPE","STREETPOSTD","STREETADD2","CITY",
                                              "STATE","ZIP","ZIP4","EFFDATE","TYPE", "PREMIUM","FILL", "VAMOUNT","FILLS","OVAMOUNT"))
于 2020-04-29T16:04:59.853 回答