我正在尝试应用 vcftools --diff 以提取两个 VCF 文件之间的不同变体。
vcftools --vcf marked_IO002_tumor-pe.vcf --diff marked_IO002_normal-pe.vcf --diff-site --out t_v_n
我得到这个结果:
VCFtools - 0.1.16
(C) Adam Auton and Anthony Marcketta 2009
Parameters as interpreted:
--vcf marked_IO002_tumor-pe.vcf
--out out.diff.sites
--diff marked_IO002_normal-pe.vcf
--diff-site
Comparing sites in VCF files...
Found 75584 sites common to both files.
Found 419593 sites only in main file.
Found 84102 sites only in second file.
Found 2908 non-matching overlapping sites.
After filtering, kept 498085 out of a possible 498085 Sites
Run Time = 6.00 seconds
0
我想提取这些仅属于主文件(第一个文件)的 419593 个站点,您知道是否有办法做到这一点?我想提取的这些站点可以在新的 vcf 文件中吗?如果您能帮助我,我将不胜感激!
谢谢