MCScanX gene id 问题

该gff 的gene行存在Name,导致脚本没有考虑到这种情况,提取的gene id为Name,gene id 过长,导致MCScanX无法运行。
解决方法:将第九列去掉Name既可。

awk -F '[\t;]' '{if($3~/gene/) {print $1"\t"$2"\t"$3"\t"$4"\t"$5"\t"$6"\t"$7"\t"$8"\t"$9} else{print $0}}'  Dod.chr.gff3 
chr07   .   gene    28275981    28288196    .   -   .   ID=evm.TU.scaffold_30.17;
chr07   .   mRNA    28275981    28288196    .   -   .   ID=evm.model.scaffold_30.17;Parent=evm.TU.scaffold_30.17;Name=EVM%20prediction%20scaffold_30.17
chr07   .   five_prime_UTR  28288064    28288196    .   -   .   ID=evm.model.scaffold_30.17.utr5p1;Parent=evm.model.scaffold_30.17
chr07   .   five_prime_UTR  28287389    28287395    .   -   .   ID=evm.model.scaffold_30.17.utr5p2;Parent=evm.model.scaffold_30.17
chr07   .   exon    28288064    28288196    .   -   .   ID=evm.model.scaffold_30.17.exon1;Parent=evm.model.scaffold_30.17
chr07   .   exon    28287253    28287395    .   -   .   ID=evm.model.scaffold_30.17.exon2;Parent=evm.model.scaffold_30.17
chr07   .   CDS 28287253    28287388    .   -   0   ID=cds.evm.model.scaffold_30.17;Parent=evm.model.scaffold_30.17
chr07   .   exon    28281681    28282520    .   -   .   ID=evm.model.scaffold_30.17.exon3;Parent=evm.model.scaffold_30.17

chr07   .   gene    28275981    28288196    .   -   .   ID=evm.TU.scaffold_30.17;Name=EVM%20prediction%20scaffold_30.17
chr07   .   mRNA    28275981    28288196    .   -   .   ID=evm.model.scaffold_30.17;Parent=evm.TU.scaffold_30.17;Name=EVM%20prediction%20scaffold_30.17
chr07   .   five_prime_UTR  28288064    28288196    .   -   .   ID=evm.model.scaffold_30.17.utr5p1;Parent=evm.model.scaffold_30.17
chr07   .   five_prime_UTR  28287389    28287395    .   -   .   ID=evm.model.scaffold_30.17.utr5p2;Parent=evm.model.scaffold_30.17
chr07   .   exon    28288064    28288196    .   -   .   ID=evm.model.scaffold_30.17.exon1;Parent=evm.model.scaffold_30.17
chr07   .   exon    28287253    28287395    .   -   .   ID=evm.model.scaffold_30.17.exon2;Parent=evm.model.scaffold_30.17
chr07   .   CDS 28287253    28287388    .   -   0   ID=cds.evm.model.scaffold_30.17;Parent=evm.model.scaffold_30.17
chr07   .   exon    28281681    28282520    .   -   .   ID=evm.model.scaffold_30.17.exo
LIU_sa07    EVM20prediction20scaffold_30.17 28275981    28288196
LIU_sa07    EVM20prediction20scaffold_30.1  27505683    27508995
LIU_sa07    EVM20prediction20scaffold_30.33 28962370    28967589
LIU_sa07    EVM20prediction20scaffold_30.74 29946477    29988627
LIU_sa07    EVM20prediction20scaffold_30.62 29734064    29735281

你可能感兴趣的:(MCScanX gene id 问题)