validate
Command validate checks gff consistency and performs statistics.
usage
$ ingenannot -v 2 validate genes.gff -s -g genome.fasta
positional arguments:
Gff_genes |
Gene Annotation file in GFF/GTF file format |
optional arguments:
-h, –help |
show this help message and exit |
-s, –statistics |
Statistics of annotations |
-g GENOME, –genome GENOME |
Genome in Fasta file, more statistics metrics if provided |
-a, –addseqs |
Export file in GFF3 with ##sequence-region pragma |
–fixframe |
fix error in CDS frame and log warning |
–gaeval |
Perform required control and transform to run with gaeval |
-o OUTPUT, –output OUTPUT |
Output Annotation file in GFF file format |
inputs
Gff_genes in GFF/GTF format. Common errors in annotation files are missing Parent or missing ID on some features.
outputs
The first goal of validate is to ensure the structure of your gff/gtf file with required info/tags. If you specify the -s
option you get statitistics on your annotations and more with -g
.
## Statistics ##
nb_genes;12635
average_gene_length;1443.0950534230312
median_gene_length;1158
min_gene_length;143
max_gene_length;29360
nb_transcripts;12635
average_transcripts_per_gene;1.0
average_transcript_length;1443.0950534230312
median_transcript_length;1158
min_transcript_length;143
max_transcript_length;29360
nb_exons;35603
average_exons_per_transcript;2.8178076770874556
average_exon_length;445.75858775945846
median_exon_length;214
min_exon_length;3
max_exon_length;20704
nb_transcript_mono_exon;3094
nb_introns;22968
average_introns_per_transcript;1.8178076770874554
average_intron_length;102.88936781609195
median_intron_length;63
min_intron_length;40
max_intron_length;5775
nb_CDS;12635
nb_complete_CDS;12635
nb_partial_CDS;0
average_CDS_length;1256.0417095370003
median_CDS_length;999
min_CDS_length;102
max_CDS_length;29232
nb_transcripts_with_utr;1
average_five_prime_utr_length;90.0
median_five_prime_utr_length;90
min_five_prime_utr_length;90
max_five_prime_utr_length;90
average_three_prime_utr_length;164.0
median_three_prime_utr_length;164
min_three_prime_utr_length;164
max_three_prime_utr_length;164
################