From left to right, the fields of the full_table.txt file correspond to the following: Column 1: Sequential annotation number. Column 2: Sequential number. Column 3: Sequence of TAG. Column 4: Number of ocurrences of TAGs in the genome. Column 5: Frequency of annotation for identical sequence TAGs. Column 6: Classification of this TAG based on the result of the annotation. Column 7: (TAG-Class) Classification of this TAG based on the result of the annotation and the frequency of ocurrence (this is our proposal of annotation) class platinum (C_Pt): TAG is unique in the genome and it matches a transcript. class gold (C_Au): TAG is not unique in the genome and it matches a transcript, but other occurrences of this TAG always match intergenic regions. class silver (C_Ag): TAG is unique in the genome but it matches two or more overlapping transcripts located at the same genomic region. class aluminum (C_Al): TAG is not unique in the genome, it matches a transcript, but other occurrences of this TAG also match another transcript located at a different genomic region. class copper (C_Cu): TAG is unique in the genome and it matches an intergenic region. class iron (C_Fe): TAG is not unique in the genome, it matches an intergenic region, but other occurrences of this TAG always match an intergenic region. class silicon (C_Si): TAG is not unique in the genome, it matches an intergenic region, but other occurrences of this TAG match a transcript. Column 8: Score assigned to each TAG based on TAG-class, transcript position and proximity to internal polyA sequences. Column 9: Score assigned to each TAG based on odds-ratios. Column 10: (TAG-Confidence) Classification of this TAG based on the confidence that it can be observed by experiment. Hi: TAG has a high probability of being correctly assigned. Lo: This TAG has another alternative for the same tag sequence that is at least five times more probable. Un: This TAG has not a single instance in the genome that can be assigned to the class High confidence. Column 11: Chromosome where the TAG maps. Column 12: 5' position of the TAG (in reference to the forward strand). Column 13: 3' position of the TAG (in reference to the forward strand). Column 14: Forward (+) or reverse (-) strand. Column 15: Chromosome name (in string format: i.e. Chr IV) where the ORF is found. Column 16: 5' position (relative to the forward strand) of the ORF that this TAG matches. (the first position of the CDS region) Column 17: 3' position (relative to the forward strand) of the ORF that this TAG matches. (the last position of the CDS region) Column 18: ORF length. Column 19: 5' position (relative to the forward strand) of the ORF that this TAG matches, corrected by UTR-5' (the first position of the annotated transcript). Column 20: 3' position (relative to the forward strand) of the ORF that this TAG matches, corrected by UTR-3' (the last position of the annotated transcript). Column 21: Length of the ORF with the UTRs corrections. Column 22: 'Y' if the tag is contained within an ORF (fully contained); 'N' otherwise. Column 23: 'Y' if it tag is contained within an intron; 'N' otherwise. Column 24: Genomic region type where the tag maps (The position used to assign the TAG to a given type is given by the first and the last base of the TAG. i.e. must be fully contained, otherwise is intergenic). The types defined and used are: 1: ORF, known protein ('Verified') 2: ORF, hypothetical protein ('Dubious' or 'Uncharacterized') 3: rRNA (ribosomal RNA) 4: snoRNA (small nucleolar RNA) 5: snRNA (nucleosome) 6: smRNA 7: tRNA 8: Intergenic, but in the opposite strand there is an annotated transcript (ie. tag matches a region of the mature transcript) 9: Intergenic, but in the opposite strand there is an annotated transcript within an intron (ie. tag matches the intronic region of the transcript in the opposite strand) 10: Intergenic, and in the opposite strand there is not an annotated transcript Column 25: Position from the 3'end within the transcript that this TAG matches (relative to other TAGs). Starting from 1. Column 26: Distance (in nts) from the 3'end of the transcript that this TAG matches (UTR3' end position minus TAG first position). Column 27: 'Y' or 'N' if TAG is located within a transcript and upstream (less than 500 nts) from a poly-adenylation region. Column 28: Distance (in nts) from the closest internal poly(A) site that this TAG matches (internal poly(A) initial position minus TAG first position). Column 29: Length of the internal poly(A) sequence. Column 30: Standard Gene name. Column 31: Feature type. Definition of the genomic region where a tag maps. The definitions used are: ORFv (verified ORF) ORFdu (dubious ORF) int,opORF (intergenic, but in the opposite strand there is an ORF) intron rRNA snoRNA tRNA snRNA ncRNA. Column 32: Feature name. Name of the genomic region where the tag maps. The definitions used are: Verified Uncharacterized Dubious --- (for intergenic regions) n.a. (for any RNA type) Column 33: Systematic name. In the case that the tag maps to an intergenic region, but in the opposite strand an ORF is found, then the systematic name of that ORF is given here. Column 34: Mapping of the tag inside the transcript (UTR5, ORF, UTR3). In the cases where the tag maps into an intergenic region, 'int' is assigned in this field. Column 35: 'P' if the ORF has a predicted 3'UTR; 'N' otherwise. Column 36: Description of ORF annotation.