GENSCAN 1.0 Date run: 8-Nov-116 Time: 09:47:43 Sequence gi568815576f:31939493_32139893 : 200401 bp : 45.00% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 5242 5328 87 0 0 98 89 230 0.999 24.74 1.02 Term + 16647 17300 654 2 0 86 49 1025 0.995 92.40 1.03 PlyA + 17799 17804 6 1.05 2.04 PlyA - 17958 17953 6 1.05 2.03 Term - 23287 23168 120 1 0 -2 48 125 0.141 -2.03 2.02 Intr - 28810 28758 53 2 2 127 81 27 0.169 4.53 2.01 Init - 44528 44387 142 2 1 86 80 60 0.553 5.30 2.00 Prom - 64300 64261 40 -2.46 3.06 PlyA - 65109 65104 6 1.05 3.05 Term - 85788 85636 153 0 0 60 55 127 0.843 4.52 3.04 Intr - 92185 92130 56 2 2 91 69 57 0.840 2.80 3.03 Intr - 95193 95040 154 0 1 65 49 110 0.372 4.45 3.02 Intr - 96248 96073 176 0 2 39 20 103 0.556 -1.74 3.01 Init - 96389 96290 100 2 1 70 78 31 0.620 0.73 3.00 Prom - 97310 97271 40 -6.46 4.00 Prom + 98151 98190 40 -4.66 4.01 Init + 99997 100350 354 0 0 90 43 552 0.953 48.04 4.02 Intr + 101185 101368 184 0 1 94 80 114 0.564 10.56 4.03 Intr + 103746 103924 179 1 2 -51 59 327 0.209 15.54 4.04 Intr + 110451 110522 72 2 0 94 131 30 0.796 7.50 4.05 Intr + 127443 127547 105 2 0 124 82 91 0.987 12.51 4.06 Intr + 128475 128534 60 2 0 117 98 29 0.981 5.73 4.07 Intr + 129004 129108 105 0 0 108 109 134 0.999 17.91 4.08 Intr + 142374 142479 106 2 1 137 109 30 0.993 9.79 4.09 Intr + 143582 143662 81 0 0 56 111 67 0.973 5.41 4.10 Intr + 144866 145167 302 0 2 46 80 253 0.893 16.75 4.11 Intr + 145408 145543 136 0 1 86 90 143 0.997 14.44 4.12 Intr + 146728 146835 108 2 0 105 81 135 0.961 14.76 4.13 Intr + 152120 152270 151 0 1 85 94 274 0.959 26.92 4.14 Intr + 159691 159859 169 1 1 44 50 217 0.985 13.45 4.15 Intr + 162530 162745 216 1 0 94 84 177 0.998 16.60 4.16 Intr + 165294 165399 106 2 1 102 30 213 0.923 16.69 4.17 Term + 170498 170721 224 0 2 60 47 282 0.952 18.38 4.18 PlyA + 171499 171504 6 1.05 5.00 Prom + 177608 177647 40 -4.86 5.01 Init + 179831 179895 65 1 2 54 110 23 0.681 1.72 5.02 Intr + 180059 180132 74 2 2 35 21 87 0.604 -4.25 5.03 Intr + 180206 180456 251 0 2 98 39 131 0.773 6.36 5.04 Intr + 181118 181312 195 1 0 99 77 202 0.904 19.81 5.05 Intr + 183586 183746 161 0 2 39 49 123 0.309 2.29 5.06 Intr + 190129 190312 184 1 1 -22 77 106 0.021 -1.71 5.07 Intr + 190365 190502 138 2 0 -17 63 169 0.739 4.56 5.08 Intr + 192825 193015 191 2 2 85 83 349 0.995 32.48 5.09 Intr + 193155 193189 35 0 2 9 96 -1 0.941 -9.33 5.10 Intr + 194536 194641 106 2 1 90 69 197 0.983 17.27 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 123275 123289 15 1 0 83 97 12 0.824 1.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815576f:31939493_32139893|GENSCAN_predicted_peptide_1|246_aa MGDREQLLQRARLAEQAERYDDMASAMKAVTELNEPLSNEDRNLLSVAYKNVVGARRSSW RVISSIEQKTMADGNEKKLEKVKAYREKIEKELETVCNDVLSLLDKFLIKNCNDFQYESK VFYLKMKGDYYRYLAEVASGEKKNSVVEASEAAYKEAFEISKEQMQPTHPIRLGLALNFS VFYYEIQNAPEQACLLAKQAFDDAIAELDTLNEDSYKDSTLIMQLLRDNLTLWTSDQQDE EAGEGN >gi568815576f:31939493_32139893|GENSCAN_predicted_CDS_1|741_bp atgggggaccgggagcagctgctgcagcgggcgcggctggccgagcaggcggagcgctac gacgacatggcctccgctatgaaggcggtgacagagctgaatgaacctctctccaatgaa gatcgaaatctcctctctgtggcctacaagaatgtggttggtgccaggcgatcttcctgg agggtcattagcagcattgagcagaaaaccatggctgatggaaacgaaaagaaattggag aaagttaaagcttaccgggagaagattgagaaggagctggagacagtttgcaatgatgtc ctgtctctgcttgacaagttcctgatcaagaactgcaatgatttccagtatgagagcaag gtgttttacctgaaaatgaagggtgattactaccgctacttagcagaggtcgcttctggg gagaagaaaaacagtgtggtcgaagcttctgaagctgcctacaaggaagcctttgaaatc agcaaagagcagatgcaacccacgcatcccatccggctgggcctggccctcaacttctcc gtgttctactatgagatccagaatgcacctgagcaagcctgcctcttagccaaacaagcc ttcgatgatgccatagctgagctggacacactaaacgaggattcctataaggactccacg ctgatcatgcagttgctgcgagacaacctcaccctctggacgagcgaccagcaggatgaa gaagcaggagaaggcaactga >gi568815576f:31939493_32139893|GENSCAN_predicted_peptide_2|104_aa MNKFQINNKEFVRAKEKSSQRHNCYPFGNVMASRKGPRPKARSGMTPGVGLEMRATKYAV PGNQLANVKKVIEQRLPGKEERGLIKEDLEWQAFSPCLEDNIAS >gi568815576f:31939493_32139893|GENSCAN_predicted_CDS_2|315_bp atgaacaagttccagataaacaataaggaatttgtcagggcaaaagagaagagctcccaa agacacaactgttacccctttgggaacgttatggccagcaggaaaggcccaagacccaag gcaagatctgggatgacgccaggtgtaggcctggaaatgagagctacaaagtatgctgtc cctgggaaccagctggcgaatgtaaagaaggtgattgagcaaaggctacctggcaaggaa gaaagaggtcttatcaaggaagacctggagtggcaggctttcagcccctgtcttgaagac aacatcgcatcatga >gi568815576f:31939493_32139893|GENSCAN_predicted_peptide_3|212_aa MHVSFGLEPEPRLLELPHPLDLKSQTGFREEPWEPFTGNCLNTVLNLGQQEECVGWKSEA CVQEPVPLALGKSFPSLKLSPHLSNLAEFDQQFDTWTQLYESERHFNESLIITNAGKSQQ CNVGIIVCQTPWFSENSEPRMRAGCQRFRFHPEPESDTAGDVCEESVILGGCRNPKPDVC LTQVLKELALKDPHQQLPVLKKYKQFSHIRHV >gi568815576f:31939493_32139893|GENSCAN_predicted_CDS_3|639_bp atgcatgtcagctttggattggaaccagagcccaggttactggaactgcctcatcctcta gacctcaaatcccagactggcttcagagaggagccatgggagcccttcacagggaactgc ctcaacactgtcctcaatctggggcaacaggaagaatgtgttgggtggaagtcggaggcc tgtgttcaagagccggttcctctggccttaggcaagtcatttccctctctgaaactgtct ccgcacctgtcaaatttggcagaatttgaccagcagtttgacacttggacccagctgtac gaaagtgagaggcatttcaatgagtcattaattataacaaatgcaggcaaaagtcagcag tgtaatgtaggcatcatcgtctgccaaacaccttggttttctgaaaactcagagcccagg atgagagcaggctgtcaacgcttcaggttccatccagaacctgagtctgacacagctgga gatgtgtgtgaagagtctgttattcttggaggttgtagaaatccaaaacctgatgtatgc ctgactcaggtattaaaggaactggccctcaaggaccctcaccagcagctacctgtgctc aagaagtacaagcagttctcccatatcaggcatgtatga >gi568815576f:31939493_32139893|GENSCAN_predicted_peptide_4|885_aa MCRVCTKTVKKAARVIIEKYYTRLGNDFHTNKRVCKEIAIIPSKKLRNKIAGYVTHLMKW IQRGPVRGISIKLQEEERERRDNYVPEVSALDQEIIEVDPDTKEMLKLLDFGSLSNLQPH LSCGLSVWLAFFFLGCSANKGPFSDQILVLIGTNDGKRESNRILPVSSDPAAQASCMGAG ASGPGEREGRNAATMDSSTWSPKTTAVTRPVETHELIRNAADISIIVIYFVVVMAVGLWA MFSTNRGTVGGFFLAGRSMVWWPIGASLFASNIGSGHFVGLAGTGAASGIAIGGFEWNAL VLVVVLGWLFVPIYIKAGVVTMPEYLRKRFGGQRIQVYLSLLSLLLYIFTKISADIFSGA IFINLALGLNLYLAIFLLLAITALYTITGGLAAVIYTDTLQTVIMLVGSLILTGFVSQAS MERSCYDELKVSECSFLYRCFPAFHEVGGYDAFMEKYMKAIPTIVSDGNTTFQEKCYTPR ADSFHIFRDPLTGDLPWPGFIFGMSILTLWYWCTDQVIVQRCLSAKNMSHVKGGCILCGY LKLMPMFIMVMPGMISRILYTEKIACVVPSECEKYCGTKVGCTNIAYPTLVVELMPNGLR GLMLSVMLASLMSSLTSIFNSASTLFTMDIYAKVRKRASEKELMIAGRLFILVLIGISIA WVPIVQSAQSGQLFDYIQSITSYLGPPIAAVFLLAIFWKRVNEPGAFWGLILGLLIGISR MITEFAYGTGSCMEPSNCPTIICGVHYLYFAIILFAISFITIVVISLLTKPIPDVHLYRL CWSLRNSKEERIDLDAEEENIQEGPKETIEIETQVPEKKKGIFRRAYDLFCGLEQHGAPK MTEEEEKAMKMKMTDTSEKPLWRTVLNVNGIILVTVAVFCHAYFA >gi568815576f:31939493_32139893|GENSCAN_predicted_CDS_4|2658_bp atgtgccgcgtttgcaccaaaaccgtgaagaaggcggcccgggtcatcatagaaaagtac tacacacgcctgggcaacgacttccacacgaacaagcgcgtgtgcaaggagatcgccatt atccccagcaagaagctccgcaacaagatagcaggctatgtcacgcatctgatgaaatgg attcagagaggcccagtaagaggtatctccatcaagctgcaggaggaggagagagaaagg agagacaattatgttcctgaggtctcagccttggatcaggagataattgaagtagatcct gacactaaggaaatgctgaagcttttggacttcggcagtctgtccaacctgcagcctcat ctttcctgcggcctgagtgtctggctggcttttttctttctcgggtgttctgctaacaag ggcccctttagtgaccagatcctggttttgattggcaccaatgacggcaagagagagtcc aacaggatcctaccagtgagcagtgacccagcagctcaggccagctgcatgggagcagga gctagcggccctggcgagagggaaggacgcaacgctgccaccatggacagtagcacctgg agccccaagaccaccgcggtcacccggcctgttgagacccacgagctcattcgcaatgca gccgatatctccatcatcgttatctacttcgtggtagtgatggccgtcggactgtgggct atgttttccaccaatcgtgggactgttggaggcttcttcctggcaggccgaagtatggtg tggtggccgattggagcctccctctttgctagtaacattggaagtggccactttgtgggg ctggccgggactggggcagcttcaggcatcgccattggaggctttgaatggaatgccctg gttttggtggttgtgctgggctggctgtttgtccccatctatattaaggctggggtggtg acaatgccagagtacctgaggaagcggtttggaggccagcggatccaggtctacctttcc cttctgtccctgctgctctacattttcaccaagatctcggcagacatcttctcgggggcc atattcatcaatctggccttaggcctgaatctgtatttagccatctttctcttattggca atcactgccctttacacaattacagggggcctggcggcggtgatttacacggacaccttg cagacggtgatcatgctggtggggtctttaatcctgactgggtttgtttctcaggcatct atggaaagaagctgctatgacgagttgaaggtttcagaatgttcatttctgtaccgatgt tttccagcttttcacgaagtgggaggctatgacgccttcatggaaaagtacatgaaagcc attccaaccatagtgtctgatggcaacaccacctttcaggaaaaatgctacactccaagg gccgactccttccacatcttccgagatcccctcacgggagacctcccatggcctgggttc atctttgggatgtccatccttaccttgtggtactggtgcacagatcaggtcattgtgcag cgctgcctctcagccaagaatatgtctcacgtgaagggtggctgcatcctgtgtgggtat ctaaagctgatgcccatgttcatcatggtgatgccaggaatgatcagccgcattctgtac acagaaaaaattgcctgtgtcgtcccttcagaatgtgagaaatattgcggtaccaaggtt ggctgtaccaacatcgcctatccaaccttagtggtggagctcatgcccaatggactgcga ggcctgatgctatcagtcatgctggcctccctcatgagctccctgacctccatcttcaac agcgccagcaccctcttcaccatggacatctacgccaaggtccgcaagagagcatctgag aaagagctcatgattgccggaaggttgtttatcctggtgctgattggcatcagcatcgcc tgggtgcccattgtgcagtcagcacaaagtgggcaactcttcgattacatccagtccatc accagttacttgggaccacccattgcggctgtcttcctgcttgctattttctggaagaga gtcaatgagccaggagccttttggggactgatcctaggacttctgattgggatttcacgt atgattactgagtttgcttatggaaccgggagctgcatggagcccagcaactgtcccacg attatctgtggggtgcactacttgtactttgccattatcctcttcgccatttctttcatc accatcgtggtcatctccctcctcaccaaacccattccggatgtgcatctctaccgtctg tgttggagcctgcgcaacagcaaagaggagcgtattgacctggatgcggaagaggagaac atccaagaaggccctaaggagaccattgaaatagaaacacaagttcctgagaagaaaaaa ggaatcttcaggagagcctatgacctattttgtgggctagagcagcacggtgcacccaag atgactgaggaagaggagaaagccatgaagatgaagatgacggacacctctgagaagcct ttgtggaggacagtgttgaacgtcaatggcatcatcctggtgaccgtggctgtcttttgc catgcatattttgcctga >gi568815576f:31939493_32139893|GENSCAN_predicted_peptide_5|467_aa MKKNKKMIRRRKTRLEKRKKNRPQKEEGALREDSSLPWLHVYCCHRGPRLATAATTSTDA NTTTTAAALNEPAHPTGLPPPGHRCSTALALAAASRPPTSLALCSLHRCHHANHSEASRG VPGSSLQHVQDTAAPDGAHNLPPAIGSAAQVNAPKLLPDVGSAATESAPNQPPAMGSDSR DSAPKQTLALGSDARWGVVWVPYWGVTASGGGLVGAAIHGYTVYGMERIEVATLGCSARG WGQVAGASRTGCFVRSEEDSMEEFTESEGLDIIILHGSEQHAWTFSICKGTMQQRLTVES AVHIKTRQCLRCANGDYGIAMADVRLLAESARHDGNTIGHKSTEVLALTWMLPLKLVVAN AVAALSEIAESHPSNNLLHLNPQFINKLLTALIECTEWGQIFILDCLTNYTPKDDREAQR EACLLPTSPEPGEIFELKAELNSDKKEKEEEAVKKVIASMTVGKDVS >gi568815576f:31939493_32139893|GENSCAN_predicted_CDS_5|1401_bp atgaaaaagaacaagaaaatgataaggagaagaaaaaccagactagaaaaaaggaaaaag aacaggccccagaaggaggagggtgccctcagagaagactcctccttgccctggctgcac gtctactgctgccacagaggccccagacttgctactgctgccaccactagcaccgatgcc aatacaaccaccactgctgccgccctcaatgaaccggcccaccctacagggctcccacca cctggccaccgctgcagcactgccctggccctggcagcagccagccgccctcctacctct ctggcactctgcagtctccatcgctgccaccacgccaaccacagcgaggcaagccgtggt gtccctggctccagcctccagcatgtgcaggatactgcagcacccgatggtgcccacaac ctgccccctgccattggcagtgcagcccaggttaatgcccccaaactgctccctgatgtt ggcagtgcagccacggaaagtgcccccaaccagcctcccgccatgggaagtgactcccgg gatagcgcccccaagcagaccctagccttgggcagtgacgcccggtggggggtagtttgg gtgccatattggggcgtcactgccagtggcggtggtctggttggggccgctattcatggc tacaccgtctatggcatggagcggattgaggttgctaccctgggctgcagtgcccgtggc tgggggcaagttgcaggtgccagcaggactgggtgctttgtgaggagtgaagaagacagt atggaggagttcacggagtccgagggactggacattatcatattgcatggctcagagcag cacgcatggacattcagcatctgtaaggggaccatgcagcagaggctgacagtggagagt gcagtgcacataaagacacgccagtgtttgcggtgtgcaaatggcgactatgggatagca atggccgatgtcaggctgctggcagagagcgcaagacatgatggaaacaccattggccac aagtctacagaggtcttggcgctgacctggatgcttcctttgaagctggtggtggccaac gcagtggcagcgctctcagaaatcgccgagtctcacccgagcaacaacctgctccatctg aacccacagttcatcaacaagctgctgacagccctgattgagtgcaccgagtggggccag atcttcatcctggactgcctcaccaactatacgcccaaggacgaccgcgaggcccagagg gaggcctgcctgcttcccacctctccagagccaggggagatctttgagctgaaggcagag ctcaatagtgacaagaaggagaaggaggaggaggcagtgaagaaagtgattgcatcaatg accgtgggcaaagatgtcagn