GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:18:27 Sequence gi568815585f:31852580_32052978 : 200399 bp : 39.59% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 11184 11223 40 -3.65 1.01 Init + 21388 21439 52 0 1 66 70 35 0.096 0.87 1.02 Intr + 25638 25773 136 1 1 73 72 77 0.268 3.31 1.03 Intr + 35711 35829 119 2 2 59 34 153 0.073 6.19 1.04 Intr + 46033 46325 293 2 2 4 53 213 0.261 5.33 1.05 Intr + 46369 46549 181 0 1 45 -11 117 0.009 -3.98 1.06 Term + 60113 60246 134 0 2 129 48 99 0.493 7.27 1.07 PlyA + 62036 62041 6 1.05 2.05 PlyA - 62367 62362 6 1.05 2.04 Term - 78507 78389 119 1 2 73 41 80 0.800 -0.48 2.03 Intr - 80966 80909 58 2 1 61 86 85 0.848 3.14 2.02 Intr - 84036 83858 179 1 2 3 49 196 0.537 5.92 2.01 Init - 96549 96468 82 0 1 64 70 42 0.080 1.18 2.00 Prom - 97251 97212 40 -5.95 3.00 Prom + 99540 99579 40 -9.75 3.01 Init + 100001 100358 358 1 1 77 -12 330 0.818 19.21 3.02 Intr + 100433 100792 360 0 0 40 12 603 0.009 42.17 3.03 Term + 104859 105190 332 1 2 59 39 170 0.014 3.03 3.04 PlyA + 105732 105737 6 1.05 4.00 Prom + 117011 117050 40 -4.65 4.01 Init + 126111 126243 133 2 1 59 68 114 0.670 6.85 4.02 Intr + 133651 133789 139 2 1 63 82 86 0.725 4.10 4.03 Intr + 141902 142178 277 2 1 83 33 168 0.783 7.40 4.04 Intr + 144035 144129 95 1 2 28 115 112 0.909 5.74 4.05 Intr + 145582 145719 138 1 0 33 50 145 0.451 3.86 4.06 Term + 146005 146131 127 1 1 45 51 102 0.598 -1.03 4.07 PlyA + 148539 148544 6 1.05 5.07 PlyA - 148871 148866 6 1.05 5.06 Term - 165832 165688 145 0 1 114 37 72 0.382 1.10 5.05 Intr - 166832 166610 223 0 1 44 93 106 0.226 2.96 5.04 Intr - 174674 174624 51 0 0 122 98 6 0.581 2.96 5.03 Intr - 178627 178400 228 2 0 68 15 211 0.687 8.72 5.02 Intr - 179003 178872 132 0 0 64 101 135 0.964 12.10 5.01 Intr - 182421 182323 99 1 0 73 87 52 0.450 2.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 100433 100824 392 0 2 40 47 596 0.984 44.86 S.002 Sngl + 104900 105190 291 1 0 69 39 193 0.881 7.90 S.003 Sngl - 115608 115393 216 0 0 96 40 153 0.969 6.22 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:31852580_32052978|GENSCAN_predicted_peptide_1|304_aa MRKVSAKKIHPEFDEDKELGLINLVPSFHLSCYPFHKGCPFVSHDQDGNTSNESFTVSGM SPWAEDRGPVLPLVETQQDYREVAQYGHFNCQCRFTGTSTGPDGVESSRKNVQGRELPWS WGRDSRWHPFCDKAILELLGDRHILTVPKGAACGEEVEIQNPKQSLCTEADGRTHCRSFT LECAAKGLEVEAPAKRGSRKLLDAAGNSHSPHTGSACARAGTLRPTSLHPILLLPATWMV ERAQLPMFLSQWTVRKPLTNFVGHGNINPGVGSQMTQVPRFGALSSVYVDTVTSFASLNP FDLC >gi568815585f:31852580_32052978|GENSCAN_predicted_CDS_1|915_bp atgaggaaagtctcagctaaaaaaatccacccagagtttgatgaggataaagagctaggc ctcattaaccttgtgccatcatttcatctgtcctgttacccgtttcataaaggatgtcct tttgtgtcccatgatcaagatggaaacacctctaatgagagcttcacagttagtggaatg agcccctgggctgaagacagaggtccagtcttgccacttgttgaaactcaacaagactac agagaagtggcacaatacggccacttcaactgccaatgccgcttcactggcactagcact ggaccagatggtgtggagagcagcaggaagaatgtccagggtagagagctgccctggagt tggggccgagatagcagatggcaccctttctgtgacaaggctattctagagttgctgggt gaccgacacatcctgacagtgcccaaaggggctgcttgtggagaggaggttgaaattcaa aatccaaagcagagtctctgcactgaggcagatggaagaactcactgtagaagcttcacg ctggagtgtgcagccaagggcttagaggtcgaggcacctgcgaaacgtggaagccggaag ctccttgatgctgctgggaactcacattctcctcacactggaagtgcttgtgccagggca gggaccctaagacctacctctctccatcccatcctacttctgccagccacatggatggtg gagagagcccagctgcccatgttcctctctcagtggactgtgagaaaaccactcacaaat tttgtgggccatgggaacataaatccaggtgtgggtagccagatgactcaagtccccaga tttggtgctctctcctctgtttatgtggacacagtgactagttttgcctctctcaatcct tttgacctctgctga >gi568815585f:31852580_32052978|GENSCAN_predicted_peptide_2|145_aa MQKGPNPNTGIKKTSWKRGCQVDPQSQGAGLPHTEGLDKTKNLEILPMLSDAADVLQALL SEPSRRKCSKNTFSSFAVESHVALIVENRSSCYVTNAYGPLLYAKEVISVQQLNCMQLIP KHCPPPVVSQSVPEFTLPPPPGILC >gi568815585f:31852580_32052978|GENSCAN_predicted_CDS_2|438_bp atgcagaaaggacccaacccaaacactggcatcaagaaaacttcctggaagaggggatgt caagttgacccccaaagccaaggtgccggcttaccacacaccgaagggctggacaaaacg aaaaacctggagattcttcccatgctttctgatgccgctgacgttcttcaggctcttcta tccgaaccctccagaaggaagtgcagcaagaataccttcagtagctttgcagttgaatct catgttgcactgattgttgagaataggtcttcatgctacgtaaccaacgcatatggacca cttttatatgctaaagaagtcatctctgttcagcaactcaattgcatgcagttaataccc aaacactgcccacccccagttgtttctcagtctgtgcctgagttcacgttgccacctcct cctggaatcctttgctaa >gi568815585f:31852580_32052978|GENSCAN_predicted_peptide_3|349_aa MATNFLVGEKIWFHKFKYGDAERRFYEQMNGPVAGASLQEASMILHDIARARENIPKSLA GSLGPGASSGPSGDHSELVVRIASLEVDNQRDLAERAGEELARPLGHSPADPAHVSHAPT TMRQHSCGITNEERLQQYAEKKAKKPALVAKSSILLDFKPWDDETDTAHLEACVRFIQPD GLVRGASKLVPVGYGIWKLQIQRVVEDKVGTDLLEEEITKFEECMQSVDIAAFNKIWPEC IPTCTNRCVGGREMGKNDFLQEATENENSKSAQIGVVEGEKENSMPEITCLAEREREEEE EGGGGGGEEGGRGKGKGKGEREKEGRRKKRKRKKRKEGRRGRRRKKQQK >gi568815585f:31852580_32052978|GENSCAN_predicted_CDS_3|1050_bp atggctacaaactttctagtaggtgagaagatctggttccacaagttcaaatatggcgat gcagaaaggagattctacgaacagatgaacgggcctgtggccggcgcctccctccaggag gccagcatgatcctccatgatattgccagagccagagagaacatcccgaaatccctggcc ggaagcttaggcccaggggcgtctagcggccccagcggagaccacagcgagctcgtcgtc cggatcgccagtctggaagtggacaaccagagagacctggctgaacgtgctggagaagag cttgcccggccactgggccacagccccgcagacccagcacatgtctcccatgcgccaacg acaatgaggcaacacagctgcggcattaccaatgaggagcggctgcagcagtacgcggag aagaaggccaagaagcccgcgctggtggccaagtcctccatcctgctggacttcaagcct tgggacgatgagacggacacggcccacctggaggcctgtgtgcgcttcatccagccggac gggctggtgcggggggcctccaagctggtgcccgtgggctacggtatctggaagctgcag attcagcgtgtggtggaggacaaggtggggacagacttgctggaggaggagatcaccaag tttgaagagtgtatgcagagtgtcgacatcgcagctttcaacaagatctggcctgagtgc attcccacatgcacaaacaggtgtgtgggaggaagggaaatgggcaagaatgatttcctt caagaggctactgagaatgaaaactcaaaatctgcacaaataggggtggtggaaggagag aaagaaaactccatgccagaaataacatgcttagcagagagagagagagaagaagaagaa gaaggaggaggaggaggaggagaagaggggggaagggggaaggggaaggggaagggagag agagagaaggaaggaaggagaaagaaaagaaagagaaagaaaagaaaagaaggaagaaga ggaaggaggagaaaaaaacagcagaaataa >gi568815585f:31852580_32052978|GENSCAN_predicted_peptide_4|302_aa MGKELKGHPNQATGNLDSNHGSASKCHYDVGQIADPVVQFSSSGEKIIKARWDAYSKRGH LNKQALNLENAYESVLGYVPKTDSEVTCPARVLIGPFYRALIGPFYKPLASHRPLIGAFY NPSYRVLIGAFYKPLASHRVLIGAFNRALIVAFYKPLASHRALIGAFYNPSYRVLIGAFY NPLCPKLLNVGLRDAHTGRADGKWIGMGDLVTVGRTVPENGVWNYVKKWNLAALPRIREK GGETEGGKPDKNADKDNESRKSHKAKMKVLIQMGFYLEATGKNPFPDSFRVLEESISLQL WD >gi568815585f:31852580_32052978|GENSCAN_predicted_CDS_4|909_bp atgggtaaagaattgaaggggcatccaaaccaagcaacaggaaatctggattctaatcat ggttctgcctcaaaatgtcattatgacgtagggcaaatcgcagaccctgtggtccagttt tcctcatctggagaaaagataattaaggccaggtgggatgcatacagtaaaagagggcac ctgaacaaacaagcattgaatttggagaatgcctacgagtctgtgctgggctacgttccc aagacagactcagaagttacctgccctgccagagtgctgattggtccattttacagagca ctgattggtccattttacaaacctctagctagccacagaccgctgattggtgcattttac aatcctagctacagagtgctgattggtgcattttacaaacctctagctagccacagagtg ctgattggtgcatttaacagagcactgattgttgcgttttacaaacctctagctagccac agagcactgattggtgcgttttacaatcctagctacagagtgctgattggcgcattttac aatcctctttgtccaaaactgctaaatgttggattgagggatgctcacactggaagagca gatggaaagtggatagggatgggagacctggtgactgtggggcgcactgttccagagaat ggtgtttggaactatgttaagaaatggaatctggcagcattgcctaggattagagaaaag ggtggggagactgaaggagggaagccagataagaatgcagataaagataatgagagcagg aagtcccacaaggccaaaatgaaggttttgatccagatgggcttttacctggaggctacg gggaagaatccatttccagactctttcagggtgttggaggaatccatatccttacagttg tgggactga >gi568815585f:31852580_32052978|GENSCAN_predicted_peptide_5|292_aa XEETESDYLTVRSHASAGARDLKPIWPCHFYILGLASYLLDADLPAALGGGGRAAFISGL SSPETPAAGRPPARLQPARLPVSGPGVLWLLLPLVSVPEINTTAARCGKQSCSLASGCSD VLKNPIRQPRCSKETQQEDACLGFFFPAFHTHGEDLCLTQLCVPYISVAHNTWYELRNLQ QYASKGAPPTSDCHWKKEKDSGCPEKKWQGPGVIMKRRRQSTSREPRTETQYDTKSSDKT AAGTWDDPASYMAPDRKSHFSAIKSTKPPLSPLLTLFLPPAQDCHSYGSLAS >gi568815585f:31852580_32052978|GENSCAN_predicted_CDS_5|879_bp natgaggaaactgagagtgattacttaactgtccgaagccatgcaagtgcaggagcaaga gatctgaagcccatttggccctgccatttttacattctaggacttgcttcttatttgctg gacgctgatctccctgcagccctcggaggcggcggccgcgctgctttcatttcggggctg tcatcaccagaaacgccggctgccggccgcccaccagctcggctccagccagcaaggctt cccgtgagcggcccgggtgtgctctggctgctgcttccccttgtgtctgtccctgaaata aacacaactgcagcacgctgcggtaagcaaagctgcagccttgcatctggttgcagcgac gttttgaaaaaccccatccgtcagcctcgttgcagtaaagagactcaacaagaggacgct tgtctgggctttttcttccctgccttccacacgcatggtgaagacctgtgtcttactcag ctttgtgttccttacatctctgtagcacacaatacttggtatgaacttaggaacctccaa caatatgcatccaagggggctcccccaacttctgactgtcactggaagaaagaaaaagac tctgggtgtccagaaaaaaagtggcagggacctggagtgattatgaaaaggagaagacag agtacaagcagagaacccaggacagaaactcaatatgatacaaagtcaagcgacaaaaca gcagcagggacctgggacgaccctgcctcctacatggcaccggataggaagtcccatttt tctgccatcaaatccacaaagccacctttgtcccctctcctgaccctgtttctgcctcct gcccaagattgtcactcctatgggtctctggcatcttaa