GENSCAN 1.0 Date run: 4-Nov-116 Time: 01:58:51 Sequence gi568815578f:34458869_34659895 : 201027 bp : 44.34% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3225 3353 129 0 0 79 95 96 0.884 9.21 1.02 Intr + 11180 11252 73 2 1 87 92 95 0.991 9.21 1.03 Term + 13683 13778 96 2 0 48 53 120 0.959 2.47 1.04 PlyA + 14237 14242 6 1.05 2.02 PlyA - 14466 14461 6 1.05 2.01 Sngl - 17606 17142 465 2 0 84 48 397 0.759 29.45 2.00 Prom - 19134 19095 40 -9.75 3.00 Prom + 20622 20661 40 -5.66 3.01 Init + 20802 20921 120 2 0 22 115 61 0.787 2.39 3.02 Intr + 21731 21864 134 1 2 87 97 60 0.994 6.24 3.03 Intr + 22198 22338 141 1 0 52 94 106 0.876 7.07 3.04 Intr + 30398 30518 121 2 1 42 80 54 0.200 0.50 3.05 Intr + 30954 31058 105 2 0 63 72 44 0.564 0.71 3.06 Intr + 33633 33729 97 2 1 32 71 112 0.839 3.48 3.07 Intr + 45463 45535 73 2 1 73 119 34 0.901 3.46 3.08 Intr + 48827 48922 96 2 0 58 -6 139 0.012 0.62 3.09 Intr + 53281 53332 52 1 1 45 76 30 0.008 -3.59 3.10 Intr + 57516 57593 78 2 0 57 89 92 0.025 5.95 3.11 Intr + 67400 67475 76 1 1 103 107 113 0.995 13.79 3.12 Intr + 75760 75927 168 2 0 117 110 268 0.995 31.82 3.13 Intr + 79875 80113 239 1 2 64 40 57 0.105 -4.17 3.14 Intr + 86059 86208 150 0 0 76 86 111 0.747 10.06 3.15 Intr + 99473 99550 78 1 0 91 85 77 0.828 7.45 3.16 Intr + 99827 100040 214 1 1 32 85 71 0.650 -0.51 3.17 Intr + 100340 100395 56 0 2 86 117 95 0.886 10.90 3.18 Intr + 100479 100585 107 2 2 112 90 255 0.622 27.11 3.19 Term + 100868 101030 163 2 1 67 52 360 0.623 27.71 3.20 PlyA + 101451 101456 6 1.05 4.14 PlyA - 101700 101695 6 1.05 4.13 Term - 102111 101998 114 0 0 126 47 194 0.999 17.67 4.12 Intr - 105272 105246 27 2 0 100 75 24 0.407 0.61 4.11 Intr - 107597 107487 111 2 0 9 54 121 0.108 1.38 4.10 Intr - 116378 116236 143 0 2 108 110 138 0.997 18.07 4.09 Intr - 122804 122680 125 1 2 113 76 180 0.990 19.53 4.08 Intr - 126712 126569 144 0 0 84 106 113 0.964 12.10 4.07 Intr - 129739 129585 155 1 2 96 92 46 0.458 4.67 4.06 Intr - 157271 157174 98 0 2 103 115 -11 0.045 2.83 4.05 Intr - 175847 175720 128 1 2 98 86 100 0.783 11.12 4.04 Intr - 179117 179008 110 2 2 108 81 199 0.999 20.28 4.03 Intr - 185358 185296 63 0 0 90 103 37 0.920 4.31 4.02 Intr - 186466 186407 60 1 0 112 115 -26 0.696 1.43 4.01 Init - 198568 198488 81 1 0 73 88 70 0.929 6.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 48827 48926 100 2 1 58 32 146 0.889 3.60 S.002 Init - 56825 56765 61 2 1 86 94 45 0.842 4.93 S.003 Init + 64716 64763 48 2 0 93 85 2 0.892 0.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:34458869_34659895|GENSCAN_predicted_peptide_1|99_aa XQLNEKPLPEGWEMRFTVDGIPYFVDHNRRTTTYIDPRTGKSALDNGPQIAYVRDFKAKV QYFRFWCQPDVEVIIFKHQWYCSNYSFKKYGIECKGKRG >gi568815578f:34458869_34659895|GENSCAN_predicted_CDS_1|300_bp nntcaattaaatgaaaagcccttacctgaaggttgggaaatgagattcacagtggatgga attccatattttgtggaccacaatagaagaactaccacctatatagatccccgcacagga aaatctgccctagacaatggacctcagatagcctatgttcgggacttcaaagcaaaggtt cagtatttccggttctggtgtcagccagatgtggaagtgattattttcaagcaccagtgg tattgtagcaactacagcttcaagaagtacggaattgaatgcaaagggaaaagaggttga >gi568815578f:34458869_34659895|GENSCAN_predicted_peptide_2|154_aa MAAAGGARLLRAASAVLGDPAGRWLHHAGSRAGASGLLRSRGPGRSAEASRPLSVSAGAR SSSEDKVTVHFINCDGETLTTKGKVGDSLLDVVVENNPDIDGFGACEGTLACLTCHLIFE DHIYEKLDAITDEENHMLDLAYGLTDHSWAAKSV >gi568815578f:34458869_34659895|GENSCAN_predicted_CDS_2|465_bp atggctgccgctgggggcgcccggctgctgcgcgccgcttctgcggtcctcggcgacccg gccggccggtggctgcaccacgccgggtcccgcgctggagccagcggcctgctgaggagc cggggaccgggccggagcgcggaggcgagccggccgctgagcgtgtcggcaggggcgcgg agcagctcagaagataaagtgacagtccactttataaactgtgatggtgaaacattaaca accaaaggaaaagttggtgattctctgctagacgttgtggttgaaaataatccagatatt gatggctttggtgcatgtgagggaactctggcttgtttaacctgtcatctcatctttgaa gatcacatatatgagaagttagatgcaatcactgatgaggagaatcacatgctcgatctg gcatatggactaacagatcacagttgggctgccaaatctgtttga >gi568815578f:34458869_34659895|GENSCAN_predicted_peptide_3|755_aa MYCLFEYAGKDNYCLQINPASYINPDHLKYFRFIGRFIAMALFHGKFIDTGFSLPFYKRI LNKPVGLKDLESIDPEFYNSLIWVKENNIEECDLEMYFSVDKEILGEIKSHDLKPNGGNI LVTEENKEEYIRMVAEWRLSRGVEEQTQAFFEGFNEILPQQYLQYFDAKELEVLLCGMQE IDLNDWQRHAIYRHYARTSKQIMWFWQFVKEIDNEKRMRLLQFVTGTCRLPVGGFADLMG SNGPQKFCIEKVGKENWLPRSHTCFNRLDLPPYKSYEQLKEKLLFAIEETEGFGQDAETV VMEACVIEIWPATKPLRRRRKAQDSLSVRYAGLPDRSEMAEVEETLKRLQSQKGVQGIIV VNTEGIPIKSTMDNPTTTQYASLMHSFILKARSTVRDIDPQNDLTFLRIRSKKNEIMVAP AHLRGKNQALGLALNSHKVTWLRSCHFAHAQQGAADQGSSEYGVKPRSLSPLHLWIYPGL GKPFNSLRLRVLRVTRGFTELLYRPHSQQKCELAWDHLVSQSRLLLVLLLYVAIGPENMP WGVSAETKMKPREGAGPLAPAVSHGAQNRVSPDTDSECCDLTSPGELPPAAAAAVLSASP GALEREARSPRSPQTADTSPRPRAPACAPSRARAMPSDRPFKQRRSFADRCKEVQQIRDQ HPSKIPVIIERYKGEKQLPVLDKTKFLVPDHVNMSELVKIIRRRLQLNPTQAFFLLVNQH SMVSVSTPIADIYEQEKDEDGFLYMVYASQETFGF >gi568815578f:34458869_34659895|GENSCAN_predicted_CDS_3|2268_bp atgtattgcctgtttgaatatgcagggaaggataactactgcttgcagataaaccccgct tcttacatcaatccagatcacctgaaatattttcgttttattggcagatttattgccatg gctctgttccatgggaaattcatagacacgggtttttctttaccattctataagcgtatc ttgaacaaaccagttggactcaaggatttagaatctattgatccagaattttacaattct ctcatctgggttaaggaaaacaatattgaggaatgtgatttggaaatgtacttctccgtt gacaaagaaattctaggtgaaattaagagtcatgatctgaaacctaatggtggcaatatt cttgtaacagaagaaaataaagaggaatacatcagaatggtagctgagtggaggttgtct cgaggtgttgaagaacagacacaagctttctttgaaggctttaatgaaattcttccccag caatatttgcaatactttgatgcaaaggaattagaggtccttttatgtggaatgcaagag attgatttgaatgactggcaaagacatgccatctaccgtcattatgcaaggaccagcaaa caaatcatgtggttttggcagtttgttaaagaaattgataatgagaagagaatgagactt ctgcagtttgttactggaacctgccgattgccagtaggaggatttgctgatctcatgggg agcaatggaccacagaaattctgcattgaaaaagttgggaaagaaaattggctacccaga agtcatacctgttttaatcgcctggacctgccaccatacaagagctatgagcaactgaag gaaaagctgttgtttgccatagaagaaacagaaggatttggacaagatgctgaaacagtt gttatggaggcctgcgttattgagatctggcctgccacgaaacctttgcgcaggcgcaga aaggcacaggactcgctaagtgttcgctacgcggggctaccggatcggtcggaaatggca gaggtggaggagacactgaagcgactgcagagccagaagggagtgcagggaatcatcgtc gtgaacacagaaggcattcccatcaagagcaccatggacaaccccaccaccacccagtat gccagcctcatgcacagcttcatcctgaaggcacggagcaccgtgcgtgacatcgacccc cagaacgatctcaccttccttcgaattcgctccaagaaaaatgaaattatggttgcacca gcccatcttcgtgggaagaaccaagctttaggcttggctctgaacagccacaaagtgact tggctgaggtcctgccatttcgctcatgctcagcagggggcagcagaccagggcagttca gagtatggggtcaaacccaggtccctgtccccactccacctgtggatttaccctggattg ggcaagccctttaactctctgaggctccgtgttctcagagtgactcgaggattcactgag ctcctttatagaccacattcccagcagaagtgtgagctcgcttgggaccacctggtgtca cagtccaggctgctgctggtcctgctcctctacgtggcaatcggcccagagaatatgcct tggggggtctcagcagaaactaagatgaagccccgtgaaggcgccgggcctttggctccg gctgtttcccacggcgcccagaaccgcgtcagtcctgacacagactcggaatgttgtgac ctgacgtcaccgggcgagttacctcccgcagccgcagccgccgtgctcagcgcgagcccc ggagcccttgagcgcgaggcgcggagcccccggagcccccaaaccgcagacacatccccg cgccccagagccccggcctgcgcgcccagccgggcccgcgcgatgccctcagaccggcct ttcaagcagcggcggagcttcgccgaccgctgtaaggaggtacagcagatccgcgaccag caccccagcaaaatcccggtgatcatcgagcgctacaagggtgagaagcagctgcccgtc ctggacaagaccaagtttttggtcccggaccatgtcaacatgagcgagttggtcaagatc atccggcgccgcctgcagctgaaccccacgcaggccttcttcctgctggtgaaccagcac agcatggtgagtgtgtccacgcccatcgcggacatctacgagcaggagaaagacgaggac ggcttcctctatatggtctacgcctcccaggaaaccttcggcttctga >gi568815578f:34458869_34659895|GENSCAN_predicted_peptide_4|452_aa MKKDAVHEREFPVIRNVQDLKPLAGDETPLIIYLFHFLIDYAELVFMITDALTAIALYFA IQDFNKVVFKKQKLLLELDQYAPDVAELIRTPMEMRYIPLKVALFYLLNPYTILSCVAKS TCAINNTLIAFFILTTIKGKALKGEYKGSAFLSAIFLALATYQSLYPLTLFVPGLLYLLQ RQYIPVKMKSKAFWIFSWEYAMMYVGSLVVIICLSFFLLSSWDFIPAVYGFILSVPDLTP NIGLFWYFFAEMFEHFSLFFVCVFQINVFFYTIPLAIKLKEHPIFFMFIQIAVIAIFKSY PTVGDVALYMAFFPVWNHLYRFLRNIFVLTCIIIVCSLLFPVLWHLWIYAGSANSNFFYA ITLTFNVGQLPHELFIMVPEGGGPGALMAAIERNVDRAVPGGSAERDPIERSTLHILLIS DYFYAFLRREYYLTHGLYLTAKDGTEAMLVLK >gi568815578f:34458869_34659895|GENSCAN_predicted_CDS_4|1359_bp atgaaaaaggatgctgtccatgaaagagagtttcccgtcatcagaaatgttcaagactta aagccacttgctggggatgagactccattaataatatacctctttcatttcctaattgac tatgctgaattggtgtttatgataactgatgcactcactgctattgccctgtattttgca atccaggacttcaataaagttgtgtttaaaaagcagaaactcctcctagaactggaccag tatgccccagatgtggccgaactcatccggacccctatggaaatgcgttacatccctttg aaagtggccctgttctatctcttaaatccttacacgattttgtcttgtgttgccaagtct acctgtgccatcaacaacaccctcattgctttcttcattttgactacgataaaaggtaag gccctgaaaggagagtacaaaggcagtgctttcctcagtgctatttttcttgccttagcg acataccagtctctgtacccactcaccttgtttgtcccaggactcctctatctcctccag cggcagtacatacctgtgaaaatgaagagcaaagccttctggatcttttcttgggagtat gccatgatgtatgtgggaagcctagtggtaatcatttgcctctccttcttccttctcagc tcttgggatttcatccccgcagtctatggctttatactttctgttccagatctcactcca aacattggtcttttctggtacttctttgcagagatgtttgagcacttcagcctcttcttt gtatgtgtgtttcagatcaacgtcttcttctacaccatccccttagccataaagctaaag gagcaccccatcttcttcatgtttatccagatcgctgtcatcgccatctttaagtcctac ccgacagtgggggacgtggcgctctacatggccttcttccccgtgtggaaccatctctac agattcctgagaaacatctttgtcctcacctgcatcatcatcgtctgttccctgctcttc cctgtcctgtggcacctctggatttatgcaggaagtgccaactctaatttcttttatgcc atcacactgaccttcaacgttgggcagctgccccatgagctcttcatcatggtcccagag ggaggcggccctggagccctcatggctgccatagagaggaatgtggaccgagccgtccct gggggcagcgctgagagggaccccatcgagaggtccacattacatatcctgctcatctct gattacttctatgccttcctgcggcgggagtactacctcacacatggcctctacttgacc gccaaggatggcacagaggccatgctcgtgctcaagtag