GENSCAN 1.0 Date run: 3-Nov-116 Time: 12:05:31 Sequence gi568815581r:67240846_67466519 : 225674 bp : 45.17% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 966 961 6 1.05 1.02 Term - 35503 35270 234 1 0 39 46 335 0.865 20.72 1.01 Init - 38672 38553 120 2 0 48 72 29 0.214 -2.51 1.00 Prom - 45710 45671 40 -5.06 2.00 Prom + 46520 46559 40 -4.56 2.01 Init + 49366 49414 49 0 1 75 58 21 0.255 -3.09 2.02 Intr + 52744 52887 144 2 0 110 9 169 0.142 11.45 2.03 Term + 56440 56570 131 2 2 103 54 106 0.971 7.04 2.04 PlyA + 57270 57275 6 1.05 3.13 PlyA - 57316 57311 6 1.05 3.12 Term - 58091 58000 92 0 2 38 48 61 0.087 -5.02 3.11 Intr - 59862 59829 34 0 1 125 108 35 0.220 7.00 3.10 Intr - 62556 62406 151 2 1 86 28 57 0.026 -0.34 3.09 Intr - 83506 83430 77 1 2 88 53 101 0.247 4.91 3.08 Intr - 91198 90989 210 1 0 92 22 110 0.136 3.91 3.07 Intr - 101418 101341 78 0 0 97 95 -2 0.542 1.15 3.06 Intr - 103935 103761 175 2 1 14 66 77 0.674 -1.96 3.05 Intr - 105012 104900 113 0 2 65 108 102 0.794 9.08 3.04 Intr - 106405 106271 135 1 0 80 110 -14 0.602 0.76 3.03 Intr - 106640 106491 150 2 0 22 83 93 0.853 2.56 3.02 Intr - 107809 107705 105 1 0 65 110 19 0.821 2.21 3.01 Init - 109479 109384 96 0 0 89 94 79 0.635 8.91 3.00 Prom - 114567 114528 40 -1.16 4.04 PlyA - 114761 114756 6 1.05 4.03 Term - 116586 116528 59 1 2 113 54 91 0.786 6.05 4.02 Intr - 119301 119265 37 0 1 83 100 10 0.690 -0.46 4.01 Init - 125674 125567 108 1 0 115 100 234 0.998 27.52 4.00 Prom - 126904 126865 40 -6.86 5.00 Prom + 134949 134988 40 -8.46 5.01 Init + 136376 136497 122 1 2 63 24 157 0.742 4.46 5.02 Intr + 137246 137357 112 2 1 39 99 138 0.700 10.38 5.03 Intr + 137769 138349 581 2 2 94 38 177 0.090 4.80 5.04 Term + 157639 157831 193 1 1 51 47 231 0.728 12.19 5.05 PlyA + 158726 158731 6 1.05 6.06 PlyA - 158812 158807 6 1.05 6.05 Term - 159175 159155 21 1 0 106 50 16 0.006 -2.09 6.04 Intr - 192407 192313 95 0 2 108 113 2 0.175 4.38 6.03 Intr - 195830 195699 132 0 0 37 104 31 0.075 0.22 6.02 Intr - 200771 200521 251 1 2 81 96 12 0.003 -1.62 6.01 Init - 215411 215338 74 2 2 70 93 48 0.182 4.04 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 56202 56211 10 2 1 66 113 2 0.861 1.08 S.002 Term + 137769 138398 630 2 0 94 35 171 0.901 6.72 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:67240846_67466519|GENSCAN_predicted_peptide_1|117_aa MTICKLGSPQNHSRFRDSSVATWWKKVYRQKMGNDVQNLEAEGQRQDQENGSLLNIKVNQ EAEIATYRHLLDDDEDFNLCDALDSSNSMQTIHKTTTPQIVDGKEVCETNDTKVLRH >gi568815581r:67240846_67466519|GENSCAN_predicted_CDS_1|354_bp atgacaatttgcaaattgggtagcccccagaatcacagcagattcagagattccagcgta gccacgtggtggaagaaggtttatagacaaaaaatgggaaatgacgtacagaatttggaa gcagaggggcagcgccaggaccaggagaacgggtccttgctaaacatcaaggtcaatcag gaggctgagatcgccacctatcgccacctgctagacgacgacgaggacttcaatctttgt gatgccctggacagcagcaactccatgcaaaccatccataagaccaccaccccccagata gtggatggtaaagaggtgtgtgagaccaacgacaccaaagttctgagacattaa >gi568815581r:67240846_67466519|GENSCAN_predicted_peptide_2|107_aa MGFHHVAQAGLKLLSSGLRPFLGLKLLGLPQGPERLTTHGSTGPLVTTERRLQLFLATLR EIKERLRPFLGLKLLGLPQGPERLTTYGSTGPLVTTERRLQLFLATL >gi568815581r:67240846_67466519|GENSCAN_predicted_CDS_2|324_bp atggggtttcaccatgttgcccaggctggtctcaaactcctgagctcaggcctgcgtccc ttcctggggctgaagcttctgggcctgccccagggtcctgaacggctgacaacacacgga tcaactggaccactggtgaccacagaaaggaggctgcagctgtttcttgctacgcttcga gagatcaaagagcgcctgcgtcccttcctggggctgaagcttctgggcctgccccagggt cctgaacggctaacaacatacgggtcaactggaccactggtgaccacagaaaggaggctg cagctgtttcttgctaccctttga >gi568815581r:67240846_67466519|GENSCAN_predicted_peptide_3|471_aa MVQQCCTYVEEITDLPIKLRLIDTLRMVTEGKIYVEIERARLTKTLATIKEQNGDVKEAA SILQELQVETYGSMEKKERVEFILEQMRLCLAVKDYIRTQIISKKINTKFFQEENTEKLK LKYYNLMIQLDQHEGSYLSICKHYRAIYDTPCIQAESEKWQQALKSVVLYVILAPFDNEQ SDLVHRISGDKKLEEIPKYKDLLKLFTTMELMRWSTLVEDYGMELRKGSLESPATDVFGS TEEGEKRWKDLKNRVVEHNIRIMAKYYTRITMKRMAQLLDLSVDVTLMQEVASHGLGKLC PCGFAGYSPPPGCFHGLALSVAFPGARCKLSVDLPFWDLEDSGPLLTAPLGHAPETSQKQ LDHLDILQSIPLGHNIDDRRMTVPIIVQMIHQYLSCTTSGLFSPICYFSNYLFRVDFKQL KACTATPCLAVIGVQVMFGYTILSHWKVFRGNNMHGAVISYDNHYCFCNAS >gi568815581r:67240846_67466519|GENSCAN_predicted_CDS_3|1416_bp atggttcaacagtgctgtacttatgttgaggaaatcacagaccttcctatcaaacttcga ttaattgatactctacgaatggttaccgaaggcaagatttatgttgaaattgagcgtgcg cgactgactaaaacattagcaactataaaagaacaaaatggtgatgtgaaagaggcagcc tccattttacaggagttacaggtggaaacctacgggtcaatggaaaagaaagagcgagtg gaatttattttggagcaaatgaggctctgcctagctgtgaaggattacattcgaacacaa atcatcagcaagaaaattaacaccaaatttttccaggaagaaaatacagagaaattaaag ttgaagtactataatttaatgattcagctggatcaacatgagggatcctatttgtctatt tgtaagcactacagagcaatatatgatactccctgtatacaggcagaaagtgaaaaatgg cagcaggctctgaagagtgttgtactctatgttatcctggctccttttgacaatgaacag tcagatttggttcaccgaataagtggtgacaagaagttagaagaaattcccaaatacaag gatcttttaaagctttttaccacaatggagttgatgcgttggtccacacttgttgaggac tatggaatggaattaagaaaaggttcccttgagagtcctgcaacggatgtttttggttct acagaggaaggtgaaaaaaggtggaaagacttgaagaacagagttgttgaacataatatt agaataatggccaagtattatactcggataacaatgaaaaggatggcacagcttctggat ctatctgttgatgtcacgctgatgcaagaggtggcctcccatggccttgggaagctctgc ccttgcggctttgcagggtatagcccccctcctggctgctttcatgggctggcattgtct gtggcttttccaggtgcacggtgcaagctgtcagtggatttaccattctgggatctggag gacagtggcccacttctcacagccccactaggccatgccccagagacatcccaaaagcag ctggaccatctggacatcctgcagagcatcccactaggccacaatattgatgacaggaga atgacagttccaattattgttcagatgatacatcaatacctctcatgcaccacctcaggc cttttcagccccatctgttacttttccaactacctcttccgcgtagacttcaaacaactt aaagcttgcacggccacaccgtgtcttgccgttattggggtgcaggtgatgtttggttac acaatcttgtcccactggaaggtcttcaggggcaataacatgcatggagctgtcatctcc tatgataaccattactgcttctgtaatgcctcctga >gi568815581r:67240846_67466519|GENSCAN_predicted_peptide_4|67_aa MADGGSERADGRIVKMEVDYSATVDQRLPECAKLAKIVTIEMTLIKAKGFRYGIDIPYLS CSSEDVL >gi568815581r:67240846_67466519|GENSCAN_predicted_CDS_4|204_bp atggcggacggcggctcggagcgggctgacgggcgcatcgtcaagatggaggtggactac agcgccacggtggatcagcgcctacccgagtgtgcgaagctagccaagatagtaaccata gagatgactttgatcaaagcgaaaggcttccgatatggtatcgacatcccgtatcttagt tgcagtagtgaagatgtgctatga >gi568815581r:67240846_67466519|GENSCAN_predicted_peptide_5|335_aa MGLPSPPRPLLMSASLALAPPAPTSRPQAHCIGADAPAPASSLAGLGGAPRFPPRGSAAG RTMLLKEYRICMPLTVDEVGVRGGALRPRDLHAPSPAGGTSEPLGEVPAGPDFSPSPRAW AQLPARPPFEARDPAVSFRKPPGSETFCVSCFFRRGERRASRERRGLRRGHRRALQHVVP GPRLPAGGAGRGAGARPPSAALLSFSRSCEAASSSIQEVVTQIGLFRPRLPFLPLLPPHL CGDSTTAERLCPGTAREPKAKQGNVKLTSVVWHWLCDVQLGVVEKNWALPVDQCRLQELQ FLVHLIDLLSILLSCNGFPRIQKAVMDQTSSSDHQ >gi568815581r:67240846_67466519|GENSCAN_predicted_CDS_5|1008_bp atggggctaccctccccgccgcggccgctgctgatgtcagcctcgctcgcgctcgctcct cccgcacccacctcccggccccaggcacactgcatcggcgcggacgctccggccccggcg agcagccttgctggtcttgggggcgccccccgcttcccgccccgggggtccgcggccggc aggaccatgctgctgaaagagtaccggatctgcatgccgctcaccgtagacgaggtaggg gtgcgaggaggagccctgcgccctcgggatctgcacgccccgagccccgcgggaggaacc tctgagcccttaggggaggtccctgcggggcccgacttctcgccgtcgccgcgggcttgg gcgcagctcccggcacgtccgcccttcgaggctcgggacccggctgtgtcctttcgcaaa ccgcctggctcggaaactttctgcgtctcttgtttcttccgccgcggggagcggcgcgcg agccgggagcggcgggggctgcgacgcggccacaggagggcgctccagcacgtggtgccg gggccgcggctgccggctgggggcgccgggcgcggggcgggggctcgtcctccaagcgcg gctctgctgtccttctcccgatcctgcgaagccgcgagctccagtattcaggaagtggtg actcagataggattattccggccccggcttccctttcttccccttcttcctccccacctt tgtggtgattccacgactgctgagcgtctctgtccagggacagcgagggagcccaaagcc aagcaaggcaacgtgaaactcactagtgtggtttggcattggttgtgcgacgtgcagttg ggtgttgtggagaagaattgggcccttcctgttgaccagtgccggctgcaggagttgcag tttttggtgcatctcatcgatttgctgagcatacttctcagctgtaatggtttccccagg attcagaaagccgtaatggatcagaccagcagcagcgaccaccagtga >gi568815581r:67240846_67466519|GENSCAN_predicted_peptide_6|190_aa MYQTETDQKGEEDWKGYINISKSLSCDRIRRKADKSVVSIRLLQSERMNELLSSFLFQVL TGFPLHFASVISPLLMLASVSHDMYIAPSLPSKSEAELWAVDSVSDNKALHTAKKHESDY VTSGFRTLSGSHCKKDISQTPSQAPNAFHALSGLASLGLAQLNSFLALTARRQENYRKQK RENHSRLRIA >gi568815581r:67240846_67466519|GENSCAN_predicted_CDS_6|573_bp atgtaccagacagaaactgaccaaaaaggggaagaagactggaagggctatattaatatc agtaaaagcctaagctgtgatcggattagaagaaaggctgacaaaagtgttgtttccatc cgccttttacaatcggagagaatgaatgagctgctttccagttttctgttccaagtatta actggattcccacttcactttgcttccgtgatttctcctctgcttatgttagcttccgtc tcacacgatatgtacatagccccatctctgccgagcaaatcagaggctgagctctgggct gtggacagtgtcagtgacaacaaggctctgcacacagccaagaaacatgaatcagattat gtcacttctggttttagaactctcagtggctcccattgcaaaaaggacatatcccaaact cccagccaggccccgaacgctttccacgccctgtccggtctggcttcccttggtctggcc caactaaattctttcctggcactgactgccaggagacaggaaaactatagaaagcagaag agggagaaccacagtaggctccgtattgcatga