GENSCAN 1.0 Date run: 5-Nov-116 Time: 09:08:28 Sequence gi568815578f:37284154_37503376 : 219223 bp : 50.51% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 2517 2364 154 0 1 101 45 113 0.141 8.44 1.00 Prom - 7630 7591 40 -4.26 2.00 Prom + 9339 9378 40 -3.26 2.01 Init + 17111 17260 150 1 0 92 101 233 0.996 25.04 2.02 Intr + 29683 29776 94 0 1 19 37 134 0.019 0.94 2.03 Intr + 32352 32517 166 1 1 77 58 45 0.048 -0.58 2.04 Intr + 44064 44200 137 0 2 85 94 38 0.369 4.31 2.05 Intr + 49656 49736 81 1 0 72 96 45 0.828 3.31 2.06 Term + 49854 49996 143 1 2 52 47 117 0.839 2.09 2.07 PlyA + 51080 51085 6 1.05 3.09 PlyA - 51169 51164 6 1.05 3.08 Term - 52170 52126 45 0 0 91 53 37 0.137 -2.29 3.07 Intr - 59175 59086 90 0 0 49 101 80 0.316 5.59 3.06 Intr - 69916 69812 105 1 0 125 14 35 0.193 0.21 3.05 Intr - 72306 72027 280 2 1 99 131 70 0.662 9.88 3.04 Intr - 73656 73519 138 2 0 126 61 0 0.557 0.68 3.03 Intr - 79441 79229 213 0 0 72 80 128 0.720 8.23 3.02 Intr - 80386 80181 206 1 2 137 40 57 0.871 3.80 3.01 Init - 93495 93460 36 0 0 82 66 79 0.273 3.36 3.00 Prom - 93723 93684 40 -6.26 4.00 Prom + 93755 93794 40 -7.76 4.01 Init + 100001 100250 250 1 1 88 77 427 0.667 39.23 4.02 Intr + 101922 102021 100 1 1 130 94 190 0.985 22.97 4.03 Intr + 109742 109840 99 2 0 94 64 154 0.314 12.83 4.04 Intr + 110021 110124 104 2 2 109 102 117 0.782 15.02 4.05 Intr + 112009 112158 150 2 0 111 55 434 0.988 42.53 4.06 Intr + 112397 112462 66 0 0 114 63 16 0.529 0.58 4.07 Intr + 113546 113701 156 0 0 113 82 207 0.999 22.58 4.08 Intr + 115962 116141 180 1 0 111 96 273 0.999 30.24 4.09 Intr + 117449 117525 77 0 2 42 109 131 0.999 9.83 4.10 Intr + 118282 118435 154 0 1 82 86 301 0.945 28.95 4.11 Intr + 118596 118727 132 1 0 124 99 238 0.999 29.12 4.12 Term + 119018 119226 209 0 2 106 48 478 0.999 43.10 4.13 PlyA + 124060 124065 6 -0.45 5.11 PlyA - 129667 129662 6 1.05 5.10 Term - 132433 132317 117 1 0 79 48 83 0.126 1.94 5.09 Intr - 139564 139486 79 0 1 35 55 110 0.061 1.95 5.08 Intr - 142271 142153 119 2 2 49 87 77 0.083 2.96 5.07 Intr - 160762 160616 147 1 0 71 92 114 0.444 10.53 5.06 Intr - 161381 160966 416 0 2 43 -73 416 0.034 14.82 5.05 Intr - 185493 185203 291 1 0 7 81 160 0.164 4.31 5.04 Intr - 187561 187426 136 1 1 68 65 90 0.146 4.84 5.03 Intr - 189663 189499 165 0 0 86 47 54 0.134 1.26 5.02 Intr - 202095 202011 85 2 1 111 30 23 0.019 -1.38 5.01 Init - 219221 219181 41 2 2 78 100 49 0.847 4.77 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 216459 216327 133 2 1 62 38 114 0.816 1.36 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:37284154_37503376|GENSCAN_predicted_peptide_1|52_aa MDTDAHIERTSCEDEGRYRDDASTAKEHHRLPAGHQKLVKRHGTGSSQPSEX >gi568815578f:37284154_37503376|GENSCAN_predicted_CDS_1|156_bp atggacacagatgcacacatagagagaacgtcatgtgaagatgaaggcagatatcgggac gatgcctctacagccaaggaacaccacagattgccagcaggccaccagaagctagtcaag aggcatgggacaggctcctcacagccctcagaagnn >gi568815578f:37284154_37503376|GENSCAN_predicted_peptide_2|256_aa MASDLDFSPPEVPEPTFLENLLRYGLFLGAIFQLICVLAIIVPIPKSHEAGQTMGAVCRV AKETGIPVDEGDQRKLWGLTQCSGSEEAVTPYDRRDLDSRNSPQAPAGQSTTSSSFCFCD GLESRGLKHTVSIDCIRDPESLLLCSHLVETPNLKCGTLLLKPEKDPGLWPLAKAQAVQF SSAQAQCKGASLPHYPLLPYPGNVCGRRKGAKCTTDVTEHPQQVPPEGPAQSGCGKAKCM TSRLEDFGPQEPWAVD >gi568815578f:37284154_37503376|GENSCAN_predicted_CDS_2|771_bp atggcctctgacctagacttctcacctccggaggtgcccgagcccactttcctggagaac ctgctacggtacggactcttcctgggagccatcttccagctcatctgtgtgctggccatc atcgtacccattcccaagtcccacgaggcgggccaaaccatgggagcagtttgccgggtg gccaaggaaacaggtattcctgttgatgaaggtgaccagcggaagctctggggcctgact cagtgctcagggtctgaggaggctgtgacgccctatgaccgcagagatctagacagtcgt aacagtccccaggctccagctgggcaatccaccacttcctcttccttctgcttctgtgac ggtttagagtcaagggggctgaaacacactgtgagcatagactgtattagggatcctgag tctttgctcctatgtagtcacttggtagaaacgccgaacctgaaatgtggcactttgctt ctcaagccagagaaggatccaggcttatggcccttagcaaaagcccaagcggttcagttc agctcagcccaggcccagtgcaaaggagcctcccttcctcattacccgcttctgccctac cccgggaacgtgtgtggacgtagaaagggtgcaaaatgcacaacggatgtcacagagcat ccccagcaggtgcccccagagggaccagcccagagcgggtgcggaaaggccaagtgcatg accagccgccttgaggactttggcccccaggaaccttgggctgtggactag >gi568815578f:37284154_37503376|GENSCAN_predicted_peptide_3|370_aa MALQLVVLPSPMTWRRESPEGSSCHALITGWTQRPNKPGAPQGGGTVGPQRITTATASPS RCSWVAGTMLTSSVNHPNPLCSPVASCGHTRFTYKATGAPGGCPSQKADSQDTSQIRGTP EPILPHLPCEEHSKGDPGGSLGAEAQADQGARSSGLSHLLRQALLDHLFLDTPSQLCSPS LSSKATSSQKASLKPQSKPICSQCGHQGHLTEATLTLTQTLPRMWDTLLSSSSPSSLPEE QAPRPSTHWHQPTDPSQDSMAPQTSKRSCSQGTFRPFVKLYLPSEVPPILPVNTNCNKIT RTEHLLCAGIAEGVGHKTSSLQQAHGASLRTPTGTTSQPKSTQQMIDAIDVNIAEMACMS APVLELGTDK >gi568815578f:37284154_37503376|GENSCAN_predicted_CDS_3|1113_bp atggccctgcagctggtggtgctgccgtctcccatgacctggagaagagaaagtccagag gggtcctcctgccacgccctcatcacggggtggacacagaggccaaataaaccaggggcc ccacaaggaggagggacagttggtcctcagaggataacaacagctacggcgtcaccatca cggtgctcctgggtggcgggcaccatgcttacgagctcagtgaaccaccccaaccctctg tgcagccctgtagcaagctgtggccacactcgttttacatacaaggccacaggggcccca ggaggctgccccagccagaaggcagacagccaggataccagccagatccgagggacacca gagcccatcctaccccacctcccctgtgaggagcacagcaagggcgacccaggagggagc ctgggtgcagaggcgcaggcggaccagggtgccaggtcttcaggactcagccatctgctc agacaggccttgctcgaccacctgtttcttgatacacccagccaactctgctctccaagt ctcagctccaaagccacttcctcacagaaagcctccctcaaaccccagtccaagcccatc tgttctcagtgcggccaccagggccacctcactgaagccacactgaccctgacccagaca ctacccagaatgtgggatacattactctcatccagtagcccttcctcactgcccgaggag caggcccccaggccctctacccattggcaccagccaactgacccaagtcaagactccatg gctcctcagacctccaagaggagctgctcacagggcaccttcaggccctttgtcaagctc tatctcccctcagaggtccctcccatccttccggttaatacaaattgcaataaaataaca cgcaccgagcacttactgtgtgctggcattgctgagggcgttggacacaagacttcatcc ttgcaacaagcccatggagccagtcttagaacaccaacaggaaccacttcccaacccaaa agcacccagcagatgatagatgctattgacgtcaacattgctgagatggcctgcatgtca gctcctgtgctggagctggggacagacaagtga >gi568815578f:37284154_37503376|GENSCAN_predicted_peptide_4|558_aa MGSNKSKPKDASQRRRSLEPAENVHGAGGGAFPASQTPSKPASADGHRGPSAAFAPAAAE PKLFGGFNSSDTVTSPQRAGPLAGGVTTFVALYDYESRTETDLSFKKGERLQIVNNTEGD WWLAHSLSTGQTGYIPSNYVAPSDSIQAEEWYFGKITRRESERLLLNAENPRGTFLVRES ETTKGAYCLSVSDFDNAKGLNVKHYKIRKLDSGGFYITSRTQFNSLQQLVAYYSMSHCPP FSAAGWYGGGCPEEPQKHADGLCHRLTTVCPTSKPQTQGLAKDAWEIPRESLRLEVKLGQ GCFGEVWMGTWNGTTRVAIKTLKPGTMSPEAFLQEAQVMKKLRHEKLVQLYAVVSEEPIY IVTEYMSKGSLLDFLKGETGKYLRLPQLVDMAAQIASGMAYVERMNYVHRDLRAANILVG ENLVCKVADFGLARLIEDNEYTARQGAKFPIKWTAPEAALYGRFTIKSDVWSFGILLTEL TTKGRVPYPGMVNREVLDQVERGYRMPCPPECPESLHDLMCQCWRKEPEERPTFEYLQAF LEDYFTSTEPQYQPGENL >gi568815578f:37284154_37503376|GENSCAN_predicted_CDS_4|1677_bp atgggtagcaacaagagcaagcccaaggatgccagccagcggcgccgcagcctggagccc gccgagaacgtgcacggcgctggcgggggcgctttccccgcctcgcagacccccagcaag ccagcctcggccgacggccaccgcggccccagcgcggccttcgcccccgcggccgccgag cccaagctgttcggaggcttcaactcctcggacaccgtcacctccccgcagagggcgggc ccgctggccggtggagtgaccacctttgtggccctctatgactatgagtctaggacggag acagacctgtccttcaagaaaggcgagcggctccagattgtcaacaacacagagggagac tggtggctggcccactcgctcagcacaggacagacaggctacatccccagcaactacgtg gcgccctccgactccatccaggctgaggagtggtattttggcaagatcaccagacgggag tcagagcggttactgctcaatgcagagaacccgagagggaccttcctcgtgcgagaaagt gagaccacgaaaggtgcctactgcctctcagtgtctgacttcgacaacgccaagggcctc aacgtgaagcactacaagatccgcaagctggacagcggcggcttctacatcacctcccgc acccagttcaacagcctgcagcagctggtggcctactactccatgtcccactgcccgcct ttctctgcagctggctggtatggagggggctgccctgaggagccccagaaacacgccgat ggcctgtgccaccgcctcaccaccgtgtgccccacgtccaagccgcagactcagggcctg gccaaggatgcctgggagatccctcgggagtcgctgcggctggaggtcaagctgggccag ggctgctttggcgaggtgtggatggggacctggaacggtaccaccagggtggccatcaaa accctgaagcctggcacgatgtctccagaggccttcctgcaggaggcccaggtcatgaag aagctgaggcatgagaagctggtgcagttgtatgctgtggtttcagaggagcccatttac atcgtcacggagtacatgagcaaggggagtttgctggactttctcaagggggagacaggc aagtacctgcggctgcctcagctggtggacatggctgctcagatcgcctcaggcatggcg tacgtggagcggatgaactacgtccaccgggaccttcgtgcagccaacatcctggtggga gagaacctggtgtgcaaagtggcggactttgggctggctcggctcattgaagacaatgag tacacggcgcggcaaggtgccaaattccccatcaagtggacggctccagaagctgccctc tatggccgcttcaccatcaagtcggacgtgtggtccttcgggatcctgctgactgagctc accacaaagggacgggtgccctaccctgggatggtgaaccgcgaggtgctggaccaggtg gagcggggctaccggatgccctgcccgccggagtgtcccgagtccctgcacgacctcatg tgccagtgctggcggaaggagcctgaggagcggcccaccttcgagtacctgcaggccttc ctggaggactacttcacgtccaccgagccccagtaccagcccggggagaacctctag >gi568815578f:37284154_37503376|GENSCAN_predicted_peptide_5|531_aa MEKKQLKAQLQSTSQTQFIPPTPTGASAPCPKVAVTKVPKGQLETPALAFTLSTWRLAFI LAAAEFTFSDMKCKCLKCSSIFKKKMMKTNWGVKEDIVVMRVRANMQDEGDRLEDDLSLR LQWVREGAKPFRRDIQGRPLGKDIGTGCRKTSPLRMLMKAHKGVLGEQEKVPRRLMKVLK GVPGEQEKVLWEAREEGVSSRWTGSPMGCEQRVWMGDEQAMEPLGLADGKLWRTFMGAAD AERKEGRGMKVAPAPAVVKKQEAKKVVNPLFEKRPKNFGVGQDIQPKRDLTRFVKWPRYI KLQHQRAFLYKRLKVPPAINQFTQALDRQTATQLLELAHKYRPETKQEKKQRLLAQAEEK TAGKGDISTKRLPVLQAGVNWEDKSALAKLVEGIRTNYNERYNEIHHHWGGNVLGPESVA RIAKLKKLRIANLALPQQLFGICLILEILHEEHVVTSFVNGHGDLGRRGAGGAIASDKND CSAIGENVPGTGQEFVGAGLRYTLKVPDKLTQQQEGHQRGEGGSSTAQTEA >gi568815578f:37284154_37503376|GENSCAN_predicted_CDS_5|1596_bp atggagaagaaacagctgaaggcacagctccagagcacaagccagacccagttcattccc ccaacaccaacaggggcttctgccccttgccccaaagttgctgtgacaaaagtccccaaa ggccagctggagacaccggccctcgccttcaccctcagcacctggcgactggcattcatc ctggcggctgctgaattcacattttctgacatgaaatgcaaatgtctgaaatgcagttca atctttaaaaaaaaaatgatgaaaaccaattggggtgtcaaggaagacatagtggtcatg cgtgtgagggcgaacatgcaggatgagggcgacaggctggaggatgacctgagcctgcgc ttgcaatgggtgagggaaggggccaagccatttagacgggatattcagggaaggcctctt ggaaaagacataggcactggatgcagaaagacaagcccgctaaggatgttgatgaaggcc cacaagggggtcctaggggagcaggagaaggtaccaaggagactgatgaaggtcctcaag ggggtcccaggagaacaggagaaggtgctctgggaagcaagggaggagggcgtttccagc aggtggacagggtcacccatgggctgtgagcagagagtttggatgggggatgagcaagcc atggagcccctgggcttggccgatgggaagctttggaggaccttcatgggagcagctgat gccgaaaggaaagaaggccgggggatgaaggtggctccagcccctgctgtcgtgaagaag caggaggccaagaaagtggtgaatcccctgtttgagaaaaggcctaagaattttggtgtt ggacaggacatccagcccaaaagagacctcacccgttttgtgaaatggccccgctatatc aagttgcagcatcagagagccttcctctataaacggctgaaagtgcctcctgcaattaac cagttcacccaggccctggaccgccaaactgctactcagctgcttgagctggcccacaag tacagaccagagacaaagcaagagaagaagcagaggctcttggcccaggccgaggagaaa actgctggcaaaggggacatctccactaagagactacctgtccttcaagcaggagttaac tgggaagacaaaagtgctttggctaagctggtggaaggtataaggaccaattacaatgag agatacaatgagatccaccatcactggggaggcaatgtcctgggtcccgagtctgtggct cgcatcgccaagctcaaaaagctacggattgcaaacctggccctgccgcaacagcttttc ggtatttgcctaatcctcgagatactccatgaggaacatgtcgtcaccagcttcgttaac ggacacggggacttaggaaggagaggggctgggggcgccatcgccagtgataagaatgac tgtagtgccatcggagaaaacgttcctggcactggccaggagtttgtgggtgcagggctc aggtacaccttgaaagttccagacaaactcactcagcagcaggaaggccaccagaggggc gagggtgggagctccacggcacagacagaggcctga