GENSCAN 1.0 Date run: 5-Nov-116 Time: 06:15:14 Sequence gi568815587f:75466350_75672080 : 205731 bp : 49.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 265 260 6 1.05 1.02 Term - 3029 2988 42 2 0 139 45 58 0.061 3.76 1.01 Init - 11386 11270 117 1 0 56 105 220 0.978 20.70 1.00 Prom - 13558 13519 40 -6.96 2.09 PlyA - 13671 13666 6 -0.45 2.08 Term - 20990 20856 135 2 0 121 54 36 0.070 1.52 2.07 Intr - 34690 34610 81 1 0 75 72 35 0.069 0.43 2.06 Intr - 35316 35187 130 2 1 79 62 34 0.046 0.60 2.05 Intr - 42671 42607 65 2 2 63 80 64 0.273 0.62 2.04 Intr - 49287 49162 126 0 0 57 110 57 0.616 5.68 2.03 Intr - 59329 58941 389 2 2 83 94 182 0.864 12.81 2.02 Intr - 66160 65992 169 1 1 60 44 79 0.156 0.22 2.01 Init - 72767 72765 3 2 0 108 81 0 0.415 1.30 2.00 Prom - 77372 77333 40 -0.76 3.00 Prom + 88043 88082 40 -2.76 3.01 Init + 100001 100622 622 1 1 74 75 1444 0.998 135.21 3.02 Intr + 102382 102480 99 2 0 85 96 144 0.994 14.98 3.03 Intr + 102590 102822 233 0 2 107 110 441 0.999 45.69 3.04 Term + 105432 105734 303 2 0 119 49 511 0.976 45.47 3.05 PlyA + 106414 106419 6 1.05 4.07 PlyA - 111143 111138 6 1.05 4.06 Term - 121835 120710 1126 1 1 136 47 641 0.758 56.28 4.05 Intr - 139655 139459 197 2 2 100 0 159 0.007 6.51 4.04 Intr - 141973 141760 214 0 1 80 100 222 0.872 21.32 4.03 Intr - 157798 157721 78 0 0 78 96 33 0.002 1.77 4.02 Intr - 184715 184644 72 1 0 39 68 86 0.004 0.22 4.01 Init - 202020 201116 905 0 2 101 94 991 0.445 94.70 4.00 Prom - 202722 202683 40 -3.16 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 139655 139455 201 2 0 100 32 161 0.992 9.09 S.002 Init + 162087 162157 71 2 2 115 100 71 0.984 9.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:75466350_75672080|GENSCAN_predicted_peptide_1|52_aa MVRHQPLQYYEPQLCLSCLTGIYGCRWKRYQRSHDDTTPNCGDDEMEQVKGS >gi568815587f:75466350_75672080|GENSCAN_predicted_CDS_1|159_bp atggtgagacaccagcccctgcagtactacgagccacagctgtgcctctcctgcctcacg ggcatctacggctgccgttggaagcgctaccagcgctcccatgatgataccacaccgaac tgtggggatgatgagatggagcaagtaaagggctcttag >gi568815587f:75466350_75672080|GENSCAN_predicted_peptide_2|365_aa MLKVLSAYWVWPHPVLRHRNQSGKFMVGVGGKQSPATDNGGDKADSSSSEHPKALGKAPR FGACGRDARSPGPDPLRRDGTLLTPPYRSVVAGSLRAFPSVRVGTPAPGLAQCGPRPSPP VHGTPDPPSLWLHETPTFSWNSLEQRERPGSPGRVVRSVRGPRRAGQSDRTAGLVPRAQA VTFVPPQSWAVIATVLGIALDPEHVPHVQGVKSHATLSSEGLPGAYECQELSVGMTKIFN VDTALFQQGLSLRVQMEQARLFRWLLPRLPEPLLWSPEHQPGALAQPKASPLEQSQPRQS AMQLDTSVAAEWMAVQGTRKQVWLPSLQIPLESRAFSEAQRGGTTSPRLYSELVAAGDLS IQAIK >gi568815587f:75466350_75672080|GENSCAN_predicted_CDS_2|1098_bp atgttaaaggtattgagtgcctactgggtgtggccacaccctgtgctgcgacacagaaat caatctggcaagttcatggtgggggttggagggaagcagtccccggcaactgacaatggt ggggacaaagcagatagttcttctagtgagcatccaaaggctctggggaaggccccgcgg ttcggggcgtgcggccgggacgcgcggagccccggccctgacccgctgcgccgggacggg acgctcctaacgccgccgtaccggtccgtggttgcagggtcgctccgcgcctttccgtca gtgagggtcggaaccccagctccagggctagcgcagtgtggccccaggccctcgcctccg gtgcacgggaccccggacccccccagcctctggcttcacgagaccccaaccttcagttgg aacagcttggaacagcgcgagcgcccgggaagccctggccgggtcgtgcgctcggtgagg ggtcctaggcgggcaggacagtcggaccgaaccgccgggctggtgccgcgagcccaggct gtcaccttcgtcccgcctcagagctgggcagtcattgccacagttctggggatagccctg gacccagaacatgttcctcatgtccagggtgttaagagccatgccactctgagttcagag ggtctcccaggagcctatgaatgtcaggagctgtcagtcgggatgaccaagatcttcaat gtggacactgctttatttcagcagggcctcagcctcagagtccaaatggaacaggccaga ctgttcaggtggctgttgccacgtctccctgagcccctgctctggagccccgagcaccag cctggagccctggcccagccaaaggcttcacccctggagcagagccagcccaggcagtca gctatgcagttggacactagcgtggctgctgaatggatggctgtacagggcacaagaaaa caggtctggttaccttctctgcagattccattagaatctagggctttctctgaggcccag agaggtggaacgacttccccaaggctgtacagtgagttggtggcagccggggatctctcc attcaagccatcaaatga >gi568815587f:75466350_75672080|GENSCAN_predicted_peptide_3|418_aa MRSLLLLSAFCLLEAALAAEVKKPAAAAAPGTAEKLSPKAATLAERSAGLAFSLYQAMAK DQAVENILVSPVVVASSLGLVSLGGKATTASQAKAVLSAEQLRDEEVHAGLGELLRSLSN STARNVTWKLGSRLYGPSSVSFADDFVRSSKQHYNCEHSKINFRDKRSALQSINEWAAQT TDGKLPEVTKDVERTDGALLVNAMFFKPHWDEKFHHKMVDNRGFMVTRSYTVGVMMMHRT GLYNYYDDEKEKLQIVEMPLAHKLSSLIILMPHHVEPLERLEKLLTKEQLKIWMGKMQKK AVAISLPKGVVEVTHDLQKHLAGLGLTEAIDKNKADLSRMSGKKDLYLASVFHATAFELD TDGNPFDQDIYGREELRSPKLFYADHPFIFLVRDTQSGSLLFIGRLVRPKGDKMRDEL >gi568815587f:75466350_75672080|GENSCAN_predicted_CDS_3|1257_bp atgcgctccctcctgcttctcagcgccttctgcctcctggaggcggccctggccgccgag gtgaagaaacctgcagccgcagcagctcctggcactgcggagaagttgagccccaaggcg gccacgcttgccgagcgcagcgccggcctggccttcagcttgtaccaggccatggccaag gaccaggcagtggagaacatcctggtgtcacccgtggtggtggcctcgtcgctagggctc gtgtcgctgggcggcaaggcgaccacggcgtcgcaggccaaggcagtgctgagcgccgag cagctgcgcgacgaggaggtgcacgccggcctgggcgagctgctgcgctcactcagcaac tccacggcgcgcaacgtgacctggaagctgggcagccgactgtacggacccagctcagtg agcttcgctgatgacttcgtgcgcagcagcaagcagcactacaactgcgagcactccaag atcaacttccgcgacaagcgcagcgcgctgcagtccatcaacgagtgggccgcgcagacc accgacggcaagctgcccgaggtcaccaaggacgtggagcgcacggacggcgccctgcta gtcaacgccatgttcttcaagccacactgggatgagaaattccaccacaagatggtggac aaccgtggcttcatggtgactcggtcctataccgtgggtgtcatgatgatgcaccggaca ggcctctacaactactacgacgacgagaaggaaaagctgcaaatcgtggagatgcccctg gcccacaagctctccagcctcatcatcctcatgccccatcacgtggagcctctcgagcgc cttgaaaagctgctaaccaaagagcagctgaagatctggatggggaagatgcagaagaag gctgttgccatctccttgcccaagggtgtggtggaggtgacccatgacctgcagaaacac ctggctgggctgggcctgactgaggccattgacaagaacaaggccgacttgtcacgcatg tcaggcaagaaggacctgtacctggccagcgtgttccacgccaccgcctttgagttggac acagatggcaacccctttgaccaggacatctacgggcgcgaggagctgcgcagccccaag ctgttctacgccgaccaccccttcatcttcctagtgcgggacacccaaagcggctccctg ctattcattgggcgcctggtccggcctaagggtgacaagatgcgagacgagttatag >gi568815587f:75466350_75672080|GENSCAN_predicted_peptide_4|863_aa MAWPCITRACCIARFWNQLDKADIAVPLVFTKYSEATEHPGAPPQPPPPQQQAQPALAPP SARAVAIETQPAQGELDAVARATGPAPGPTGEREPAAGPGRSGPGPGLGSGSTSGPADSV MRQDYRAWKVQRPEPSCRPRSEYQPSDAPFERETQYQKDFRAWPLPRRGDHPWIPKPVQI SAASQASAPILGAPKRRPQSQERWPVQAAAEAREQEAAPGGAGGLAAGKASGADERDTRR KAGPAWIVRRAEGLGHEQTPLPAAQAQVQATGPEAGRGRAAADALNRQIREEVASAVSSS YSSYFNSREKCDDEDDDNDGDCGADGVSRLLKSLPKVKLQCEKKERYRSQRRNEFRAWTD IKPVKPIKAKPQYKPPDDKMVHETSYSAQFKGEASKPTTADNKVIDRRRIRSLYSEPFKE PPKVEKPSVQSSKPKKTSASHKPTRKAKDKQAVSGQAAKKKSAEGPSTTKPDDKEQSKEM NNKLAEAKESLAQPVSDSSKTQGPVATEPDKDQGSVVPGLLKGQGPMVQEPLKKQGSVVP GPPKDLGPMIPLPVKDQDHTVPEPLKNESPVISAPVKDQGPSVPVPPKNQSPMVPAKVKD QGSVVPESLKDQGPRIPEPVKNQAPMVPAPVKDEGPMVSASVKDQGPMVSAPVKDQGPIV PAPVKGEGPIVPAPVKDEGPMVSAPIKDQDPMVPEHPKDESAMATAPIKNQGSMVSEPVK NQGLVVSGPVKDQDVVVPEHAKVHDSAVVAPVKNQGPVVPESVKNQDPILPVLVKDQGPT VLQPPKNQGRIVPEPLKNQVPIVPVPLKDQDPLVPVPAKDQGPAVPEPLKTQGPRDPQLP TVSPLPRVMIPTAPHTEYIESSP >gi568815587f:75466350_75672080|GENSCAN_predicted_CDS_4|2592_bp atggcgtggccgtgcatcacgagggcctgctgcatcgcccgcttctggaaccagttggac aaagcggacatcgctgtgccgctggttttcaccaagtactcggaggccaccgagcacccg ggcgccccgccgcagccaccgccgccgcagcagcaggcgcagccggcgctcgcgcccccc tcggcgcgcgcggttgccatagagacgcagccagcccagggcgagttggatgcagttgcc cgggcaacggggccagcgcctgggcctaccggcgagcgcgagccggcggcgggccccggc cggagcgggccgggcccgggcctgggctccggctccacctccggccccgcggactcggtg atgcggcaggattaccgagcctggaaggtgcagcggcccgagcccagctgccggccgcgc agcgaataccagccctccgacgctcccttcgagcgcgagacccagtaccagaaggacttc cgcgcctggccgctgccgcgccgcggggaccacccgtggatccccaagcccgtgcagatc tctgcggcctcccaggcgtcggcgcccattctcggggcgcccaagcgccggccgcagagc caggagcgctggccagtgcaggccgccgctgaggcccgggagcaggaggcggcccccggc ggagcgggtggcctggcggccggaaaggcgtccggggcggacgagcgcgacacgcgcagg aaggccgggcctgcctggattgtgcgccgcgccgagggcctggggcacgagcagacgccg ctgcccgcggcccaggcccaggtccaggccaccggccccgaggctggcagggggcgcgcc gcggcggacgccctcaaccggcaaatccgcgaggaggtggcgagtgcagtgagcagctcc tacagctcctacttcaatagtagggagaaatgtgatgatgaagatgatgataatgatggg gattgtggtgctgatggagttagcaggttgctgaaaagcttgccgaaagtaaaattacaa tgtgaaaagaaagagagatatcgaagccagagaaggaatgaattcagggcatggacggac atcaagcctgtgaaaccaataaaggccaagccccagtacaagcccccagatgataagatg gttcatgagaccagctacagtgctcagttcaaaggagaggccagcaagccaacaacagct gacaataaggtcattgatcgcagaagaatacgcagcctctacagcgaacccttcaaggaa cccccaaaggtggaaaaacctagtgttcagagttccaaaccaaaaaagacctcagcgagc cataagcccacgaggaaggccaaagacaagcaggcggtgtcaggccaggctgccaagaaa aagagcgcggagggcccgagtaccaccaagccagacgacaaggagcaaagcaaagagatg aacaataaactggctgaggcgaaagagagcctggctcaacccgtcagtgattcaagtaag actcaaggtcctgtagccacagagccagacaaggatcaaggttctgtggtcccaggcctt ctgaaaggtcaaggtcctatggtgcaagagcctctgaagaagcaaggttctgtggtccca gggcctccaaaggatctaggtcccatgatcccattaccagtcaaggatcaagatcacacg gtccctgagcctttaaagaatgaaagccctgttatctcagcaccagtcaaggaccaaggt ccctcggtcccagttcctccaaagaatcaaagtcctatggttccagcaaaagttaaggat caaggctctgtggtaccagagtctctaaaggatcaaggtcctaggattcctgagcctgtg aagaatcaagctcctatggtcccagcacctgtcaaggatgaaggtcccatggtctcagca tctgtcaaggatcaaggtcccatggtctcagcacctgtcaaggatcaaggtcccatagtc ccagcacctgtcaagggtgaaggtcccatagtcccagcacctgtcaaggatgaaggtccc atggtctcagcacctatcaaggatcaagatcccatggtcccagagcatccgaaggatgaa agtgccatggccacagcacccataaagaatcaaggttccatggtctctgagcctgtaaag aatcaaggtttagtggtctcagggccagtcaaggatcaagatgttgtagtcccagagcat gcaaaggttcacgattctgcagttgtggcacctgtaaagaatcaaggtcctgtggtcccc gagtccgtgaagaatcaagaccccattctcccagtactagttaaggatcaaggccccaca gtcctacagcctccaaagaatcaaggtcgtatagtccctgaacctctgaagaatcaagtt cctatagtcccagtgcctctgaaggatcaagatcctctggtgccagtaccagcaaaggac caaggtcctgcagtccctgaacctctgaagactcaaggtcccagggaccctcagctacct actgtctcacctctaccccgagtcatgatcccaactgccccccatacggaatacattgag agctccccttga