GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:12:20 Sequence gi568815587r:32289061_32535344 : 246284 bp : 44.02% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 1696 1691 6 1.05 1.05 Term - 12526 12512 15 1 0 128 42 2 0.312 -2.16 1.04 Intr - 18866 18705 162 2 0 61 68 94 0.462 4.87 1.03 Intr - 22793 22687 107 0 2 68 60 73 0.486 2.43 1.02 Intr - 44911 44771 141 2 0 72 98 118 0.702 11.52 1.01 Init - 80638 80503 136 1 1 53 57 127 0.703 6.40 1.00 Prom - 83173 83134 40 -1.66 2.00 Prom + 86778 86817 40 -6.36 2.01 Init + 90260 90290 31 1 1 83 78 23 0.658 0.71 2.02 Term + 91065 91309 245 1 2 55 36 193 0.192 6.86 2.03 PlyA + 91513 91518 6 1.05 3.19 PlyA - 91719 91714 6 -0.45 3.18 Term - 94292 94180 113 0 2 78 55 78 0.239 2.22 3.17 Intr - 97487 97455 33 0 0 116 76 8 0.029 0.59 3.16 Intr - 97876 97736 141 2 0 74 95 62 0.035 5.82 3.15 Intr - 103004 102921 84 0 0 57 98 83 0.886 5.99 3.14 Intr - 103272 103135 138 1 0 73 72 32 0.566 0.54 3.13 Intr - 103695 103606 90 1 0 55 72 96 0.952 4.67 3.12 Intr - 107347 107197 151 1 1 117 119 12 0.823 6.94 3.11 Intr - 110984 110888 97 1 1 112 99 63 0.942 9.81 3.10 Intr - 126210 126155 56 0 2 132 55 18 0.047 0.68 3.09 Intr - 127480 127430 51 1 0 125 89 1 0.385 3.00 3.08 Intr - 132879 132827 53 1 2 52 45 67 0.039 -2.57 3.07 Intr - 138483 138296 188 2 2 41 50 120 0.671 2.83 3.06 Intr - 138998 138896 103 0 1 139 91 189 0.997 23.53 3.05 Intr - 139559 139437 123 0 0 127 94 190 0.925 24.06 3.04 Intr - 146279 145640 640 2 1 93 110 613 0.925 55.63 3.03 Intr - 148131 148083 49 2 1 71 91 98 0.751 7.08 3.02 Intr - 149265 149039 227 0 2 2 45 161 0.594 -0.02 3.01 Init - 152366 152358 9 2 0 74 105 10 0.431 1.24 3.00 Prom - 152830 152791 40 -0.76 4.00 Prom + 153397 153436 40 0.64 4.01 Init + 155211 155266 56 2 2 56 71 25 0.377 -1.64 4.02 Intr + 162961 163071 111 1 0 135 81 94 0.973 12.89 4.03 Term + 168855 168879 25 0 1 128 38 12 0.313 -2.10 4.04 PlyA + 169701 169706 6 1.05 5.03 PlyA - 170208 170203 6 1.05 5.02 Term - 173509 173449 61 0 1 107 43 57 0.097 0.38 5.01 Init - 203301 202358 944 0 2 86 53 376 0.783 27.02 5.00 Prom - 205251 205212 40 -5.76 6.06 PlyA - 207278 207273 6 1.05 6.05 Term - 208632 208527 106 2 1 11 32 109 0.322 -4.52 6.04 Intr - 208803 208706 98 0 2 48 115 130 0.088 10.51 6.03 Intr - 219900 219837 64 2 1 38 58 80 0.087 -1.28 6.02 Intr - 222925 222821 105 0 0 92 92 36 0.076 3.73 6.01 Init - 235624 234696 929 1 2 75 2 368 0.010 20.26 6.00 Prom - 239982 239943 40 -1.76 7.02 PlyA - 240020 240015 6 1.05 7.01 Term - 243063 242830 234 0 0 79 49 157 0.959 7.22 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 98650 98616 35 1 2 62 49 122 0.837 2.72 S.002 Init - 208758 208706 53 0 2 57 115 115 0.843 11.73 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:32289061_32535344|GENSCAN_predicted_peptide_1|186_aa MWESLELPGDLLNGFDQNADNDVDNVIQAEVVSDGNEELVGNWRKGCSAGPLGSKVAGGE AHSACLGRRVNRDLYPPDPPLPLEAPATPAAEEAIKLHVEKPVRQKEDPLTSSPKPGGRG NSAFGIDRTFESLDNFDRHDFHHSKENFPTTDNVLFLKAFNSLVLLWVGSDFYRRSRNDV GIICLF >gi568815587r:32289061_32535344|GENSCAN_predicted_CDS_1|561_bp atgtgggaaagtttggaactccctggagacttgctgaatggctttgaccaaaatgctgat aatgatgtggacaatgtaatccaggctgaagtagtctcagatggaaatgaggaacttgtt gggaactggagaaaaggctgcagcgcagggcctctagggtcaaaagttgcgggcggtgaa gcccacagtgcctgcctgggccgtcgcgtcaatcgcgacttgtacccgccggacccgccc ctgcccttggaggcgccagcgacgcctgctgcagaggaagctatcaagttgcatgttgaa aagccagtgaggcagaaagaagatccactgacttcatctcccaagccaggagggagggga aattcagcatttggcatcgacaggacgtttgaatccttggacaactttgaccgacatgat ttccaccactccaaggagaattttcctaccacagacaatgtgctgttcctgaaagccttt aattctttggtgcttttgtgggttggttctgacttctataggagatccaggaatgatgta ggtattatttgcctgttctag >gi568815587r:32289061_32535344|GENSCAN_predicted_peptide_2|91_aa MAGVNLFGNQRVHPGVYSMVRNSPGVCIPPHGVVKECLQQERECQPYQGIGMPCLNWAIL PVSWATLIDFQVRGAQEKPRGSPGHVKPDNH >gi568815587r:32289061_32535344|GENSCAN_predicted_CDS_2|276_bp atggctggcgttaacctttttggcaatcaaagagtgcatcctggcgtctacagcatggta cggaactcgcctggtgtttgcattccgcctcatggagttgtcaaggaatgcttgcagcag gaaagggagtgccagccttaccagggcattggtatgccctgcctcaactgggccatcttg ccagtatcctgggccaccctcattgacttccaggtgaggggtgcccaggaaaaaccaaga ggctccccaggacatgtcaagccagacaaccattag >gi568815587r:32289061_32535344|GENSCAN_predicted_peptide_3|781_aa MGKGGPRELKEVQESAYRPRRVKVLPAFVLGCESWCLAQGAKASVAGMSPLENLRSPAEN VTHNKEEDREAWSALRLQEVSCAYGMRIPKKCALRDPASTCVPEPASQHTLRSGPGCLQQ PEQQGVRDPGGIWAKLGAAEASAERLQGRRSRGASGSEPQQMGSDVRDLNALLPAVPSLG GGGGCALPVSGAAQWAPVLDFAPPGASAYGSLGGPAPPPAPPPPPPPPPHSFIKQEPSWG GAEPHEEQCLSAFTVHFSGQFTGTAGACRYGPFGPPPPSQASSGQARMFPNAPYLPSCLE SQPAIRNQGYSTVTFDGTPSYGHTPSHHAAQFPNHSFKHEDPMGQQGSLGEQQYSVPPPV YGCHTPTDSCTGSQALLLRTPYSSLGHSLLPTSTLQGFDPWQRFVDVVPSAPATLWGGPP AGVQDGDRVATGPCGEAALARVGPSSSLYFFQDVTWPASSYVAESCCWELQLSEMDRRAE QNLITFPKLNSKDRQNMIVNHSTGYESDNHTTPILCGAQYRIHTHGVFRGIQDVRRVPGV APTLVRSASETSEKRPFMCAYPGCNKRYFKLSHLQMHSRKHTGEKPYQCDFKDCERRFSR SDQLKRHQRRHTAEIFQLVRSWTSVVKEWLFPRQKQVLEDYCAESVILVGPGGLGKSKGV KPFQCKTCQRKFSRSDHLKTHTRTHTEDSISLEKLRTSMANVSESERARSYNKSTATLIV STAQGRTEILRRPCILLQYLAILTEALVMLTLGHEGLPDAVGTWRAMEEAVGLQEKEGGV E >gi568815587r:32289061_32535344|GENSCAN_predicted_CDS_3|2346_bp atggggaagggcgggcctcgcgagctaaaggaggtacaggagagcgcctatcgtccgcgg cgggtgaaggtgctacctgccttcgtgctaggctgtgagtcctggtgcttagctcagggc gccaaggccagtgtagctggcatgtcccccttggaaaacctcaggtctcccgcagagaac gttacccacaacaaagaagaggacagagaggcatggagcgccctgcgactgcaggaggtc tcctgcgcctacgggatgcgcattcccaagaagtgcgcccttcgagacccggcttccacg tgtgtcccggagccggcgtctcagcacacgctccgctccgggcctgggtgcctacagcag ccagagcagcagggagtccgggacccgggcggcatctgggccaagttaggcgccgccgag gccagcgctgaacgtctccagggccggaggagccgcggggcgtccgggtctgagccgcag caaatgggctccgacgtgcgggacctgaacgcgctgctgcccgccgtcccctccctgggt ggcggcggcggctgtgccctgcctgtgagcggcgcggcgcagtgggcgccggtgctggac tttgcgcccccgggcgcttcggcttacgggtcgttgggcggccccgcgccgccaccggct ccgccgccacccccgccgccgccgcctcactccttcatcaaacaggagccgagctggggc ggcgcggagccgcacgaggagcagtgcctgagcgccttcactgtccacttttccggccag ttcactggcacagccggagcctgtcgctacgggcccttcggtcctcctccgcccagccag gcgtcatccggccaggccaggatgtttcctaacgcgccctacctgcccagctgcctcgag agccagcccgctattcgcaatcagggttacagcacggtcaccttcgacgggacgcccagc tacggtcacacgccctcgcaccatgcggcgcagttccccaaccactcattcaagcatgag gatcccatgggccagcagggctcgctgggtgagcagcagtactcggtgccgcccccggtc tatggctgccacacccccaccgacagctgcaccggcagccaggctttgctgctgaggacg ccctacagcagccttggccacagccttctccccacctcaaccctccagggtttcgacccg tggcagcgctttgtggacgtcgttcccagcgcccctgctacgctctggggtgggcccccg gccggggtgcaggacggagatcgggtcgcaacagggccctgcggggaggctgctctggcc cgcgtgggaccttcttcctctctttacttcttccaagatgttacctggccagccagcagc tacgtggcagagagttgctgctgggagctccagctcagtgaaatggacagaagggcagag caaaacttgataaccttccccaagctaaatagcaaggatagacagaacatgattgtgaac cacagcacagggtacgagagcgataaccacacaacgcccatcctctgcggagcccaatac agaatacacacgcacggtgtcttcagaggcattcaggatgtgcgacgtgtgcctggagta gccccgactcttgtacggtcggcatctgagaccagtgagaaacgccccttcatgtgtgct tacccaggctgcaataagagatattttaagctgtcccacttacagatgcacagcaggaag cacactggtgagaaaccataccagtgtgacttcaaggactgtgaacgaaggttttctcgt tcagaccagctcaaaagacaccaaaggagacatacagctgagatcttccagctggtgaga agctggacctctgtggttaaggaatggttgttcccaagacagaagcaggtccttgaagat tactgtgcagaatcagtgattcttgttgggcctgggggactggggaaatctaagggtgtg aaaccattccagtgtaaaacttgtcagcgaaagttctcccggtccgaccacctgaagacc cacaccaggactcatacagaagacagcatttctctggagaagctcaggacaagcatggca aacgtcagcgagtcggaaagagccaggtcttacaacaaaagtacagccacattgattgtt tcaactgcacagggaagaacagagattctcagacgaccctgcatcctcttgcagtatcta gccatcctcacagaagcattggtgatgctgacacttggccatgaaggattgcctgatgca gtggggacttggagggccatggaggaggcagttgggctccaggagaaagaaggaggcgtg gagtga >gi568815587r:32289061_32535344|GENSCAN_predicted_peptide_4|63_aa MQIVTIIEHNAILNFLTAKLCHYFPLAKRYWKWKAKKPDGAIHTRQPLGPEWDEKRIKKH QKY >gi568815587r:32289061_32535344|GENSCAN_predicted_CDS_4|192_bp atgcagatagttactataattgagcataatgcaattctgaacttcctgacagccaagctc tgtcactatttcccattggccaaacgctactggaagtggaaggctaagaagcccgatggt gcaatccatacacgtcagcctctggggccagagtgggatgaaaaaagaataaagaaacat cagaaatattaa >gi568815587r:32289061_32535344|GENSCAN_predicted_peptide_5|334_aa MAILPKVIYRFNAILIKLPTTFFTELEKTTLKFIWNQKRACIAKSILSQKNKGGGITLPD FKLYYNATVTKTAWYWYQNRDMDQWNRTEPSEIIPHIYNYLIFDKPDKNKKWGKDSLFNK WCWENWLAICRKLKLDLFLTPYSKINSRWIKDLNVRAKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATEVKIDKWDLIKLKSFCTAKETTIRVNRQPTEWENIFAIYSSDKGLISRIY KEHKQIYKEKTNNPINMWAKDMNRHFSKEDIYAAKRHMKKCSSSLAIREMQIKTTRRYYL TSVRMVIIKKSENNRAQTQDTEKQMFSDSMTSAD >gi568815587r:32289061_32535344|GENSCAN_predicted_CDS_5|1005_bp atggccatactgcccaaggtaatttatagattcaatgccatcctcatcaagctaccaacg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc tgcattgccaagtcaatcctaagccaaaagaacaaaggtggaggcatcacgctacctgac ttcaaactatactacaacgctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatggaccaatggaacagaacagagccctcagaaataataccacacatctacaactat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctgtttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatctcttccttaca ccttattcaaaaattaattcaagatggattaaagacttaaatgttagagctaaaaccata aaaaccctagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaacagaagtcaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca gaatgggaaaacatttttgcaatctactcatctgacaaagggctaatatcgagaatctac aaagaacacaaacaaatttacaaggaaaaaacaaacaaccccatcaacatgtgggcgaag gatatgaacagacacttctcaaaagaagacatttatgcagccaaaagacacatgaaaaaa tgctcatcatcactggccatcagagaaatgcaaatcaaaaccacaaggagatactatctc acatcagttagaatggtgatcattaaaaagtcagaaaacaacagagctcagacacaagac acagagaagcaaatgttttcggactcgatgacttcagcggattag >gi568815587r:32289061_32535344|GENSCAN_predicted_peptide_6|433_aa MAILPKVIYRFNAILIKLPMTFFTELEKTTLKFIWNQKRARIAKSILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRAIDQWNRTEPSKIIPHIYKYLIFDKPDKNKKWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF VSKTPNAMATKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWENIFAIYSSDKGLIPRIY NELKQIYKEKTNNPINKQAKDINRHFSKEDIYAAKRHMKKCSSSLAVREMQIKTTRRYHL TPVRMVIIKKLHDARESHMKCSHFPLAYRKHSPFLRLSDNNKQGGLIYEVNLPSTAGEID SSIEYMKNYDINLTGLEGEWEMNTFYRLLTDNQFKGSTRKKFKTVESFDAKVIVFFAIES NGKNPNYFCTNLM >gi568815587r:32289061_32535344|GENSCAN_predicted_CDS_6|1302_bp atggccatactgcccaaggtaatttacagattcaatgccatcctcatcaagctaccaatg accttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cgcattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gctatagaccaatggaacagaacagagccctcaaaaataataccacacatctacaagtat ctgatctttgacaaacctgacaaaaacaagaaatggggaaaggattccctatttaacaaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaaccctagaagaaaacctgggcaataccattcaggacataggcatgggcaaggacttc gtgtctaaaacaccgaacgcaatggcgacaaaagccaaaattgacaaatgggatctaatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcaacctaca gaatgggaaaacatttttgcaatctactcatctgacaaagggctaatacccagaatctac aatgaactcaaacaaatttacaaggaaaaaacaaacaaccccatcaacaagcaggcaaag gatattaacagacacttctcgaaagaagacatttatgcagccaaaagacacatgaaaaaa tgctcatcatcactggccgtcagagaaatgcaaatcaaaaccacaaggagataccatctc acaccagttagaatggtgatcattaaaaagctccacgatgctcgtgaatcccacatgaag tgttctcatttccctttagcctaccgaaaacacagcccctttctcaggctctcagacaat aataaacaaggtggattgatttatgaggtgaatttgccatcaactgctggagaaattgat agcagcatagagtacatgaaaaattatgatataaacctgactggactagaaggggaatgg gaaatgaacaccttctatcggctgctgacggacaaccagtttaaagggagcacaaggaag aagtttaaaactgtagaaagttttgatgcaaaagtaattgtgttttttgccattgaaagt aatggcaaaaaccccaattacttttgcaccaacctaatgtaa >gi568815587r:32289061_32535344|GENSCAN_predicted_peptide_7|77_aa NAFDILLQKKRKVLKCPDSKVLVHGFSFHQDNLFQCGGRGPDNSPDTDVGCPLFADPVPK FLHGEELIIILSDHSSG >gi568815587r:32289061_32535344|GENSCAN_predicted_CDS_7|234_bp aatgcttttgacatcctgctgcaaaagaagaggaaagttctgaagtgtcctgattcaaag gtcctggtccatggtttctccttccaccaggacaacctgttccagtgtggaggcaggggt ccggacaacagccctgacactgacgtgggatgcccattgtttgcagatcctgttcccaag tttcttcatggggaagagctgatcataattttatctgatcatagttcaggctga