GENSCAN 1.0 Date run: 8-Nov-116 Time: 16:09:02 Sequence gi568815591r:138606627_138871247 : 264621 bp : 44.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.15 PlyA - 2531 2526 6 1.05 1.14 Term - 5374 5280 95 2 2 87 49 85 0.177 2.49 1.13 Intr - 14509 14374 136 1 1 114 78 13 0.165 3.04 1.12 Intr - 19509 19343 167 1 2 54 105 133 0.514 11.28 1.11 Intr - 21737 21532 206 1 2 126 89 239 0.994 26.64 1.10 Intr - 23496 23423 74 0 2 90 95 38 0.940 2.90 1.09 Intr - 38219 38091 129 2 0 62 105 207 0.994 20.69 1.08 Intr - 42511 42386 126 1 0 120 81 204 0.994 23.78 1.07 Intr - 49885 49822 64 0 1 86 110 22 0.910 2.92 1.06 Intr - 53362 53238 125 1 2 77 61 90 0.912 4.58 1.05 Intr - 56519 56448 72 2 0 71 100 46 0.819 3.70 1.04 Intr - 65491 65393 99 1 0 67 127 82 0.113 10.31 1.03 Intr - 67861 67679 183 1 0 76 -10 141 0.101 3.08 1.02 Intr - 71899 71808 92 2 2 62 84 189 0.925 15.61 1.01 Init - 72419 72338 82 2 1 61 100 26 0.876 2.55 1.00 Prom - 82233 82194 40 -7.96 2.00 Prom + 82468 82507 40 -5.86 2.01 Sngl + 82633 82899 267 0 0 47 32 311 0.989 16.63 2.02 PlyA + 82912 82917 6 1.05 3.00 Prom + 86119 86158 40 -3.06 3.01 Sngl + 95147 95575 429 1 0 72 49 331 0.798 23.89 3.02 PlyA + 96625 96630 6 1.05 4.23 PlyA - 96729 96724 6 1.05 4.22 Term - 100091 99998 94 1 1 58 44 119 0.496 1.80 4.21 Intr - 103169 102998 172 0 1 74 95 63 0.086 4.60 4.20 Intr - 109255 109138 118 1 1 44 94 122 0.786 8.44 4.19 Intr - 115399 115271 129 1 0 54 55 153 0.565 9.49 4.18 Intr - 122236 122135 102 1 0 72 110 28 0.923 3.77 4.17 Intr - 126467 126251 217 1 1 83 68 120 0.996 8.01 4.16 Intr - 127628 127510 119 2 2 83 103 79 0.967 8.16 4.15 Intr - 128461 128341 121 0 1 52 47 74 0.093 0.20 4.14 Intr - 138654 138497 158 0 2 20 81 172 0.614 8.51 4.13 Intr - 140938 140799 140 2 2 106 105 107 0.999 14.38 4.12 Intr - 142691 142541 151 2 1 117 103 158 0.999 19.94 4.11 Intr - 146211 145999 213 0 0 94 74 372 0.996 35.31 4.10 Intr - 148978 148847 132 1 0 75 36 61 0.570 0.44 4.09 Intr - 149222 149063 160 1 1 20 84 76 0.491 0.39 4.08 Intr - 149914 149832 83 1 2 89 86 76 0.986 5.94 4.07 Intr - 153252 153126 127 2 1 66 56 100 0.577 5.28 4.06 Intr - 156399 156266 134 0 2 94 16 136 0.586 6.44 4.05 Intr - 162248 162154 95 0 2 74 107 67 0.943 6.88 4.04 Intr - 164638 164501 138 2 0 123 52 70 0.423 7.34 4.03 Intr - 175701 175597 105 1 0 40 71 70 0.037 0.69 4.02 Intr - 179622 179532 91 0 1 68 107 58 0.039 5.27 4.01 Init - 182775 182773 3 0 0 87 52 0 0.078 -3.70 4.00 Prom - 183989 183950 40 -3.66 5.00 Prom + 186703 186742 40 -8.36 5.01 Init + 191479 191560 82 0 1 82 74 187 0.999 15.83 5.02 Intr + 194701 194772 72 2 0 104 63 45 0.880 2.98 5.03 Term + 196274 196443 170 0 2 87 53 283 0.826 22.74 5.04 PlyA + 197326 197331 6 1.05 6.04 PlyA - 198892 198887 6 1.05 6.03 Term - 225915 225887 29 1 2 118 43 12 0.668 -2.06 6.02 Intr - 226943 226868 76 2 1 42 102 72 0.799 3.09 6.01 Init - 227239 227138 102 1 0 89 26 131 0.915 7.24 6.00 Prom - 227870 227831 40 -5.26 7.06 PlyA - 228732 228727 6 1.05 7.05 Term - 231705 231280 426 0 0 122 43 133 0.922 7.50 7.04 Intr - 233652 233507 146 1 2 107 78 33 0.924 4.20 7.03 Intr - 237848 237691 158 1 2 88 76 155 0.928 13.95 7.02 Intr - 239110 239037 74 1 2 39 77 90 0.902 1.20 7.01 Init - 243760 243692 69 1 0 81 77 15 0.656 0.75 7.00 Prom - 244205 244166 40 -0.46 8.07 PlyA - 244222 244217 6 1.05 8.06 Term - 245247 245137 111 0 0 87 44 110 0.886 5.06 8.05 Intr - 245643 245597 47 1 2 102 116 62 0.900 8.53 8.04 Intr - 254743 254635 109 1 1 61 49 161 0.357 9.36 8.03 Intr - 261502 261349 154 0 1 68 43 263 0.532 19.87 8.02 Intr - 263135 262912 224 2 2 107 64 490 0.980 45.43 8.01 Init - 264611 264531 81 2 0 88 53 64 0.721 2.26 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 67861 67673 189 1 0 76 49 152 0.857 7.55 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:138606627_138871247|GENSCAN_predicted_peptide_1|549_aa MATKPTEPVTILSLRKLSLGTAEPQVKEPKTFTVEDAVETIGFGRFHIALFLIMGSTGED RTAYLGAGTKSFDWSYDGMLYEQDSSISIDMLTVMGRGPSTCSGPYSEAIEAAVGFPGLV VEAMEIMLIAVVSPVIRCEWQLENWQVALVTTMVFFGYMVFSILFGLLADRYGRWKILLI SFLWGAYFSLLTSFAPSYIWFVFLRTMVGCGVSGHSQGLIIKTEFLPTKYRGYMLPLSQV FWLAGSLLIIGLASVIIPTIGWRWLIRVASIPGIILIVAFKFIPESARFNVSTGNTRAAL ATLERVAKMNRSVMPEGKLVEPVLEKRGRFADLLDAKYLRTTLQIWVIWLGISFAYYGVI LASAELLERDLVCGSKSDSAVVVTGGDSGESQSPCYCHMFAPSDYRTMIISTIGEIASST VHSKLGHWRWSSMLPFIISLRFLFSSAGLIGFLFMLRALVAANFNTVYIYTAEVYPTTMR ALGMGTSGSLCRIGAMVAPFISQVQPRTLQGERESERGDPATIRFPNPFPAFPAFLFHKA AIVILARSQ >gi568815591r:138606627_138871247|GENSCAN_predicted_CDS_1|1650_bp atggcaaccaagccaacagagcctgtcacgatcctcagccttcggaaattgagcctgggg accgcagagccacaggttaaagagccaaagacgttcaccgtggaagatgcagtggagact atcggcttcgggcgtttccacattgccctctttctgatcatgggcagtactggggaagac agaactgcttatttgggtgcagggaccaaatctttcgactggtcttacgatgggatgctt tatgagcaggattcttccatcagcattgatatgttgacagtgatgggccgtggcccctcc acctgttcaggcccttattcagaggccatagaagcagcagttgggtttccgggcctcgtg gttgaggccatggagatcatgttgatagctgttgtgtctcctgtcatccgctgtgaatgg caactggagaattggcaggtggcattagtaaccacgatggtgttttttggctacatggtt ttcagtatcctctttggcctcctggctgacagatatggccgctggaagattctgctcatc tcgttcctgtggggagcctatttctccttgctgacctcgtttgctccttcgtacatctgg tttgtcttcctgcggacgatggtgggctgtggtgtgtccggccactcgcaagggttaatc ataaagactgaatttttgcccacgaaataccgaggctatatgttacccttgtctcaggtg ttctggcttgcgggctccctgctcatcattggcttggcctctgtgatcatccccaccatc gggtggcgctggctcattcgcgtcgcctccatcccgggcatcatcctcatcgtggccttc aagtttattcctgaatctgcccggttcaatgtctccactgggaacactcgggctgccctg gccactctggagcgcgttgccaagatgaaccgctcggtcatgccggaggggaagctggtg gagcccgtcctggaaaaaagaggaagatttgcagacctattggatgctaaatatttacgg accacattacagatctgggtcatatggcttggaatctcttttgcctactatggggttatc ctggccagtgctgagctgctggagcgggacttggtctgtggttcaaagtcagactctgcg gtggtggtgactgggggggactcaggggagagccagagcccctgctactgccacatgttt gcaccctctgactatcggaccatgatcatcagcaccatcggtgaaattgcttcatccaca gtgcactcgaagctgggccactggaggtggtcctccatgctgccttttataatttctctc cgcttcctgttttctagtgccggcctgattggcttcctcttcatgctgagggctctggta gctgcaaacttcaacaccgtctacatttacacagctgaggtctaccccaccacgatgcgc gctttggggatgggaaccagcggctccctgtgtcgcattggtgcaatggtggcaccattt atatcccaggtacagccaaggaccctgcagggagagagagagtctgagagaggggacccg gcaaccatccgatttcccaatcctttccccgcctttcccgcctttctattccacaaagcc gccattgtcatcctggcccgttctcaatga >gi568815591r:138606627_138871247|GENSCAN_predicted_peptide_2|88_aa MVKNAESNAELKGLDVDSLVIEHIQVNKAPKMRRRIDRAYGRIYPCMSSPCHIEMILTEK EQIVPKPEDKVALKKKISQKKQKLMTRE >gi568815591r:138606627_138871247|GENSCAN_predicted_CDS_2|267_bp atggttaaaaatgcagagagcaatgcagaacttaagggtttagatgtagattctctggtc attgagcatatccaagtgaacaaagcacctaagatgcgccgccggatcgacagagcttat ggtcggatttacccatgcatgagctctccctgccacatcgagatgatccttactgaaaag gaacagattgttcctaaaccagaagacaaggttgccctgaagaaaaagatatcccagaag aaacaaaaacttatgacacgagagtaa >gi568815591r:138606627_138871247|GENSCAN_predicted_peptide_3|142_aa MSLSWVNSVGLNVPASVRYSHTDINVPDFPDYCCIEIFDGTKSSKEDGEARKGFSYLVTA TTAVGVTYAAKNVISQFVSSISASADESAVSKIAIKLFDIPEGKNMAFKWRGKSLFVCHR TKKEIDQEPAVEESQLRAHSMI >gi568815591r:138606627_138871247|GENSCAN_predicted_CDS_3|429_bp atgagcctgtcttgggtcaactctgtgggcctcaatgtccctgcttctgttcgttattcc catacagacatcaatgtgcctgacttccctgattactgttgcattgagatttttgatggg acaaagtcttcaaaagaggatggcgaggctaggaaagggttctcctatttggtaactgca acaactgctgtgggtgtcacatatgctgccaagaatgtcatctcccagtttgtttccagc atcagtgcttctgctgatgagtcggccgtgtcgaaaattgcaatcaagttatttgatatt ccagaaggcaagaacatggctttcaaatggagaggcaaatccctgtttgtatgccacaga accaagaaggaaattgaccaagaacctgcagttgaagagtcccagctgagggcccatagc atgatttaa >gi568815591r:138606627_138871247|GENSCAN_predicted_peptide_4|933_aa MFLHDVASEDRNERNVKQLFSEEKKRERHSQGIWTQTHTCECEGIDWSDASPSQGMPKIA SNHQQPNRGWAKMVSVFRSEEMCLSQLFLQVEAAYCCVAELGELGLVQFKDVGFLEDEMQ NEIVVQLLEKSPLTPLPREMITLETVLEKLEGELQEANQNQQALKQSFLELTELKYLLKK TQDFFEVVTFIAGVINRERMASFERLLWRICRGNVYLKFSEMDAPLEDPVTKEEIQKNIF IIFYQGEQLRQKIKKICDGKGDAECFAARTHLFPLEYIKHRFRATVYPCPEPAVERREML ESVNVRLEDLITGPCWEAGWNAGYAFLVVAVEDFTIGKLFDKLEGTKLCLKNALPRVITQ TESHRQRLLQEAAANWHSWLIKVQKMKAVYHILNMCNIDVTQQCVIAEIWFPVADATRIK RALEQGMELSGSSMAPIMTTVQSKTAPPTFNRTNKFTAGFQNIVDAYGVGSYREINPAPY TIITFPFLFAVMFGDCGHGTVMLLAALWMILNERRLLSQKTDNEIWNTFFHGRYLILLMG IFSIYTGLIYNDCFSKSLNIFGSSWSVQPMFRNGTWKTGTHDPHLAVLFKMGHVEGRDCV LRIVLAKIGYRNGFLRKIWNLASNKLTFLNSYKMKMSVILGIVQMVFGVILSLFNHIYFR RTLNIILQFIPEMIFILCLFGYLVFMIIFKWCCFDVHVSQHAPSILIHFINMFLFNYSDS SNAPLYKHQQEVQSFFVVMALISVPWMLLIKPFILRASHRKSQLQASRIQEDATENIEGD SSSPSSRSGQRTSADTHGALDDHGEEFNFGDVFVHQAIHTIEYCLGCISNTASYLRLWAL SLAHAQLSEVLWTMVMNSGLQTRGWGGIVGVFIIFAVFAVLTVAILLIMEGLSAFLHALR LHWVEFQNKFYVGDGYKFSPFSFKHILDGTAEE >gi568815591r:138606627_138871247|GENSCAN_predicted_CDS_4|2802_bp atgttcctccatgacgtggcaagtgaagacaggaatgaaaggaatgtaaagcagcttttc tctgaagagaagaagagagagagacacagccaagggatttggacacagacacacacatgt gaatgtgaaggcatagattggagtgatgcttctccaagccaaggaatgccaaagattgcc agcaaccaccagcagccaaaccgaggctgggccaagatggtgtctgtgtttcgaagcgag gagatgtgtttgtcacaactgtttctccaggtggaagctgcatattgctgtgtggctgag ctcggagagctcggattggttcagttcaaagatgtaggttttctggaagacgagatgcaa aatgagattgtagttcagttgctcgagaaaagcccactgaccccgctcccacgggaaatg attaccctggagactgttctagaaaaactggaaggagagttacaggaagccaaccagaac cagcaggccttgaaacaaagcttcctagaactgacagaactgaaatacctcctgaagaaa acccaagacttctttgaggtggtcacgttcatagccggtgtgatcaacagggagaggatg gcttcctttgagcggttactgtggcgaatctgccgaggaaacgtgtacttgaagttcagt gagatggacgcccctctggaggatcctgtgacgaaagaagaaattcagaagaacatattc atcatattttaccaaggagagcagctcaggcagaaaatcaagaagatctgtgatggaaaa ggggatgcagaatgctttgcagcaagaactcacctgtttcctttggaatatattaaacac aggtttcgagccactgtctacccttgcccagagcctgcggtggagcgcagagagatgttg gagagcgtcaatgtgaggctggaagatttaatcaccggcccttgctgggaggctggctgg aatgctggctatgcctttctggttgtggctgtggaagatttcacaattgggaaactgttc gataagctagagggaacgaagctgtgtcttaagaatgctttgcccagggtcataacacaa acagagtctcaccgccagcgcctgctgcaggaagccgctgccaactggcactcctggctc atcaaggtgcagaagatgaaagctgtctaccacatcctgaacatgtgcaacatcgacgtc acccagcagtgtgtcatcgccgagatctggttcccggtggcagatgccacacgtatcaag agggcactggagcaaggcatggaactaagtggctcctccatggcccccatcatgaccaca gtgcaatctaaaacagcccctcccacatttaacaggaccaataaattcacagctggcttc cagaatattgttgatgcctatggtgtcggcagctaccgggagataaacccagccccctac accatcatcactttccccttcctgttcgctgtgatgtttggagactgtggtcatggaacc gtgatgctcctggctgcactttggatgattctgaatgagagacgcttgctctcccagaag acagacaatgagatttggaacaccttcttccacgggcgctatctgatcctacttatgggc atcttctccatctacacgggtttgatctacaatgactgcttctccaagtccttgaacatc tttggctcttcttggagtgtccaacccatgttcagaaacggcacatggaagacaggtaca catgatccccacctggcagtgctgtttaaaatgggccatgtagaaggcagggactgtgtc ttacgcatcgtgcttgcaaagattggctacaggaacggcttcttgagaaagatttggaac ttggcttcaaacaaactcacatttctgaactcgtataaaatgaagatgtcggtgatcctg ggaattgtccagatggttttcggtgtcatcctcagccttttcaatcacatatacttcaga agaactctcaacatcattctgcaatttatccctgagatgatttttatcctgtgtctgttt ggatacctggttttcatgatcattttcaaatggtgctgctttgacgtccatgtatctcag cacgcccccagcatcctcatccacttcatcaacatgtttctgtttaactacagtgactct tccaacgcacccctctacaaacatcagcaagaagtccaaagtttctttgtggttatggct ttgatttctgtgccgtggatgcttctgattaagccgtttattcttagagccagtcatcgg aaatcccagctgcaggcatccaggatccaagaagatgccactgagaacattgaaggtgat agctccagcccttctagccgttctggccagaggacttctgcagatacccacggggctctg gacgaccatggagaagagttcaactttggagacgtctttgtccaccaagccatccacacc atcgagtactgcctgggctgcatttcaaacacagcctcctacctgcggctctgggccctc agcctggctcatgcacaactgtctgaagtgctctggactatggtgatgaacagcggcctt cagacgcgaggctggggaggaatcgtcggggtttttattatttttgccgtatttgctgtc ctgacagtagccatccttctgatcatggagggcctctctgctttcctgcacgccctgcga ctgcactgggttgagttccagaacaagttctatgtcggggatggttacaagttttctcca ttctcctttaaacacatcctggatggcacagccgaggagtag >gi568815591r:138606627_138871247|GENSCAN_predicted_peptide_5|107_aa MQRLPAATRATLILSLAFASLHSACSAEASSSNSSSLTAHHPDPGTLEQCLNVDFCPQAA RCCRTGVDEYGWIAAAVGWSLWFLTLILLCVDKLMKLTPDEPKDLQA >gi568815591r:138606627_138871247|GENSCAN_predicted_CDS_5|324_bp atgcagcgcctccccgctgccacccgggccaccctgatcctcagcctggcctttgcctcc ctccactcggcttgctcggcagaagcaagcagcagcaacagctcaagcttgaccgctcac cacccagaccctgggaccctggagcagtgcctcaacgtggacttctgcccacaagcagcc cggtgctgccgcacaggagtggacgagtacggctggatcgcggcagctgttggctggagc ctctggttcctcaccctcatcctgctctgtgtggacaaactgatgaagctgactccagat gagcccaaggacttgcaagcgtga >gi568815591r:138606627_138871247|GENSCAN_predicted_peptide_6|68_aa MQQGNDQGCEDLQRKYSSVGEAREEDENDENDTEMTFKQWVFAGQIFPGMRTVRQSTGEG FSGVTSHF >gi568815591r:138606627_138871247|GENSCAN_predicted_CDS_6|207_bp atgcagcaaggaaacgaccagggctgtgaagatctacagaggaagtactccagtgtgggg gaggcaagggaagaggacgaaaatgatgagaatgacactgagatgacttttaagcagtgg gtgtttgccggccagatctttcctggcatgcggactgtgaggcaaagcacgggtgaaggt ttttcaggggtcacttcacatttctaa >gi568815591r:138606627_138871247|GENSCAN_predicted_peptide_7|290_aa MAIIKRSKITDAGEVVEQRECLLHRETFLDEFDDEINEWDFSILQNERPGFGPGLLQSTE LVPPDPQQPQASAEAPFAARGIYSEEMPSVARPRPVGGTTGSQIQHLTQVGIASRIGAQP VEIPPSRGSQYGGPGWPSYGEDEAGRREAPPGVFTPHLQHRQQGPVRGLSWWMDVGSPGN LPKASLHLMFRLILRRKYSDVCHVFQTHMLGHQEYSSSPLFQVPRTSGREPSAPSGNLPH RGLQGPGLGYPTSSTEDLQPGHSSASLIKAIREELLRLSQKQSTVQNFHS >gi568815591r:138606627_138871247|GENSCAN_predicted_CDS_7|873_bp atggctattattaaaaggtcgaaaataacagatgctggtgaggttgtggagcaaagggaa tgcttgttgcaccgagagacttttctggatgagtttgatgatgagattaatgaatgggat ttttcgatactgcaaaatgagagaccaggttttggccccggtttgctgcagtctacagag ctggtgccccctgaccctcagcagccacaggcctccgccgaagccccatttgctgccaga gggatctactcggaggagatgccgtcggtggcccggcctcggcctgtcgggggtaccaca ggctcccagatccagcacctgacacaggtggggattgccagcagaattggagctcagcca gtggaaatcccgccaagcagaggcagccagtatggggggccaggctggccttcgtacggg gaggacgaagcggggcgaagagaggccccccctggtgtcttcacgcctcacctgcagcat aggcagcagggccctgttagagggctgagttggtggatggacgtgggcagtcctgggaac cttcctaaagccagtttgcatttgatgtttcggcttattcttcgtaggaagtacagtgac gtttgccatgtttttcagacacacatgctcggacatcaagagtattcttcttcaccgcta tttcaggtgccaaggacttcaggcagggagccctcagctccttccgggaacctcccccac cggggactgcagggccctgggctgggttaccccaccagctccacggaagacctccagcct ggccactcctcggcctctctcatcaaagcaatccgcgaggagctcctccggctctcccag aaacagagcaccgtgcagaacttccacagctga >gi568815591r:138606627_138871247|GENSCAN_predicted_peptide_8|241_aa MQPIPAPPVQRPSPADRVAESNKINKEIQTALRHKSEIEHHRNKIRLRAKRRGHYEFPVV DDLSSGDTKERHRVYRRAQMQIDKILDPTASVPSVFIEPRKSSRIKRSPKPRRKHQVNGC PADAEKDRLITTDSDGTYRRPPGVHNSAYIGCPYIPPQPSIEEARQTMHSLLDDAFALVA PSSQPASTAEIRRLWNDSPDGSIAKGHLRIPEEAPRKPGKAFDRLMGLGARGETLVNIQR F >gi568815591r:138606627_138871247|GENSCAN_predicted_CDS_8|726_bp atgcagccgatcccggcacctcccgtccagcgcccctccccagccgaccgagtggcggaa agcaataaaatcaacaaagagattcagaccgcgctgcggcacaagtctgagatcgagcac catcgcaacaagatccgcctgcgcgccaagcgccgcgggcactacgagttcccggtggta gacgacctgtcctcgggcgacactaaggagcgacaccgggtgtaccgcagggcacagatg cagatcgacaagatcctggaccccacggccagcgtgccctccgtgttcatagagcccagg aagagctcacggataaaacgttctcccaagcctcgccggaaacaccaggtcaacggctgt cctgccgacgctgagaaggaccggctcatcaccacagacagcgatggcacctacaggagg ccccccggcgtccacaactcagcctacatcggatgcccatacatcccaccccagccgtcc atcgaggaggcacgccagaccatgcactccctcctggacgacgcctttgccctcgtggcc cccagcagccagcctgccagcaccgcagagatacgaagactatggaatgactcccccgac gggtccattgccaagggacacctgcggattccagaagaagcacctaggaagccaggcaaa gcctttgacagactcatggggctgggagcacgaggggagaccctggtaaacatccagagg ttttga