GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:38:29 Sequence gi568815575r:48087634_48295358 : 207725 bp : 44.58% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 13307 13346 40 -1.46 1.01 Init + 14735 14805 71 1 2 53 48 87 0.391 1.72 1.02 Intr + 22295 22415 121 2 1 115 100 123 0.934 16.70 1.03 Intr + 22850 22964 115 1 1 93 60 152 0.999 12.92 1.04 Intr + 23627 23722 96 0 0 98 96 121 0.933 13.88 1.05 Intr + 25560 25609 50 1 2 37 113 36 0.546 -0.60 1.06 Intr + 29379 29514 136 2 1 46 99 130 0.329 10.04 1.07 Intr + 31901 31997 97 0 1 90 32 112 0.729 4.87 1.08 Term + 32467 32593 127 1 1 95 48 70 0.377 1.46 1.09 PlyA + 33034 33039 6 1.05 2.00 Prom + 37296 37335 40 -7.76 2.01 Init + 43055 43199 145 1 1 86 37 334 0.989 28.38 2.02 Intr + 43628 43786 159 0 0 115 115 211 0.997 26.46 2.03 Intr + 44617 44695 79 2 1 127 119 84 0.811 14.01 2.04 Intr + 70246 70381 136 1 1 117 100 71 0.971 11.77 2.05 Term + 70815 70889 75 2 0 114 43 108 0.993 6.74 2.06 PlyA + 71614 71619 6 1.05 3.03 PlyA - 73593 73588 6 1.05 3.02 Term - 75593 75487 107 0 2 70 48 115 0.911 4.27 3.01 Init - 78195 78105 91 0 1 103 106 40 0.940 8.19 3.00 Prom - 85509 85470 40 -2.46 4.09 PlyA - 86131 86126 6 1.05 4.08 Term - 100098 99998 101 1 2 82 35 103 0.107 2.69 4.07 Intr - 102635 102500 136 2 1 36 99 114 0.965 7.44 4.06 Intr - 104729 104599 131 0 2 47 113 -3 0.578 -1.49 4.05 Intr - 106591 106496 96 2 0 94 96 74 0.986 8.78 4.04 Intr - 107221 107107 115 1 1 108 60 178 0.999 17.02 4.03 Intr - 107745 107657 89 1 2 73 100 83 0.796 7.69 4.02 Intr - 111212 111044 169 2 1 84 110 87 0.873 10.02 4.01 Init - 112577 112464 114 2 0 91 93 67 0.923 7.71 4.00 Prom - 124720 124681 40 -1.86 5.06 PlyA - 125441 125436 6 1.05 5.05 Term - 128019 127741 279 0 0 -12 48 248 0.144 6.25 5.04 Intr - 129260 129192 69 2 0 94 69 42 0.164 2.28 5.03 Intr - 129446 129352 95 0 2 63 84 40 0.197 0.78 5.02 Intr - 129713 129585 129 0 0 109 70 1 0.098 1.07 5.01 Init - 129863 129764 100 2 1 103 73 49 0.206 5.63 5.00 Prom - 145176 145137 40 -2.46 6.05 PlyA - 145944 145939 6 1.05 6.04 Term - 150282 150178 105 0 0 68 41 92 0.777 0.91 6.03 Intr - 156042 155954 89 1 2 92 100 93 0.911 10.59 6.02 Intr - 157508 157218 291 0 0 90 110 148 0.526 14.31 6.01 Init - 159584 159437 148 2 1 90 -4 98 0.271 1.05 6.00 Prom - 162719 162680 40 -4.76 7.00 Prom + 165124 165163 40 -4.26 7.01 Init + 169453 169460 8 0 2 56 115 4 0.303 0.11 7.02 Intr + 169542 169677 136 0 1 109 61 106 0.898 10.57 7.03 Intr + 170113 170227 115 0 1 96 60 120 0.987 10.02 7.04 Intr + 170903 170998 96 0 0 98 96 61 0.750 7.88 7.05 Intr + 174133 174182 50 2 2 49 113 22 0.218 -0.80 7.06 Term + 176149 176298 150 0 0 28 34 142 0.253 0.71 7.07 PlyA + 176539 176544 6 1.05 8.07 PlyA - 176865 176860 6 1.05 8.06 Term - 177002 176896 107 0 2 57 49 145 0.606 6.07 8.05 Intr - 179136 179008 129 1 0 100 70 23 0.509 2.37 8.04 Intr - 180814 180722 93 2 0 52 66 77 0.010 1.84 8.03 Intr - 189170 189075 96 0 0 94 96 23 0.411 3.68 8.02 Intr - 189443 189329 115 2 1 103 60 153 0.690 14.02 8.01 Intr - 190012 189877 136 0 1 117 100 55 0.845 10.17 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 66979 67124 146 0 2 79 41 92 0.827 3.19 S.002 Init - 93548 93507 42 2 0 95 116 27 0.823 6.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575r:48087634_48295358|GENSCAN_predicted_peptide_1|270_aa MIGSFLKPSLEADAGAMLAQSAELTGKTQAVSLAGQSAPGAMNGDDAFAKRPRDDAKASE KRSKAFDDIAKYFSKEEWEKMKFSEKISCVHMKRKYEAMTKLGFNVTLSLFMRNKRATDS QRNDSDNDRNRGNEVERPQMTFGRLQRIIPKIMPEKPAEEGSDSKGVPEASGPQNDGKKL CPPGKASSSEKIHERSGPKRGKHAWTHRLRERKQLVIYEEISDPEEDDNLRDTTHAHDEK QNVVTFHERGHGCGPLVIRCIASESKCSQQ >gi568815575r:48087634_48295358|GENSCAN_predicted_CDS_1|813_bp atgattggaagcttcctgaagccctcactagaagcagatgctggtgctatgcttgcacag tctgcagaactgactggaaagactcaggctgtttctcttgcaggtcagagtgctcctggt gccatgaacggagacgacgcctttgcaaagagacccagggatgatgctaaagcatcagag aagagaagcaaggccttcgatgatattgccaaatacttctctaaggaagagtgggaaaag atgaaattctcggagaaaatcagctgtgtgcatatgaagagaaagtatgaggccatgact aaactaggtttcaacgtcaccctctcacttttcatgcgtaataaacgggccacagactct cagaggaatgattctgataatgaccgtaaccgtgggaatgaggttgaacgtcctcagatg acttttggcaggctccagagaatcatcccgaagatcatgcccgagaagccagcagaggaa ggaagtgattcgaagggagtgccagaagcatctggcccacagaacgatgggaaaaagctg tgcccgccgggaaaagcaagtagctctgagaagattcacgagagatctggacccaaaagg gggaaacatgcctggacccacagactgcgtgagagaaagcagctggtgatttatgaagag atcagcgaccctgaggaagatgacaacctcagggatacgacacatgcccatgatgagaag cagaacgtggtgacctttcacgaacgtgggcatggctgcggacccctcgtcatcaggtgt atagcaagtgaaagcaagtgttcacaacagtga >gi568815575r:48087634_48295358|GENSCAN_predicted_peptide_2|197_aa MKAWGTVVVTLATLMVVTVDAKIYERCELAARLERAGLNGYKGYGVGDWLCMAHYESGFD TAFVDHNPDGSSEYGIFQLNSAWWCDNGITPTKNLCHMDCHDLLNRHILDDIRCAKQIVS SQNGLSAWTKGPAAGKTQAVSLAGQIAPSATNGDDAFARRPRVGSQIPENMQKVFDDIAK YFSKKECEKTKAWEKII >gi568815575r:48087634_48295358|GENSCAN_predicted_CDS_2|594_bp atgaaggcctggggcactgtggtagtgaccttggccacgctgatggttgtcactgtggat gccaagatctatgaacgctgcgagctggcggcaagactggagagagcagggctgaacggc tacaagggctacggcgttggagactggctgtgcatggctcattatgagagtggctttgac accgccttcgtggaccacaatcctgatggcagcagtgaatatggcattttccaactgaat tctgcctggtggtgtgacaatggcattacacccaccaagaacctctgccacatggattgt catgacctgctcaatcgccatattctggatgacatcaggtgtgccaagcagattgtgtcc tcacagaatgggctttctgcctggaccaaaggtcctgcagctggaaagactcaggctgtt tctcttgcaggtcagattgctcccagtgccacgaatggagacgacgcctttgcaaggaga cccagggttggttctcaaataccagagaacatgcaaaaggtcttcgatgatattgccaaa tatttctctaagaaagaatgcgaaaagacgaaagcctgggagaaaatcatctag >gi568815575r:48087634_48295358|GENSCAN_predicted_peptide_3|65_aa MPMFVKGHHILLLIMGMCHIPEAEARGEKERTATATPPFSNHHLDQPAAINTEARPSTSK KSVTY >gi568815575r:48087634_48295358|GENSCAN_predicted_CDS_3|198_bp atgcccatgttcgtgaaaggtcaccacattctgcttctcatcatgggcatgtgtcatatc cctgaggctgaggcaagaggagagaaggaaagaactgccacagccactccacccttcagc aaccaccaccttgatcagccagcagccatcaacactgaggcaagaccctccaccagtaaa aagagtgtgacttactga >gi568815575r:48087634_48295358|GENSCAN_predicted_peptide_4|316_aa MRYHYTLITIAKIKDIVTTPNGDKDAEKMDTSLSAAGKRHRHRVSAPRKGLSKEDYIRSD IFMINSQPSTSSGQSYVYFISNEQNQAGSTRWDEGQSAPGAMNGDDAFVRRPRVGSQIPE KMQKAFDDIAKYFSEKEWEKMKASEKIIYVYMKRKYEAMTKLGFKATLPPFMRNKRVADF QGNDFDNDPNRGNQECPHATEVSLEFGNLYQQRKILMYSLSVEHPQMTFGRLQGIFPKIT PEKPAEEGNDSKGVPEASGPQNNGKQLRPSGKLNTSEKVNKTSGPKRGKHAWTHRVRERK QLVIYEEISDPQEDDE >gi568815575r:48087634_48295358|GENSCAN_predicted_CDS_4|951_bp atgaggtatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaaatggacacctcattaagtgctgctgggaagcgccac cgacacagagtgtcagcccccagaaagggcctttccaaggaggactatatcaggtctgac attttcatgatcaacagccagccatctaccagttctggccaatcctatgtgtatttcatc agcaatgaacagaaccaagctgggagcacgagatgggatgagggacagagtgctcccggt gccatgaacggagacgatgcctttgtacggagacctagggttggttctcaaataccagag aagatgcaaaaggccttcgatgatattgccaaatacttctctgagaaagagtgggaaaag atgaaagcctcggagaaaatcatctatgtgtatatgaagagaaagtatgaggccatgact aaactaggtttcaaggccaccctcccacctttcatgcgtaataaacgggtcgcagacttc caggggaatgattttgataatgaccctaaccgtgggaatcaggaatgccctcatgcaact gaagtctctctagagtttggaaatctttaccaacaaagaaaaattctgatgtattctctt tcagttgaacatcctcagatgactttcggcaggctccagggaatcttcccgaagatcacg cccgagaagccagcagaggaaggaaatgattcgaagggagtgccagaagcatctggccca cagaacaatgggaaacagctgcgcccctcaggaaaactaaatacctctgagaaggttaac aagacatctggacccaaaagggggaaacatgcctggacccacagagtgcgtgagagaaag caactggtgatttatgaagagatcagcgaccctcaggaagatgacgagtaa >gi568815575r:48087634_48295358|GENSCAN_predicted_peptide_5|223_aa MPMFVKGHHILLLIMGMCHIPEAEARREKEISGNPAHSDSVLPQDLHMASTVPRSEPSAH ATVTSLPGLSIPRLEEENYEKGMIWGISDKIGYQYHNRIASTSAGMWRSVDPGMFPSFGS YDGEELEDKSWSKTRTLFKSMKAERGEEAAGETCEASRGWFMRFKERSCLHNIKVQGEAA NPDGEAAASYPEDLAKVTDEGGYTKEQIFNTDKIAFYIEEDAI >gi568815575r:48087634_48295358|GENSCAN_predicted_CDS_5|672_bp atgcccatgttcgtgaaaggtcaccacattctgcttctcatcatgggcatgtgtcatatc cccgaggctgaggcaagaagagagaaggaaataagtggcaaccctgcacactctgattct gtcctaccccaggacctgcacatggcttccacggttcctcgaagtgaaccatctgctcat gccacagtgacttccttgcctgggttatctattcctaggctagaggaagaaaattatgag aagggaatgatttggggaataagtgacaagattggataccagtaccataacagaatagct agcacatctgcagggatgtggaggtctgtggatccaggcatgtttccctcttttggttcc tatgatggagaagagttggaagataagagttggagcaagacccgaactctcttcaagtcc atgaaagctgagagaggtgaagaagctgcaggagaaacgtgtgaagctagcagaggttgg ttcatgaggtttaaggaaagaagctgtctccataacataaaagttcaaggtgaagcagca aaccctgatggagaagctgcagcaagttatccagaagatctagctaaggtcactgatgaa ggtggctacactaaagaacagattttcaatacggataaaatagccttctatattgaagaa gatgccatctag >gi568815575r:48087634_48295358|GENSCAN_predicted_peptide_6|210_aa MRYHYTLITIDEIKDIVTTPNGDKDAEKLDTSLSAAGKVKSYSHSGKQFARFSSSKTSRA VMVTQQPSGLPPYVDPRSPGLMVSIMSSSHSIADRVSAPRKGLSKEDYIRTDIFMINSQP STSSGQSYVYFISNEQNQAGSMRWDEGQTTPGAMNGDNNCAKRAGDDAQIPEKIQKAKWA EFLDERQSPRLGPSQKQDSEEPDSSLVKGE >gi568815575r:48087634_48295358|GENSCAN_predicted_CDS_6|633_bp atgaggtatcactacacacttattacaatagatgaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaactggacacctcattaagtgctgctgggaaggtaaaa tcttacagccactctggaaagcagtttgcaaggttttcttcttctaaaacctcaagggct gtgatggtcactcagcagcctagtggattgccaccgtatgtggacccacgttcccctggc ttaatggtcagcattatgtcatcgtcccacagcatcgcagacagagtatcagcccccaga aagggcctttccaaggaggactatatcaggactgacattttcatgatcaacagccagccc tctaccagttctggccaatcctatgtgtatttcatcagcaatgaacagaaccaagctggg agcatgagatgggacgagggtcagactactcccggtgccatgaacggagacaacaactgt gcaaagagagctggggatgatgctcaaataccagagaagatacaaaaggctaaatgggca gagttcctggatgaaagacagagcccgaggttgggacctagtcagaaacaggactcagag gagcccgactcaagtttggtcaaaggagagtga >gi568815575r:48087634_48295358|GENSCAN_predicted_peptide_7|184_aa MDRTKGPVARKSQAVSLAGETAPGAMNGDDTFAKRPRDDAKASEKRSKAFDDIATYFSKK EWKKMKYSEKISYVYMKRNYKAMTKLGFKVTLPPFMCNKQATDFQGNDFDNDHNRRIQVE HPQMTFGRLHRIIPKIMPKKPAEDENDSKGVSEASGPQNDGKQLHPPGKANISEKINKRS GKRK >gi568815575r:48087634_48295358|GENSCAN_predicted_CDS_7|555_bp atggacaggaccaagggtcctgtagctagaaagtctcaggctgtttctcttgcaggtgag actgctcctggtgccatgaacggagacgacacctttgcaaagagacccagggatgatgct aaagcatcagagaagagaagcaaggcctttgatgatattgccacatacttctctaagaaa gagtggaaaaagatgaaatactcggagaaaatcagctatgtgtatatgaagagaaactat aaggccatgactaaactaggtttcaaagtcaccctcccacctttcatgtgtaataaacag gccacagacttccaggggaatgattttgataatgaccataaccgcaggattcaggttgaa catcctcagatgactttcggcaggctccacagaatcatcccgaagatcatgcccaagaag ccagcagaggacgaaaatgattcgaagggagtgtcagaagcatctggcccacaaaacgat gggaaacaactgcaccccccaggaaaagcaaatatttctgagaagattaataagagatct ggtaagaggaaatga >gi568815575r:48087634_48295358|GENSCAN_predicted_peptide_8|225_aa XTKGLEPGKSQAVSLAGQTAPGAMNGDDAFARRPRVDAQIPEKIQKAFDDIAKYFSKEEW EKMKASEKILYVYMKRKYEAMTKLGFKATLPPFMCNKRTADFQGNDFDNDYNHGHQGWEN QEAKVMGYGRSKQLQSCLHLGLVFQVHDPAHSDSALPQDLDTPSMVPRSEPSAHATVTSS PGLCIPRLEEETATATPPFSNHHLDQPAAINTEARPSTNKKSVTH >gi568815575r:48087634_48295358|GENSCAN_predicted_CDS_8|678_bp nngaccaaaggtcttgagcctggaaagtctcaggcagtttctcttgcaggtcagactgct cccggtgccatgaatggagacgacgcctttgcaaggagacccagggttgatgctcaaata ccagagaagatacaaaaggccttcgatgatattgccaaatacttctctaaggaagagtgg gaaaagatgaaagcctcagagaaaatcctctatgtgtatatgaagagaaagtatgaggcc atgactaaactaggtttcaaggcaaccctcccacctttcatgtgtaataaacggaccgca gacttccaggggaatgattttgataatgactataaccatgggcatcagggctgggagaac caggaggccaaggtgatgggctatggacggtcaaaacagctccaatcctgcctccacctg gggctggtgtttcaagtccatgaccctgcacactctgattctgccctaccccaggacctg gacacgccttccatggttcctcgaagtgaaccatctgctcatgccacagtgacttcctcg cctggtttatgcattcctaggctagaagaagaaactgccacagccactccacccttcagc aaccaccaccttgatcagccagcagccatcaacaccgaggcaagaccctccaccaacaag aagagtgtaactcactga