GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:09:26 Sequence gi568815581f:60500415_60763549 : 263135 bp : 42.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.07 PlyA - 1286 1281 6 1.05 1.06 Term - 12762 12622 141 0 0 63 43 147 0.336 4.65 1.05 Intr - 25804 25380 425 2 2 72 91 378 0.147 29.16 1.04 Intr - 28762 28629 134 0 2 77 71 84 0.462 4.97 1.03 Intr - 56775 56696 80 0 2 115 27 67 0.010 0.73 1.02 Intr - 57790 57683 108 1 0 105 92 -8 0.014 0.86 1.01 Init - 60361 60251 111 1 0 40 105 99 0.297 6.93 1.00 Prom - 62502 62463 40 -5.25 2.04 PlyA - 62597 62592 6 1.05 2.03 Term - 65212 65018 195 1 0 42 40 186 0.628 5.63 2.02 Intr - 68187 68091 97 2 1 69 84 37 0.165 0.49 2.01 Init - 84730 84696 35 1 2 113 89 8 0.342 2.92 2.00 Prom - 88788 88749 40 -5.65 3.00 Prom + 97582 97621 40 -7.55 3.01 Init + 100001 100472 472 1 1 107 106 481 0.710 46.03 3.02 Intr + 113249 113323 75 0 0 29 45 121 0.060 0.47 3.03 Term + 114255 114529 275 1 2 -32 45 313 0.221 9.65 3.04 PlyA + 115105 115110 6 1.05 4.00 Prom + 116958 116997 40 -5.95 4.01 Init + 123124 123335 212 0 2 59 77 297 0.921 24.00 4.02 Intr + 133439 133563 125 2 2 64 111 148 0.943 14.01 4.03 Intr + 147478 147668 191 2 2 109 81 109 0.923 10.68 4.04 Intr + 156185 156427 243 1 0 69 71 182 0.716 11.37 4.05 Term + 162581 163138 558 1 0 76 44 385 0.998 26.26 4.06 PlyA + 164041 164046 6 1.05 5.00 Prom + 173161 173200 40 -6.15 5.01 Init + 174245 174434 190 1 1 42 70 116 0.302 4.42 5.02 Intr + 179039 179126 88 0 1 68 87 107 0.823 6.71 5.03 Intr + 183568 183622 55 1 1 89 111 41 0.895 4.56 5.04 Intr + 189272 189347 76 1 1 78 56 47 0.094 -1.33 5.05 Intr + 208805 208911 107 0 2 68 95 92 0.923 6.91 5.06 Term + 212625 212765 141 2 0 29 54 102 0.611 -2.15 5.07 PlyA + 215864 215869 6 1.05 6.02 PlyA - 216445 216440 6 1.05 6.01 Sngl - 227374 226751 624 1 0 68 37 450 0.814 33.84 6.00 Prom - 236092 236053 40 -3.55 7.04 PlyA - 236316 236311 6 1.05 7.03 Term - 237960 237756 205 2 1 15 41 186 0.606 2.46 7.02 Intr - 239404 239310 95 1 2 -14 87 82 0.526 -4.26 7.01 Init - 240868 240674 195 1 0 47 97 115 0.732 7.28 7.00 Prom - 261276 261237 40 -3.95 8.03 PlyA - 261821 261816 6 1.05 8.02 Term - 262813 262581 233 2 2 67 41 150 0.492 3.85 8.01 Init - 263042 262964 79 2 1 42 63 98 0.576 3.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 206901 206952 52 2 1 86 68 64 0.874 3.57 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:60500415_60763549|GENSCAN_predicted_peptide_1|332_aa MAGHPVFFLLIHLLPLDFSMGWTQTPGSNNWRRGWKERQAVAMFPGLTSNSRAQGILLCK EHGCNAARQAWAKGLSLSASSMATHHTQGETGQQNAAALGPQGVLPDFHGCFLKAHGLSS QCVVNAVWAETQPSGQKALLWTRAEISAGTEPGFLSNRDCDAFGLAGRCFLSHFLLGPAS AAAVPQFAPGGGRAAVPSAVRPRGCHRPSESSGAAEGFATEGGGGLREEEAEEAEEEGRK MAAVELEWIPETLYNTAISAVVDNYIRSRRDIRSLPENIQFDVYYKADEVTVANTEILLD TFRARWVSGAAQHQGWDINTQVLLEPVIAPTE >gi568815581f:60500415_60763549|GENSCAN_predicted_CDS_1|999_bp atggccggccacccagtgttctttctgctcatccacctactgcccttagacttcagcatg ggctggacccagaccccaggatctaacaactggcgacgaggatggaaggagagacaggct gttgctatgttccctgggctgacctcaaactcccgggctcaagggatcctcctgtgtaag gaacatggctgcaatgcagcgaggcaggcatgggccaagggtcttagcttgtcagccagt tccatggccactcatcatactcagggtgaaactggccagcagaatgctgcagccttgggg ccacagggagtactgccagacttccacggatgtttccttaaggcccacggcctctcaagt cagtgtgtggtgaatgctgtctgggcagagactcaaccgtcagggcagaaggctctactc tggaccagggcagagatcagcgctgggacggaacccgggttcctctcgaaccgggattgt gacgcttttggcctggctggccgctgttttctgtcccactttttactcgggcctgcgtcc gctgccgccgtccctcagtttgcccccggaggaggcagggcggccgtgccttctgccgtg cgcccgcgtggctgccaccgcccctccgaatcctccggggccgcagaggggttcgctacg gagggaggtgggggccttcgggaggaggaggcggaggaggcggaggaggagggaaggaag atggcggccgtggaactagagtggatcccagagactctctataacaccgccatctccgct gtcgtggacaactacatccgctcccgccgagacatccgctccttgcccgagaacatccag tttgatgtttactacaaggctgatgaggttactgtagcaaatactgaaattcttctggac accttccgagctcgctgggtttcaggcgctgcacagcatcaggggtgggacatcaacacc caagtactgctggaacctgttatagcacccacggaatag >gi568815581f:60500415_60763549|GENSCAN_predicted_peptide_2|108_aa MANIVKCCREVKLYSGMIKVHCILNLLGSSDPPTSACQAAGGPLIQETFWEPQLLVVIDS NNPTTTTGLSQRQLMLILSTIALCNTDSPLRYGDIAIPCNNKGAYLRV >gi568815581f:60500415_60763549|GENSCAN_predicted_CDS_2|327_bp atggccaatatcgtcaaatgctgtagagaggtcaagttgtacagtggcatgatcaaagtt cactgcatccttaacctcctgggctcaagtgatcctcccacatcagcgtgccaagcagct gggggaccactgatccaggaaaccttctgggagccacagcttctggtggttattgattcc aacaatccaacaacgaccaccggcctgtcacagaggcaacttatgttaatcttatctacc atcgctctgtgcaacacagattctcctttgcgctatggggacattgccatcccatgcaac aacaaaggagcttacttacgggtttga >gi568815581f:60500415_60763549|GENSCAN_predicted_peptide_3|273_aa MAGLYSLGVSVFSDQGGRKYMEDVTQIVVEPEPTAEEKPSPRRSLSQPLPPRPSPAALPG GEVSGKGPAVAAREARDPLPDAGASPAPSRCCRRRSSVAFFAVCDGHGGREAAQFAREHL WGFIKKQKGFTSSEPAKVCAAIRKGFLACHLAMWKKLARVPGGHGLRRPCTRSGQLALLA AGTVTLTAKVCSFTPEARETTNPPEGTNNSRRAALKAVTLTGKVCSFIPEASETTNPPEG RNSEHIRTSEGTNSGHTAFKNCNTRGKGLRFHS >gi568815581f:60500415_60763549|GENSCAN_predicted_CDS_3|822_bp atggcggggctgtactcgctgggagtgagcgtcttctccgaccagggcgggaggaagtac atggaggacgttactcaaatcgttgtggagcccgaaccgacggctgaagaaaagccctcg ccgcggcggtcgctgtctcagccgttgcctccgcggccgtcgccggccgcccttcccggc ggcgaagtctcggggaaaggcccagcggtggcagcccgagaggctcgcgaccctctcccg gacgccggggcctcgccggcacctagccgctgctgccgccgccgttcctccgtggccttt ttcgccgtgtgcgacgggcacggcgggcgggaggcggcacagtttgcccgggagcacttg tggggtttcatcaagaagcagaagggtttcacctcgtccgagccggctaaggtttgcgct gccatccgcaaaggctttctcgcttgtcaccttgccatgtggaagaaactggcgcgagtt ccaggtgggcatgggctccgcaggccctgcactcggagcggccagctggcgctgctggcc gcaggcactgtaacactcactgcgaaggtctgcagcttcactcctgaagccagggagacc accaacccaccagaaggaacgaacaactccagacgcgcagcattaaaagctgtaacactc accgggaaagtctgcagcttcattcctgaagccagcgagaccacgaacccaccagaagga agaaactccgaacacatccgaacatcagaaggaacaaattctggacacactgcctttaag aactgtaacactcgtggcaagggtctgcggtttcattcttga >gi568815581f:60500415_60763549|GENSCAN_predicted_peptide_4|442_aa MTGLPSTSGTTASVVIIRGMKMYVAHVGDSGVVLGIQDDPKDDFVRAVEVTQDHKPELPK ERERIEGLGGSVMNKSGVNRVVWKRPRLTHNGPVRRSTVIDQIPFLAVARALGDLWSYDF FSGEFVVSPEPDTSVHTLDPQKHKYIILGSDGLWNMIPPQDAISMCQDQEEKKYLMGEHG QSCAKMLVNRALGRWRQRMLRADNTSAIVICISPEVDNQGNFTNEDELYLNLTDSPSYNS QETCVMTPSPCSTPPVKSLEEDPWPRVNSKDHIPALVRSNAFSENFLEVSAEIARENVQG VVIPSKDPEPLEENCAKALTLRIHDSLNNSLPIGLVPTNSTNTVMDQKNLKMSTPGQMKA QEIERTPPTNFKRTLEESNSGPLMKKHRRNGLSRSSGAQPASLPTTSQRKNSVKLTMRRR LRGQKKIGNPLLHQHRKTVCVC >gi568815581f:60500415_60763549|GENSCAN_predicted_CDS_4|1329_bp atgacgggtcttcctagcacatcagggacaactgccagtgtggtcatcattcggggcatg aagatgtatgtagctcacgtaggtgactcaggggtggttcttggaattcaggatgacccg aaggatgactttgtcagagctgtggaggtgacacaggaccataagccagaacttcccaag gaaagagaacgaatcgaaggacttggtgggagtgtaatgaacaagtctggggtgaatcgt gtagtttggaaacgacctcgactcactcacaatggacctgttagaaggagcacagttatt gaccagattccttttctggcagtagcaagagcacttggtgatttgtggagctatgatttc ttcagtggtgaatttgtggtgtcacctgaaccagacacaagtgtccacactcttgaccct cagaagcacaagtatattatattggggagtgatggactttggaatatgattccaccacaa gatgccatctcaatgtgccaggaccaagaggagaaaaaatacctgatgggtgagcatgga caatcttgtgccaaaatgcttgtgaatcgagcattgggccgctggaggcagcgtatgctc cgagcagataacactagtgccatagtaatctgcatctctccagaagtggacaatcaggga aactttaccaatgaagatgagttatacctgaacctgactgacagcccttcctataatagt caagaaacctgtgtgatgactccttccccatgttctacaccaccagtcaagtcactggag gaggatccatggccaagggtgaattctaaggaccatatacctgccctggttcgtagcaat gccttctcagagaattttttagaggtttcagctgagatagctcgagagaatgtccaaggt gtagtcataccctcaaaagatccagaaccacttgaagaaaattgcgctaaagccctgact ttaaggatacatgattctttgaataatagccttccaattggccttgtgcctactaattca acaaacactgtcatggaccaaaaaaatttgaagatgtcaactcctggccaaatgaaagcc caagaaattgaaagaacccctccaacaaactttaaaaggacattagaagagtccaattct ggccccctgatgaagaagcatagacgaaatggcttaagtcgaagtagtggtgctcagcct gcaagtctccccacaacctcacagcgaaagaactctgttaaactcaccatgcgacgcaga cttaggggccagaagaaaattggaaatcctttacttcatcaacacaggaaaactgtttgt gtttgctga >gi568815581f:60500415_60763549|GENSCAN_predicted_peptide_5|218_aa MFRFCPSTVPFLQSSPRLAFRLLLIGTFYRALIGAFYRALIGAFYRVLIGAFYNPLASYR ALIGFMNEAMATDSPRRPSRCTGGVVVRPQAVTEQSYMESVVTFLQDVVPQAYSGTPLTE EKEKIVWVRFENADLNDTSRNLEFHEIHSTGNEPPLLIMIGYSDGMQVWSIPKFQKGERE AIVFLLEELQERELERVPPCACEGEAGEHQKVLGSEDF >gi568815581f:60500415_60763549|GENSCAN_predicted_CDS_5|657_bp atgttccgtttctgtcctagcacagtgccctttcttcaatcttccccacgattggctttt agactcctgctgattggtacgttttacagagcactgattggtgcattttacagagcactg attggtgcattttacagagtgctgattggtgcattttacaatcctcttgctagctacaga gcgttgattggttttatgaatgaagctatggctacagattccccaagaagacccagtcgt tgtactggtggagttgtggttcgcccccaggctgtcacagagcagtcctacatggaaagt gttgtgacttttctgcaggatgttgtgccacaggcttacagtggaacacctctaacagaa gaaaaggagaaaatagtctgggtcagatttgaaaatgcagatttaaatgatacatcaaga aatctggaatttcatgaaatacatagtactgggaatgaaccgcctttgttgattatgatt ggctacagtgatggaatgcaggtctggagcatccctaaatttcagaaaggggagagggag gcaattgtgttccttctagaagagcttcaggaaagggaacttgagagagtccctccttgt gcttgtgagggagaagcaggagaacaccagaaagtccttggttctgaggacttctga >gi568815581f:60500415_60763549|GENSCAN_predicted_peptide_6|207_aa MNPTVSYGLWVTIIYQCMFMDCNNFTTVVLGVLTVGEVVHDVEAGGIRDLRILSTQFCRE PKTALKNKVCFFLSCQARALRGESLRCHFSFSSHRRNSPAAKMVNVPKTRRTFCKKCGKH QPHKVTQYKSKDSLYAQGKRRYDRKQSGYGGQRRPVFLKKAKTTKKTVLRPECVEPNCRS KRMVVIQRCKHFEPGGDKKRKGQEIQF >gi568815581f:60500415_60763549|GENSCAN_predicted_CDS_6|624_bp atgaatcctactgtaagctatggactgtgggtaacaataatatatcaatgtatgttcatg gattgtaacaattttaccactgtggtgctgggggtgttgacagttggggaggttgtgcac gatgtggaggcaggaggtatacgggacctccgtatactttctactcagttttgccgtgaa cctaaaactgcccttaaaaataaagtctgttttttcctttcctgtcaggcgagagctttg cgaggcgagagtctcagatgccatttctcgttttcatcacatagacggaacagccctgct gcaaagatggtcaacgtacctaaaacccgaagaaccttctgtaagaagtgtggcaagcat cagcctcacaaagtgacacagtataagagcaaggattccttgtatgcccagggaaagagg cgctatgatcggaagcagagtggctatggtgggcagagaaggccagttttcctgaagaag gctaagaccacaaagaagactgtgctaaggccagaatgtgttgagcctaactgcagatcc aagaggatggtggtcattcagagatgtaagcattttgaaccgggaggagataagaagaga aagggccaagagatccagttctaa >gi568815581f:60500415_60763549|GENSCAN_predicted_peptide_7|164_aa MQLRRLHLDLTTGKKLNSSSYIRQKNDVIEQTAAPKIGETNRQIQRITIYQGRNPQARTS GLSTKTKTFLDKQEPREFVAGRPALQEMLKEVLQGKSTKIHEAKIDKEKEMNPLLELETL TPLYLKRTDPAARKAQKDIVELNSTISQLDITDIYKLLLTTVAQ >gi568815581f:60500415_60763549|GENSCAN_predicted_CDS_7|495_bp atgcagctcagaagacttcacttagatctgaccacaggtaaaaagctgaacagctcttct tacatccgtcagaaaaatgacgtcatagagcaaactgctgccccaaaaattggagagaca aacaggcagatacagagaatcacaatttaccagggcagaaatccacaagcgagaacctca ggactcagtaccaagacaaaaactttcttggacaaacaggaaccgagggaatttgttgct ggtagacctgccttgcaagaaatgttaaaagaagttctacagggaaaaagcaccaaaata catgaagcaaaaattgacaaagagaaagagatgaatccactattagagttagagactcta acacccctctatctgaaacgaacagatccagcagccagaaaagcacaaaaggacatagtt gaactcaacagcaccatcagtcaactggatataactgacatctataaactacttcttaca acagtagcacaataa >gi568815581f:60500415_60763549|GENSCAN_predicted_peptide_8|103_aa MIISIDAEKAFDKIRQHFMLKTLNKLVLEVLARAIRQEKEIKGIQLGKEEVKLSLLADDM IAYLENPIVSAQNLLKLISNFSKVSGYKINVQKSQAFLYTSNT >gi568815581f:60500415_60763549|GENSCAN_predicted_CDS_8|312_bp atgattatctcaatagatgcagaaaaggccttcgacaaaattcgacagcacttcatgcta aaaactctcaataaactagtgttggaagttctggccagggcaatcaggcaagagaaagaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgcttgcagatgacatg attgcatatttagaaaaccccatcgtctcagcccaaaatctccttaagctgataagcaac ttcagcaaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacacc agtaacacataa