GENSCAN 1.0 Date run: 4-Nov-116 Time: 04:30:01 Sequence gi568815577r:34349248_34549634 : 200387 bp : 44.57% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 19431 19470 40 -2.66 1.01 Sngl + 21232 21603 372 0 0 71 44 431 0.952 33.13 1.02 PlyA + 21751 21756 6 1.05 2.09 PlyA - 22150 22145 6 1.05 2.08 Term - 26308 26109 200 2 2 78 52 164 0.923 9.36 2.07 Intr - 26621 26493 129 0 0 74 75 119 0.940 9.87 2.06 Intr - 33527 33443 85 2 1 71 65 56 0.027 0.99 2.05 Intr - 39418 39304 115 0 1 123 26 -20 0.064 -4.25 2.04 Intr - 42465 42339 127 1 1 59 113 96 0.829 8.94 2.03 Intr - 42879 42781 99 1 0 128 28 50 0.439 3.08 2.02 Intr - 44709 44597 113 2 2 65 59 79 0.824 2.72 2.01 Init - 51825 51473 353 0 2 110 34 130 0.760 6.44 2.00 Prom - 54286 54247 40 -4.16 3.04 PlyA - 54516 54511 6 1.05 3.03 Term - 58484 58389 96 2 0 105 37 42 0.385 -1.23 3.02 Intr - 62061 61873 189 0 0 3 89 135 0.412 4.88 3.01 Init - 62322 62161 162 0 0 74 37 103 0.273 3.53 3.00 Prom - 63336 63297 40 -6.56 4.00 Prom + 63474 63513 40 -4.26 4.01 Init + 64803 64865 63 2 0 91 80 26 0.881 3.25 4.02 Intr + 67763 67796 34 1 1 93 50 106 0.121 5.10 4.03 Term + 68341 68480 140 2 2 -16 39 155 0.129 -1.67 4.04 PlyA + 68767 68772 6 -0.45 5.04 PlyA - 68779 68774 6 -0.45 5.03 Term - 70383 69971 413 1 2 73 54 365 0.952 27.00 5.02 Intr - 77082 76902 181 0 1 88 36 73 0.218 1.54 5.01 Init - 86720 86580 141 2 0 71 2 136 0.010 3.43 5.00 Prom - 93605 93566 40 -6.16 6.02 PlyA - 94656 94651 6 1.05 6.01 Sngl - 100387 99998 390 1 0 70 46 588 0.816 48.82 6.00 Prom - 105471 105432 40 -3.96 7.00 Prom + 108006 108045 40 -1.76 7.01 Init + 108715 108722 8 0 2 58 87 0 0.073 -2.58 7.02 Intr + 115007 115114 108 2 0 117 86 52 0.308 7.30 7.03 Intr + 119710 119844 135 1 0 -52 105 141 0.333 1.48 7.04 Intr + 120646 120761 116 1 2 50 66 63 0.666 0.39 7.05 Term + 123557 123795 239 0 2 29 48 184 0.682 4.83 7.06 PlyA + 123929 123934 6 1.05 8.07 PlyA - 124431 124426 6 1.05 8.06 Term - 129463 129428 36 1 0 130 48 11 0.062 -1.36 8.05 Intr - 138849 138727 123 0 0 47 84 57 0.156 1.98 8.04 Intr - 145818 145733 86 1 2 53 85 51 0.104 0.84 8.03 Intr - 156511 156449 63 2 0 71 83 53 0.190 1.79 8.02 Intr - 157345 157282 64 1 1 84 63 24 0.308 -2.21 8.01 Init - 160041 159952 90 0 0 72 105 59 0.677 6.49 8.00 Prom - 166611 166572 40 -8.26 9.05 PlyA - 167217 167212 6 1.05 9.04 Term - 169009 168837 173 2 2 115 43 151 0.998 11.29 9.03 Intr - 172411 172252 160 1 1 131 99 21 0.993 7.06 9.02 Intr - 174463 174290 174 1 0 118 109 123 0.999 17.54 9.01 Init - 177502 177416 87 1 0 74 86 47 0.932 3.74 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 29439 29385 55 0 1 35 119 62 0.825 5.45 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815577r:34349248_34549634|GENSCAN_predicted_peptide_1|123_aa MSTLSNFTQTLEDVFRRIFITYMDNWRQNTTAEQEALQAKVDAENFYYVILYLMVMIGMF SFIIVAILVSTVKSKRREHSNDPYHQYIVEDWQEKYKSQILNLEESKATIHENIGAAGFK MSP >gi568815577r:34349248_34549634|GENSCAN_predicted_CDS_1|372_bp atgtctactttatccaatttcacacagacgctggaagacgtcttccgaaggatttttatt acttatatggacaattggcgccagaacacaacagctgagcaagaggccctccaagccaaa gttgatgctgagaacttctactatgtcatcctgtacctcatggtgatgattggaatgttc tctttcatcatcgtggccatcctggtgagcactgtgaaatccaagagacgggaacactcc aatgacccctaccaccagtacattgtagaggactggcaggaaaagtacaagagccaaatc ttgaatctagaagaatcgaaggccaccatccatgagaacattggtgcggctgggttcaaa atgtccccctga >gi568815577r:34349248_34549634|GENSCAN_predicted_peptide_2|406_aa MPRFASPLLRNVIIRSQFDGIKRKQCLQYLKTLRTLQYDGFKTVYFGETNIPESLVTGED ISDGYFIQTPTWCIVHAAGSQGWVPWKYRVFLRDELCIKQEDSLFSEFCDVVRKAYGNHS LIKYWSTFCVPGWVRQGQEYQGKQAWTLKVQVWCGDSPPSILWLHSDTDEICMGELPHGV MVRVFPAVDCVKRLNLFFTPEVITMNITSSRVASHVGRCGCRSVWRMKSTRWSPLKLTFT WQNLEGNTKGMSRKCLIRVYPMFTAQHKDAVETHGKRQKCGLRRTQHKGSRGQEHSQDSP GTNTVGLNQSWAPPGPRASRNTLLNGLTAPDGQVQQGRPAAEAPAPGRAQTTTSTVLPGK TTGPEMHRYVLRRSGPVPPTRVHYSVRRRSGPGQKTWGALLTAQAQ >gi568815577r:34349248_34549634|GENSCAN_predicted_CDS_2|1221_bp atgcctcgctttgcaagccctcttttaagaaatgtcattatcagaagtcaatttgatggc atcaagaggaagcaatgcctccaatatctgaaaaccctgagaacactgcaatatgatgga tttaagaccgtatattttggggaaaccaatatcccagaaagtctcgtaactggggaagat attagtgatggatatttcatacaaaccccaacttggtgtattgtgcatgctgcgggtagt caaggatgggtgccttggaaatatcgggtgttcctaagagacgagctgtgtatcaaacaa gaagacagcctcttctctgagttctgtgatgtggtgaggaaggcctatggaaatcattca ctcatcaaatactggagcaccttctgtgtgcctggctgggtgcgtcagggccaggagtac caaggaaaacaagcctggaccctcaaggtccaggtctggtgtggagactctccaccgtcc atcctctggcttcactctgacacagatgagatctgcatgggggagctccctcatggtgtg atggtgcgtgtgtttccggcagtggactgtgtgaaacggctgaacctcttcttcacgcct gaggtcatcaccatgaacataacatcatcacgtgtggcatcccatgtgggacgctgtggc tgccgcagtgtctggcgcatgaagagcacacgctggagtccactcaagctgacatttaca tggcagaatctagaagggaataccaaagggatgagtaggaaatgtttgataagagtatac ccgatgtttactgctcagcataaagacgcagtggagacgcacggcaagaggcagaagtgt ggactgagaaggacgcagcacaagggcagcagaggtcaagaacattcacaggactctccg gggaccaacaccgtaggcttgaaccaatcctgggccccgcccgggccacgtgccagccgg aacacgctgctcaacgggttgacggcgcccgacggccaggtccagcaggggcgccccgca gcggaggctccagcgcctggccgcgcacaaaccacgacttctaccgtcctgccggggaaa actacaggtcccgaaatgcaccgctacgtgctcaggcgcagtgggccggtgccgccgacg agagtgcactactccgtgcgcaggcgcagtgggcccgggcagaagacctggggcgcgctg ctcactgcgcaggcgcagtga >gi568815577r:34349248_34549634|GENSCAN_predicted_peptide_3|148_aa MGSVPLQEETPDPPLSLHLVRTQQEVFYPRTRKQVLTGYRICQHLDLRLPASRTAARLIP SDSCSSGKRFQYRPSSTPTQKTVTEVFKERTEREQKGTEGEMKTERGGHVEMESSRTVRF QEKPEMGVCLNKPGAEEGRREFGDYPDS >gi568815577r:34349248_34549634|GENSCAN_predicted_CDS_3|447_bp atgggatcagtgcccttacaggaagagacaccagacccccctctctctctgcaccttgtg aggacacagcaagaagtgttctacccgcgaaccaggaagcaggtcctcaccgggtaccga atctgccaacaccttgatctcagacttccagcttccagaactgcggcaagactgattcct tcagactcctgcagcagcggaaagagattccagtaccgaccaagctcaaccccaactcag aaaacggtgactgaggtcttcaaagagagaacagaaagggaacaaaaagggactgaagga gaaatgaaaacagaacgtgggggccacgtggaaatggaaagttccagaacggtgaggttc caggagaagccagagatgggggtttgccttaacaaaccaggagctgaggaaggtcgaaga gaattcggtgattatcccgactcatga >gi568815577r:34349248_34549634|GENSCAN_predicted_peptide_4|78_aa MGAELSFGRILVRRFSNWGRQDDDNDDDDNDDASSPPPLMPTSTTTTYATTVSIITTTNN ITNSNTAININHHHHQNH >gi568815577r:34349248_34549634|GENSCAN_predicted_CDS_4|237_bp atgggagctgagctgtcttttgggagaatcctagtgagaaggttctccaactggggccgc caagatgatgacaatgatgatgacgacaatgatgatgcatcatcaccaccaccattaatg ccaacatcaaccaccaccacctacgccaccaccgttagcatcataaccaccaccaataac atcaccaacagcaacactgccatcaacataaaccatcaccaccaccaaaaccattag >gi568815577r:34349248_34549634|GENSCAN_predicted_peptide_5|244_aa MGKDFTSKTPKAMAAKAKIDKWDLIKLKSFCTAKETTIRVNRQPTEWDQGHKILLILHLL HHLHLTDEETEAHKDEGAGPGSCSKDVAGARALTPSPYHCHEPLIITGAKWTPHEASNQT QASTLLGLLLGDHTEGRNDTNSTRALKVPDGTSAAWYILTIIGIYAVIFVFRLASNILRK NDKSLEDVYYSNLTSELKMTGLQGKVAKCSTLSISNRAVLQPCQAHLGAKGGSSGPQTAT PETP >gi568815577r:34349248_34549634|GENSCAN_predicted_CDS_5|735_bp atgggcaaggacttcacgtctaaaacaccaaaagcaatggcagcaaaagccaaaattgac aaatgggatctaattaaactaaagagcttctgcacagcaaaagaaactaccatcagagtg aacaggcaacctacagaatgggaccaaggacataagatcctgctcatccttcacctcctt caccaccttcacctcacagatgaggaaactgaggctcacaaagatgagggggctggccca gggtcatgcagcaaggatgtggctggggccagagccctgactcccagcccttatcactgc cacgagccccttattatcacgggggccaagtggactccccatgaagcctccaaccagacc caggccagcaccctcctggggctcctgctgggtgaccacacagaggggaggaatgacacc aactccaccagggctctgaaggtgccagacggaaccagcgctgcctggtatatactcacc atcatcggcatctacgcggtgattttcgtcttccggctggccagcaacatcctcagaaag aatgacaagtccttagaagatgtttattactcaaatctgacctctgaactcaaaatgaca gggctgcagggcaaggtcgccaagtgctccaccctgtctatcagcaacagagctgtgctg cagccctgccaggcccacctgggggcaaagggcggaagcagcgggccccaaaccgcaacc ccagagaccccctga >gi568815577r:34349248_34549634|GENSCAN_predicted_peptide_6|129_aa MILSNTTAVTPFLTKLWQETVQQGGNMSGLARRSPRSSDGKLEALYVLMVLGFFGFFTLG IMLSYIRSKKLEHSNDPFNVYIESDAWQEKDKAYVQARVLESYRSCYVVENHLAIEQPNT HLPETKPSP >gi568815577r:34349248_34549634|GENSCAN_predicted_CDS_6|390_bp atgatcctgtctaacaccacagcggtgacgccctttctgaccaagctgtggcaggagaca gttcagcagggtggcaacatgtcgggcctggcccgcaggtccccccgcagcagtgacggc aagctggaggccctctacgtcctcatggtactgggattcttcggcttcttcaccctgggc atcatgctgagctacatccgctccaagaagctggagcactcgaacgacccattcaacgtc tacatcgagtccgatgcctggcaagagaaggacaaggcctatgtccaggcccgggtcctg gagagctacaggtcgtgctatgtcgttgaaaaccatctggccatagaacaacccaacaca caccttcctgagacgaagccttccccatga >gi568815577r:34349248_34549634|GENSCAN_predicted_peptide_7|201_aa MGKHFFTGLASVSSFTQHRLMPATERDPGLFQISDPQQRQLGSGADPEQTTAALRKSDLT DKRKTNKQTESTTDNINQENPIQRDTDGAGSRYPQQTNAGTENQTLHDLIYKWELNDKNT WPQTDRYSMLKISANIVEFSSTINQLDLIHLKRHPTTAEYTFFSSSYGTLTKIDHILRHK IHLANLEESKSSYVCPQTTME >gi568815577r:34349248_34549634|GENSCAN_predicted_CDS_7|606_bp atgggaaaacacttcttcacgggtctagcctctgtttccagctttacacagcacaggctg atgcctgccactgaacgggatccaggactttttcaaatctcagatccccagcaaaggcaa ctagggtctggagcagaccctgagcaaaccacagcagccctgcggaagagtgacctgact gataaaagaaaaacaaacaaacaaacagaaagcaccaccgacaacatcaaccaggaaaac cccattcaaagggacaccgatggagctggaagccgttatcctcagcaaactaatgcagga acagaaaaccaaacactgcatgatctcatttataagtgggagctgaatgacaagaacaca tggccacaaacagacagatacagcatgctaaaaatcagtgcgaacatagttgaattcagc agcaccatcaatcaactggatctgattcatctaaaacgtcatccaacaacagcagaatac acattcttctcaagctcatatggaacactcaccaagatagaccacattctgcgtcataaa atacacttggcaaatttagaagaatcaaaatcatcctatgtctgccctcagaccacaatg gaatga >gi568815577r:34349248_34549634|GENSCAN_predicted_peptide_8|153_aa MAPGQQPARKRTPPSSNHKEMDSTNTTCALLQISPGSAKPGTVEQHQVCPLARGSSVEIC VGHSGHIHMATGDAGKAFDKIQHLFMIKTLSKIGIEGTHLKSGCQHLELLIPLQQLMHLT VHSNWTLQTCHSTPGSPVVAGRNPTLDLVVMSL >gi568815577r:34349248_34549634|GENSCAN_predicted_CDS_8|462_bp atggcccccggccagcagccagcaaggaagaggacacctccgtcctccaaccacaaggaa atggattctaccaacactacatgtgctctgctgcagatctcccctggcagtgctaagcct ggcactgttgagcaacaccaggtctgtcccttagcgaggggatcaagtgtagaaatctgc gtgggccactcaggacacatccacatggcaacaggagatgcaggaaaagcatttgacaaa atccagcatctttttatgattaaaaccctcagcaaaattggcatagaagggacacacctt aagtctggatgccagcacctggagctgctcattccattgcagcagctgatgcacctgact gtgcacagtaactggaccctgcaaacttgccactccacacctggctcacctgtggtggct ggtaggaatcccacattagatttagttgtcatgtctctttag >gi568815577r:34349248_34549634|GENSCAN_predicted_peptide_9|197_aa MHFRNFNYSFSSLIACVANSDIFSESETRAKFESLFRTYDKDITFQYFKSFKRVRINFSN PFSAADARLQLHKTEFLGKEMKLYFAQTLHIGSSHLAPPNPDKQFLISPPASPPVGWKQV EDATPVINYDLLYAISKLGPGEKYELHAATDTTPSVVVHVCESDQEKEEEEEMERMRRPK PKIIQTRRPEYTPIHLS >gi568815577r:34349248_34549634|GENSCAN_predicted_CDS_9|594_bp atgcattttagaaactttaactacagttttagctccctgattgcctgtgtggcaaacagt gatatcttcagcgaaagtgaaaccagggccaaatttgagtccctctttaggacgtatgac aaggacatcacctttcagtattttaagagcttcaaacgagtcagaataaacttcagcaac cccttctccgcagcagatgccaggctccagctgcataagactgagtttctgggaaaggaa atgaagttatattttgctcagaccttacacataggaagctcacacctggctccgccaaat ccagacaagcagtttctgatctcccctcccgcctctccgccagtgggatggaaacaagtg gaagatgcgaccccagtcataaactatgatctcttatatgccatctccaagctggggcca ggggaaaagtatgaattgcacgcagcgactgacaccactcccagcgtggtggtccatgta tgtgagagtgatcaagagaaggaggaagaagaggaaatggaaagaatgaggagacctaag ccaaaaattatccagaccaggaggccggagtacacgccgatccacctcagctga