GENSCAN 1.0 Date run: 7-Nov-116 Time: 00:33:59 Sequence gi568815582f:66479608_66688116 : 208509 bp : 47.29% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 979 1318 340 1 1 53 55 214 0.417 8.61 1.02 PlyA + 2674 2679 6 -0.45 2.03 PlyA - 2793 2788 6 1.05 2.02 Term - 5451 5242 210 0 0 55 53 168 0.565 7.29 2.01 Init - 6121 5801 321 1 0 78 80 189 0.949 12.44 2.00 Prom - 10165 10126 40 -6.16 3.00 Prom + 10846 10885 40 -8.46 3.01 Init + 11434 11775 342 0 0 108 102 66 0.762 5.34 3.02 Intr + 13355 13522 168 1 0 111 81 170 0.958 18.74 3.03 Term + 13931 14155 225 1 0 62 37 117 0.351 0.78 3.04 PlyA + 15009 15014 6 1.05 4.00 Prom + 18797 18836 40 -4.96 4.01 Sngl + 20917 21828 912 0 0 75 39 284 0.811 18.50 4.02 PlyA + 22666 22671 6 1.05 5.10 PlyA - 22886 22881 6 1.05 5.09 Term - 32459 32361 99 2 0 97 49 48 0.765 -0.07 5.08 Intr - 34204 34124 81 1 0 97 119 42 0.867 8.03 5.07 Intr - 37608 37529 80 1 2 45 99 51 0.908 1.17 5.06 Intr - 38270 38182 89 1 2 63 66 98 0.198 4.71 5.05 Intr - 49460 49387 74 2 2 131 79 18 0.789 3.40 5.04 Intr - 51862 51773 90 1 0 92 96 54 0.914 6.79 5.03 Intr - 57410 57357 54 2 0 120 105 48 0.993 8.88 5.02 Intr - 62346 62272 75 0 0 67 77 87 0.923 5.21 5.01 Init - 62887 62885 3 1 0 70 66 0 0.035 -4.00 5.00 Prom - 63200 63161 40 -8.26 6.00 Prom + 64401 64440 40 -0.46 6.01 Init + 73109 73186 78 1 0 88 81 102 0.957 10.56 6.02 Intr + 78583 78741 159 0 0 97 89 15 0.806 2.68 6.03 Intr + 83515 83610 96 0 0 92 98 103 0.958 11.91 6.04 Term + 83644 83742 99 0 0 67 40 7 0.201 -8.07 6.05 PlyA + 83824 83829 6 1.05 7.00 Prom + 85326 85365 40 -5.16 7.01 Init + 86907 87353 447 2 0 105 85 284 0.603 25.77 7.02 Intr + 92179 92271 93 0 0 13 116 46 0.319 0.06 7.03 Intr + 97497 97595 99 2 0 105 98 39 0.920 6.91 7.04 Intr + 99485 99718 234 1 0 71 59 126 0.320 5.89 7.05 Intr + 100013 100285 273 1 0 -33 85 281 0.315 13.43 7.06 Intr + 100419 100577 159 2 0 97 84 71 0.993 7.78 7.07 Intr + 107390 107491 102 1 0 101 89 119 0.999 13.67 7.08 Term + 108312 108512 201 2 0 76 38 146 0.897 5.79 7.09 PlyA + 111646 111651 6 1.05 8.00 Prom + 119231 119270 40 -1.96 8.01 Init + 125199 125345 147 2 0 95 109 141 0.572 16.99 8.02 Intr + 128702 128857 156 1 0 128 77 135 0.996 16.61 8.03 Intr + 129828 129923 96 2 0 93 72 210 0.983 20.11 8.04 Intr + 130276 130396 121 0 1 115 121 196 0.998 25.67 8.05 Intr + 133286 133363 78 0 0 125 35 50 0.481 2.92 8.06 Term + 138420 138529 110 1 2 116 49 59 0.344 3.47 8.07 PlyA + 139653 139658 6 1.05 9.05 PlyA - 140999 140994 6 1.05 9.04 Term - 142615 142451 165 1 0 69 55 273 0.672 20.02 9.03 Intr - 143895 143797 99 0 0 102 91 20 0.935 4.01 9.02 Intr - 156974 156798 177 2 0 81 79 132 0.813 11.72 9.01 Intr - 196157 196134 24 2 0 85 103 53 0.355 4.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 69402 69371 32 1 2 106 103 22 0.832 3.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:66479608_66688116|GENSCAN_predicted_peptide_1|113_aa XYEECVGPGATQLYVPTDAPPPYSLTDSCPTLDGTSDSGSGHSPGRHQQEQRTPAQGGLH TVSMDTLPPYEAVCGAGPPSGLLPLPGPDPGPRGSQGSPTPTRAPASGPERIV >gi568815582f:66479608_66688116|GENSCAN_predicted_CDS_1|342_bp nnctacgaggagtgtgtggggccaggggccactcagctgtatgtccccacggacgcacca ccaccctactcgctgactgattcctgccccacgctggatggcacctccgactcaggcagc ggccacagccctggccgacaccagcaggagcagaggaccccggcccaaggtggccttcac acggtctccatggacacccttcccccctacgaggctgtgtgcggggctggccccccatca ggcctgctgccactgccgggcccagacccagggccaaggggctcccagggctcacccacc ccaacccgggccccagcctctggcccagagaggattgtgtga >gi568815582f:66479608_66688116|GENSCAN_predicted_peptide_2|176_aa MLVRWSVVAWLRWGRQGLRTCNLGFLGDMLGWHNWAATLGVKICPAPKPRDTRPDLGNQS WAPESSSGPEDVQCGVKQSRDQDRRFTSSDDEAYCLSTAHAKSLDMQQLLEPFAETWDLT TPIHGDPMMALVDYHDGVIASTVTPLKTFFWPEEPIWPPQGAGMNILQRAASKNVC >gi568815582f:66479608_66688116|GENSCAN_predicted_CDS_2|531_bp atgctggtgagatggtcagtggtggcatggctgaggtggggacggcagggcctgaggacc tgcaaccttggatttcttggagacatgttgggctggcacaactgggctgccaccctgggg gtaaagatctgcccagcccccaagcccagagatacccggcctgatctaggcaaccagtcc tgggctccagagagttcctctggccccgaggacgtgcaatgtggggtgaagcaaagcaga gaccaggacagaagattcacttcctcagacgatgaagcttactgtctctccaccgctcat gccaagagcctggacatgcagcagctgctggagccctttgcagagacctgggacctcacc acacccatacacggggacccaatgatggctcttgtggactatcacgatggcgtcatcgcc agcacggtgactcccctgaagacttttttctggccagaggagcccatttggcctccacaa ggggcaggaatgaacattctccaaagggcagcctccaaaaatgtctgctga >gi568815582f:66479608_66688116|GENSCAN_predicted_peptide_3|244_aa MSRSQSPGPGEAALGETLLGNIISYYQERAGEGRLDVCRQAALTHRARWLLGRRAPPWLC LPEEVALALGGCCCGDPSPVPKVSPELVHALEFLELISVNLLLFPWRKEIRSLKMYTGSF TYWVRSMLSRHKLDAILGRLGYTATLESEFSLVQAISKENVKQMVFKIFLDIFLRSLKGN QLPGPALAPGPSSGEVVTASGCSRGQPLVSDAVPESRKVTILSQLCPTPGPQSPGQPLDR KPEA >gi568815582f:66479608_66688116|GENSCAN_predicted_CDS_3|735_bp atgagcaggtcccaatccccaggcccaggcgaagctgctctgggtgagaccctcctgggc aacatcatcagctactaccaggagagagcaggggagggccggctcgatgtctgcaggcaa gcagccctcacccacagggccaggtggctcctgggcaggagggcccctccatggctctgt ctgccagaggaagtggcccttgctctgggtggctgctgttgtggggatcccagccctgtc ccaaaagtcagccctgagctggtgcatgcactagagttcctggagctgatctctgtcaac ctgcttctgtttccctggaggaaggaaatcaggtccctaaagatgtacactgggagcttt acctactgggtgcgatctatgctttccagacacaaactggacgccatcctgggcaggctg ggctacacggccaccttggagtctgagttctcactggtccaggccatcagcaaagagaat gtcaagcagatggtgttcaagatcttcctggacattttcctgagaagcctgaagggaaac caactcccaggcccagccctggcccctggcccttcatcgggtgaggtggtcaccgcctca gggtgcagcagggggcagcccctggtctcggatgcagttcctgagagcagaaaagtcacc attctcagccagctctgcccgacaccaggtccccaatctccagggcagcccttagaccgg aagccagaagcatag >gi568815582f:66479608_66688116|GENSCAN_predicted_peptide_4|303_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQRNRAEPSEILPHIYNHLIFDKPDKNRKWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATKAKIDKRDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYLSDKGLISRIY NELKQIYKKKNKQPHQKVGKGYNRHFSKEDIYAAKRHMKKMLIITGHQRDANQNHNEIPS HTS >gi568815582f:66479608_66688116|GENSCAN_predicted_CDS_4|912_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaacggaacagagcagagccctcagaaatattgccgcatatctataaccat ctgatctttgacaaacctgataaaaacaggaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaacactagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaaccaaagccaaaattgacaaacgggatctcatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagacaacctaca gaatgggagaaaatttttgcaatctacttatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaaacaaacaaccccatcaaaaagtgggcaaa ggatataatagacacttctcgaaagaagacatttatgcagccaaaagacacatgaaaaaa atgctcatcatcactggccatcagagagatgcaaatcaaaaccacaatgagataccatct cacaccagttag >gi568815582f:66479608_66688116|GENSCAN_predicted_peptide_5|214_aa MICVEGNIASGKTTCLEFFSNATDVEVLTEPVSKWRNVRGHNPLGLMYHDASRWGLTLQT YVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRN MDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSLFPMAAP VLVIEADHHMERMLELFEQNRDRILTPENRKHCP >gi568815582f:66479608_66688116|GENSCAN_predicted_CDS_5|645_bp atgatctgtgtcgagggcaatattgcaagtgggaagacgacatgcctggaattcttctcc aacgcgacagacgtcgaggtgttaacggagcctgtgtccaagtggagaaatgtccgtggc cacaatcctctgggcctgatgtaccacgatgcctctcgctggggtcttacgctacagact tatgtgcagctcaccatgctggacaggcatactcgtcctcaggtgtcatctgtacggttg atggagaggtcgattcacagcgcaagatacatttttgtagaaaacctgtatagaagtggg aagatgccagaagtggactatgtagttctgtcggaatggtttgactggatcttgaggaac atggacgtgtctgttgatttgatagtttaccttcggaccaatcctgagacttgttaccag aggttaaagaagagatgcagggaagaggagaaggtcattccgctggaatacctggaagca attcaccatctccatgaggagtggctcatcaaaggcagccttttccccatggcagcccct gttctggtgattgaggctgaccaccacatggagaggatgttagaactctttgaacaaaat cgggatcgaatattaactccagagaatcggaagcattgcccatag >gi568815582f:66479608_66688116|GENSCAN_predicted_peptide_6|143_aa MDNVQPKIKHRPFCFSVKGHVKMLRLALTVTSMTFFIIAQAPEPYIVITGFEVTVILFFI LLYVLRLDRLMKWLFWPLLDIINSLVTTVFMLIVSVLALIPETTTLTVGGGVLSECKFTW KTDVRIESYTIFILEIPRPIYNA >gi568815582f:66479608_66688116|GENSCAN_predicted_CDS_6|432_bp atggataacgtgcagccgaaaataaaacatcgccccttctgcttcagtgtgaaaggccac gtgaagatgctgcggctggcactaactgtgacatctatgaccttttttatcatcgcacaa gcccctgaaccatatattgttatcactggatttgaagtcaccgttatcttatttttcata cttttatatgtactcagacttgatcgattaatgaagtggttattttggcctttgcttgat attatcaactcactggtaacaacagtattcatgctcatcgtatctgtgttggcactgata ccagaaaccacaacattgacagttggtggaggggttttatcagaatgcaaatttacttgg aaaacagatgtaagaatagaaagctatactatattcatccttgagattcctaggccaata tacaatgcctag >gi568815582f:66479608_66688116|GENSCAN_predicted_peptide_7|535_aa MDPEHAKPESSEAPSGNLKQPETAAALASSGSVVSSVPKAQRNISAKTAPRKHPAVSIRS AQSAAAARPQGSEGTAPSRKATTRPPPKPTLPPPTPSAHTESKLLNEMAIKERVEGRAKV PYKFRDSLKRFSFSPTGMLKILRLVSGELAGSTHKAGEGQETSYFAVGFSGNAQEVSQTK DLTNSIITAVFLSVVAILAMQEKKRRHLLYVGGPKCDHKIRAAAFIENTCFLSLPCVKVA LYFPALGAAPAELRAVTRCSQPPDPAAERRLTDLGPPGDAGMGPRGQLRAEAAKGAKPEP APAPPPPGAKPEEDKKDGKEPSDKPQKAVQDHKEPSDKPQKAVQPKHEVGTRRGCRRYRW ELKDSNKEFWLLGHAEIKIRSLGCLIAAMILLSSLTVHPILRLIITMEISFFSFFILLYS FAIHRYIPFILWPISDLFNDLIACAFLVGAVVFAVRSRRSMNLHYLLAVILIGAAGVFAF IDVCLQRNHFRGKKAKKHMLVPPPGKEKGPQQGKGPEPAKPPEPGKPPGPAKGKK >gi568815582f:66479608_66688116|GENSCAN_predicted_CDS_7|1608_bp atggatcctgaacacgccaaacctgagtcatccgaggcaccttcagggaacttgaaacaa ccggagactgccgcagccctggcaagtagcggcagcgtagtgagttctgtacccaaggca cagcgcaacatctcagcgaagaccgcaccccggaagcaccccgcagtctcaattcgcagt gcgcagtccgcagccgccgcacgtccccaaggcagtgagggcaccgcaccctcaaggaaa gccaccacacgcccacccccaaagcccacactcccaccccccacgccctctgcacacact gaatccaaactcttaaatgagatggcgatcaaagagcgcgtggagggccgagccaaagtc ccgtacaaattcagggacagcctcaaacgtttctccttctcgcccactggaatgttgaag atcctgagactggtgagcggagagctggctgggtccacacataaagctggagagggccag gagacatcatactttgctgtagggttcagtggtaatgcccaagaagtatcccagacaaag gatcttaccaacagtatcattacagctgtgttcctttcagtagttgccatcttggccatg caagaaaagaaaagaaggcatttactctatgtcggggggcctaaatgtgaccataaaatt agggctgctgcttttatcgagaacacctgcttcctctcgttgccttgtgtgaaagtcgcg ttgtattttcctgcgcttggcgctgcgcccgcggagctcagggccgtgacccggtgctcg cagccccccgaccccgcagcggagcggcggcttacagacctggggccacctggagacgcc gggatgggaccgcgagggcagctccgcgcggaggcggcaaagggggccaagccagagcca gcaccagctccacctccacccggggccaaacccgaggaagacaagaaggacggtaaggag ccatcggacaaacctcaaaaggcggtgcaggaccataaggagccatcggacaaacctcaa aaggcggtgcagcccaagcacgaagtgggcacgaggagggggtgtcgccgctaccggtgg gaattaaaagacagcaataaagagttctggctcttggggcacgctgagatcaagattcgg agtttgggctgcctaatagctgcaatgatactgttgtcctcactcaccgtgcaccccatc ttgaggcttatcatcaccatggagatatccttcttcagcttcttcatcttactgtacagc tttgccattcatagatacatacccttcatcctgtggcccatttctgacctcttcaacgac ctgattgcttgtgcgttccttgtgggagccgtggtctttgctgtgagaagtcggcgatcc atgaatctccactacttacttgctgtgatccttattggtgcggctggagtttttgctttt atcgatgtgtgtcttcaaagaaaccacttcagaggcaagaaggccaaaaagcatatgctg gttcctcctccaggaaaggaaaaaggaccccagcagggcaagggaccagaacccgccaag ccaccagaacctggcaagccaccagggccagcaaagggaaagaaatga >gi568815582f:66479608_66688116|GENSCAN_predicted_peptide_8|235_aa MWPPDPDPDPDPEPAGGSRPGPAVPGLRALLPARAFLCSLKGRLLLAESGLSFITFICYV ASSASAFLTAPLLEFLLALYFLFADAMQLNDKWQGLCWPMMDFLRCVTAALIYFAISITA IAKYSDGASKAAGVFGFFATIVFATDFYLIFNDVAKFLKQGDSADETTAHKTEGLQPFNV SAKNQHKETKQSLVCIWAADLAVCGPHQPTKLLGLDVGADKIRQTWKKPWILQLH >gi568815582f:66479608_66688116|GENSCAN_predicted_CDS_8|708_bp atgtggcccccagaccccgaccccgacccggaccccgagcctgccggcggctcccgtccc ggccccgcggtccccgggctccgcgccctgctgccggcgcgggctttcctctgctctctc aaaggccgcctcctgctggccgagtcgggtctctcattcatcacttttatctgctatgtg gcgtcctcagcatctgccttcctcacagcgcctctgctggagttcctgctggccttgtac ttcctctttgctgatgccatgcagctgaatgacaagtggcagggcttgtgctggcccatg atggacttcctgcgctgtgtcaccgcggccctcatctactttgctatctccatcacggcc atcgccaagtactcggatggggcttccaaagccgctggggtgtttggcttctttgctacc atcgtgtttgcaactgatttctacctgatctttaacgacgtggccaaattcctcaaacaa ggggactctgcagatgagaccacagcccacaagacagaaggccttcagccctttaacgtc tctgccaaaaaccagcacaaggagacaaagcagagccttgtctgtatctgggcagcagac ctggccgtgtgtggacctcaccagcctaccaagctgttgggcctggacgtgggtgctgat aaaataagacaaacctggaaaaaaccctggattctgcaacttcactag >gi568815582f:66479608_66688116|GENSCAN_predicted_peptide_9|154_aa CVALVEVVILALIAFICIETIMACSPCEGLYFFEFVSCSAFVVTGVLLIMFSLNLHMRIP QINWNLTDLVNTGLSAFLFFIASIVLAALNHRAGAEIAAVIFGFLATAAYAVNTFLAVQK WRVSVRQQSTNDYIRARTESRDVDSRPEIQRLDT >gi568815582f:66479608_66688116|GENSCAN_predicted_CDS_9|465_bp tgtgtggcgctggtggaggtggtgatcttggccctgattgcattcatctgcatagagacc atcatggcatgctccccgtgtgaaggcctctacttttttgagtttgtgagctgcagtgcg tttgtggtgactggcgtcttgctgattatgttcagtctcaacctgcacatgaggatcccc cagatcaactggaatctgacagatttggtcaacactggactcagcgctttccttttcttt attgcttcaatcgtactggctgctttaaaccatagagccggagcagaaattgctgccgtg atatttggcttcttggcgactgcggcatatgcagtgaacacattcctggcagtgcagaaa tggagagtcagcgtccgccagcagagcaccaatgactacatccgagcccgcacggagtcc agggatgtggacagtcgccctgagatccagcgcctggacacgtga