GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:27:37 Sequence gi568815582f:66452716_66665991 : 213276 bp : 47.75% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 2404 2453 50 1 2 97 102 32 0.629 3.82 1.02 Intr + 16887 17150 264 1 0 114 101 375 0.707 38.88 1.03 Intr + 24845 24995 151 0 1 81 86 272 0.998 25.52 1.04 Term + 27871 28210 340 1 1 53 55 214 0.419 8.61 1.05 PlyA + 29566 29571 6 -0.45 2.03 PlyA - 29685 29680 6 1.05 2.02 Term - 32343 32134 210 0 0 55 53 168 0.565 7.29 2.01 Init - 33013 32693 321 1 0 78 80 189 0.949 12.44 2.00 Prom - 37057 37018 40 -6.16 3.00 Prom + 37738 37777 40 -8.46 3.01 Init + 38326 38667 342 0 0 108 102 66 0.762 5.34 3.02 Intr + 40247 40414 168 1 0 111 81 170 0.958 18.74 3.03 Term + 40823 41047 225 1 0 62 37 117 0.351 0.78 3.04 PlyA + 41901 41906 6 1.05 4.00 Prom + 45689 45728 40 -4.96 4.01 Sngl + 47809 48720 912 0 0 75 39 284 0.811 18.50 4.02 PlyA + 49558 49563 6 1.05 5.10 PlyA - 49778 49773 6 1.05 5.09 Term - 59351 59253 99 2 0 97 49 48 0.765 -0.07 5.08 Intr - 61096 61016 81 1 0 97 119 42 0.867 8.03 5.07 Intr - 64500 64421 80 1 2 45 99 51 0.908 1.17 5.06 Intr - 65162 65074 89 1 2 63 66 98 0.198 4.71 5.05 Intr - 76352 76279 74 2 2 131 79 18 0.789 3.40 5.04 Intr - 78754 78665 90 1 0 92 96 54 0.914 6.79 5.03 Intr - 84302 84249 54 2 0 120 105 48 0.993 8.88 5.02 Intr - 89238 89164 75 0 0 67 77 87 0.923 5.21 5.01 Init - 89779 89777 3 1 0 70 66 0 0.035 -4.00 5.00 Prom - 90092 90053 40 -8.26 6.00 Prom + 91293 91332 40 -0.46 6.01 Init + 100001 100078 78 1 0 88 81 102 0.957 10.56 6.02 Intr + 105475 105633 159 0 0 97 89 15 0.806 2.68 6.03 Intr + 110407 110502 96 0 0 92 98 103 0.958 11.91 6.04 Term + 110536 110634 99 0 0 67 40 7 0.201 -8.07 6.05 PlyA + 110716 110721 6 1.05 7.00 Prom + 112218 112257 40 -5.16 7.01 Init + 113799 114245 447 2 0 105 85 284 0.603 25.77 7.02 Intr + 119071 119163 93 0 0 13 116 46 0.319 0.06 7.03 Intr + 124389 124487 99 2 0 105 98 39 0.920 6.91 7.04 Intr + 126377 126610 234 1 0 71 59 126 0.320 5.89 7.05 Intr + 126905 127177 273 1 0 -33 85 281 0.315 13.43 7.06 Intr + 127311 127469 159 2 0 97 84 71 0.993 7.78 7.07 Intr + 134282 134383 102 1 0 101 89 119 0.999 13.67 7.08 Term + 135204 135404 201 2 0 76 38 146 0.897 5.79 7.09 PlyA + 138538 138543 6 1.05 8.00 Prom + 146123 146162 40 -1.96 8.01 Init + 152091 152237 147 2 0 95 109 141 0.572 16.99 8.02 Intr + 155594 155749 156 1 0 128 77 135 0.996 16.61 8.03 Intr + 156720 156815 96 2 0 93 72 210 0.983 20.11 8.04 Intr + 157168 157288 121 0 1 115 121 196 0.998 25.67 8.05 Intr + 160178 160255 78 0 0 125 35 50 0.481 2.92 8.06 Term + 165312 165421 110 1 2 116 49 59 0.344 3.47 8.07 PlyA + 166545 166550 6 1.05 9.04 PlyA - 167891 167886 6 1.05 9.03 Term - 169507 169343 165 1 0 69 55 273 0.672 20.02 9.02 Intr - 170787 170689 99 0 0 102 91 20 0.935 4.01 9.01 Intr - 183866 183690 177 2 0 81 79 132 0.816 11.72 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 96294 96263 32 1 2 106 103 22 0.832 3.35 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:66452716_66665991|GENSCAN_predicted_peptide_1|268_aa XNTLSGHYSFTDEEAKVLARYNRTSYFYPTFSESSEHSHLLVSPVLVASAVIGVVIILSC ITIIVGSIRRDRQARLQRHRHRHHRHHHHHHHHRRRRHREYEHGYVSDEHTYSRSSRRMR YACSSSEDWPPPLDISSDGDVDATVLRELYPDSPPGYEECVGPGATQLYVPTDAPPPYSL TDSCPTLDGTSDSGSGHSPGRHQQEQRTPAQGGLHTVSMDTLPPYEAVCGAGPPSGLLPL PGPDPGPRGSQGSPTPTRAPASGPERIV >gi568815582f:66452716_66665991|GENSCAN_predicted_CDS_1|807_bp nnaaacactctgagtggtcattactcttttacagatgaggaagctaaggtactagcacga tacaaccgcaccagctacttctaccccacattctcagagagctcggagcacagccatctg ctcgtgtcccccgtgctggtggcgagtgccgtcataggtgtggtcatcatcctctcctgc atcaccatcattgtgggcagcatccgcagggacaggcaggcccggcttcagcggcaccgc caccgccaccaccgccaccaccaccaccatcatcaccaccgccggcgtcgacaccgagag tacgagcacggctacgtgtcggacgagcacacatacagccgctcaagccgcaggatgcgc tatgcctgcagctcctcagaggactggcccccacccttggacatcagctctgacggggac gtggatgccacggtgctcagggagctgtacccagattctccaccaggctacgaggagtgt gtggggccaggggccactcagctgtatgtccccacggacgcaccaccaccctactcgctg actgattcctgccccacgctggatggcacctccgactcaggcagcggccacagccctggc cgacaccagcaggagcagaggaccccggcccaaggtggccttcacacggtctccatggac acccttcccccctacgaggctgtgtgcggggctggccccccatcaggcctgctgccactg ccgggcccagacccagggccaaggggctcccagggctcacccaccccaacccgggcccca gcctctggcccagagaggattgtgtga >gi568815582f:66452716_66665991|GENSCAN_predicted_peptide_2|176_aa MLVRWSVVAWLRWGRQGLRTCNLGFLGDMLGWHNWAATLGVKICPAPKPRDTRPDLGNQS WAPESSSGPEDVQCGVKQSRDQDRRFTSSDDEAYCLSTAHAKSLDMQQLLEPFAETWDLT TPIHGDPMMALVDYHDGVIASTVTPLKTFFWPEEPIWPPQGAGMNILQRAASKNVC >gi568815582f:66452716_66665991|GENSCAN_predicted_CDS_2|531_bp atgctggtgagatggtcagtggtggcatggctgaggtggggacggcagggcctgaggacc tgcaaccttggatttcttggagacatgttgggctggcacaactgggctgccaccctgggg gtaaagatctgcccagcccccaagcccagagatacccggcctgatctaggcaaccagtcc tgggctccagagagttcctctggccccgaggacgtgcaatgtggggtgaagcaaagcaga gaccaggacagaagattcacttcctcagacgatgaagcttactgtctctccaccgctcat gccaagagcctggacatgcagcagctgctggagccctttgcagagacctgggacctcacc acacccatacacggggacccaatgatggctcttgtggactatcacgatggcgtcatcgcc agcacggtgactcccctgaagacttttttctggccagaggagcccatttggcctccacaa ggggcaggaatgaacattctccaaagggcagcctccaaaaatgtctgctga >gi568815582f:66452716_66665991|GENSCAN_predicted_peptide_3|244_aa MSRSQSPGPGEAALGETLLGNIISYYQERAGEGRLDVCRQAALTHRARWLLGRRAPPWLC LPEEVALALGGCCCGDPSPVPKVSPELVHALEFLELISVNLLLFPWRKEIRSLKMYTGSF TYWVRSMLSRHKLDAILGRLGYTATLESEFSLVQAISKENVKQMVFKIFLDIFLRSLKGN QLPGPALAPGPSSGEVVTASGCSRGQPLVSDAVPESRKVTILSQLCPTPGPQSPGQPLDR KPEA >gi568815582f:66452716_66665991|GENSCAN_predicted_CDS_3|735_bp atgagcaggtcccaatccccaggcccaggcgaagctgctctgggtgagaccctcctgggc aacatcatcagctactaccaggagagagcaggggagggccggctcgatgtctgcaggcaa gcagccctcacccacagggccaggtggctcctgggcaggagggcccctccatggctctgt ctgccagaggaagtggcccttgctctgggtggctgctgttgtggggatcccagccctgtc ccaaaagtcagccctgagctggtgcatgcactagagttcctggagctgatctctgtcaac ctgcttctgtttccctggaggaaggaaatcaggtccctaaagatgtacactgggagcttt acctactgggtgcgatctatgctttccagacacaaactggacgccatcctgggcaggctg ggctacacggccaccttggagtctgagttctcactggtccaggccatcagcaaagagaat gtcaagcagatggtgttcaagatcttcctggacattttcctgagaagcctgaagggaaac caactcccaggcccagccctggcccctggcccttcatcgggtgaggtggtcaccgcctca gggtgcagcagggggcagcccctggtctcggatgcagttcctgagagcagaaaagtcacc attctcagccagctctgcccgacaccaggtccccaatctccagggcagcccttagaccgg aagccagaagcatag >gi568815582f:66452716_66665991|GENSCAN_predicted_peptide_4|303_aa MAILPKVIYRFNAIPIKLPMTFFTELEKTTLKFIWNQKRAHIAKSILSQKNKAGGITLPD FKLYYKATVTKTAWYWYQNRDIDQRNRAEPSEILPHIYNHLIFDKPDKNRKWGKDSLFNK WCWENWLAICRKLKLDPFLTPYTKINSRWIKDLNVRPKTIKTLEENLGNTIQDIGMGKDF MSKTPKAMATKAKIDKRDLIKLKSFCTAKETTIRVNRQPTEWEKIFAIYLSDKGLISRIY NELKQIYKKKNKQPHQKVGKGYNRHFSKEDIYAAKRHMKKMLIITGHQRDANQNHNEIPS HTS >gi568815582f:66452716_66665991|GENSCAN_predicted_CDS_4|912_bp atggccatactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatg actttcttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcc cacattgccaagtcaatcctaagccaaaagaacaaagctggaggcatcacgctacctgac ttcaaactatactacaaggctacagtaaccaaaacagcatggtactggtaccaaaacaga gatatagaccaacggaacagagcagagccctcagaaatattgccgcatatctataaccat ctgatctttgacaaacctgataaaaacaggaaatggggaaaggattccctatttaataaa tggtgctgggaaaactggctagccatatgtagaaagctgaaactggatcccttccttaca ccttatacaaaaattaattcaagatggattaaagacttaaatgttagacctaaaaccata aaaacactagaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttc atgtctaaaacaccaaaagcaatggcaaccaaagccaaaattgacaaacgggatctcatt aaactaaagagcttctgcacagcaaaagaaactaccatcagagtgaacagacaacctaca gaatgggagaaaatttttgcaatctacttatctgacaaagggctaatatccagaatctac aatgaactcaaacaaatttacaagaaaaaaaacaaacaaccccatcaaaaagtgggcaaa ggatataatagacacttctcgaaagaagacatttatgcagccaaaagacacatgaaaaaa atgctcatcatcactggccatcagagagatgcaaatcaaaaccacaatgagataccatct cacaccagttag >gi568815582f:66452716_66665991|GENSCAN_predicted_peptide_5|214_aa MICVEGNIASGKTTCLEFFSNATDVEVLTEPVSKWRNVRGHNPLGLMYHDASRWGLTLQT YVQLTMLDRHTRPQVSSVRLMERSIHSARYIFVENLYRSGKMPEVDYVVLSEWFDWILRN MDVSVDLIVYLRTNPETCYQRLKKRCREEEKVIPLEYLEAIHHLHEEWLIKGSLFPMAAP VLVIEADHHMERMLELFEQNRDRILTPENRKHCP >gi568815582f:66452716_66665991|GENSCAN_predicted_CDS_5|645_bp atgatctgtgtcgagggcaatattgcaagtgggaagacgacatgcctggaattcttctcc aacgcgacagacgtcgaggtgttaacggagcctgtgtccaagtggagaaatgtccgtggc cacaatcctctgggcctgatgtaccacgatgcctctcgctggggtcttacgctacagact tatgtgcagctcaccatgctggacaggcatactcgtcctcaggtgtcatctgtacggttg atggagaggtcgattcacagcgcaagatacatttttgtagaaaacctgtatagaagtggg aagatgccagaagtggactatgtagttctgtcggaatggtttgactggatcttgaggaac atggacgtgtctgttgatttgatagtttaccttcggaccaatcctgagacttgttaccag aggttaaagaagagatgcagggaagaggagaaggtcattccgctggaatacctggaagca attcaccatctccatgaggagtggctcatcaaaggcagccttttccccatggcagcccct gttctggtgattgaggctgaccaccacatggagaggatgttagaactctttgaacaaaat cgggatcgaatattaactccagagaatcggaagcattgcccatag >gi568815582f:66452716_66665991|GENSCAN_predicted_peptide_6|143_aa MDNVQPKIKHRPFCFSVKGHVKMLRLALTVTSMTFFIIAQAPEPYIVITGFEVTVILFFI LLYVLRLDRLMKWLFWPLLDIINSLVTTVFMLIVSVLALIPETTTLTVGGGVLSECKFTW KTDVRIESYTIFILEIPRPIYNA >gi568815582f:66452716_66665991|GENSCAN_predicted_CDS_6|432_bp atggataacgtgcagccgaaaataaaacatcgccccttctgcttcagtgtgaaaggccac gtgaagatgctgcggctggcactaactgtgacatctatgaccttttttatcatcgcacaa gcccctgaaccatatattgttatcactggatttgaagtcaccgttatcttatttttcata cttttatatgtactcagacttgatcgattaatgaagtggttattttggcctttgcttgat attatcaactcactggtaacaacagtattcatgctcatcgtatctgtgttggcactgata ccagaaaccacaacattgacagttggtggaggggttttatcagaatgcaaatttacttgg aaaacagatgtaagaatagaaagctatactatattcatccttgagattcctaggccaata tacaatgcctag >gi568815582f:66452716_66665991|GENSCAN_predicted_peptide_7|535_aa MDPEHAKPESSEAPSGNLKQPETAAALASSGSVVSSVPKAQRNISAKTAPRKHPAVSIRS AQSAAAARPQGSEGTAPSRKATTRPPPKPTLPPPTPSAHTESKLLNEMAIKERVEGRAKV PYKFRDSLKRFSFSPTGMLKILRLVSGELAGSTHKAGEGQETSYFAVGFSGNAQEVSQTK DLTNSIITAVFLSVVAILAMQEKKRRHLLYVGGPKCDHKIRAAAFIENTCFLSLPCVKVA LYFPALGAAPAELRAVTRCSQPPDPAAERRLTDLGPPGDAGMGPRGQLRAEAAKGAKPEP APAPPPPGAKPEEDKKDGKEPSDKPQKAVQDHKEPSDKPQKAVQPKHEVGTRRGCRRYRW ELKDSNKEFWLLGHAEIKIRSLGCLIAAMILLSSLTVHPILRLIITMEISFFSFFILLYS FAIHRYIPFILWPISDLFNDLIACAFLVGAVVFAVRSRRSMNLHYLLAVILIGAAGVFAF IDVCLQRNHFRGKKAKKHMLVPPPGKEKGPQQGKGPEPAKPPEPGKPPGPAKGKK >gi568815582f:66452716_66665991|GENSCAN_predicted_CDS_7|1608_bp atggatcctgaacacgccaaacctgagtcatccgaggcaccttcagggaacttgaaacaa ccggagactgccgcagccctggcaagtagcggcagcgtagtgagttctgtacccaaggca cagcgcaacatctcagcgaagaccgcaccccggaagcaccccgcagtctcaattcgcagt gcgcagtccgcagccgccgcacgtccccaaggcagtgagggcaccgcaccctcaaggaaa gccaccacacgcccacccccaaagcccacactcccaccccccacgccctctgcacacact gaatccaaactcttaaatgagatggcgatcaaagagcgcgtggagggccgagccaaagtc ccgtacaaattcagggacagcctcaaacgtttctccttctcgcccactggaatgttgaag atcctgagactggtgagcggagagctggctgggtccacacataaagctggagagggccag gagacatcatactttgctgtagggttcagtggtaatgcccaagaagtatcccagacaaag gatcttaccaacagtatcattacagctgtgttcctttcagtagttgccatcttggccatg caagaaaagaaaagaaggcatttactctatgtcggggggcctaaatgtgaccataaaatt agggctgctgcttttatcgagaacacctgcttcctctcgttgccttgtgtgaaagtcgcg ttgtattttcctgcgcttggcgctgcgcccgcggagctcagggccgtgacccggtgctcg cagccccccgaccccgcagcggagcggcggcttacagacctggggccacctggagacgcc gggatgggaccgcgagggcagctccgcgcggaggcggcaaagggggccaagccagagcca gcaccagctccacctccacccggggccaaacccgaggaagacaagaaggacggtaaggag ccatcggacaaacctcaaaaggcggtgcaggaccataaggagccatcggacaaacctcaa aaggcggtgcagcccaagcacgaagtgggcacgaggagggggtgtcgccgctaccggtgg gaattaaaagacagcaataaagagttctggctcttggggcacgctgagatcaagattcgg agtttgggctgcctaatagctgcaatgatactgttgtcctcactcaccgtgcaccccatc ttgaggcttatcatcaccatggagatatccttcttcagcttcttcatcttactgtacagc tttgccattcatagatacatacccttcatcctgtggcccatttctgacctcttcaacgac ctgattgcttgtgcgttccttgtgggagccgtggtctttgctgtgagaagtcggcgatcc atgaatctccactacttacttgctgtgatccttattggtgcggctggagtttttgctttt atcgatgtgtgtcttcaaagaaaccacttcagaggcaagaaggccaaaaagcatatgctg gttcctcctccaggaaaggaaaaaggaccccagcagggcaagggaccagaacccgccaag ccaccagaacctggcaagccaccagggccagcaaagggaaagaaatga >gi568815582f:66452716_66665991|GENSCAN_predicted_peptide_8|235_aa MWPPDPDPDPDPEPAGGSRPGPAVPGLRALLPARAFLCSLKGRLLLAESGLSFITFICYV ASSASAFLTAPLLEFLLALYFLFADAMQLNDKWQGLCWPMMDFLRCVTAALIYFAISITA IAKYSDGASKAAGVFGFFATIVFATDFYLIFNDVAKFLKQGDSADETTAHKTEGLQPFNV SAKNQHKETKQSLVCIWAADLAVCGPHQPTKLLGLDVGADKIRQTWKKPWILQLH >gi568815582f:66452716_66665991|GENSCAN_predicted_CDS_8|708_bp atgtggcccccagaccccgaccccgacccggaccccgagcctgccggcggctcccgtccc ggccccgcggtccccgggctccgcgccctgctgccggcgcgggctttcctctgctctctc aaaggccgcctcctgctggccgagtcgggtctctcattcatcacttttatctgctatgtg gcgtcctcagcatctgccttcctcacagcgcctctgctggagttcctgctggccttgtac ttcctctttgctgatgccatgcagctgaatgacaagtggcagggcttgtgctggcccatg atggacttcctgcgctgtgtcaccgcggccctcatctactttgctatctccatcacggcc atcgccaagtactcggatggggcttccaaagccgctggggtgtttggcttctttgctacc atcgtgtttgcaactgatttctacctgatctttaacgacgtggccaaattcctcaaacaa ggggactctgcagatgagaccacagcccacaagacagaaggccttcagccctttaacgtc tctgccaaaaaccagcacaaggagacaaagcagagccttgtctgtatctgggcagcagac ctggccgtgtgtggacctcaccagcctaccaagctgttgggcctggacgtgggtgctgat aaaataagacaaacctggaaaaaaccctggattctgcaacttcactag >gi568815582f:66452716_66665991|GENSCAN_predicted_peptide_9|146_aa ILALIAFICIETIMACSPCEGLYFFEFVSCSAFVVTGVLLIMFSLNLHMRIPQINWNLTD LVNTGLSAFLFFIASIVLAALNHRAGAEIAAVIFGFLATAAYAVNTFLAVQKWRVSVRQQ STNDYIRARTESRDVDSRPEIQRLDT >gi568815582f:66452716_66665991|GENSCAN_predicted_CDS_9|441_bp atcttggccctgattgcattcatctgcatagagaccatcatggcatgctccccgtgtgaa ggcctctacttttttgagtttgtgagctgcagtgcgtttgtggtgactggcgtcttgctg attatgttcagtctcaacctgcacatgaggatcccccagatcaactggaatctgacagat ttggtcaacactggactcagcgctttccttttctttattgcttcaatcgtactggctgct ttaaaccatagagccggagcagaaattgctgccgtgatatttggcttcttggcgactgcg gcatatgcagtgaacacattcctggcagtgcagaaatggagagtcagcgtccgccagcag agcaccaatgactacatccgagcccgcacggagtccagggatgtggacagtcgccctgag atccagcgcctggacacgtga