GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:46:09 Sequence gi568815595f:115576024_115820879 : 244856 bp : 39.34% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 11980 12039 60 0 0 81 121 32 0.938 7.10 1.02 Term + 15299 15409 111 1 0 75 40 88 0.119 0.28 1.03 PlyA + 15980 15985 6 1.05 2.06 PlyA - 21350 21345 6 1.05 2.05 Term - 23005 22780 226 0 1 81 39 111 0.077 0.77 2.04 Intr - 48537 48455 83 0 2 59 90 64 0.060 1.22 2.03 Intr - 52823 52710 114 2 0 23 90 121 0.608 5.52 2.02 Intr - 59485 59433 53 2 2 113 82 -6 0.347 -0.99 2.01 Init - 61265 61208 58 2 1 64 88 31 0.546 2.22 2.00 Prom - 63274 63235 40 -5.65 3.00 Prom + 70786 70825 40 -5.45 3.01 Init + 72633 72828 196 2 1 65 61 91 0.277 3.24 3.02 Term + 83087 83214 128 0 2 109 49 110 0.083 6.66 3.03 PlyA + 83585 83590 6 1.05 4.00 Prom + 84284 84323 40 -4.45 4.01 Init + 90971 91015 45 1 0 84 55 25 0.118 -0.47 4.02 Intr + 99990 100587 598 2 1 80 71 946 0.555 83.28 4.03 Term + 144771 144859 89 1 2 101 43 128 0.976 6.44 4.04 PlyA + 145365 145370 6 1.05 5.00 Prom + 146539 146578 40 -4.55 5.01 Init + 153554 153634 81 1 0 78 100 75 0.210 8.72 5.02 Term + 156305 156373 69 1 0 78 54 27 0.071 -4.74 5.03 PlyA + 156592 156597 6 1.05 6.02 PlyA - 157376 157371 6 1.05 6.01 Sngl - 158879 157908 972 2 0 70 36 339 0.993 23.38 6.00 Prom - 159832 159793 40 -6.15 7.02 PlyA - 160010 160005 6 1.05 7.01 Sngl - 160578 160180 399 0 0 60 38 189 0.530 7.11 7.00 Prom - 161594 161555 40 -4.35 8.00 Prom + 167473 167512 40 -6.75 8.01 Init + 169995 170064 70 2 1 63 56 59 0.015 1.46 8.02 Term + 172622 173166 545 0 2 48 41 223 0.329 7.04 8.03 PlyA + 173273 173278 6 1.05 9.00 Prom + 174036 174075 40 -3.15 9.01 Sngl + 176310 176969 660 2 0 36 38 238 0.896 9.52 9.02 PlyA + 177142 177147 6 1.05 10.00 Prom + 180523 180562 40 -3.75 10.01 Init + 182734 182756 23 0 2 63 100 29 0.408 0.95 10.02 Term + 190787 190895 109 2 1 102 34 143 0.957 7.30 10.03 PlyA + 192194 192199 6 1.05 11.12 PlyA - 193083 193078 6 1.05 11.11 Term - 193461 193245 217 2 1 -20 48 171 0.008 -2.07 11.10 Intr - 201688 201547 142 2 1 20 65 101 0.021 -0.51 11.09 Intr - 206241 206005 237 1 0 -2 71 188 0.239 4.76 11.08 Intr - 216066 215732 335 2 2 -23 68 348 0.514 15.99 11.07 Intr - 217358 217275 84 1 0 98 89 73 0.549 6.42 11.06 Intr - 218709 218578 132 2 0 50 87 66 0.360 1.54 11.05 Intr - 220748 220681 68 2 2 64 93 43 0.431 -0.82 11.04 Intr - 221519 221346 174 2 0 101 33 67 0.441 1.71 11.03 Intr - 222415 222210 206 2 2 83 19 169 0.803 7.50 11.02 Intr - 233301 233207 95 2 2 44 86 65 0.087 0.59 11.01 Intr - 235227 235083 145 1 1 99 75 46 0.814 2.82 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 82931 82482 450 2 0 63 49 218 0.838 9.60 S.002 Init + 144367 144481 115 0 1 41 59 96 0.892 2.42 S.003 Term + 170453 170609 157 2 1 79 46 192 0.801 10.52 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_1|56_aa MGLKTKHLAGDGGGQKWVEKLLVFNKCSLTSLAKWRDDTVAHRIGFSEAQDPVPTS >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_1|171_bp atggggttaaaaaccaagcaccttgctggggatggaggggggcaaaagtgggtggagaag ctcctggtcttcaacaaatgctccttaacctcacttgcaaaatggagggatgacactgtt gctcatcgcataggatttagtgaagcccaagacccagtgcctacctcatag >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_2|177_aa MHCTGYLTLKEFANYLAESVIEILISSSILVSLKRCKVAKDLDVDDLMRSPRISESNEKS GMNEDYDPGDHRFGEDVSLSLVTDNPRKEAREGDKRKDLVSNSCVLVHFHAADKDILETG KKKRSNWTYSSTWLGRPQNHGKGTSYMAAARKSEGEAKMETPDKLIRSLETYSLSQE >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_2|534_bp atgcactgtactggatatcttacccttaaagagtttgcaaactacttggcagaatcagtt atagaaatcctcatttcctcctccatactggtttcactcaagaggtgtaaggtggctaaa gatttggatgtggatgacctgatgagatcacctaggattagtgagtcaaacgagaagagt ggaatgaatgaagattatgatccaggagatcatagatttggggaggatgttagcctgtct ctggtcactgacaatcccaggaaagaagcaagggaaggggataagagaaaagacctggtt tccaacagttgtgttttagtccattttcatgctgctgataaagacatactagagactggg aagaaaaagaggtctaattggacttacagttccacatggctggggaggccccagaatcat ggcaaaggcacttcttacatggcggcggcaagaaaaagtgaaggagaagcaaaaatggaa acccctgataaactcatcagatctcttgagacttattcactatcacaagaatag >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_3|107_aa MTQRLIESLFIFVALVEEGLERKELREGLTHPSVAEDTSFREKERAVSKSIWDQGSQQKE SSKINGPGEWKSGRQEGFFVSKSGPLETKQQCSVFRRQHRKEQTAEE >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_3|324_bp atgactcaaaggctgattgagagcttattcatcttcgtggcattagttgaggaagggtta gaaagaaaagagctaagagaagggcttactcacccttcagtggcagaagatacttcattc agagaaaaagaaagagcagttagtaaaagcatatgggatcagggaagccaacaaaaagaa agctctaaaataaatgggccaggggagtggaagagcggccggcaggaaggtttctttgtc agcaaaagcggacctttggaaactaaacagcaatgctctgtctttagaagacagcacagg aaggaacaaacagctgaggagtga >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_4|243_aa MKAPALVTWSENTQPVEKNDDDQKIEQDGIKPEDKAHKAATKIQASFRGHITRKKLKGEK KDDVQAAEAEANKKDEAPVADGVEKKGEGTTTAEAAPATGSKPDEPGKAGETPSEEKKGE GDAATEQAAPQAPASSEEKAGSAETESATKASTDNSPSSKAEDAPAKEEPKQADVPAAVT AAAATTPAAEDAAAKATAQPPTETGESSQAEENIEAVDETKPKESARQDEGKEEEPEADQ EHA >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_4|732_bp atgaaggctccagcattagtgacctggagcgagaacacccagccagttgaaaaaaatgat gacgaccaaaagattgaacaagatggtatcaaaccagaagataaagctcataaggccgca accaaaattcaggctagcttccgtggacacataacaaggaaaaagctcaaaggagagaag aaggatgatgtccaagctgctgaggctgaagctaataagaaggatgaagcccctgttgcc gatggggtggagaagaagggagaaggcaccactactgccgaagcagccccagccactggc tccaagcctgatgagcccggcaaagcaggagaaactccttccgaggagaagaagggggag ggtgatgctgccacagagcaggcagccccccaggctcctgcatcctcagaggagaaggcc ggctcagctgagacagaaagtgccactaaagcttccactgataactcgccgtcctccaag gctgaagatgccccagccaaggaggagcctaaacaagccgatgtgcctgctgctgtcact gctgctgctgccaccacccctgccgcagaggatgctgctgccaaggcaacagcccagcct ccaacggagactggggagagcagccaagctgaagagaacatagaagctgtagatgaaacc aaacctaaggaaagtgcccggcaggacgagggtaaagaagaggaacctgaggctgaccaa gaacatgcctga >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_5|49_aa MWRREPFQEKERDSPGNICPLDVTMPEALGCDVPHPVSKCSHRSIPTYE >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_5|150_bp atgtggaggagagagcctttccaagaaaaggaaagagacagtcctggtaacatttgtcct ctagatgtaacaatgcctgaagccctggggtgtgatgttccccaccctgtgtccaagtgt tctcatcgttcaattccaacctatgagtga >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_6|323_aa MDKFLDTYTPPRLNQEEVECLNKPITGSEIEAIINSLTTKKRPGPDGFTAEFYQSYKEEL VPFVLKLFQSIEKEGILLNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKNLIHHNQIGVITGMQGWFTICKSMNIIHHINRTNDKSHVIISIDAEKAFDKI QQPFMLKTLNKPGIDGTYLKIIGAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLF NIVLEVLARAIRQEKEIKHIQLGKEEVRLSLFADDIIVYLENPIISAQNLLKLINNFSKI SGYKINVQKSHTFLYTNNMQTES >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_6|972_bp atggataaattcctggacacatacacccccccgagactaaaccaggaagaagttgaatgt ctgaataaaccaataacaggctctgaaattgaggcaataattaatagcctaacaaccaaa aaaaggccaggaccagacggattcacagccgaattctatcagagttacaaagaggagctg gtaccattcgttctgaaactattccaatcaatagaaaaagagggaatccttcttaactca ttttatgaggccagcatcatcctgataccaaagcctggcagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaaccttatccaccataatcaaatcggtgttatcact gggatgcaaggctggttcaccatatgcaaatcaatgaacataatccatcacataaacaga accaatgacaaaagccacgtgattatctcaatagatgcagaaaaggccttcgacaaaatt caacaacccttcatgctaaaaactctcaataaaccaggtattgatggaacatatctcaaa ataatcggagctatatatgacaaacctacagccaatatcattctgaatgggcaaaaactg gaagcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattc aacatagtgttggaagttctggccagggcaatcaggcaagagaaagaaataaagcatatt caattaggaaaagaggaagtcagactgtccctgtttgcagatgacataattgtatattta gaaaaccccatcatctcagcccaaaatctccttaagctgataaacaacttcagcaaaatc tcaggatacaaaatcaatgtgcaaaaatcacacacattcctgtataccaataacatgcaa acagagagctag >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_7|132_aa MKEKMLRAARKKGRVTQKEKPIRLTADLLAETLQARRKWEPIFNILKEKNFQPRISYPAK QSFTSEGEIKSFTDKQTLRHFVTTRPALQEILKEALNMERKNGYQPLQKTYQIVKTIDAR KKLHQLIGKITS >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_7|399_bp atgaaggaaaaaatgttaagggcagccagaaagaaaggtcgggttacccagaaagagaag cccatcagactaacagcggatctcttggcagaaactctgcaagccagaagaaagtgggag ccaatatttaacattctcaaagaaaagaattttcagcccagaatttcatatccagccaaa caaagcttcacaagtgaaggagaaataaaatcctttacagataagcaaacgctgagacat tttgtcaccaccaggcctgccttacaagagatcctgaaggaagcactaaatatggaaagg aaaaacgggtaccagccactgcaaaaaacataccaaattgtaaagaccattgatgctagg aagaaactgcatcaactaattggcaaaataaccagctaa >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_8|204_aa MQETMLAIPANLYRICACSSNDNPQNLLKLIGNFSKVSGYKINVQKSQAFLYTNNRQTES QIMSELPFTIASKRIKYLGIQLTRDVKDLFKENYKPLLNEIKEDTNKWKNIPCSWVGRIN IVKMAMVTKVIYRFNAIPVKLPMTFFTELEKTTLKFIWNQKRACIDKSILSQKNKAGGIM LPDFKVYYKATVTKKHGTATKTEI >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_8|615_bp atgcaggagacaatgctagctattcctgcgaacctctacaggatctgtgcctgctcttca aatgacaacccccaaaatctccttaagctgataggcaacttcagcaaagtctcaggatac aaaatcaatgtgcaaaaatcacaagcattcttatacaccaataacagacaaacagagagc caaatcatgagtgaactcccattcacaattgcttcaaagagaataaaatacctaggaatc caacttacaagggatgtgaaggacctcttcaaggagaactacaaaccactgctcaatgaa ataaaagaggatacaaacaaatggaagaacattccatgctcatgggtaggaaggatcaat atcgtgaaaatggccatggtgaccaaggtaatttatagattcaatgccatccccgtcaag ctaccaatgactttcttcacagaattggaaaaaactactttaaagttcatatggaaccaa aaaagagcctgcattgacaagtcaatcctaagccaaaagaacaaagctggaggcatcatg ctacctgacttcaaagtatactacaaggctacagtaaccaaaaagcacggtactgctacc aaaacagagatatag >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_9|219_aa MKRNEKSLQEIWDYVKRPNICLIGVPESDWENGIKLENTLQDIIQENFPNLVRQANIQIQ EIQRTSQKYSSRRATPRHIIVRFTKVEMKEKMLRAAREKGRVTHKEKPIRLPADLSAETL QARREWGPIFNILKEKNFQPRISYPTKLSFISKGEIKSFTDKQMLRHFVTTRPALQELLK EALNMERKNGYQPLQKTYQIVKTIDARKKLHQLTGKINS >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_9|660_bp atgaaaaggaatgaaaaaagcctccaagaaatatgggactatgtgaaaagaccaaacata tgtttaattggtgtacctgaaagtgactgggagaatggaatcaagttggaaaacactctt caggatattatccaggagaacttccccaatctagtaagacaggccaacattcaaattcag gaaatacagagaacatcacaaaaatactcctcgagaagagcaaccccaagacacataatt gtcagattcaccaaggttgaaatgaaggaaaaaatgttaagggcagctagagagaaaggt cgggttacccacaaggagaagcccatcagactaccagcagatctctcagcagaaacccta caagccagaagagagtgggggccaatatttaacattctcaaagaaaagaattttcagccc agaatttcatatccaaccaaactaagcttcataagcaaaggagaaataaaatcctttaca gacaagcaaatgctgagacattttgtcaccaccaggcctgccttacaagagctcctgaag gaagcactaaatatggaaaggaaaaatgggtaccagccactgcaaaaaacataccaaatt gtaaagaccattgatgctaggaagaaactgcatcaactaacgggcaaaataaacagctaa >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_10|43_aa MKRAQRMKAQKQSDASKLEGPEKENLQRKAFEKKFGKGPDLQS >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_10|132_bp atgaaaagagcacagcgcatgaaggcacaaaaacagtcagatgcttcaaagttagaagga ccagaaaaggaaaatctgcaaaggaaagcctttgaaaagaaatttggcaaagggcctgat ctacaaagctga >gi568815595f:115576024_115820879|GENSCAN_predicted_peptide_11|611_aa XPHPHPPNCVHIGKTKVAAGNTDGHSRLIWVSSNLSSAHGLDLPLNLSRAGPHIPPSPII RTEWGVTQNITVNESAYYRWETKETCFIRGPKTPVPVTDWEGSLPLVFNHCRDTSVIIHP RFKGVRPRRDACLGPSPLAASPAFLGKGQAELGPNSSSASAPPPYNLFIASPPHTWSGLQ FRSVTSPPPPAQQFTLKKVAGAKGIVKRLKTDTARSPWKTPRPSRTPSFRKAERTKGLLK IHLTKLSHQLKKDWTILLPLSLLRIQACPRNATRLATGQLGYPFISQSYVLVNGFQTVED LCEAADLRVSVADLRVSVTALKVARLELFVPPGGLVVSLASAVKLQTFAVLQLIKAKRWD WGTLEQGAALIGEARDAQEPTEGVGGSGMAGCRSRDLPRGKAAKARREIERSAGLTIKKE RCIRNGYSKEKMKLIVVSHGLHVNDLQHKLTLFTKETYTYLARDSEKQKQGYLAGLEGAH ANRVNQQISDNLAIVFGQKIQGHLSTTGQLLMAGQEGKLGQLLFLGISGWTIQLNDVYCL RILNNNDSESDLEVMVGRKFSGSRQSEVANLGCLTQEEREHPGKLCLGKCAPLQTFSATK MDSCDYITEIT >gi568815595f:115576024_115820879|GENSCAN_predicted_CDS_11|1836_bp nacccacacccacatcccccaaattgtgtccacattgggaaaactaaggtggcagcagga aacacggatggccattccaggttaatctgggtgagttcaaatctcagttctgctcatggt ttggacctgcctctcaacctctctagggcaggaccccatattcctcctagtcccatcatc agaaccgagtggggagtcactcagaatatcactgtaaatgaaagtgcctactatcgatgg gagacaaaggagacatgttttatccgtggacccaaaactccggtgccggtcacggactgg gaaggcagccttcccttggtgtttaatcactgcagggacacctctgtgattattcaccca cgtttcaaaggtgtcagaccacgcagggacgcctgccttggtccttcacccttagcggca agtcctgcttttctggggaaggggcaagccgagctaggtcccaattcttcctcagcctct gctcctccaccctataatctttttatcgcctcccctcctcacacctggtccggcttacag tttcgttcggtgactagccctcccccacctgcccagcaatttactcttaaaaaggtggct ggagccaaaggcatagtcaagcggctgaagactgacactgcccgatcgccttggaagacc cctagaccatcacggacgccgagcttcagaaaagcagaacggactaaaggtcttttaaaa atacacctcaccaagctcagccaccaacttaaaaaggactggacaatacttttaccactt tcccttctcagaattcaggcctgtcctcggaatgctacaagactggctactggccaattg ggatatccttttatcagtcaaagctatgtcctggtcaatggatttcaaactgttgaggat ctatgtgaagctgcagaccttcgagtgagtgttgcagaccttcgagtgagtgttacagct cttaaggtggcgcgtctggagttgttcgttcctcccggtgggctcgtggtctcgctggct tcagcagtgaagctgcagaccttcgcggtgttacagctcataaaagcaaaaagatgggac tggggcaccttggagcagggggcggcgctcatcggggaggctcgggacgcacaggagccc acggagggggtgggaggctcaggcatggcgggctgcaggtccagagacctgccccgcggg aaggcagctaaggcccggcgagaaatcgagcgcagcgccgggctgacgatcaagaaggaa agatgcatacgaaatggatacagcaaagaaaaaatgaaactcatagttgtttctcatggc ctccatgtaaatgacttgcagcataaactgacgcttttcaccaaggaaacatacacatac ctagccagggattcagagaagcagaagcaggggtatctagcaggacttgaaggagcccat gccaacagggtgaaccagcagatcagtgacaacttagccatagtctttggacaaaagatt caaggccatctctccactactggacagctgcttatggcaggtcaagaaggaaagctgggc cagcttcttttcctgggaatttcgggttggacaattcaactgaatgatgtttattgtctg agaatactgaacaataatgatagtgaaagcgaccttgaggtgatggtgggccgcaaattc agtgggagccggcagtcagaagtggccaatttgggatgcttgacacaggaggagagggag cacccagggaagttgtgtctgggaaagtgtgctcccctccagacctttagtgccactaaa atggactcatgcgattacatcactgaaataacttga