GENSCAN 1.0 Date run: 7-Nov-116 Time: 02:28:52 Sequence gi568815585f:48671179_48871826 : 200648 bp : 40.20% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.03 PlyA - 290 285 6 1.05 1.02 Term - 3380 2602 779 0 2 -5 36 407 0.332 18.74 1.01 Init - 4920 4866 55 0 1 102 37 71 0.238 4.90 1.00 Prom - 17564 17525 40 -4.85 2.00 Prom + 17859 17898 40 -7.65 2.01 Sngl + 26764 27402 639 0 0 54 43 283 0.770 16.33 2.02 PlyA + 27630 27635 6 1.05 3.00 Prom + 28110 28149 40 -3.35 3.01 Sngl + 28746 29573 828 2 0 70 43 187 0.547 7.88 3.02 PlyA + 29742 29747 6 1.05 4.00 Prom + 31685 31724 40 -4.25 4.01 Sngl + 35640 36680 1041 2 0 99 49 567 0.940 50.77 4.02 PlyA + 37524 37529 6 1.05 5.00 Prom + 45650 45689 40 -5.45 5.01 Init + 51735 51762 28 2 1 83 95 12 0.254 0.66 5.02 Intr + 56875 57002 128 2 2 103 80 61 0.373 6.28 5.03 Term + 57886 57984 99 0 0 97 36 52 0.284 -1.95 5.04 PlyA + 58207 58212 6 1.05 6.09 PlyA - 58548 58543 6 1.05 6.08 Term - 60047 59867 181 1 1 94 49 120 0.214 4.80 6.07 Intr - 60680 60581 100 0 1 40 100 26 0.268 -2.75 6.06 Intr - 61490 61410 81 0 0 75 94 72 0.603 5.19 6.05 Intr - 62461 62426 36 2 0 122 61 59 0.164 3.92 6.04 Intr - 62965 62801 165 2 0 62 42 137 0.660 5.51 6.03 Intr - 67447 67092 356 0 2 44 35 177 0.200 2.01 6.02 Intr - 68034 67969 66 2 0 91 80 53 0.203 1.90 6.01 Init - 72510 72449 62 0 2 78 61 46 0.258 1.67 6.00 Prom - 73965 73926 40 -3.05 7.00 Prom + 75618 75657 40 -5.35 7.01 Sngl + 80013 80459 447 2 0 71 54 158 0.276 6.68 7.02 PlyA + 80759 80764 6 1.05 8.00 Prom + 83046 83085 40 -6.35 8.01 Init + 88586 88646 61 1 1 83 70 59 0.178 5.06 8.02 Term + 100440 100651 212 1 2 5 46 256 0.369 9.57 8.03 PlyA + 100670 100675 6 1.05 9.00 Prom + 104501 104540 40 -4.55 9.01 Init + 106979 107031 53 1 2 74 52 87 0.833 4.38 9.02 Intr + 111931 112110 180 1 0 113 95 166 0.977 17.86 9.03 Term + 115349 115463 115 2 1 78 55 58 0.501 -1.44 9.04 PlyA + 116342 116347 6 1.05 10.00 Prom + 125994 126033 40 -6.75 10.01 Init + 127187 127364 178 1 1 60 108 108 0.195 9.47 10.02 Term + 147462 147724 263 1 2 12 52 228 0.159 6.30 10.03 PlyA + 148267 148272 6 1.05 11.00 Prom + 160986 161025 40 -4.05 11.01 Init + 161940 162037 98 2 2 56 42 76 0.193 -0.37 11.02 Intr + 172208 172301 94 2 1 110 91 27 0.702 4.25 11.03 Intr + 172861 172934 74 0 2 81 70 73 0.819 2.09 11.04 Intr + 173277 173408 132 0 0 66 110 55 0.901 4.34 11.05 Intr + 175121 175251 131 2 2 24 68 138 0.790 4.82 11.06 Intr + 177114 177231 118 1 1 78 60 54 0.244 0.20 11.07 Intr + 184924 184961 38 1 2 86 111 42 0.264 3.29 11.08 Term + 193720 193832 113 2 2 56 47 110 0.223 1.44 11.09 PlyA + 193951 193956 6 1.05 12.02 PlyA - 195491 195486 6 -0.45 12.01 Term - 197359 197093 267 1 0 85 49 161 0.793 6.41 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_1|277_aa MAEQEQLRFAAPSKTNAETQCKKSKNIDNRLKKLLTRITSLEQNIKDLMELKNTARELYE AYTSINSRIDQMEERISEIEDLLNEIKHEDKIKEKIMKRNKQSLKICDYMKRPNLHLIGV PESDGENGSKLGNTLPDIIQENFPTLARQANIQIQEIQRTPLRYSSIRATPRHIIIRFTR AEVKEKMLRTVRQKGWVTHKGKPIRLTADLSAETLQARKEWGPIFNILKEKNFQPGISRP AKISFTGEGEIKSFTDKQMVRDFVTTRPALQELQRKH >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_1|834_bp atggctgaacaggaacagctccggtttgcagctcccagtaagaccaatgcagaaacccaa tgcaagaaatctaagaacattgataacaggttaaagaaactgctaactagaataaccagt ttagagcagaacataaaagacctgatggagctgaagaacacagcacgagaactttatgaa gcatatacaagtatcaatagcagaatcgatcaaatggaagaaaggatatcagagattgaa gatctacttaatgaaataaagcatgaagacaagattaaagaaaaaataatgaaaaggaac aaacaaagcctcaaaatatgtgactatatgaaaagaccaaacctacatctgattggtgta cctgaaagtgacggggagaatggatctaagttgggaaacacacttccagatattatccaa gagaacttccctaccctagcaagacaggccaacattcaaattcaggaaatacagagaaca ccactaagatactcctcaataagagcaaccccaagacacataatcatcagattcaccagg gctgaagtgaaggaaaaaatgttaaggacagtcagacagaaaggttgggttacccacaaa gggaagcccatcagactaaccgcagatctctccgcagaaactctacaagccagaaaagag tgggggccaatattcaacattcttaaagaaaagaattttcaacccggaatttcacgtcct gctaaaataagcttcacaggtgaaggagaaataaaatcgtttacagacaagcaaatggtg agggattttgtcaccaccaggcctgccttacaagagctccaaaggaagcactaa >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_2|212_aa MKQEEKHREKKVKRNEKSLQEIWEDYVKRPKLRLIGVPASDGENGTKLENTLQDIIQENF PNLARQANIEIQEIQRTPLRYSSRRATPRHIIVRFTKVEMKEKMLKAPREKGRVTHKGKP IRLTADPLAETLQARREWGPIFNILKEKNFQPRISYPAKLSFISEGEIKSFTDKQMLRDF VTTKPALQELLKEALNMERNNQFQPLQKHAKL >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_2|639_bp atgaagcaagaagagaagcatagagagaaaaaagtaaaaagaaatgaaaaaagcctccaa gaaatatgggaggactatgtgaaaagaccaaaactacgtctgattggtgtacctgcaagt gacggggagaatggaaccaagttggaaaacactcttcaggatattatccaggagaacttc cccaacctagcgaggcaggccaacattgaaattcaggaaatacagagaacaccactaaga tactcctcgagaagagcaactccaagacacataattgtcagattcaccaaagttgaaatg aaggaaaaaatgttaaaggcacccagagagaaaggtagggttacccacaaagggaagccc atcagactaacagcagatcccttggcagaaactctacaagccagaagagagtgggggcca attttcaacattcttaaagaaaagaattttcaacccagaatttcatatccagccaaacta agcttcataagtgaaggagaaataaaatcctttacagacaagcaaatgctgagagatttt gtcaccaccaagcctgccttacaagagctcctgaaggaagcactaaacatggaaaggaac aaccagttccagccactgcaaaaacatgccaaattgtaa >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_3|275_aa MDKFLDTYTLPRLNQKEVESLNRPITGSEIEGIINILPTKKSPGPDGFTAEFYQRYKEEL VPFLLKLFQSIEKEGILPNSFYEANIILIPKPGRDTTTKENFRLMSFMNIDAKILNKILA NQIQQHIKKLIHHDQVGFIPGMQGWFNICKSINVIHHVNRTKDKNHMIISIDAEKAFNKI QQPFMLKSVNKLGIWTYLKIIRAIYDKPTANIILNGQKLEAFPLKTGTRQGCPLSPLLFN IVLDVLARAIRQEKEIKGIKLGKEEVKLSLFADDI >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_3|828_bp atggataaattcctggacacatataccctcccaagactaaatcagaaagaagttgaatcc ctgaatagaccaataacaggttctgaaattgagggaataattaatatcctaccaaccaaa aaaagtccaggaccagatggattcacagccgaattctaccagaggtacaaagaggagctg gtaccattccttctgaaactattccaatcaatagaaaaagagggaatcctccctaattca ttttatgaggccaacatcatcctgataccaaagcctggcagagacacaacaacaaaagag aattttagactgatgtccttcatgaacattgatgcaaaaatcctaaataaaatactggca aaccaaatccagcagcacatcaaaaagcttatccaccacgatcaagttggcttcatccct gggatgcaaggctggttcaacatatgcaaatcaataaacgtaatccatcatgtaaacaga accaaagacaaaaaccacatgattatctccatagatgcagaaaaggccttcaacaaaatt caacaacccttcatgctaaaaagtgtcaataaactaggtatttggacgtatctcaaaata ataagagctatttatgacaaacccacagccaatatcatactgaatgggcaaaaattggaa gcattccctttgaaaactggcacaagacagggatgccctctctcaccactcctattcaac atagtgttggacgttctggccagggcaatcaggcaagagaaagaaataaagggtattaaa ttaggaaaagaggaagtcaaattgtcactgtttgcagatgacatttag >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_4|346_aa MERKFMSLQPSISVSEMEPNGTFSNNNSRNCTIENFKREFFPIVYLIIFFWGVLGNGLSI YVFLQPYKKSTSVNVFMLNLAISDLLFISTLPFRADYYLRGSNWIFGDLACRIMSYSLYV NMYSSIYFLTVLSVVRFLAMVHPFRLLHVTSIRSAWILCGIIWILIMASSIMLLDSGSEQ NGSVTSCLELNLYKIAKLQTMNYIALVVGCLLPFFTLSICYLLIIRVLLKVEVPESGLRV SHRKALTTIIITLIIFFLCFLPYHTLRTVHLTTWKVGLCKDRLHKALVITLALAAANACF NPLLYYFAGENFKDRLKSALRKGHPQKAKTKCVFPVSVWLRKETRV >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_4|1041_bp atggagagaaaatttatgtccttgcaaccatccatctccgtatcagaaatggaaccaaat ggcaccttcagcaataacaacagcaggaactgcacaattgaaaacttcaagagagaattt ttcccaattgtatatctgataatatttttctggggagtcttgggaaatgggttgtccata tatgttttcctgcagccttataagaagtccacatctgtgaacgttttcatgctaaatctg gccatttcagatctcctgttcataagcacgcttcccttcagggctgactattatcttaga ggctccaattggatatttggagacctggcctgcaggattatgtcttattccttgtatgtc aacatgtacagcagtatttatttcctgaccgtgctgagtgttgtgcgtttcctggcaatg gttcacccctttcggcttctgcatgtcaccagcatcaggagtgcctggatcctctgtggg atcatatggatccttatcatggcttcctcaataatgctcctggacagtggctctgagcag aacggcagtgtcacatcatgcttagagctgaatctctataaaattgctaagctgcagacc atgaactatattgccttggtggtgggctgcctgctgccatttttcacactcagcatctgt tatctgctgatcattcgggttctgttaaaagtggaggtcccagaatcggggctgcgggtt tctcacaggaaggcactgaccaccatcatcatcaccttgatcatcttcttcttgtgtttc ctgccctatcacacactgaggaccgtccacttgacgacatggaaagtgggtttatgcaaa gacagactgcataaagctttggttatcacactggccttggcagcagccaatgcctgcttc aatcctctgctctattactttgctggggagaattttaaggacagactaaagtctgcactc agaaaaggccatccacagaaggcaaagacaaagtgtgttttccctgttagtgtgtggttg agaaaggaaacaagagtataa >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_5|84_aa MATIDSPSTYEKLRHKQFKASWIPNQPLPSTAADQTTPLRSQRESPLQFDAEQCWLPAST RNQQACSHPRDLAKAITFARNAFL >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_5|255_bp atggctaccatcgattccccctctacctatgaaaagctcaggcataaacaatttaaggca tcttggattccaaaccaacccctacccagtactgctgcagaccagactacccctcttaga agtcagagggaatcccctctccaatttgatgctgagcaatgctggcttcctgcctctact cgaaaccagcaagcatgctcccatcccagggacttggcaaaggctattacctttgcccgg aatgctttcctttaa >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_6|348_aa MTSATGQLLGRFQEAYNHGRRFSHFYQVALSSSSVPELKNMCGQLLHEEPWKRLQEMTGA SLLQQKAFRTAGQHGKAACCVRAPAVITVFLGAMGLRKGKQRSLDSLEENTKVYLLFPGL AVHTHITSFNPCFSRKIAESLRGSGSCQGHTAPKEQGSSDSANQTGSSHESPHSPSDHAK APNNPHRAQPTSAAAALKAPASLFWVCSHIPANQECYPSQTRLYPSVTSHSRHGKLETNT VHSCAKCFRRPITLQEVRLYSEPPFCFPYAGNLGSYRSALKAKENEQKSRCPSLMNSRAK KLLPPVETCVSPTGLNCPVSAQHSSGHPGGSSLGLSSSTRWVVALHLC >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_6|1047_bp atgacatcggcaactggtcagcttctggggaggtttcaggaagcttacaatcatggcaga aggttcagccatttttatcaagtcgctttgtcgtcttcatcagttcctgagctgaagaac atgtgcgggcagcttctccatgaagagccttggaagcgcctgcaggaaatgacaggtgca tctcttttgcagcaaaaggccttcagaacagcagggcagcacggcaaagctgcttgctgt gtgcgtgccccggcagtcattactgtgttccttggagctatggggctgagaaaggggaag cagaggagccttgactcgctggaagagaacacaaaagtgtaccttctttttcctggcctt gcagtccacacacacattacctcatttaatccctgtttttcaaggaagatagcagagtcc ctaagagggtcaggaagttgtcaaggtcacacagctcctaaagagcagggctcctcagac tcagctaatcaaactggcagtagccatgagtcacctcacagcccaagtgaccatgctaag gctcccaacaaccctcacagagctcagccaacgtctgctgctgctgccctgaaggcccca gcttctctcttctgggtttgctctcacattccagccaaccaggaatgctacccttctcag actcgcctctatcctagtgtcacttcccacagcagacatgggaaactggaaaccaacacc gttcattcctgtgccaagtgctttaggcgcccaataaccctacaagaggtgcggctttac tctgagcctcctttttgctttccatacgctggaaatctaggctcttacaggtctgctctg aaggctaaagagaatgaacagaaaagcaggtgcccaagcctgatgaattcccgtgctaaa aaactgctgcctcctgtggagacctgtgtgtctcccacagggctgaattgccctgtgtct gcccagcacagctctgggcacccaggtggctcctccctgggcctcagcagcagcacaaga tgggttgtagccctgcatctctgttaa >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_7|148_aa MADMPFKLSSDIASFAEPHRPHSEFHVLSSPSEDSVKASTEHFSYCCSPICQTSALDYEL PEGRTHSCSPLAAQSSPVPGMKERSISCLPNEWILKYPVCSLVTQCNTLGHKISSCLKAG AGSMHSQLESESEQLCSKARQESGTHYY >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_7|447_bp atggcagatatgccttttaaacttagctcagacattgcttcctttgcagaaccacaccgg ccacactcagagttccatgtgctctcttcaccctcagaggactctgtaaaagccagcacg gagcacttctcttactgttgttcacctatttgtcagacttcagcactagactatgagctc cctgagggccggactcactcttgttcacctttagctgctcagtccagcccagtgcctgga atgaaagagaggtccatttcatgtttaccaaatgaatggattctgaaataccctgtctgc tctctggttacgcagtgcaacaccttaggccataaaattagttcctgcttaaaagcagga gcaggctctatgcactcccagttagagtcagaatctgagcagctatgttccaaggctaga caagagtctgggacacactactactga >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_8|90_aa MAEGKEEAGTSYKARAGGKKKRGDAAAKASEETHVMDYRALVHERDEAAYGELRAMVLDL RAFYAELYHIINSNLEKIVNPKGEKKPSMY >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_8|273_bp atggcagaaggcaaagaggaggcaggcacttcttacaaggctagagcaggaggaaaaaag aaacgtggggatgctgcggccaaggcctccgaggagactcatgtaatggattaccgggcc ttggtgcatgagcgagatgaggcagcctatggggagctcagggccatggtgctggacctg agggccttctatgctgagctttatcatatcatcaacagcaacctggagaaaattgtcaac ccaaagggtgaaaagaagccatctatgtactga >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_9|115_aa MPQDWCGEDAVLQPLDVRNGSPSGPDYCESCCFSGSSYPLELPDSMLVLENVCKGSSDVT CPLVFQRWVPAAADESDRFQGSSLSQTMPEFSYKKPGKTVTSTHVIIQKPFGVKF >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_9|348_bp atgcctcaggactggtgtggtgaagacgctgtgcttcagccgctggacgttaggaatggc agtccctcagggccagactactgtgaatcctgctgcttttctggatctagttacccactg gagctgccagactccatgctggtgctggagaatgtttgcaagggatctagtgatgtgacc tgtcctctagtcttccagcggtgggtaccagcagcagctgatgagagtgatagatttcag ggttccagtctttcccaaacaatgcctgaattctcatacaagaaacctggcaaaactgtg acttcaactcatgttataattcaaaaaccctttggtgtcaaattttga >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_10|146_aa MKYRKQKANTEGDVNPQCNFERRSQDKSYESDLESNRPRLQWITRFWNIFLQDEIDSLTT VLSQRNLLEKHKGSENQQEAGVTGIKQLKPAGTNGTGGPGSKNTATATQQNGLVKHCPIP DTNGLFLPSFQHLLKTQVPEMVNPIG >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_10|441_bp atgaaatatagaaaacagaaagcaaacacagaaggtgacgtgaatcctcagtgtaatttt gaaaggaggtctcaggataagagctatgaatcagatctagagagcaaccgtcccagacta cagtggatcacaagattctggaatatatttcttcaagatgaaattgatagtttaacaact gtcttaagccaaaggaatttgttggagaaacataagggctcagaaaatcaacaggaggct ggagtaacaggcattaagcaactaaaaccagctggaactaatggcactggaggaccaggc agtaagaacacagctacagccacacagcagaatggactggtcaaacactgccccatccct gatacaaatggcctgtttcttccatcctttcagcacttgctcaagactcaagttcctgag atggtgaatccaattggttga >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_11|265_aa MASTSGGDTVNIVEMITKNLEYYINLADEAVASSVCPSVVRCLHYLHWPLVDNPLHVEMS MTGQPRRLAFGTANTVDHSHAPIDLEEIWLKVAFQENGLCDLLRKEWSKINYSMTKGTIS GSHLPTLSRFIPRYSFTRTPRCPGDPLTYLGLICTDFPVLTGEKIGIMYPINMLAAAADC KLCLILPKSAFKWHHVCARKLAMGQSLHHKDAINQGAMELINSSYNVAGSFSRCIENKLE GTRMEAESPFRRLMQQIRVRKKKVA >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_11|798_bp atggcatctacttctggtggagatactgtgaacattgttgaaatgataacaaagaattta gaatattacataaacttagctgatgaagcagtggcaagctctgtctgcccctctgtggtg agatgccttcactatttgcactggcctcttgtggataatccattgcatgtagaaatgtca atgacaggacagcccagaaggctggcttttggcacagctaacacagttgaccattctcat gctcctattgacctagaagaaatctggttgaaagtggccttccaagaaaatggtctttgt gatctgcttaggaaagagtggtcaaaaattaactattctatgaccaagggcaccatttct ggcagccaccttccaactttgtccaggtttatacccagatatagcttcacgaggacccct cgctgcccaggggacccactcacatatctaggactcatctgcacagatttcccggtgctg actggagagaaaatcggcatcatgtatcccattaatatgctagctgctgctgctgattgc aaattatgcctgattcttcccaagtctgcattcaaatggcatcacgtttgtgcccggaaa ttggccatggggcagtccttacaccacaaagatgctataaatcagggtgccatggaactc atcaacagctcctacaatgtagcaggatcgttcagccgctgcattgagaacaaactggag ggcacaaggatggaagcagagagtccttttaggaggctgatgcaacaaatccgggtaaga aagaagaaagtggcctga >gi568815585f:48671179_48871826|GENSCAN_predicted_peptide_12|88_aa QAKLIALTRALTLAKELHVNIYADCKYAFHILHQHDVIWAERGFLTVQGSSSIINASLIK TLLKATLLPKEAGVIHYKGHQKASAPIA >gi568815585f:48671179_48871826|GENSCAN_predicted_CDS_12|267_bp caagccaaactcattgccttaactcgagccctcactcttgcaaaggaattgcatgtcaat atttatgctgactgtaaatatgccttccatatcctgcaccagcatgatgttatatgggct gaaagaggtttcctcactgtgcaagggtcctcctccatcattaatgcctctttaataaaa actcttctcaaggccactttacttccaaaggaagctggagtcatacactacaagggccat caaaaggcatcagctcccatcgcttag