GENSCAN 1.0 Date run: 2-Nov-116 Time: 20:37:50 Sequence gi568815586r:94471438_94682523 : 211086 bp : 42.93% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3165 3497 333 2 0 95 115 119 0.202 10.14 1.02 Term + 3707 4057 351 1 0 17 41 196 0.194 1.30 1.03 PlyA + 4581 4586 6 1.05 2.02 PlyA - 5047 5042 6 1.05 2.01 Sngl - 13031 12474 558 2 0 70 42 279 0.901 17.38 2.00 Prom - 18479 18440 40 -5.95 3.00 Prom + 20975 21014 40 -5.35 3.01 Init + 26540 26783 244 1 1 69 64 220 0.525 15.66 3.02 Intr + 27347 27550 204 0 0 57 25 138 0.134 2.65 3.03 Intr + 32815 33069 255 2 0 18 65 207 0.153 7.89 3.04 Term + 38789 38880 92 0 2 73 44 90 0.525 0.00 3.05 PlyA + 40370 40375 6 1.05 4.05 PlyA - 43399 43394 6 1.05 4.04 Term - 44207 44072 136 1 1 73 37 105 0.642 0.41 4.03 Intr - 45787 45674 114 0 0 91 99 52 0.690 5.24 4.02 Intr - 46216 46021 196 2 1 39 34 202 0.588 7.45 4.01 Init - 50237 49787 451 2 1 69 79 379 0.392 31.22 4.00 Prom - 53842 53803 40 -5.25 5.00 Prom + 61320 61359 40 -3.05 5.01 Init + 69360 69408 49 2 1 86 89 36 0.742 2.66 5.02 Intr + 72517 72598 82 2 1 93 64 91 0.888 4.98 5.03 Intr + 75627 75825 199 0 1 112 75 137 0.572 13.23 5.04 Term + 76804 78099 1296 0 0 13 41 1353 0.116 113.69 5.05 PlyA + 78766 78771 6 1.05 6.02 PlyA - 85402 85397 6 1.05 6.01 Sngl - 91044 90712 333 0 0 67 42 141 0.214 3.17 6.00 Prom - 94435 94396 40 -2.05 7.07 PlyA - 94734 94729 6 1.05 7.06 Term - 100300 99998 303 1 0 124 48 222 0.875 15.99 7.05 Intr - 101620 101432 189 1 0 9 67 150 0.272 3.86 7.04 Intr - 107092 106957 136 0 1 68 82 275 0.982 24.55 7.03 Intr - 111101 110185 917 2 2 149 68 1093 0.963 102.52 7.02 Intr - 116798 116700 99 2 0 -6 117 127 0.345 5.59 7.01 Init - 117334 117311 24 1 0 70 58 17 0.231 -3.44 7.00 Prom - 118359 118320 40 -8.25 8.00 Prom + 118651 118690 40 -5.65 8.01 Init + 119480 119703 224 1 2 58 14 208 0.363 8.48 8.02 Intr + 121902 122059 158 0 2 -5 63 203 0.531 7.23 8.03 Intr + 127441 127527 87 2 0 49 70 91 0.052 2.42 8.04 Term + 149810 149976 167 0 2 13 48 215 0.573 7.10 8.05 PlyA + 150414 150419 6 1.05 9.08 PlyA - 154875 154870 6 1.05 9.07 Term - 167709 167578 132 0 0 86 44 149 0.970 7.41 9.06 Intr - 171083 170955 129 2 0 93 102 105 0.922 12.37 9.05 Intr - 173112 172957 156 0 0 81 77 80 0.751 5.49 9.04 Intr - 179031 178916 116 1 2 0 88 193 0.001 9.75 9.03 Intr - 192608 192506 103 2 1 68 88 52 0.694 2.03 9.02 Intr - 199662 199583 80 1 2 98 76 43 0.124 2.45 9.01 Init - 204655 204613 43 1 1 56 76 79 0.218 4.13 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 76816 78099 1284 0 0 69 41 1407 0.855 127.79 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586r:94471438_94682523|GENSCAN_predicted_peptide_1|227_aa LQVSLKRLPSGSPRGYIFFSAHCVKEALLLPVDLGQLPLPRWGFLFLPAGTLTQRVQNAQ TVAIACSSVGPSLHFLARIHPTLGTRTSKPSKPRVAGTESTKFPDESFGWMMVQYVIKPV DPLVISSYSYHLYYEMDPLFGSCVELDTIPVDQAFFKPLDSGMGCGSEAKKGKPIPRINM YPCKDKLLATPEWKGHSVVDLPPSGWFIFSSNNAISGTQGWSLLLAN >gi568815586r:94471438_94682523|GENSCAN_predicted_CDS_1|684_bp ttacaagtaagtctcaaaagactcccaagtggatcacctaggggctacatatttttctct gcccattgtgtaaaagaagcccttctgcttcctgttgatctaggtcagttacccctgcca agatggggattcctcttcctgcctgctggcaccttgacacaaagggtccaaaatgcccaa acagttgccatagcttgtagttcagtgggaccttccttgcatttcctggcaagaattcac cccactttggggaccaggacttctaaaccttcaaaacccagagttgcaggaacagaaagc acaaagttccccgatgagtcgtttgggtggatgatggtgcagtatgtgataaaaccagta gatcccttagtcattagttcatactcataccacctttactatgaaatggatcccctgttt ggatcctgtgttgagttggatactatacctgtggatcaggccttctttaagcctctagat agtggtatgggctgtggctctgaagccaagaaaggcaaacccatacccagaataaatatg tatccctgtaaggataaactgctggccactccagaatggaaagggcacagtgttgttgac ttgccaccaagtggctggtttatcttctcaagtaataatgccatatcagggactcagggt tggtctctgttgctggcaaattag >gi568815586r:94471438_94682523|GENSCAN_predicted_peptide_2|185_aa MDKFLDTYTLPRLNQEEVESLNRPITGAEIVAIINSLPSKKSPGPDGFTAEFYQRYMEEL VPFLLKLFQSVEKEGIFPNSFYEASIILIPKPGRDTTKKENFRPISLMNIDAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGWFNIHKSINVIQHINRTKNKNHMIISIDAEKAFDKI QQPSC >gi568815586r:94471438_94682523|GENSCAN_predicted_CDS_2|558_bp atggataaattcctcgacacatacactctcccaagactaaaccaggaagaagttgaatct ctgaatagaccaataacaggagctgaaattgtggcaataatcaatagcttaccaagcaaa aagagtccaggaccagatggattcacagccgaattctaccagaggtacatggaggaactg gtaccattccttctgaaactattccaatcagtagaaaaagagggaatcttccctaactca ttttatgaggccagcatcatcctgataccaaagcctggaagagacacaaccaaaaaagag aattttagaccaatatccttgatgaacatcgatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaatatacacaaatcaataaatgtaatccagcatataaacaga accaaaaacaaaaaccacatgattatctcaatagatgcagaaaaggcctttgacaaaatt caacaaccttcatgctaa >gi568815586r:94471438_94682523|GENSCAN_predicted_peptide_3|264_aa MALFSRLLHQPLSQLPALATVPLSASSMAPAAPTPAAMLPSVGTTSWLPPVHLQGSQLYQ ARSHSVSDTSMHVTDNSDSEPEKKRRMGLERSTQGTLKVMSEESENDLETIEQGPETKTP YLRNLEGSKEHLVTIKQAIQRQKLSRKFKIKSLDTKARGASMLGWRYSMHLPLIIADKSK LCLRDSTERERLEAHAWELAWTLSSVPFPTADFNLYSFTVINHHREYNSFLSEFSISRED KVAKRCSLLYGSVSCPMTSDVFAK >gi568815586r:94471438_94682523|GENSCAN_predicted_CDS_3|795_bp atggctttattctctcgccttctccatcagcctttgtctcagctgcctgctctggccaca gtccctctcagtgccagctccatggctcctgctgcccccacgcctgcagctatgctccct agtgtgggcaccacttcctggctccctcctgttcacctgcaaggcagccagctttatcaa gcacgctctcacagtgtcagtgatacatctatgcacgtcacagataacagtgactcagaa ccagagaagaaaaggaggatggggttggagagaagcacacagggtactttaaaggtaatg agtgaggaaagtgaaaatgacttggagaccattgaacagggcccagagacaaaaactcct tatctgaggaatttagaaggcagcaaagaacacctggtgaccatcaaacaggccatccaa aggcaaaagttatctaggaaatttaaaataaaatccctggacaccaaggctaggggagcc tccatgctaggttggcgttactccatgcatctgccactcatcattgccgacaaaagtaaa ctctgtctgcgggactccactgaaagagaacgactggaagctcatgcctgggaacttgcc tggactctgtcctctgtgccttttcccactgctgattttaatctgtattctttcactgta ataaaccatcatcgtgaatataacagctttctttctgagttctccatcagcagggaagac aaagtggccaagagatgctcacttctctatggaagtgtcagctgccccatgaccagtgat gtctttgccaagtga >gi568815586r:94471438_94682523|GENSCAN_predicted_peptide_4|298_aa MHEKRNIKQKNDEKTPQGAVPAYLLDREEQSRAKVLSNRIKQKRKEKVGKWEVPLPKVRA QGETEVLKVIRTGKRKKKAWKRMVTKVCFVGDGFTRKPPKYERFIRPMGLHFKKAHVTHP EMKATFCLRILGVKKSPLSPLYTTFDVITKGIIHEVTCVRLLSEDVKVNLECRLDWIKEY PEKWQNIILGCVFEDVSRADWHVSLSGLSREDLPSISKAFRLGLSHTAGIPGPPVCRPSI VELSLYNNMNQFSSQPAGLLLLGGHYIDAKVNADTRSAQRTTAQNSWAQVVLLPQPPK >gi568815586r:94471438_94682523|GENSCAN_predicted_CDS_4|897_bp atgcatgaaaagagaaacatcaaacaaaagaatgatgaaaagactccacagggagcagta cctgcctatctgctggacagagaggaacaatctcgagctaaagtactttccaataggatt aaacagaaacgaaaagagaaagtgggaaaatgggaagtccctctgcctaaagtacgtgcc cagggagaaacagaagtattaaaagttattcgaacaggaaagagaaagaagaaggcatgg aagaggatggttactaaagtctgctttgttggagatgggtttacaagaaaaccacctaaa tatgaaagattcatcaggccaatgggcttgcatttcaagaaagcccatgtaacacatcct gaaatgaaagccaccttttgcctacgaatacttggtgtaaagaagagccccttgtcccca ctgtatacaactttcgatgttattaccaaaggtatcatccatgaagtaacatgtgtcaga cttttatcggaggacgtgaaggttaatttagagtgtcgacttgactggattaaggaatac ccagagaagtggcaaaacattattttggggtgtgtcttcgaggatgtttctagagcagat tggcatgtgagtctcagtggactaagtcgggaagatctgccctccatttctaaggctttc agacttggactgagtcacactgctggcatcccagggcctccagtttgcagaccgtctata gtggaacttagcctctataataacatgaaccaattctcctcgcaacctgctggtcttctg ctcctgggaggtcactatatcgatgccaaagttaatgcagacaccagatcggcacagcgc actactgcccagaactcctgggctcaagtggtcctcctgcctcagcctcccaagtag >gi568815586r:94471438_94682523|GENSCAN_predicted_peptide_5|541_aa MGFHHVGQAGLELLTSVLPHFIKDEYKENPLFLTVQWGKVAVLGWHHGDCLVTPLSNQHQ TLETSSQVSWSLIPTRRPHLHPRMDFRPQQQRQPYFLCDTGATTFLTGPQIASPMAAQAW KLLQALALLPRFLVARSRAVQLTSRRWLSLQECQSKKLMSDNRVTVQRFFVADTANEALE AAKRLNAKEIVLKAQILAGGRGKGVFNSGLKRGVHLTKDPNVVGQLAKQMIGYNLATKQT PKEGVKVNKVMVAEALDISRETYLAILMDQSCNGPMLVGSPQEGIDIEEVAASNPELIFK EQIDIFEGIKDSQAQRMAENLGFVGPLKSQAADQITKLYNLFLKIDATQVEVNPFGETPE GQVVCFDAKINFDDNAEFQQKDIFAMDDKSENEPIENEAAKYDLKYIGLDGNIACFVNGA GLAMATCDIIFLNGRKPASFLDLGGGVKEAQVYQAFTLPTADPKVEAILVNIFGGIVNCA IIANGITKACQELELKVPLVVWLEGTNVQEAQEILNSRLPITSAVDLEDAAKKAVASVAK K >gi568815586r:94471438_94682523|GENSCAN_predicted_CDS_5|1626_bp atggggtttcaccatgttggccaggctggtctcgaactcctgacctcagttttgccacat tttatcaaggatgaatacaaggaaaatcctctcttcctgaccgttcagtggggaaaagtg gcagtgctggggtggcatcatggggactgtctggtgactcccctaagcaaccagcatcag acactagaaacttccagtcaagtctcctggtccctgattccaaccagacgaccacatctt caccccaggatggactttagacctcagcagcagagacagccatacttcctgtgtgacact ggggcaaccacgtttctcacaggcccccagatagcgtcccccatggcagcgcaggcctgg aagcttctgcaagccctagcgctgctgccccgcttcctggtggccaggtcccgggcagtt caattaacctccagaagatggctgagcctgcaggaatgccagagcaagaaactgatgtct gataacagagtgacagttcaaagattctttgtagcagacactgcgaatgaagctcttgag gccgctaagagactaaatgcaaaagaaattgttttaaaagcccagatcttagctggagga agaggaaaaggtgtcttcaatagtggtttgaaaagaggtgttcatttaacaaaagaccct aatgttgtgggacagctggctaaacagatgattgggtacaatctagcgacaaaacaaact ccaaaagaaggtgtgaaagttaacaaggtgatggttgctgaagccttggacatttccaga gaaacctacctggcaattctgatggaccagtcctgcaatggccccatgctggtgggcagc ccccaggagggcatcgacattgaagaggtggctgcttcaaacccagagctcatttttaag gagcaaattgacatttttgaaggaataaaggacagccaagctcagcggatggcagaaaat ctaggcttcgttgggcctttgaaaagccaggctgcagatcaaattacgaagctgtataat ctcttcctgaaaattgatgctactcaggtggaagtgaatccctttggtgaaactccagaa ggacaagttgtctgttttgatgccaagataaactttgatgacaatgcagaattccaacaa aaagacatatttgctatggacgacaaatcagagaatgagcccattgaaaatgaagctgcc aaatatgatctaaaatacataggactagatgggaacattgcctgctttgtgaatggtgct gggctcgccatggctacttgtgatatcattttccttaatggtaggaagccagccagcttc ttggatcttggaggtggtgtaaaggaagctcaagtatatcaagcattcacattgcccaca gctgatcctaaggttgaagccatccttgtcaatatttttggtggtatcgtcaactgtgcc atcattgccaatgggatcaccaaagcctgccaggagctagaactcaaggtgccccttgtg gtttggcttgaaggaaccaacgtccaagaggcccaggaaatactcaacagcagactcccc attacttcagccgttgacctggaggatgcagccaagaaggctgtggccagtgtggccaag aagtga >gi568815586r:94471438_94682523|GENSCAN_predicted_peptide_6|110_aa MEVTHSVNKQSFSKEKVEGCRGPVAGSLGELESGSCVPRAGGDPAWPAALGLLGFWFTEL VHGLPEAIDSVQAPSQAGDCLAASVNSFYWGQTELSDRAERFSSLAAVYA >gi568815586r:94471438_94682523|GENSCAN_predicted_CDS_6|333_bp atggaggtgacccattcagtcaataagcaatcattctcaaaagagaaggtagagggttgc agggggccggtggctggctctctgggtgagctggagagtggcagctgtgtccctagagca ggcggggatcctgcctggccagctgcactcggtcttctgggtttctggttcacagagctt gttcatggcctccctgaggctatcgactctgtgcaggctccaagccaagctggcgactgc ttggcagcctcagtgaactctttctactggggacagactgagctgtccgacagggcagaa cgtttcagctccttggcagctgtatatgcttga >gi568815586r:94471438_94682523|GENSCAN_predicted_peptide_7|555_aa MLGRGLDKRVNTECPGRQPSVNPAQCVQAALVVTELLCPCPVERHDMNTLSLPLNIRRGG SDTNLNFDVPDGILDFHKVKLTADSLKQKILKVTEQIKIEQTSRDGNVAEYLKLVNNADK QQAGRIKQVFEKKNQKSAHSIAQLQKKLEQYHRKLREIEQNGASRSSKDISKDHLKDIHR SLKDAHVKSRTAPHCMESSKSGMPGVSLTPPVFVFNKSREFANLIRNKFGSADNIAHLKN SLEEFRPEASARAYGGSATIVNKPKYGSDDECSSGTSGSADSNGNQSFGAGGASTLDSQG KLAVILEELREIKDTQAQLAEDIEALKVQFKREYGFISQTLQEERYRYERLEDQLHDLTD LHQHETANLKQELASIEEKVAYQAYERSRDIQVLSGSSDGDVKEAWRSVSQSQKKNWPRD TLVAICVLVIEDLAWTRSREIASDGVLSNTAFREQEALESCQTRISKLELHQQEQQALQT DTVNAKVLLGRCINVILAFMTVILVCVSTIAKFVSPMMKSRCHILGTFFAVTLLAIFCKN WDHILCAIERMIIPR >gi568815586r:94471438_94682523|GENSCAN_predicted_CDS_7|1668_bp atgctgggccgtggattggacaagcgggttaatacagagtgtcctgggagacagccttcc gtgaatcctgcacagtgtgtccaagccgcactggtggtgactgagttgctgtgcccgtgt ccggtagaacgtcatgacatgaataccttaagcctgcccctgaacatacgccgagggggg tcagacaccaacctcaactttgatgtcccggatggcatcctggacttccacaaggtcaaa ctcactgcagacagcctgaagcaaaaaattctaaaggtaacagagcagataaaaattgag caaacatcgcgcgatgggaatgttgcggagtatctgaaactagtgaacaacgcagacaag cagcaggcgggacgtatcaagcaagtctttgagaagaagaatcagaaatcagctcactcc atcgcccagctgcagaagaagttagagcagtatcatcgaaagctcagagagatcgagcag aatggagcctctaggagctcaaaggacatttccaaagaccacctgaaggatatacatcgc tctttgaaagatgcccacgtgaaatctcgaactgccccccattgcatggagagcagcaaa tcgggcatgccaggggtctcacttactccacctgtgttcgttttcaataagtccagagag tttgccaacctgatccggaataagtttggcagcgccgacaacattgctcacttgaaaaat tccttagaggagtttaggccagaggcgagtgccagggcctacgggggcagcgctaccatc gtgaacaaacccaagtatggcagtgatgatgaatgttcgagtggcacgtcaggctcggcc gacagtaacggaaaccagtcgtttggggctggtggagccagcacactggacagccagggc aagctcgccgtgatcctggaggaactgagggagatcaaggatacccaagctcagctggct gaggacatcgaggcactgaaggtgcagtttaagagagaatatggttttatttctcagacc ctgcaagaggaaagatacaggtatgagcgactggaggaccagctgcatgacctgacggac ctgcatcagcatgagacagccaacctgaagcaggagctggccagcattgaggagaaggtg gcctaccaggcctacgagcgctcgcgggacatccaggtgctttcaggaagctctgatgga gatgtcaaagaggcatggagatctgtgagtcaatctcagaagaagaactggcctagagat acacttgtagccatctgtgtcttagtaattgaagacttggcatggacgagatccagggag atagcctcggatggtgtcctgagtaacaccgcgttcagagagcaggaagccttggaatcc tgccagactcgcatttctaagctggagctccaccagcaagagcagcaagctctgcagaca gacaccgtgaatgctaaagttctcctggggaggtgcatcaacgtgatcctggccttcatg actgtcatcttagtgtgtgtgtccaccatcgcgaagttcgtctcacccatgatgaagagt cgctgccacattcttggcaccttctttgccgtgactcttcttgctatattttgtaaaaac tgggaccatatcctgtgtgccatagaaaggatgataataccaagatga >gi568815586r:94471438_94682523|GENSCAN_predicted_peptide_8|211_aa MRNDRELQGRSHAYDQHLNMILGDEEETATTIEIDEETYEEIYKSRKQNIPMLFVLGDGR AGCPSIDSWMKQFVLLDNRLHLKKKKEKKRKKKEERRRRKKKKEEEGEGEGEGEEEEEEE EFYFLRAEPKSVFGLAVSPKHRQQTSRTHSKHSQQPELTLRVSKPTRLSQLTQLFWLLRG SEDQYVEEPLGLVFLKLQLQAISGTEPGLCI >gi568815586r:94471438_94682523|GENSCAN_predicted_CDS_8|636_bp atgagaaatgaccgagagcttcaaggcagatcacatgcttatgatcaacatttaaatatg atcttgggagatgaggaagaaaccgcgactactatagaaattgatgaagaaacatacgaa gagatatataaatcaaggaaacagaatattccaatgctctttgtcctgggagatggtcgt gctggttgcccctccattgacagctggatgaaacaatttgtcctcctggacaacaggctc catctaaaaaaaaaaaaagaaaagaaaagaaagaagaaagaagaaagaagaagaaggaag aagaagaaggaagaagaaggagaaggagaaggagaaggagaagaagaagaagaagaagaa gaattctatttcctgagagctgaacctaagtctgtatttgggttggctgtatctccaaaa cacaggcaacagacttccaggacccactcaaaacactcgcagcaaccagagttaacactg cgggtaagcaaacccaccaggctctctcagttgacacagctcttctggctccttcgagga agtgaggaccagtatgtggaagagcctttggggctggtgtttctcaagctacagcttcag gcaatctcagggacagaacctggactgtgcatctga >gi568815586r:94471438_94682523|GENSCAN_predicted_peptide_9|252_aa MKGNEMRLRQQDSGCTAASIHLPSSSSHEWCQGLNCAPVTLHSFLEVEYFGQSVPAKRLC VFYMGATALGMKGAGAPEDGKVRGRSAEMPGSDTALTVDRTYSYPGRHHRCKSRAFPLEE KELPATTSSPACHIRCRDHQDSCYQVGSQGVFCFRPSAGISRTAAKVLKAFEETVPTKNI TKGSLKQLENIARCPHQNSLWNPGLTELKSSDDHDLGETEVMDDDRPKSGTSRKYEFFGN FRQGQIVGGEEI >gi568815586r:94471438_94682523|GENSCAN_predicted_CDS_9|759_bp atgaaaggaaatgaaatgcgtctgcggcaacaggactctggatgcacagctgcttccata catttgccaagttcatcatcgcatgagtggtgtcagggactgaactgtgcccctgtaaca ttgcatagcttcctagaagtagaatattttggtcaaagtgtaccagcaaaaagactctgt gtattctacatgggagctacggccttgggaatgaaaggagctggagccccagaagacggg aaagttcgcggccggagcgcggagatgccgggcagcgacacggcgctcaccgtggaccgg acctactcgtaccccggccggcaccaccgctgcaagagccgggcttttcctttagaggag aaggaactgccggccaccaccagctcacccgcctgtcacatccgctgtagagatcaccaa gacagctgctaccaagttggatctcagggtgtattttgcttcagaccctctgccggcatc tccaggactgctgctaaggtcttaaaggcattcgaggagactgtgccaactaaaaacatt acaaagggcagtttaaagcaactagaaaacatagcccggtgtccacaccagaactccctg tggaaccctggcttgactgaattaaagtcttctgacgaccatgacctgggagaaactgag gtaatggatgacgataggccgaagagtggaacatccagaaaatatgaattctttgggaat ttcaggcaaggacagatagtgggtggagaagaaatttga