GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:14:46 Sequence gi568815588f:92116332_92316790 : 200459 bp : 40.89% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Sngl + 2324 2722 399 1 0 97 47 259 0.950 18.75 1.02 PlyA + 5202 5207 6 1.05 2.00 Prom + 6889 6928 40 -4.05 2.01 Init + 11351 11471 121 1 1 70 82 24 0.285 0.40 2.02 Intr + 17025 17252 228 1 0 46 72 169 0.521 8.02 2.03 Term + 17817 18064 248 1 2 59 48 112 0.683 -0.73 2.04 PlyA + 18096 18101 6 1.05 3.00 Prom + 19358 19397 40 -3.65 3.01 Init + 20881 21373 493 0 1 71 40 436 0.674 32.46 3.02 Term + 21830 21975 146 0 2 28 48 116 0.809 -1.31 3.03 PlyA + 22156 22161 6 1.05 4.08 PlyA - 24861 24856 6 1.05 4.07 Term - 26533 26403 131 2 2 55 55 19 0.351 -7.34 4.06 Intr - 26787 26698 90 1 0 109 90 53 0.807 6.65 4.05 Intr - 28754 28614 141 0 0 70 74 186 0.506 14.80 4.04 Intr - 50208 50126 83 2 2 52 70 123 0.016 5.26 4.03 Intr - 50662 50491 172 2 1 -35 26 187 0.087 -1.72 4.02 Intr - 76305 76215 91 0 1 96 92 54 0.281 5.25 4.01 Init - 85697 85665 33 2 0 71 99 5 0.160 -0.18 4.00 Prom - 93729 93690 40 -3.95 5.00 Prom + 94635 94674 40 -4.15 5.01 Sngl + 100001 100462 462 1 0 82 55 473 0.965 39.31 5.02 PlyA + 100958 100963 6 1.05 6.06 PlyA - 103472 103467 6 1.05 6.05 Term - 106203 106157 47 1 2 123 47 18 0.643 -2.41 6.04 Intr - 110299 110213 87 2 0 97 54 48 0.532 1.32 6.03 Intr - 110527 110449 79 1 1 144 76 14 0.797 4.01 6.02 Intr - 124030 123015 1016 2 2 119 90 1169 0.179 109.66 6.01 Init - 157952 157838 115 2 1 69 77 54 0.155 2.82 6.00 Prom - 160651 160612 40 -2.95 7.00 Prom + 161108 161147 40 -1.65 7.01 Init + 172728 172730 3 2 0 103 81 0 0.351 0.85 7.02 Intr + 174166 175276 1111 0 1 -16 85 575 0.314 35.21 7.03 Term + 175819 176030 212 2 2 -18 43 208 0.737 2.17 7.04 PlyA + 180302 180307 6 1.05 8.02 PlyA - 180664 180659 6 1.05 8.01 Term - 189901 189777 125 2 2 30 42 163 0.415 3.47 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 39774 39875 102 2 0 55 86 110 0.823 7.79 S.002 Term - 88022 87840 183 2 0 74 43 145 0.846 5.16 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:92116332_92316790|GENSCAN_predicted_peptide_1|132_aa MAALLLRHVGRHCLRAHFSPQLCIRNAVPLGTTGKQEMERFWNKNTGLNRPVSPHVTIYS WSLPMAMAICHHDTGTALRAGASLFGMSALLLPGNFESYLELVKSLCLGPALIHTAKFAL VFPLMYHTWNGV >gi568815588f:92116332_92316790|GENSCAN_predicted_CDS_1|399_bp atggctgcactcttgctgagacatgttggtcgacattgcctccgagcccactttagccct cagctctgtatcagaaatgctgttcctttgggaaccacaggcaaacaagagatggagcgg ttctggaataagaacacaggtttaaaccgtcctgtatctccccacgtcactatctacagt tggtctcttcccatggcaatggccatctgccaccatgacactggtactgctttgagggca ggggcctctctttttggcatgtcagccctgttactccctgggaactttgagtcttatttg gagctggtgaagtccctgtgtctggggccagcactgatccacacagctaagtttgcactc gtcttccctctcatgtatcatacctggaatggggtctga >gi568815588f:92116332_92316790|GENSCAN_predicted_peptide_2|198_aa MNKQKRNGGRGLQKKGSTKAPQRPQLSCPKNSKVVSVAIVEIQTTIKEYYKHLYTNKLEN VEEMGKFLDTYTLPRLNQEEAESLNTPITGSEIEAIINSLPKKVQDQMDSQPNSTRVLEV LARAIRQEKEIKGIQLGKEEVKLSLFADDIIVYLENPIISAQNLLKLISNFSKVSGYKIN MQKSQAFLYTNIRQTAKS >gi568815588f:92116332_92316790|GENSCAN_predicted_CDS_2|597_bp atgaacaaacaaaaaagaaatgggggaagagggcttcagaaaaaaggaagtactaaggcc ccccagaggcctcagttgtcttgtccaaagaatagcaaagtagtcagtgtggccatagtg gaaatacaaactaccatcaaagaatactataaacacctctacacaaataaactagaaaat gtagaagaaatgggtaaattcctggacacatacaccctcccaagactaaaccaggaagaa gctgaatctctgaatacaccaataacaggctctgaaattgaggcaataattaatagccta ccaaaaaaagtccaggaccagatggattcacagccaaattctaccagagtgttagaagtt ctggccagggcaatcaggcaagagaaagaaataaagggtattcaattaggaaaagaggaa gtcaaattgtccctgtttgcagatgacatcattgtatatttagaaaaccccatcatctca gcccaaaatctccttaagctgataagcaacttcagcaaagtctcaggatacaaaatcaat atgcaaaaatcacaagcgttcctatataccaatatcagacaaacagccaaatcatga >gi568815588f:92116332_92316790|GENSCAN_predicted_peptide_3|212_aa MTLGDYMGASCHGCIGGTNVHAEVQKLQMEVPHIIVGTPGHISDMLNQRYLSPKYIKMFV LDEADEMLSHGFKDSIYDIFQKLNSNTQVVLLSATRPSDVLEVTKKFIRDPIWIPVKKEE LTLEGVHQFYINVEHEEWKLSTLCGLYETLIVTLGSHLHQYWKEARGIHVQQVSSVINCD LPTCIHRIGRGGWFGHKGVAINMVTEEDKRTL >gi568815588f:92116332_92316790|GENSCAN_predicted_CDS_3|639_bp atgacactaggagactacatgggtgcctcctgtcatggctgtattgggggcaccaacgtg catgctgaggtgcagaaactacagatggaagttccccatatcatcgtgggtacccctggc catatatctgatatgcttaaccagagatacctgtctcccaaatacatcaagatgtttgta ctggacgaagctgatgaaatgttaagccatggattcaaggactcaatctatgacatattc caaaagctcaacagcaacacccaggtggttttgctatcagccacaaggccttctgatgtg cttgaggtgaccaagaagttcataagggaccccatttggattcctgtcaagaaggaagag ttgaccctggagggggtccaccaattctacatcaatgtggaacatgaggagtggaaactg agcacactgtgtggcttgtatgaaaccctgattgtcaccctaggcagtcatcttcatcaa tactggaaagaagccagaggcattcatgtgcagcaggtttcttcagttatcaactgtgac cttcccacctgtatccacagaattggtcgaggtggatggtttggccataagggtgtggct attaacatggtaacagaagaagacaagaggactctttaa >gi568815588f:92116332_92316790|GENSCAN_predicted_peptide_4|246_aa MNSSKEQFKCEDRSRPYDTFNLHSLENSLMDMIRTDHEPLKVQGEATSTDGELQQVTYPD LVKIVGEGDYPKQQIFSVGETAIYWKKMPSRTFITREENALDGDEPDSNVVFMTANTSSI LQPMDQDAFLDDSHGDQALSSGLSSPTRCQNGERVERYSRKVFVGGLPPDIDEDEITASF RRFGPLVVDWPHKAESKSYFPPKETLPLKHTDCFMAPILLTVRPWKSVVDEPPANFHYFE RKDLGF >gi568815588f:92116332_92316790|GENSCAN_predicted_CDS_4|741_bp atgaattcatcaaaagaacaatttaaatgtgaggaccggagtaggccctatgatactttt aacttgcactcgttggagaactccttaatggatatgataaggactgatcatgaacctctg aaagtgcaaggtgaagcaacaagtactgatggagaactacagcaagttacttacccagat ttagttaagatagttggtgaaggtgactaccctaaacagcagattttcagtgtaggtgaa acagccatctattggaagaagatgccatctaggactttcataacgagagaggagaatgca cttgatggagatgaaccggacagtaatgttgttttcatgactgctaatacgtcatccatt ctacagcccatggatcaagatgccttcctggatgatagccatggtgatcaagccttgtca tctggcttaagttctcccactcgctgtcaaaatggggaacgagtagaacgctactctaga aaggtgtttgttggaggacttcctcctgatattgatgaagatgagatcactgccagcttt cgcaggtttggacctctcgtagtagactggcctcacaaagctgaaagcaagtcttatttt cctcctaaagagactcttcccctgaagcacactgattgtttcatggcaccgattttattg acagtgagaccctggaagtctgttgtggatgagcctcctgctaattttcattattttgaa aggaaagatctagggttttga >gi568815588f:92116332_92316790|GENSCAN_predicted_peptide_5|153_aa MTKIKADPEGPEAQAEVCSGERTYQELLVNQNPIAQLLASRRLKRKLYKCIKKAVKQKQI RRGVKEVQKFVNKGEKGIMVLAGDTLPIEVYCHLPVRCEDRNLPYVYIPSKTDLGAATGS KRPTCVIMVKPHEEYQEAYDECLEEVQSLPLPL >gi568815588f:92116332_92316790|GENSCAN_predicted_CDS_5|462_bp atgaccaaaataaaggcagatcccgaagggcccgaggctcaggcggaggtgtgttccggg gagcgcacctaccaggagctgctggtcaaccagaaccccatcgcgcagctcctggcttct cgccgcctcaagcggaagctctacaaatgcattaagaaagcggtgaagcagaagcagatt cggcgcggggtgaaagaggttcagaaatttgtcaacaaaggagaaaaagggatcatggtt ttggcaggagacacactgcccattgaggtatactgccatctcccagttaggtgtgaggac cgaaatctgccctatgtctatatcccctctaagacggacctgggtgcagccacaggctcc aagcgccccacctgtgtgataatggtcaagccccacgaggagtaccaggaggcttacgac gagtgcctggaggaggtgcagtccctgcccctacccctatga >gi568815588f:92116332_92316790|GENSCAN_predicted_peptide_6|447_aa MTAALREHLFGIYLLNICNGHVLGPSHALSQIVTTTLSAAQTMQDDLLMDKSKTQPQPQQ QQRQQQQPQPESSVSEAPSTPLSSETPKPEENSAVPALSPAAAPPAPNGPDKMQMESPLL PGLSFHQPPQQPPPPQEPAAPGASLSPSFGSTWSTGTTNAVEDSFFQGITPVNGTMLFQN FPHHVNPVFGGTFSPQIGLAQTQHHQQPPPPAPAPQPAQPAQPPQAQPPQQRRSPASPSQ APYAQRSAAAAYGHQPIMTSKPSSSSAVAAAAAAAAASSASSSWNTHQSVNAAWSAPSNP WGGLQAGRDPRRAVGVGVGVGVGVPSPLNPISPLKKPFSSNVIAPPKFPRAAPLTSKSWM EDNAFRTDNGNNLLPFQAPTPTGPFSQLLAPPAGLLPEPPLCPVECRLHKGKDLFTEVSP GAVIVPGIFWAHELREFLLNFDNVLNK >gi568815588f:92116332_92316790|GENSCAN_predicted_CDS_6|1344_bp atgacagctgcactcagggagcacttgtttggcatctacttactgaacatctgcaatggg catgtgttgggcccttcacatgcattatctcagatcgtgacgacaaccctttcagctgcg caaaccatgcaggatgatttactgatggacaaaagcaaaacccagccccagccccagcag cagcagcggcagcagcagcagccccaacctgagtccagcgtatccgaagccccgtccacg cccctctcctcagagacccccaagccggaggaaaacagcgcagtgccggccctcagccca gccgctgcccccccggcccccaacggcccggacaagatgcagatggaatcaccgctcctg ccaggcttgagtttccatcagcctcctcagcagccgccgccgcctcaggagcccgcggca ccgggcgcgtcgctgtcgccgtccttcggcagcacctggtccacgggcaccaccaacgcg gtagaggacagcttcttccaggggatcaccccagtcaacgggaccatgctcttccagaac ttcccgcaccatgtcaacccagtcttcggaggcactttctccccgcagatcggcctggcg cagacccagcaccaccagcagccgccgccgcctgcgcccgcgccgcagccggcacagcca gcgcagccaccacaggcgcagcccccgcagcagcgccgctcacccgccagccccagccag gcgccctacgcgcagaggagcgccgccgcggcgtacggccaccagcccatcatgaccagc aagccgtcctcgtcttcggcggttgcagccgctgctgccgcagccgccgcctcgtcggcc tcgtccagctggaacacgcaccaaagcgtgaatgcagcctggagcgcaccgtccaacccc tggggcggcctgcaggcgggccgggaccctcgccgggcggtcggtgtgggcgtgggtgtg ggtgtcggggtgccttccccgctcaaccccatctcgccgctcaaaaagcccttctccagc aacgtgatcgcgccgcccaagttccctcgcgcggcccctctcacttccaagtcctggatg gaggataacgctttccggaccgataatggtaacaatctgttgccatttcaggcccccacc cccacaggccccttctcccagctcctggcccctcctgcaggtctcctccctgagccccca ctctgcccagtagaatgtaggctccacaagggcaaggaccttttcactgaagtgtctcca ggggctgtgatagtaccagggatattttgggcacatgagctgagagaattcctgctgaac tttgataacgttttgaataaatga >gi568815588f:92116332_92316790|GENSCAN_predicted_peptide_7|441_aa MTVGEPWKAQAHLRRFLERERIIFPSPNSPAPNTQRDSGGSCLVQAVPICPPPQEEPHAP NQPTPRGGGVIPPKWALPQERLAPAPGASEGVSATPTAATDRPDATAGAGDPDKKWPFSP LPSPGSVPPEGKSGGKNKQAGRADLVATDSGAAHPAASWKAARKTFFPPGGRKREGAGQT AAGAEALVRAGCAASSGDGPKTTSPRLLWAALRRNPGAHGGGDSYLTKVAPPPAATRRPR SMDRNLGPTDGNPGRDRRLPASGSSSSLSAASAGLPQALHRRRQASGAAPGSWVTGSRLP PDCGLLRRLSVLLSLGLALSGSTGEGRVRRLRGRCRTKPYSRCWTGTGSCGGDREPPTLA LALPAPALEALVYLTIVEFIEDVSARTGVQLPILGGPQSVKLFVKSGAPPSAVEILVRGS YHEFLETLDPHFPRAKSYFLA >gi568815588f:92116332_92316790|GENSCAN_predicted_CDS_7|1326_bp atgactgtcggcgagccctggaaagcccaggcacatttgagaaggttcctggagagggag agaatcatctttccttcccccaactccccagccccaaacacacagagagactcgggtgga agctgtctggtccaggcagtccctatttgcccgccccctcaagaagaacctcacgcacca aaccaaccgacccctcgcggaggtggtgtaattcccccaaaatgggctctgccgcaggag aggctggctccagcgccgggggcttcggaaggagtttctgccacccccactgccgccact gaccgccccgacgccacggccggggccggggaccctgataagaaatggcccttcagcccc ctcccctcacctggctcggtcccacctgagggcaagagcggaggcaaaaacaaacaggca gggagggctgaccttgttgctaccgacagtggagcggcgcatcctgctgcttcctggaaa gcggcccgaaagacattttttccccctggaggaaggaaacgggagggcgccggccagacg gcggcaggcgcggaggcgttggtccgggcgggctgtgcagcctctagtggagacggtccg aagactacatctcccaggctgctctgggccgccctgcgtcgtaaccctggcgcgcacgga ggcggcgactcttacctcacaaaggtagctcctccgccggcagcaactcggcgcccgcgg tccatggaccggaacctcgggccgacggacgggaacccgggccgcgatcgccgcctcccc gcctcaggctcctcctcctcgctctccgccgcctccgccggactcccgcaggccctgcac cgccgccgccaggctagcggagctgccccgggaagctgggtgacgggttcgcggctgccg ccggactgcggcctactccgccgcctctcagtgctattgtccctgggcctggccttgagc gggtccactggggaaggccgtgtgcgccggctccgcggaagatgccggaccaagccctac agcagatgctggacaggtacgggcagctgtgggggggaccgggagccgccgaccctcgct ctggccctgcccgcgcccgccctcgaggctcttgtgtacttgacgatcgttgagttcatt gaagacgtttcagctcgtacaggagtccagttgcccattcttggggggccccagtcggta aaactctttgtgaagtcgggtgcaccgccctctgctgttgaaattctggtccgtgggtcc taccatgagtttttggagactctagatccacatttcccccgagccaagtcttacttcctg gcatag >gi568815588f:92116332_92316790|GENSCAN_predicted_peptide_8|41_aa XAENVRLLLTGEPPTKWSLKNPEEVKVETDGIETWIKMCCA >gi568815588f:92116332_92316790|GENSCAN_predicted_CDS_8|126_bp naagcagagaatgtgaggctattgctgacaggagagccaccaactaagtggagccttaag aatccagaagaagtgaaggtggaaactgacggcattgagacatggatcaagatgtgctgt gcctaa