GENSCAN 1.0 Date run: 4-Nov-116 Time: 09:01:25 Sequence gi568815593r:134928782_135133881 : 205100 bp : 47.54% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 4085 4124 40 -2.46 1.01 Init + 14888 14932 45 1 0 68 74 74 0.284 2.69 1.02 Intr + 20904 20942 39 2 0 92 77 40 0.663 1.72 1.03 Intr + 30259 30339 81 0 0 93 88 48 0.923 5.13 1.04 Term + 31805 31900 96 1 0 110 34 96 0.948 4.37 1.05 PlyA + 39188 39193 6 1.05 2.00 Prom + 40980 41019 40 -3.96 2.01 Init + 43031 43141 111 1 0 68 50 71 0.107 1.51 2.02 Intr + 67492 67731 240 0 0 73 100 489 0.979 46.35 2.03 Term + 68973 69011 39 2 0 85 41 4 0.186 -7.41 2.04 PlyA + 69590 69595 6 1.05 3.02 PlyA - 70380 70375 6 1.05 3.01 Sngl - 71897 71430 468 2 0 70 41 292 0.998 18.84 3.00 Prom - 72889 72850 40 -4.96 4.02 PlyA - 73058 73053 6 1.05 4.01 Sngl - 74292 73894 399 0 0 88 44 360 0.932 27.86 4.00 Prom - 74723 74684 40 -10.74 5.00 Prom + 74975 75014 40 -12.68 5.01 Init + 75131 75204 74 1 2 74 44 127 0.813 5.45 5.02 Intr + 79160 79358 199 2 1 53 36 266 0.415 17.25 5.03 Intr + 80060 80200 141 1 0 71 78 198 0.980 17.65 5.04 Intr + 80590 80709 120 0 0 102 109 169 0.997 21.09 5.05 Intr + 81592 81749 158 0 2 118 34 178 0.998 14.21 5.06 Term + 82740 82842 103 0 1 107 37 157 0.901 10.25 5.07 PlyA + 88682 88687 6 1.05 6.12 PlyA - 89422 89417 6 1.05 6.11 Term - 100540 99998 543 1 0 162 47 1128 0.999 110.37 6.10 Intr - 101952 101902 51 0 0 73 94 42 0.840 2.40 6.09 Intr - 102727 102495 233 2 2 89 88 398 0.996 37.39 6.08 Intr - 105168 104932 237 1 0 111 115 292 0.913 31.79 6.07 Intr - 106830 106774 57 1 0 104 94 29 0.841 3.96 6.06 Intr - 107545 107398 148 1 1 54 52 40 0.166 -3.19 6.05 Intr - 110254 110131 124 0 1 61 52 169 0.647 11.19 6.04 Intr - 110633 110467 167 2 2 21 28 155 0.672 1.56 6.03 Intr - 111047 110815 233 0 2 79 78 103 0.577 5.89 6.02 Intr - 117111 116971 141 1 0 145 45 25 0.058 4.22 6.01 Init - 121863 121710 154 0 1 104 41 139 0.279 9.07 6.00 Prom - 125442 125403 40 -5.66 7.04 PlyA - 126137 126132 6 1.05 7.03 Term - 136952 136764 189 2 0 121 52 134 0.348 10.55 7.02 Intr - 137473 137287 187 0 1 45 63 111 0.733 3.99 7.01 Init - 140669 140512 158 2 2 93 63 106 0.833 8.02 7.00 Prom - 154575 154536 40 -2.46 8.00 Prom + 161030 161069 40 -6.06 8.01 Init + 162690 162782 93 2 0 108 50 81 0.601 6.92 8.02 Intr + 182043 182210 168 2 0 94 26 78 0.013 2.34 8.03 Intr + 186189 186209 21 2 0 89 109 11 0.005 1.04 8.04 Intr + 199217 199328 112 1 1 43 53 91 0.034 1.05 8.05 Intr + 202421 202552 132 0 0 115 79 22 0.086 4.62 8.06 Term + 204446 204528 83 0 2 108 45 51 0.032 0.66 8.07 PlyA + 204661 204666 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 35785 35682 104 2 2 72 48 52 0.921 -1.96 S.002 Init + 116795 116882 88 1 1 90 64 155 0.916 14.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815593r:134928782_135133881|GENSCAN_predicted_peptide_1|86_aa MVLVKLSWVLCLLGPRLEQLTVFHVKVLAFGFMSRVALQAEKMNHHPEWFNVYNKVQITL TSHDCGELTKKDVKLAKFIEKAAASV >gi568815593r:134928782_135133881|GENSCAN_predicted_CDS_1|261_bp atggtgttggtcaagttgtcttgggttctgtgccttctgggaccgcggctagaacagtta acagtgttccacgtgaaagtcttggcatttggctttatgtcccgagttgccctacaagca gagaagatgaatcatcacccagaatggttcaatgtatacaacaaggtccagataactctc acctcacatgactgtggtgaactgaccaaaaaagatgtgaagctggccaagtttattgaa aaagcagctgcttctgtgtga >gi568815593r:134928782_135133881|GENSCAN_predicted_peptide_2|129_aa MKEEKVQIKLEEEPGYLKNQATAFLNTTQKQKNELSEFSEIFFVSICTSELSMKVYVDPI NYWKNGYNLLDVIIIIVMFLPYALRQLMGKQFTYLYIADGMQSLRILKLIGYSQGIRDPP PAYPSLFDL >gi568815593r:134928782_135133881|GENSCAN_predicted_CDS_2|390_bp atgaaggaagagaaggttcagataaaattggaggaggaaccaggatacctcaaaaatcaa gctactgcgtttttaaacacaacacaaaagcagaagaatgagctcagtgagttctcggag atcttctttgtgtccatctgcacatctgagttgtccatgaaggtctatgtggaccccatc aactactggaagaacggctacaacctgctggatgtgatcattatcatcgttatgttttta ccctatgccctccgccagctcatgggcaaacagttcacttacctgtatatcgctgatggc atgcagtccctgcgcatcctcaagcttatcggctatagccagggcatccgggaccctcca cctgcctacccctctctctttgacctttag >gi568815593r:134928782_135133881|GENSCAN_predicted_peptide_3|155_aa MDKFLDTYTLPRLNQEEVESLNRPITGSEIEAVINSLPTKKTPGPDGFTAKFYQRYKEEL VPFLLKLFQLIEKEGILPNSFYEANIILIPKPGKDTTKKENFRPISLMNINAKILNKILA NRIQQHIKKLIHHDQVGFIPGMQGCILWDATYANQ >gi568815593r:134928782_135133881|GENSCAN_predicted_CDS_3|468_bp atggataaattcctggacacctacaccctcccaagactaaaccaggaagaagttgaatcc ctgaatagaccaataacaggctctgaaattgaggcagtaattaatagcctaccaaccaaa aaaactccaggaccagatggattcacagccaaattctaccagaggtacaaggaggagctg gtaccattccttctgaaactattccaattgatagaaaaagagggaatcctccctaactca ttttatgaggccaacatcatcctgataccaaagcctggcaaagacacaacaaaaaaagag aattttagaccaatatccctgatgaacatcaatgcaaaaatcctcaataaaatactggca aaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctgcatcctatgggacgcaacatatgcaaatcaataa >gi568815593r:134928782_135133881|GENSCAN_predicted_peptide_4|132_aa MGRNQSRKTENSKNQSTCPPPKERSSSPAMEQSWMENDFDELREEGFRRSNFSKLKEDVR THHKEAKNLEKRLDKWLTRITSVEKALNDLTELKTMVRELRDACTSFSSRFNQLEERVSV IEDQMNEMKREV >gi568815593r:134928782_135133881|GENSCAN_predicted_CDS_4|399_bp atggggagaaaccagagcagaaaaactgaaaattctaaaaatcagagcacctgtccccct ccaaaggaacgcagctcctcaccagcaatggaacaaagctggatggagaatgactttgat gagttgagagaagaaggcttcagacgatcaaacttctccaagctaaaggaggatgttcga acccatcacaaagaagctaaaaaccttgaaaaaagattagacaaatggctaactagaata accagtgtagagaaggccttaaatgacctgacggagctgaaaaccatggtacgagaacta cgcgatgcatgcacaagcttcagtagccgattcaatcaactggaagaaagggtatcagtg attgaagatcaaatgaatgaaatgaagcgagaagtttag >gi568815593r:134928782_135133881|GENSCAN_predicted_peptide_5|264_aa MPRPASALTQWAPPTVLHPLSDKPHVPALQTLITAVGQTVYTVASVLLLLFLLMYIFAIL GFCLFGSPDNGDHDNWGNLAAAFFTLFSLATVDGWTDLQKQLDNREFALSRAFTIIFILL ASFIFLNMFVGVMIMHTEDSIRKFERELMLEQQEMLMGEKQVILQRQQEEISRLMHIQKN ADCTSFSELVENFKKTLSHTDPMVLDDFGTSLPFIDIYFSTLDYQDTTVHKLQELYYEIV HVLSLMLEDLPQEKPQSLEKVDEK >gi568815593r:134928782_135133881|GENSCAN_predicted_CDS_5|795_bp atgcctcgccctgcttcggctctcactcagtgggctccacccactgtcctgcacccgctg tccgacaagccccatgtccctgccttgcagacgctgatcaccgccgtggggcagacagtc tacaccgtggcctctgtgctcctcctgctcttcctcctcatgtacatcttcgctatcttg ggcttctgcctgtttggatctccagacaatggtgaccatgataactgggggaacctggct gcagcttttttcaccctcttcagcttggccacggttgatggctggacagacctgcagaag cagttggacaatcgggaatttgctttgagccgggcattcaccatcatcttcatcttgctc gcctctttcatcttcctcaacatgttcgtgggtgtgatgatcatgcacacagaggactcc atcagaaagtttgagcgagagctgatgttggagcagcaggagatgctcatgggagagaag caggtgattctgcagcggcagcaggaggagatcagcaggctgatgcacatacagaaaaat gctgactgcacaagtttcagtgagctggtggagaactttaagaagaccttgagccacact gacccaatggtcttggatgattttggcactagcttacccttcatcgatatctacttttcc actctggactaccaggacacaactgtccacaagcttcaagagctgtactatgagatcgtg catgtgctgagcctaatgctggaagacttgccccaggagaagccccagtccttggaaaag gtggatgagaagtag >gi568815593r:134928782_135133881|GENSCAN_predicted_peptide_6|695_aa MRLGTDARVRPWLLHGSPVAQRALQRAALDLRASTCCWATRHLARKSEAARSEKIKPRED NQQGIQHLGAMETLLGASLGSSWQQGQEGTQGVVHLAPVSPPSPRATGGGGGRSGPSRGS PGLARWDVANQQGPKNRAVPEPELDPGPQPEPKPEPESAAGEPGAHEPQTQRCPGGGTVI AAVTGCYLTQPAGSTLCFGSGHLKRVAASGARATAPRRAQPEKPVAAPRSVGNAAEKQQT SVRALALFAPDPDVPPQRSEVPPAQGSLLSPAQAVLFGKTAKKRMVCEGTLPSTLQHLNI SLKGGIFYCRRLVVKVAKKHGNGRQSWRSSLGLLCLVELITGRSQGGGARPAPAPGAPAA VPTSMDAFKGGMSLERLPEGLRPPPPPPHDMGPAFHLARPADPREPLENSASESSDTELP EKERGGEPKGPEDSGAGGTGCGGADDPAKKKKQRRQRTHFTSQQLQELEATFQRNRYPDM SMREEIAVWTNLTEPRVRPSVGPFAFCPFRKVSRQVWFKNRRAKWRKRERNQQLDLCKGG YVPQFSGLVQPYEDVYAAGYSYNNWAAKSLAPAPLSTKSFTFFNSMSPLSSQSMFSAPSS ISSMTMPSSMGPGAVPGMPNSGLNNINNLTGSSLNSAMSPGACPYGTPASPYSVYRDTCN SSLASLRLKSKQHSSFGYGGLQGPASGLNACQYNS >gi568815593r:134928782_135133881|GENSCAN_predicted_CDS_6|2088_bp atgaggctggggacagacgcccgagtgcggccctggctgctccacggttctccggtggca cagcgggctctgcaacgcgccgctctggacctgcgcgcgagcacctgctgctgggccacc cggcacctcgcgcggaagagtgaggcggcgcgtagtgagaaaataaagcccagagaggac aatcagcaaggaatccagcaccttggagccatggaaacccttcttggtgcctctttaggc tcctcatggcagcaggggcaggagggcacacagggtgttgtgcacctagccccagtgtcc ccgccttccccgcgggcgacgggcggcggcggcgggaggagcgggccgagccgaggaagc cccggcctcgcgcgctgggatgtagcgaaccagcaggggccgaagaaccgtgcagtgcca gagccagagctggatccggggccccagccggagccgaaacctgagccagagtccgcggcg ggcgagcccggagcccacgagccgcagacgcagcgctgcccaggtgggggcacagtcata gccgccgtcaccgggtgctacctcacccaaccggcgggatcaaccctctgctttggctcc gggcacctcaagagggtagcagcctcgggggcacgggccacggccccgcgaagggcacaa cctgagaagcccgtggcagcccctcgcagcgtcggcaatgccgcagaaaagcagcagacg tcggtccgcgccctggctctcttcgccccggaccccgacgtcccgccgcagcgctcggag gtgcccccagcccaaggcagcctgctctcgccggcacaggctgttctttttggtaagaca gctaaaaaaagaatggtctgtgaagggacactccctagcacgctgcaacacctgaatatc tccttgaaaggagggatcttctactgcaggagactcgtggtaaaggtggccaagaaacat ggcaacggcagacaatcctggcgcagctcccttgggttgctgtgtctggtggagctgatc acaggtcgcagccagggcggcggggcgcgcccagccccggcccctggagcgcccgccgcg gtccccacctccatggacgccttcaaggggggcatgagcctggagcggctgccggagggg ctccggccgccgccgccgccaccccatgacatggggcccgccttccacctggcccggccc gccgacccccgcgagccgctcgagaactccgccagcgagtcgtctgacacggagctgcca gagaaggagcgcggcggggaacccaaggggcccgaggacagtggtgcgggaggcacgggc tgcggcggcgcagacgacccagccaagaagaagaagcagcggcggcaacgtacgcacttc acaagccagcagttgcaagagctagaggccacgttccagaggaaccgctaccccgacatg agcatgagggaggagatcgccgtgtggaccaacctcaccgagccgcgcgtgcggccatca gtaggaccctttgccttctgccctttccggaaggtctcccgtcaggtctggttcaagaac cggcgagccaagtggcgtaagcgcgagcgtaaccagcagctggacctgtgcaagggtggc tacgtgccgcagttcagcggcctagtgcagccctacgaggacgtgtacgccgccggctac tcctacaacaactgggccgccaagagcctggcgccagcgccgctctccaccaagagcttc accttcttcaactccatgagcccgctgtcgtcgcagtccatgttctcagcacccagctcc atctcctccatgaccatgccgtccagcatgggcccaggcgccgtgcctggcatgcccaac tcgggcctcaacaacatcaacaacctcaccggctcctcgctcaactcggccatgtcgccg ggcgcttgcccgtacggcactcccgcctcgccctacagcgtctaccgggacacgtgcaac tcgagcctagccagcctgcggctcaagtccaaacagcactcgtcgtttggctacggcggc ctgcagggcccggcctcgggcctcaacgcgtgccagtacaacagctga >gi568815593r:134928782_135133881|GENSCAN_predicted_peptide_7|177_aa MVAFQQSEEARPAPVTLLPHAGLLCMAGMAPEGGYQNSHCRSLHSQGQIIWAELPSKGQE VPGEQLQPLPFLRTTGSQTSTAQPETQAQRQPTPTHCHLGELAGSAYEERLVHRLTPTLC PGASPEGQQTFGKPIFGSQKIDSLHWQLELCFWQTPLNVLTEERPTVCSANASAQGS >gi568815593r:134928782_135133881|GENSCAN_predicted_CDS_7|534_bp atggtggccttccagcaaagtgaagaagcccggccagctcctgtgacgctcttgccccac gcaggccttctctgcatggcgggcatggccccagaaggtggctaccaaaactcacactgc cgttctctccacagccagggccagatcatctgggcagagctgcccagcaagggccaggag gtgccaggagagcagctccagcctctgcccttcctccgtactactggttcgcagacctcc acagcccaacctgagacccaggcccaacggcagcccactcccacacactgccacctggga gagctggcaggctcagcctacgaggagcggctcgttcacaggctgactccaaccctttgc cccggagcaagcccagaaggccagcagaccttcgggaagcccatctttggctcacagaaa atagactcactccattggcagctggagctgtgcttctggcagaccccactgaatgtgctg accgaggaaaggcctactgtgtgttcagccaatgccagtgctcaagggagctga >gi568815593r:134928782_135133881|GENSCAN_predicted_peptide_8|202_aa MDMVMVQAMQGHGEACAGFLEEEALKGWENPWLSLGSYPPIFTPSPAGICEFQGQLKVKL YHVGDEMQADVARLADGGHTLRTVLSPLVEANLQKLVNRGHSRDHRQLAAGIAQAPCLLG VSFRTQSMGFVGESQIEKDRDGGSVVWKLLPYGGHGKLFSLSNMMMMPITMSITEDTQLC DSQDPSHQVQSRTGSVSSSAAQ >gi568815593r:134928782_135133881|GENSCAN_predicted_CDS_8|609_bp atggacatggtcatggtgcaggctatgcagggtcatggtgaagcctgtgcaggcttcctg gaggaggaggccttgaagggttgggaaaacccttggctttcacttggctcctatcccccc attttcaccccctcccctgctggaatttgtgaatttcaaggtcagctgaaagtcaagctt taccatgttggtgatgaaatgcaagcagatgtggctcgtttggcggatggtggccacacc ctgagaactgtgctcagtcctcttgtggaggcaaaccttcagaaattagtcaacagaggc cattcccgggaccacaggcagctggctgcaggcattgcgcaggcgccgtgcctgctggga gtgtcttttcggacacagagcatgggcttcgtgggtgagagccagatagagaaagacaga gatggtggctctgtggtttggaagctcctcccatatggtggacatggaaagttgtttagt ctgtctaatatgatgatgatgccgattaccatgagcatcactgaagacactcagctctgt gacagccaggaccccagtcaccaggttcaaagccgcacaggtagtgtgtccagcagcgct gcccagtga