GENSCAN 1.0 Date run: 4-Nov-116 Time: 22:16:21 Sequence gi568815591r:21802978_22016897 : 213920 bp : 40.06% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4730 4831 102 1 0 59 86 89 0.540 6.09 1.02 Intr + 4902 5072 171 2 0 49 115 200 0.927 18.02 1.03 Intr + 13490 13725 236 1 2 23 75 250 0.633 12.76 1.04 Intr + 15240 15362 123 0 0 19 106 146 0.843 8.28 1.05 Intr + 32520 32786 267 0 0 39 108 82 0.130 1.02 1.06 Intr + 39567 39771 205 0 1 82 93 268 0.979 24.98 1.07 Intr + 49490 49654 165 1 0 48 97 100 0.966 6.14 1.08 Intr + 51338 51478 141 1 0 80 90 74 0.973 6.43 1.09 Intr + 58876 59046 171 0 0 47 110 189 0.823 16.22 1.10 Intr + 61558 61680 123 0 0 65 95 37 0.794 1.96 1.11 Intr + 63493 63686 194 0 2 63 75 278 0.941 21.37 1.12 Intr + 64882 65030 149 1 2 81 97 145 0.999 13.66 1.13 Intr + 65887 66014 128 2 2 109 89 95 0.995 11.18 1.14 Intr + 70297 70524 228 0 0 61 110 149 0.414 11.54 1.15 Intr + 77725 77916 192 0 0 116 106 21 0.964 5.47 1.16 Intr + 81314 81433 120 1 0 108 99 146 0.999 17.47 1.17 Intr + 89448 89690 243 2 0 61 75 284 0.932 21.17 1.18 Intr + 91646 91828 183 1 0 23 121 124 0.977 8.26 1.19 Intr + 91907 92022 116 1 2 35 91 95 0.995 2.83 1.20 Intr + 96359 96471 113 2 2 51 95 180 0.980 14.10 1.21 Intr + 96871 97143 273 2 0 57 90 156 0.469 9.29 1.22 Term + 98030 98277 248 0 2 60 45 225 0.975 10.37 1.23 PlyA + 98835 98840 6 1.05 2.12 PlyA - 98864 98859 6 -1.75 2.11 Term - 99375 99345 31 2 1 80 48 43 0.732 -4.15 2.10 Intr - 100137 100001 137 0 2 62 107 89 0.725 6.65 2.09 Intr - 101282 101133 150 2 0 112 74 209 0.935 21.34 2.08 Intr - 102654 102529 126 0 0 67 63 167 0.990 12.06 2.07 Intr - 103479 103312 168 0 0 72 20 107 0.740 1.52 2.06 Intr - 105530 105153 378 2 0 86 77 445 0.883 37.44 2.05 Intr - 108774 108571 204 0 0 71 63 164 0.696 10.67 2.04 Intr - 108929 108829 101 0 2 26 51 78 0.453 -3.19 2.03 Intr - 112208 112079 130 2 1 101 65 50 0.514 3.35 2.02 Intr - 113917 113777 141 1 0 62 80 204 0.601 16.63 2.01 Init - 118034 117954 81 2 0 48 81 7 0.303 -2.98 2.00 Prom - 120635 120596 40 -7.75 3.00 Prom + 121001 121040 40 -5.05 3.01 Sngl + 125364 125831 468 2 0 78 48 363 0.879 27.18 3.02 PlyA + 126249 126254 6 1.05 4.00 Prom + 126800 126839 40 -7.05 4.01 Init + 127176 127244 69 2 0 70 55 81 0.057 4.10 4.02 Intr + 142648 142767 120 0 0 54 65 110 0.138 5.07 4.03 Term + 146025 146150 126 2 0 85 43 88 0.192 1.30 4.04 PlyA + 146697 146702 6 1.05 5.06 PlyA - 147066 147061 6 1.05 5.05 Term - 160800 160553 248 1 2 63 40 133 0.191 0.97 5.04 Intr - 165972 165816 157 0 1 43 64 123 0.438 4.06 5.03 Intr - 183291 183160 132 0 0 18 36 137 0.021 1.42 5.02 Intr - 185681 185508 174 2 0 6 91 135 0.047 4.81 5.01 Init - 206939 206871 69 2 0 11 93 102 0.443 4.10 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591r:21802978_22016897|GENSCAN_predicted_peptide_1|1296_aa MMDFKHMKDYNSHTNPLHSVPYSDCVEGRKPCHKSEKIRWGQSIKSFEAQEKTLCGDVLL TAAFVSYVGPFTRQYRQELVHCKWVPFLQQKVSIPLTEGLDLISMLTDDATIAAWNNEGL PSDRMSTENAAILTHCERWPLVIDPQQQGIKWIKNKYGMDLKVTHLGQKGFLNAIETALA FGDVILIENLEETIDPVLDPLLGRNTIKKGKIKGKKHTIISINTENSFDKIQDPFMRKIL NKLCIEGTYLKTIKAIYDKPTANILNREKLKAFPVKSETRQGFSLPPLLFSTILEVLGSK YIRIGDKECEFNKNFRLILHTKLANPHYKPELQAQTTLLNFTVTEDGLEAQLLAEVVSIE RPDLEKLKLVLTKHQNDFKIELKYLEDDLLLRLSAAEGSFLDDTKLVERLEATKTTVAEI EHKVIEAKENERKINEARECYRPVAARASLLYFVINDLQKINPLYQFSLKAFNVLFHRAI EQADKVEDMQGRISILMESITHAVFLYTSQALFEKDKLTFLSQMAFQILLRKKEIDPLEL DFLLRFTVEHTHLSPVDFLTSQSWSAIKAIAVMEEFRGIDRDVEGSAKQWRKWVESECPE KEKLPQEWKKKSLIQKLILLRAMRPDRMTYALRNFVEEKLGAKYVERTRLDLVKAFEESS PATPIFFILSPGVDALKDLEILGKRLGFTIDSGKFHNVSLGQGQETVAEVALEKASKGGH WVILQNVHLVAKWLGTLEKLLERFSQGSHRDYRVFMSAESAPTPDEHIIPQGLLENSIKI TNEPPTGMLANLHAALYNFDQDTLEICSKEQEFKSILFSLCYFHACVAGRLRFGPQGWSR SYPFNPGDLTICASVLYNYLEANSKVPWEDLRYLFGEIMYGGHITDDWDRKLCRVYLEEF MNPSLTEDELMLAPGFAAPPYLDYAGYHQYIEEMLPPESPALYGLHPNAEIEFLTVTSNT LFRTLLEMQPRNALSGDELGQSTEEKVKNVLDDILEKLPEEFNMAEIMQKNSNRSPYVLV CFQECERMNILIREIRISLEQLDLSLKGELALSPAVEAQQFALSYDTVPDTWSKLAYPST YGLAQWFNDLLLRCRELDTWTQDLTLPAVVWLSGFFNPQSFLTAQPCANAQEPKTPYGTK GCLNLLVAPGNLTCNTFILFNFCYISKAIMQTMARKNEWPLDKTRLTADVTKKTKEDYGH PPREGAYLHGLFMEGARWDTQAGTIVEARLKELACPMPVIFAKATPVDRQETKQTYECPV YRTKLRGPSYIWTFRLKSEEKTAKWVLAGVALLLEA >gi568815591r:21802978_22016897|GENSCAN_predicted_CDS_1|3891_bp atgatggatttcaaacacatgaaagattataacagtcacaccaacccgttacactctgtt ccttatagtgactgtgtggaaggacgtaaaccttgccataagtcagagaagattcgctgg ggtcaatccattaagtcctttgaagctcaagagaagacactctgtggagatgttcttctc acggcggcatttgtgtcttacgtcggacccttcacaaggcagtatcgccaggagctggtg cactgcaagtgggttccctttcttcaacagaaggtttccattccactaaccgaaggcctg gacttgatatccatgttgacggatgatgctacaattgccgcctggaataacgaaggactg cccagtgacagaatgtccaccgaaaatgccgctatcctaacacactgtgagcgctggcct ctggtgatagatccccagcaacagggaattaagtggatcaagaataagtatggaatggac ctgaaagtcacacatttgggccagaaagggtttttgaatgccattgaaactgctttggcc tttggtgatgtcatcttaattgaaaatctcgaggaaacgatagatccagtcctggatcca ctacttggcaggaacacaattaaaaaaggaaaaatcaaggggaaaaaacatacaattatt tcaatcaacacagaaaacagcttcgataaaattcaagatcctttcatgaggaaaattctc aacaaattgtgtatagaaggaacatacctcaaaacaataaaggccatatatgacaaaccc acagctaacatattaaacagggaaaaactgaaagcctttcctgtaaaatctgaaacaaga caaggattctcacttccaccacttctattcagcacaatactggaagttctaggcagcaag tatatcaggattggagataaagaatgtgaatttaacaagaactttcgccttatccttcac acaaaattggcaaatcctcactataagccggaattacaagctcagacaactctcctcaat ttcacagtcacagaagatggtctagaagcccagctgctggcagaggttgtcagtattgaa aggccagatttggagaaacttaagttggtattgacaaagcaccaaaatgattttaaaatt gagctcaagtatctggaagacgatctccttttgcgcctttctgcggcagagggaagcttt ctggatgacaccaaactggtagagagattggaggcaacaaagaccaccgtggcagagata gagcacaaggtgattgaagccaaagaaaatgaaagaaaaatcaacgaggcccgagaatgt tacagaccagtggcagcaagagcatctcttctttattttgttattaatgacctccaaaaa atcaaccccctctaccaattctctttgaaggcttttaacgtgctgttccacagagcgatc gagcaggctgacaaggtggaagacatgcagggacgcatctctatcctgatggagagcatc acccatgctgtcttcctctacaccagccaggcgctgtttgagaaggacaagctcaccttc ctgtcccagatggcttttcagattttgttgagaaagaaagagatagaccctcttgaattg gatttcctgcttcgattcacagttgaacacactcatctgagtcccgttgacttcctaact tctcagtcatggagtgctatcaaggcaattgccgtcatggaagaatttcgaggcatagac cgagatgtggaaggatctgccaagcagtggaggaagtgggtagaatccgagtgtccagaa aaagaaaaattacctcaagaatggaagaagaaaagtttaatacagaagctgattcttctg agagcaatgcgccctgacagaatgacgtatgctctcagaaattttgtagaggaaaaactg ggtgcgaagtatgtggagaggaccagattggacttagttaaagcattcgaagaaagcagc ccagccacccccatattcttcatcctgtctccgggggtagatgcccttaaagacctggag attcttggcaaaagacttggctttacaattgactctggaaaattccacaatgtgtcttta ggacaaggtcaggagacggtggcagaagtggccctggagaaagcttccaaaggaggacac tgggtcatcctccaaaatgttcatttggtagccaagtggctaggaaccttggagaagctc cttgaaagattcagccaaggaagccacagagattacagggttttcatgagtgctgagtct gcacctacaccagatgagcatatcatccctcaaggactcctggaaaattccattaagatc actaatgaacccccaacagggatgctggccaatttgcatgccgccctgtacaactttgat caggatacacttgaaatatgctccaaggagcaggagtttaaaagcatccttttttctctc tgctacttccacgcctgtgttgctgggagactgaggtttggcccccagggctggagccga agctatccttttaatcctggagacctcaccatttgtgccagtgtcctctacaactactta gaggcaaactctaaagtcccatgggaagatctccgttatctctttggtgagatcatgtat ggaggccacatcacagatgactgggatcgcaaactgtgtcgggtgtatttagaagaattc atgaatccatctctgactgaagatgaactgatgctggcaccaggttttgctgccccaccc tacctagattatgcaggctaccaccagtacatagaggagatgcttcctccagaaagcccg gcactgtatggcctccacccaaatgctgaaatagaattcctgacagtgacatccaacact ctcttcagaactttgctggagatgcagcccaggaatgcactcagtggtgatgaactgggg cagtctacagaagaaaaggttaagaatgtcttggatgacattttggagaaacttccagaa gagttcaacatggcagagataatgcaaaaaaattcaaatagaagcccatatgttcttgtt tgcttccaagaatgtgagaggatgaatattctcattcgggaaatacgtatatcacttgaa caactggaccttagtttgaagggggaattggcattatctcctgctgtggaagcccagcag tttgcattgagttatgacacggtaccagacacttggagcaaactggcttatccttctact tatggcctagcccagtggttcaatgacctcctcctgcgatgccgagaactcgatacttgg acacaagaccttacccttccggctgtcgtgtggctctccggcttcttcaaccctcagtcc ttcttaactgcgcagccatgtgccaatgcccaagaacctaaaacaccctatgggactaaa ggatgcctcaatttactggtagctcccgggaaccttacatgcaacacttttatcctattc aatttttgttatatttccaaagcaatcatgcagacgatggctcgaaaaaatgagtggccc ctggataaaacgcgcttgactgctgatgttaccaaaaaaacaaaggaagattatggacac ccgccaagggaaggtgcatacctccacggactcttcatggagggcgcccgctgggacacc caagcaggaaccattgttgaagcccgtctcaaggagctggcatgccctatgccggtcatc tttgcaaaagccacccccgtggacagacaagaaaccaaacagacctacgagtgccctgtg tatagaaccaaactgagaggccccagctacatctggaccttcaggctgaagagcgaagag aagactgcaaaatgggttctggctggagtggctctgcttctagaagcgtaa >gi568815591r:21802978_22016897|GENSCAN_predicted_peptide_2|548_aa MGCSSSFKPFMLKEASGPAGLFEFCRMIPKEVADIFNAPSDDEEFVGFRDDVPMETLSSE ESCDSFDSLESGKQELMKLLLCSWCPSFPSLCSCRSPRSSYPNAHPFLLGSVIGLTSVQC IKYIHIVVHPFSSSPSEALYPFSSSSFYPSLDVRFHSKYFTEELRRIFIEDTDSETEDFA GFTQSDLNGKTNPEVMVIITTGRDVGGNVFNPVSERVLLVVESDLSDDGKASLVSEEEED EEEDKATPRRSRSRRSSIGLRVAFQFPTKKLANKPDKNSSSEQLFSSARLQNEKKTILER KKDCRQVIQREDSTSESEDDSRDESQESSDALLKRTMNIKENKAMRKKTVRRAFSEGQIT RRMNPTRSARPPEKFALENFTVSAAKFAEEFYSFRRRKTIGGKCREYRRRHRISSFRPVE DITEEDLENVAITVRDKIYDKVLGNTCHQCRQKTIDTKTVCRNQGCCGVRGQFCGPCLRN RYGEDVRSALLDPDWVCPPCRGICNCSYCRKRDGRCATGILIHLAKFYGYDNVKEYLESL QKELVEDN >gi568815591r:21802978_22016897|GENSCAN_predicted_CDS_2|1647_bp atggggtgctcctcctcctttaaaccatttatgttaaaagaagcttctgggccagcaggg ctgtttgaattttgtagaatgatccctaaagaagtggctgacatctttaacgcccccagt gatgatgaagagtttgttggcttccgagatgatgttcccatggaaaccctctcgtcagag gagagctgcgatagttttgactcactagagtcagggaaacaggagttgatgaaacttctg ctgtgttcctggtgtccatccttcccttccctctgcagctgccgtagccctcgtagctct tacccgaatgctcatcccttcctactaggatctgttataggattgacctccgttcagtgc attaagtacattcacattgttgtccatccattttcgtcatctccatcagaagctctgtac ccgttcagtagctcctcattctacccttccctggatgtgcgctttcattccaaatacttc acagaagagctaagaagaatttttatagaggacactgactcagagactgaggattttgca ggatttacgcagagtgatctgaatggaaagactaacccagaagtaatggtaataattaca acaggaagggatgtgggtggtaacgtttttaatcctgtttctgaaagagttttacttgtc gtggagtcagatttgagtgatgatggcaaagcatctttggtgagcgaggaagaggaagat gaagaagaagataaggctacccctagaagaagcaggtctagaagaagtagtattggtctt cgagtagcctttcagttccccaccaagaagctggccaacaaaccagataaaaacagttct tccgagcagttgttttctagcgcacgcttacagaatgagaaaaaaacaattcttgaaaga aagaaagactgtagacaggtgatacaaagggaagattctacctctgagtctgaggatgac tctcgggatgagagccaggagagttcagatgctttgctgaaaaggaccatgaacatcaag gagaacaaagccatgaggaagaagacagtgaggcgggccttctcggagggacagatcacg cggcgtatgaacccaacccggagtgcgcggcctcctgagaagtttgctctagagaacttc actgtctcagccgctaaatttgcggaagagttttacagcttccgaagaaggaagacaatt ggggggaaatgccgggagtacagacgacgtcaccgtatatcttcttttcggccagtggag gatatcaccgaagaggacttagaaaatgttgccataactgttcgagataaaatctatgat aaagttctgggtaacacgtgccatcagtgtcgacaaaagaccatcgacaccaagacagtg tgtcggaaccagggttgctgtggtgtgcgaggacagttctgtggaccatgcctgcggaac cgctatggggaggatgtcagatcggcattgctggacccggattgggtgtgtcccccctgt cgtgggatctgcaattgcagctactgtcggaagcgtgacggccgctgtgccacaggaatc ctcattcatctggccaagttttatggttatgacaatgttaaggaatatctggagagctta caaaaggagctggtagaagacaattaa >gi568815591r:21802978_22016897|GENSCAN_predicted_peptide_3|155_aa MRKNQHKKAENSKNQNASSPPKDHNSLPAREQNWMENEFDELTEVRFRRWVINSSELKKH VLTQYKETKNLEKRLEELLTRITRLEKNINDLMELKNTARKLHEAYTSINSQIDQAEERI SETEDQLNEIKHEEKIREKKSEKGTKKASKKYGTM >gi568815591r:21802978_22016897|GENSCAN_predicted_CDS_3|468_bp atgaggaaaaaccagcacaaaaaggctgaaaattccaaaaaccagaatgcctcttctcct ccaaaggatcacaactccttaccagcaagggaacaaaactggatggagaatgagtttgat gaattgacagaagtacgcttcagaaggtgggtaataaactcctccgagctaaagaagcat gttctaactcaatacaaggaaactaagaaccttgaaaaaaggttagaggaattgctaacg agaataaccaggttagaaaagaacataaatgacctgatggagctgaaaaacacagcacga aaacttcatgaagcatacacaagtatcaatagccaaatcgatcaagcagaagaaaggata tcagagactgaagatcaacttaatgaaataaagcatgaagagaagattagagaaaaaaag agtgaaaaaggaacaaaaaaagcctccaagaaatatgggactatgtga >gi568815591r:21802978_22016897|GENSCAN_predicted_peptide_4|104_aa MKKLTQNRTTTLKLNNLLLNDYRGAPTPALNRLGAPDPQCRSQGPRFQPKASRTGWQREV KLWSLVVGKLPSPPHITENRAGLVLWTTHQRICLLQRAGSSMSL >gi568815591r:21802978_22016897|GENSCAN_predicted_CDS_4|315_bp atgaagaaactcactcaaaaccgcacaactacattgaaactgaacaacctgctcctgaat gactacaggggcgcgcccactccggcactcaaccggctgggcgcgccagatccccagtgc cgcagccaagggccacgtttccagcccaaggccagcaggaccgggtggcaaagggaagtc aaactgtggagcttagttgtgggcaaactccctagtcctccccacataacagaaaatcgt gctggtttggtgctctggacaactcaccagagaatctgccttctgcagagagctggaagc agcatgagtctatag >gi568815591r:21802978_22016897|GENSCAN_predicted_peptide_5|259_aa MNLQFGYVIDEESQSNLTGGITKKLRVTLPSETFKSQYTIYNTPFSPTSVIEEVSVEMVP CHPGSLMATMSDVSLLTRARQLSDSSSEVDLECGGTGQKTSYRSNCEVGPAGIRCDRDSS GNGERYSSCHGLLGIPYSSFYSVPQAISQSSFQPGGQDPKQSMIMGYNFAKILMALPGSS SYFPKRQRNLAWVFPTLVKPAVLGPPQRHQHQLSSMALRGVCVSFLQALFTAPVDNGNLF PSFYPSPRVAGCFLQLSSL >gi568815591r:21802978_22016897|GENSCAN_predicted_CDS_5|780_bp atgaacctgcagtttggttatgttatcgatgaagagtcacagtcaaacctgacgggtgga atcacaaagaaattacgcgtgacacttccaagcgaaacctttaaaagccagtacacaatt tacaacactcccttttctcctacttccgtaattgaggaagtcagtgttgagatggtcccc tgtcatcctggatccctgatggctacaatgagtgacgtttccctgctgacccgtgccagg cagctctctgacagcagttcagaggtggatttggaatgtgggggcacagggcagaaaacc agttacaggagcaattgtgaagtaggcccagcagggataaggtgtgaccgagacagtagc ggcaatggagagaggtacagctcctgtcatggccttttgggaataccctacagcagcttc tactcagttccccaagcaatcagtcagtcttctttccaacctggaggacaggacccaaaa caatccatgatcatgggatacaactttgccaaaatcttaatggccttaccaggcagcagc agttacttcccaaagaggcagcggaatctagcttgggtttttcccacccttgtaaaacca gctgtattgggcccacctcagagacatcagcatcagctgagcagcatggcccttcggggt gtgtgtgtttcatttctgcaggccctcttcacagcacctgtggataatggtaacctcttc ccctccttctaccccagccctagggttgctggctgcttcctgcagttgtcctctctgtga