GENSCAN 1.0 Date run: 4-Nov-116 Time: 23:13:12 Sequence gi568815588f:88889469_89114447 : 224979 bp : 38.48% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8896 8998 103 1 1 31 93 115 0.670 5.56 1.02 Intr + 15975 16192 218 2 2 74 111 172 0.431 14.48 1.03 Intr + 19234 19309 76 1 1 93 67 70 0.997 4.00 1.04 Intr + 21448 21543 96 0 0 25 106 81 0.880 2.89 1.05 Intr + 23633 23990 358 1 1 65 89 384 0.999 30.10 1.06 Intr + 25066 25190 125 2 2 104 64 59 0.884 4.48 1.07 Intr + 27212 27349 138 1 0 91 65 113 0.994 9.04 1.08 Term + 43363 43743 381 0 0 51 51 191 0.027 5.55 1.09 PlyA + 44779 44784 6 1.05 2.10 PlyA - 45632 45627 6 1.05 2.09 Term - 45898 45755 144 1 0 83 41 134 0.932 5.13 2.08 Intr - 48774 48593 182 1 2 100 79 220 0.647 20.97 2.07 Intr - 50230 50039 192 2 0 58 74 293 0.921 23.44 2.06 Intr - 51922 51761 162 2 0 91 81 224 0.992 21.03 2.05 Intr - 52401 52317 85 0 1 106 82 63 0.943 5.97 2.04 Intr - 54439 54329 111 1 0 57 108 115 0.991 10.06 2.03 Intr - 57918 57790 129 0 0 76 86 143 0.973 12.87 2.02 Intr - 59485 59334 152 2 2 87 94 151 0.428 14.56 2.01 Init - 66115 65980 136 1 1 42 81 89 0.040 3.95 2.00 Prom - 91973 91934 40 -3.95 3.00 Prom + 95562 95601 40 -2.55 3.01 Init + 110354 110396 43 1 1 52 115 7 0.700 0.43 3.02 Intr + 113357 113487 131 0 2 58 55 137 0.764 6.89 3.03 Intr + 113561 113726 166 1 1 108 91 127 0.999 13.61 3.04 Intr + 118232 118369 138 0 0 113 95 132 0.999 15.91 3.05 Intr + 119421 119529 109 1 1 63 108 37 0.914 1.62 3.06 Intr + 121071 121132 62 0 2 106 86 85 0.943 7.56 3.07 Intr + 122531 122629 99 0 0 100 37 61 0.466 1.36 3.08 Term + 124651 124982 332 2 2 89 40 352 0.993 24.33 3.09 PlyA + 125798 125803 6 -0.45 4.02 PlyA - 126598 126593 6 1.05 4.01 Sngl - 129482 129069 414 2 0 76 37 265 0.349 14.28 4.00 Prom - 139468 139429 40 -4.55 5.03 PlyA - 141167 141162 6 1.05 5.02 Term - 142575 142446 130 2 1 77 55 128 0.593 5.07 5.01 Init - 171211 171150 62 1 2 73 59 76 0.327 3.97 5.00 Prom - 185258 185219 40 -3.45 6.02 PlyA - 185667 185662 6 1.05 6.01 Term - 220142 220078 65 0 2 121 43 91 0.726 4.97 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:88889469_89114447|GENSCAN_predicted_peptide_1|498_aa XLPDRIVGNLDHILRATELENIPVYVDLAIVYNATKKLAAMPDHTDVSLSPEERVRALSK LGCNITISEDITPRRYFRSGVEMERMASVYLEEGNLENAFVLYNKFITLFVEKLPNHRDY QQCAVPEKQDIMKKLKEIAFPRTDELKNDLLKKYNVEYQEYLQSKNKYKAEILKKLEHQR LIEAERKRIAQMRQQQLESEQFLFFEDQLKKQELARGQMRSQQTSGLSEQIDGSALSCFS THQNNSLLNVFADQPNKSDATNYASHSPPVNRALTPAATLSAVQNLVVEGLRCVVLPEDL CHKFLQLAESNTVRGIETCGILCGKLTHNEFTITHVIVPKQSAGPDYCDMENVEELFNVQ DQHDLLTLGWIHELMPPGGKYEVQQCNRPVLLQGNPVEMTAMVGDFCLNYESDLVASICT LILKFKSSGYEVPLHGKHEFAAGRFPSIIFQWFLNFDVHKNHQGNLIEVQVSSFSSRDSS SAGVRCDPGLYPFKMSFR >gi568815588f:88889469_89114447|GENSCAN_predicted_CDS_1|1497_bp nnactccctgacaggatcgtggggaacctggaccacattttgagagccactgaactagaa aacatacctgtttatgttgatcttgcaattgtttataatgcaactaaaaagttagctgct atgcctgaccatacagatgtttccctaagcccagaagagcgagtccgtgccctaagcaag cttggttgtaatatcaccatcagtgaagacatcactccacgacgttactttaggtctgga gtagagatggagaggatggcgtctgtgtatttggaagaaggaaatttggaaaatgccttt gttctttataataaatttataaccttatttgtagaaaagcttcctaaccatcgagattac cagcaatgtgcagtacctgaaaagcaggatattatgaagaaactgaaggagattgcattc ccaaggacagatgaattgaaaaacgaccttttaaagaaatataacgtagaataccaagaa tatttgcaaagcaaaaacaaatataaagctgaaattctcaaaaaattggagcatcagaga ttgatagaggcagaaaggaagcggattgctcagatgcgccagcagcagctagaatcggag cagtttctgtttttcgaagatcaactcaagaagcaagagttagcccgaggtcaaatgcga agtcagcaaacctcagggctgtcagagcagattgatgggagcgctttgtcctgcttttcc acacaccagaacaattccttgctgaatgtatttgcagatcaacctaataaaagtgatgca accaattatgctagccactctcctcctgtaaacagggccttaacgccagctgctactcta agtgctgttcagaatttagtggttgaaggactgcgatgtgtagttttgccagaagatctt tgccacaaatttctgcaactggcagaatctaatacagtgagaggaatagaaacctgtgga atactctgtggaaaactgacacataatgaatttactattacccatgtaattgtgccaaag cagtctgcgggaccagactattgtgacatggagaatgtagaggaattattcaatgttcag gatcaacatgatctcctcactctaggatggatccatgagctcatgccgcctggtggtaaa tatgaagtacagcagtgcaacagaccagttttactccaaggaaaccctgtagagatgaca gcaatggttggtgatttctgcctcaattatgaaagtgatctggtggcaagtatctgcaca ctaatccttaagtttaaatctagtggctatgaggtgcccctgcatggcaagcatgaattt gcagctggaagatttccctcgatcatattccagtggttcttaaattttgatgtacataag aatcaccagggaaatctgattgaagttcaggtttccagcttcagctccagagattctagc tcagcgggtgttagatgtgacccagggctctacccatttaaaatgagcttccgatga >gi568815588f:88889469_89114447|GENSCAN_predicted_peptide_2|430_aa MRNRAEPGKSLQMWQDGHCASVDSMPLSSWAILGELFVTLLGNSKESCEAAPAMCEEEDS TALVCDNGSGLCKAGFAGDDAPRAVFPSIVGRPRHQGVMVGMGQKDSYVGDEAQSKRGIL TLKYPIEHGIITNWDDMEKIWHHSFYNELRVAPEEHPTLLTEAPLNPKANREKMTQIMFE TFNVPAMYVAIQAVLSLYASGRTTGIVLDSGDGVTHNVPIYEGYALPHAIMRLDLAGRDL TDYLMKILTERGYSFVTTAEREIVRDIKEKLCYVALDFENEMATAASSSSLEKSYELPDG QVITIGNERFRCPETLFQPSFIGMESAGIHETTYNSIMKCDIDIRKDLYANNVLSGGTTM YPGIADRMQKEITALAPSTMKIKIIAPPERKYSVWIGGSILASLSTFQQMWISKQEYDEA GPSIVHRKCF >gi568815588f:88889469_89114447|GENSCAN_predicted_CDS_2|1293_bp atgaggaacagagcagagcctggcaaatccctgcagatgtggcaggatggccattgtgcc agcgtagactccatgcctttgtcttcatgggctatcttaggagaactgtttgtgactctc ctggggaattccaaagaatcctgtgaagcagctccagctatgtgtgaagaagaggacagc actgccttggtgtgtgacaatggctctgggctctgtaaggccggctttgctggggacgat gctcccagggctgttttcccatccattgtgggacgtcccagacatcagggggtgatggtg ggaatgggacaaaaagacagctacgtgggtgacgaagcacagagcaaaagaggaatcctg accctgaagtacccgatagaacatggcatcatcaccaactgggacgacatggaaaagatc tggcaccactctttctacaatgagcttcgtgttgcccctgaagagcatcccaccctgctc acggaggcacccctgaaccccaaggccaaccgggagaaaatgactcaaattatgtttgag actttcaatgtcccagccatgtatgtggctatccaggcggtgctgtctctctatgcctct ggacgcacaactggcatcgtgctggactctggagatggtgtcacccacaatgtccccatc tatgagggctatgccttgccccatgccatcatgcgtctggatctggctggccgagatctc actgactacctcatgaagatcctgactgagcgtggctattccttcgttactactgctgag cgtgagattgtccgggacatcaaggagaaactgtgttatgtagctctggactttgaaaat gagatggccactgccgcatcctcatcctcccttgagaagagttacgagttgcctgatggg caagtgatcaccatcggaaatgaacgtttccgctgcccagagaccctgttccagccatcc ttcatcgggatggagtctgctggcatccatgaaaccacctacaacagcatcatgaagtgt gatattgacatcaggaaggacctctatgctaacaatgtcctatcagggggcaccactatg taccctggcattgccgaccgaatgcagaaggagatcacggccctagcacccagcaccatg aagatcaagatcattgcccctccggagcgcaaatactctgtctggatcggtggctccatc ctggcctctctgtccaccttccagcagatgtggatcagcaaacaggaatacgatgaagcc gggccttccattgtccaccgcaaatgcttctaa >gi568815588f:88889469_89114447|GENSCAN_predicted_peptide_3|359_aa MGIYQGVVNKVHFHVWFEEPEIQTAIQVTCCFLGERNLKDSGALTLSLPVHSRYCQFWVL TSVARLSSKSVNAQVTDINSKGLELRKTVTTVETQNLEGLHHDGQFCHKPCPPGERKARD CTVNGDEPDCVPCQEGKEYTDKAHFSSKCRRCRLCDEGHGLEVEINCTRTQNTKCRCKPN FFCNSTVCEHCDPCTKCEHGIIKECTLTSNTKCKEEVKRKEVQKTCRKHRKENQGSHESP TLNPVGIEIDVDLSKYITTIAGVMTLSQVKGFVRKNGVNEAKIDEIKNDNVQDTAEQKVQ LLRNWHQLHGKKEAYDTLIKDLKKANLCTLAEKIQTIILKDITSDSENSNFRNEIQSLV >gi568815588f:88889469_89114447|GENSCAN_predicted_CDS_3|1080_bp atgggaatctatcagggtgtggttaataaagtacatttccatgtgtggtttgaagaacct gagatccaaactgctatacaagtgacctgctgctttcttggagagagaaatctgaaagac agtggagccctcacattgtctttgcctgtgcacagcagatactgccaattttgggttctt acgtctgttgctagattatcgtccaaaagtgttaatgcccaagtgactgacatcaactcc aagggattggaattgaggaagactgttactacagttgagactcagaacttggaaggcctg catcatgatggccaattctgccataagccctgtcctccaggtgaaaggaaagctagggac tgcacagtcaatggggatgaaccagactgcgtgccctgccaagaagggaaggagtacaca gacaaagcccatttttcttccaaatgcagaagatgtagattgtgtgatgaaggacatggc ttagaagtggaaataaactgcacccggacccagaataccaagtgcagatgtaaaccaaac tttttttgtaactctactgtatgtgaacactgtgacccttgcaccaaatgtgaacatgga atcatcaaggaatgcacactcaccagcaacaccaagtgcaaagaggaagtgaagagaaag gaagtacagaaaacatgcagaaagcacagaaaggaaaaccaaggttctcatgaatctcca actttaaatcctgtaggtattgaaatagatgttgacttgagtaaatatatcaccactatt gctggagtcatgacactaagtcaagttaaaggctttgttcgaaagaatggtgtcaatgaa gccaaaatagatgagatcaagaatgacaatgtccaagacacagcagaacagaaagttcaa ctgcttcgtaattggcatcaacttcatggaaagaaagaagcgtatgacacattgattaaa gatctcaaaaaagccaatctttgtactcttgcagagaaaattcagactatcatcctcaag gacattactagtgactcagaaaattcaaacttcagaaatgaaatccaaagcttggtctag >gi568815588f:88889469_89114447|GENSCAN_predicted_peptide_4|137_aa MGGNRGCTWCLWASMSSRWAWAQQTLHWERPASPAGPGAVRGLAPGPAAAVLDFSPGLSC LPMGQGSGPAAHHAQASPTITAPCSKAPSRIDHPRAEECGSMARDWQAAPPAAPVQDPRG EASWASESGRDLGNLYV >gi568815588f:88889469_89114447|GENSCAN_predicted_CDS_4|414_bp atgggcgggaaccggggctgcacatggtgcttgtgggccagcatgagttccaggtgggcg tgggctcagcagaccctgcactgggagcggccagccagccccgccggccctggggcagtg aggggcttagcacctgggccagcagctgctgtgcttgacttctcaccaggccttagctgc ctccccatggggcagggctcgggacctgcagcccaccatgcccaagcctccccaacgatc accgccccctgctctaaggctcccagtcgcattgaccatccaagggctgaggagtgcggg agcatggcgcgggactggcaggcagctccacctgcagccccggtgcaggatccacggggt gaagccagctgggcttctgagtctggtagggacttagggaacctttatgtctag >gi568815588f:88889469_89114447|GENSCAN_predicted_peptide_5|63_aa MEEGKGEAGPSYTARAGSRERGAHSLKARPSALSCGTTSPKALSYPEANPFEPLVQAKHN IRP >gi568815588f:88889469_89114447|GENSCAN_predicted_CDS_5|192_bp atggaggaaggcaaaggggaagcaggcccttcttacacagccagagcaggatcaagagag aggggtgcccattccctgaaggctagaccttcagctctttcctgtggcacaacctccccc aaggctttatcctatcctgaggccaatccattcgagcctctggtgcaagcaaaacacaat attcgtccttga >gi568815588f:88889469_89114447|GENSCAN_predicted_peptide_6|21_aa XLLNNNKEDEDLYDDPFPLNE >gi568815588f:88889469_89114447|GENSCAN_predicted_CDS_6|66_bp ncgctactcaacaataacaaggaggatgaagacctttatgatgatccatttccacttaat gaatag