GENSCAN 1.0 Date run: 4-Nov-116 Time: 16:27:44 Sequence gi568815589r:70287779_70513363 : 225585 bp : 41.13% C+G : Isochore 1 ( 0 - 43 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10188 10443 256 2 1 58 89 323 0.966 25.49 1.02 Intr + 12268 12422 155 2 2 56 107 61 0.987 3.67 1.03 Intr + 26964 27058 95 2 2 96 97 79 0.995 7.44 1.04 Intr + 27668 27800 133 2 1 45 57 97 0.969 2.03 1.05 Intr + 30736 30909 174 0 0 57 81 123 0.993 7.71 1.06 Intr + 31016 31185 170 1 2 69 65 125 0.998 6.12 1.07 Intr + 35705 35828 124 2 1 48 116 86 0.950 7.07 1.08 Term + 36788 36994 207 1 0 4 28 135 0.364 -4.54 1.09 PlyA + 37574 37579 6 1.05 2.02 PlyA - 38247 38242 6 1.05 2.01 Sngl - 41864 41661 204 2 0 73 43 237 0.555 12.64 2.00 Prom - 50779 50740 40 -4.95 3.00 Prom + 54139 54178 40 -3.65 3.01 Init + 58265 58327 63 1 0 51 98 63 0.711 4.80 3.02 Intr + 59288 59383 96 1 0 72 19 116 0.839 2.39 3.03 Intr + 59835 59939 105 2 0 48 91 73 0.893 3.09 3.04 Intr + 60141 60260 120 2 0 83 83 49 0.917 3.67 3.05 Intr + 62336 62515 180 1 0 71 80 137 0.946 10.34 3.06 Intr + 62598 62693 96 2 0 54 100 47 0.624 1.79 3.07 Term + 84295 84438 144 0 0 75 44 85 0.086 -0.27 3.08 PlyA + 86502 86507 6 1.05 4.08 PlyA - 88499 88494 6 1.05 4.07 Term - 100227 99998 230 1 2 93 51 332 0.965 25.81 4.06 Intr - 101025 100920 106 0 1 58 91 94 0.966 5.57 4.05 Intr - 102506 102408 99 2 0 68 68 51 0.015 0.49 4.04 Intr - 105803 105754 50 0 2 56 91 17 0.003 -3.72 4.03 Intr - 120651 120558 94 0 1 95 93 66 0.765 6.42 4.02 Intr - 120911 120832 80 0 2 47 59 64 0.630 -2.25 4.01 Init - 125585 125081 505 2 1 87 71 613 0.801 54.69 4.00 Prom - 127850 127811 40 -9.65 5.00 Prom + 128364 128403 40 -2.85 5.01 Init + 129495 129509 15 2 0 60 93 11 0.739 -0.95 5.02 Intr + 130853 131033 181 1 1 88 103 85 0.167 8.52 5.03 Intr + 133430 133534 105 0 0 96 78 30 0.536 2.07 5.04 Intr + 133610 133757 148 0 1 85 71 65 0.915 2.87 5.05 Intr + 134200 134347 148 1 1 92 -4 89 0.768 -0.58 5.06 Intr + 136269 136425 157 2 1 79 81 61 0.751 3.16 5.07 Intr + 137532 137716 185 1 2 76 63 103 0.584 5.19 5.08 Intr + 139870 140088 219 0 0 7 59 143 0.415 0.88 5.09 Term + 143726 143926 201 1 0 47 46 118 0.298 -0.09 5.10 PlyA + 145561 145566 6 1.05 6.00 Prom + 147235 147274 40 -4.05 6.01 Init + 151596 151760 165 2 0 72 32 143 0.309 6.78 6.02 Intr + 164793 164962 170 2 2 52 82 71 0.192 0.72 6.03 Intr + 168151 168275 125 1 2 68 80 102 0.189 6.71 6.04 Term + 177388 177494 107 2 2 107 39 62 0.064 0.79 6.05 PlyA + 177539 177544 6 -3.74 7.04 PlyA - 178154 178149 6 1.05 7.03 Term - 180047 179832 216 2 0 79 52 177 0.166 9.36 7.02 Intr - 191571 191475 97 2 1 84 39 113 0.647 5.09 7.01 Init - 193084 192939 146 1 2 65 100 147 0.952 11.32 7.00 Prom - 195909 195870 40 -7.05 8.00 Prom + 200688 200727 40 -5.55 8.01 Init + 212406 212465 60 2 0 92 34 87 0.790 5.00 8.02 Intr + 213035 213210 176 1 2 15 72 162 0.087 5.12 8.03 Intr + 223467 223542 76 0 1 50 87 81 0.003 2.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 17469 17639 171 2 0 83 45 101 0.810 2.14 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:70287779_70513363|GENSCAN_predicted_peptide_1|437_aa IEELQQALIVKQNEELDRQRRIGNTRKMIEDLQNELKTTENCENLQPQIDAITNDLRRIQ DEKALCEGEIIDKRRERETLEKEKKSVDDHIVRFDNLMNQKEDKLRQRFRDTYDAVLWLR NNRDKFKQRVCEPIMLTVRDNKKLRVNAVIAPKSSYADKAPSRSLNELKQYGFFSYLREL FDAPDPVMSYLCCQYHIHEVPVGTEKTRERIERVIQETRLKQIYTAEEKYVVKTSFYSNK VISSNTSLKVAQFLTVTVDLEQRRHLEEQLKEIHRKLQAVDSGLIALRETSKHLEHKDNE LRQKKKELLERKTKKRQLEQKISSKLGSLKLMEQDTCNLEEEERKASTKIKEINVQKAKL VTELTNLIKGIHVGILQYYSDANHPGHYQTVYHRGSVPKKTTLTSDTVFKFRGLQVTCIS DQLAANSRVPMTPTQVQ >gi568815589r:70287779_70513363|GENSCAN_predicted_CDS_1|1314_bp attgaggaacttcagcaggctttaatagtaaagcaaaatgaagagcttgaccgacagagg agaataggtaatacccgcaaaatgatagaggatttgcaaaatgaactaaagaccacggaa aactgcgagaatcttcagccccagattgatgccattacaaatgatctgagacggattcag gatgaaaaggcattatgtgaaggcgaaataattgataagcgaagagagagggaaactcta gagaaggagaaaaagagtgtggacgatcatattgtacgttttgacaatcttatgaatcag aaggaagataagctaagacagagattccgtgacacgtatgatgctgttttatggctaaga aataacagagacaaatttaaacaaagagtctgtgagcccataatgctcacggttcgtgac aataaaaaattaagagtaaatgctgttattgctcccaagagttcatatgcagacaaagca ccttcaagatctttgaatgaacttaaacaatacggatttttctcttatttgagagaatta tttgatgcacctgatcctgtaatgagttacctttgctgtcagtatcatattcatgaagtt cctgtaggaactgaaaagaccagagaaagaattgaacgggtaatacaagaaacccgatta aaacagatttatacagcagaagaaaagtatgtggtgaaaacttctttttattcaaacaaa gttatttctagtaacacatctctaaaagtagcgcagtttctcactgtcactgtggaccta gagcagagaagacacttagaagaacagctaaaggaaattcatagaaaattgcaagcagtg gattcagggttgattgccttacgtgaaacaagcaaacatctggagcacaaagacaatgaa cttagacaaaagaagaaggagcttcttgagagaaaaaccaagaaaagacaactggaacaa aaaatcagttccaaactaggaagtttaaagctgatggaacaggatacttgcaatcttgaa gaggaagagcgaaaagcaagtaccaaaatcaaagaaataaatgttcaaaaagcgaaactt gttaccgaattaacaaacctaataaagggtatccatgtgggcatactgcagtactattct gacgctaaccatcctggtcactatcaaactgtataccataggggctcggttcccaaaaag accacccttacttcagacaccgtcttcaagtttaggggtctccaggtcacttgtatttct gaccaactagctgcaaattcaagagttcctatgacccccactcaggttcaataa >gi568815589r:70287779_70513363|GENSCAN_predicted_peptide_2|67_aa MRESLELRDLLSSCDQNADKDMDSEVQIKVVSDGDEELIGNSSKRHSCYVLAETGSTVPL LKRSLEF >gi568815589r:70287779_70513363|GENSCAN_predicted_CDS_2|204_bp atgagggaaagtttggaacttcgagacttgttgagcagttgtgaccaaaatgctgacaag gatatggacagtgaagtccagattaaggtggtctcagatggagatgaggaacttattgga aactcgagtaaacgtcactcttgctatgttttagcagagactggcagcactgtgcctctg ctcaagagatctctggaattctga >gi568815589r:70287779_70513363|GENSCAN_predicted_peptide_3|267_aa MPFTGMEKIEGETGLLKVIGKVFQDLPNTLDEIDALLTEERSRASCFTGLNPTIVQEYTK REEEIEQLTEELKGKKVELDQYRENISQVKERWLNPLKELVEKINEKFSNFFSSMQCAGE VDLHTENEEDYDKYGIRIRVKFRSSTQLHELTPHHQSGGERSVSTMLYLMALQELNRCPF RVVDEINQGMDPINERRVFEMVVNTACKENTSQYFFITPKFSVPKDGHLQFQTYTVQLSK CSRQKHFSFPTLVGLSEYLAGDSIWSP >gi568815589r:70287779_70513363|GENSCAN_predicted_CDS_3|804_bp atgccatttactggaatggagaagattgaaggagaaacagggttgcttaaagtgataggg aaggttttccaagaccttccaaacacattggatgaaattgatgctttattaactgaagaa agatcaagagcttcctgcttcacgggactgaatcctacaattgttcaggaatatacaaaa agagaagaagaaatagaacagttaactgaggaactaaagggaaagaaagttgaactagat caatacagggaaaacatttcacaggtaaaagaaaggtggcttaatcctttaaaagagctg gtagaaaaaattaatgaaaaattcagcaatttttttagttccatgcagtgtgctggtgaa gttgatctccatacagaaaatgaggaagattatgataaatatggaattcgaattagagtc aaatttcgaagtagtactcaactgcatgaattaactcctcatcatcaaagtggaggtgaa agaagtgtttctaccatgttatacttgatggcacttcaggagctaaatagatgtccattc agagtagttgatgaaatcaatcagggaatggacccaatcaatgaacggagagtgtttgaa atggttgtaaatactgcctgtaaagaaaatacatctcaatactttttcataacaccaaag ttttctgtgccaaaagatggccatttgcagttccagacttacacagtacaactcagcaaa tgtagcagacaaaaacacttctctttcccaacattggtgggcttgtctgagtatctggcg ggggacagcatatggagtccttag >gi568815589r:70287779_70513363|GENSCAN_predicted_peptide_4|387_aa MSAAAYMDFVAAQCLVSISNRAAVPEHGVAPDAERLRLPEREVTKEHGDPGDTWKDYCTL VTIAKSLLDLNKYRPIQTPSVCSDSLESPDEDMGSDSDVTTESGSSPSHSPEERQDPGSA PSPLSLLHPGVAAKGKHASEKRHKCPYSGCGKVYGKSSHLKAHYRVHTGFGVNCLVKESL MSSEVSGKTHFIAWERAPQSDVEWHFITQLASSNIDVQLGLSGACTVRPIPGLGWGREDE KNQLFSSELHKGLYVDLHIENQLLQIRFTHIGFPRKEGRFYINKRCDRDAQEQPDAEWPL HSGAQGEMLVPGERPFPCTWPDCLKKFSRSDELTRHYRTHTGEKQFRCPLCEKRFMRSDH LTKHARRHTEFHPSMIKRSKKALANAL >gi568815589r:70287779_70513363|GENSCAN_predicted_CDS_4|1164_bp atgtccgcggccgcctacatggacttcgtggctgcccagtgtctggtttccatttcgaac cgcgctgcggtgccggagcatggggtcgctccggacgccgagcggctgcgactacctgag cgcgaggtgaccaaggagcacggtgacccgggggacacctggaaggattactgcacactg gtcaccatcgccaagagcttgttggacctgaacaagtaccgacccatccagaccccctcc gtgtgcagcgacagtctggaaagtccagatgaggatatgggatccgacagcgacgtgacc accgaatctgggtcgagtccttcccacagcccggaggagagacaggatcctggcagcgcg cccagcccgctctccctcctccatcctggagtggctgcgaaggggaaacacgcctccgaa aagaggcacaagtgcccctacagtggctgtgggaaagtctatggaaaatcctcccatctc aaagcccattacagagtgcatacaggttttggagtcaactgtttggtcaaggagagtctg atgtcatcggaggtatcaggaaaaactcacttcattgcctgggagagggctccccagagt gatgtggaatggcatttcataacccaacttgcttcatccaacatagatgtacaactaggc ctgagtggggcttgcacagttagacccatccctggacttgggtggggcagggaagatgag aagaaccagcttttcagctcagagctccataagggtttatacgtagatctgcacatagag aatcaactgctacagattcgttttacccacattgggtttcctcgaaaagaaggcagattt tatatcaataaaagatgtgacagagatgctcaggaacaaccagatgctgagtggcccctg cactcaggagctcagggagaaatgctcgttccaggtgaacggccctttccctgcacgtgg ccagactgccttaaaaagttctcccgctcagacgagctgacccgccactaccggacccac actggggaaaagcagttccgctgtccgctgtgtgagaagcgcttcatgaggagtgaccac ctcacaaagcacgcccggcggcacaccgagttccaccccagcatgatcaagcgatcgaaa aaggcgctggccaacgctttgtga >gi568815589r:70287779_70513363|GENSCAN_predicted_peptide_5|452_aa MKIQTGPAPPAPATPPRHTEHMWRKPVFILLKAGELEKEASLRRGEAKGIKLPTMAMFLD FEIQSATFLWGPCLAVFFGQDSGPPPRVATCPGRSGETANGCPKFSADQGFQAEPQNLLL GTHPLLMESGKTFPFKACFTAGKNAGSLPNCFEVPAIRLYHLRFVAVICRLSCTPPHSPK ILAPDSQSPTSTQLLLTFLALLKSSCPGALSLFLFHIHTHSLPGFMQPQGFNIFRVGSVV HTHNPNTVEGQAPLPALLIRELTSWTPVLQAFTVALPSTWNALLQDLSVAHILTFAEPLL TCTLLRELCPGWESRPDVITRFYKKEAGGWRERRCHTAGSEDGGGAVRQGMKAASGSIKG KETGSPRASKRNKALQTHCRRLASRADVSLAENLEKFGDGEEKSHGNSCLVASVFSVKSD WSMVACASEVMGNMAEDMSRVAGVCLECLSLG >gi568815589r:70287779_70513363|GENSCAN_predicted_CDS_5|1359_bp atgaaaattcagactggcccagcccctccagctccagcgaccccacctaggcacacggaa catatgtggaggaaaccagtgttcattttgttgaaagcaggggaattagagaaggaggcg agcttgaggaggggagaggctaaagggataaagcttccaacaatggcgatgttcctggat tttgaaatacaatcagctaccttcctgtgggggccctgcctggctgtcttttttggccag gattctgggccacccccacgagtggccacctgtccagggagaagtggtgagacagctaac gggtgcccaaaattctctgccgaccaaggttttcaggcagaaccccaaaacctgctccta gggacacaccctctccttatggagtctggcaaaacctttcctttcaaagcttgcttcaca gcggggaagaatgcaggctctctccctaactgctttgaagtacctgccatcagactatac cacctccgctttgtggctgtcatctgcaggctctcctgcactccccctcattccccgaag attctagcacctgactcacagtcccccacctccacccaactcctattaacatttttggct ctcttaaagtcgagttgccctggagccctgtccttgtttctcttccacatccatactcac tcccttcctggtttcatgcagcctcaaggctttaatatcttcagggttggttccgtggtt cacacccataatcccaacaccgtggaaggccaagccccactacccgccttgctcatccgg gaattaaccagctggactcctgtcttacaagcattcacagtggctcttccttctacctgg aatgcccttctccaagatctcagcgtggctcacatcctgaccttcgctgagcctttgctc acatgtaccctcctaagagagctctgtcctggttgggagagtaggcctgatgtaatcaca agattttataagaaggaggcgggaggctggagagagagaagatgccatacagctggctct gaagatggaggaggagctgtgagacaaggaatgaaggcagcctctggtagcataaaaggc aaggaaacaggctcccctagagcctctaaaaggaataaagctctgcagacacattgtaga cgtctggcttctagagctgatgtgagcctagctgaaaatttggagaagtttggagatgga gaagagaaaagccatgggaattcatgcctagtggcatctgttttctctgtgaagtcggac tggagcatggtagcatgtgcaagtgaggtgatggggaatatggcggaggacatgagtaga gtggcaggggtttgtctggaatgcctgtcactgggatga >gi568815589r:70287779_70513363|GENSCAN_predicted_peptide_6|188_aa MDEAWESDLVDKGKGRDEDDSWVSGLDNGGNHKSIGEEEELGLEELSLRSLWDHHEKCVI IVAGTSVGCSGPSLANEKRLSKERWPLQHRLCRPEMTSIHHNWKGIERKQFRLLELRALA LPYTCNTFICGRKELCGERACLDERLDDPDVWSEIWMELQVPALRSLGLSGDSPHPEALQ GPYPKSPH >gi568815589r:70287779_70513363|GENSCAN_predicted_CDS_6|567_bp atggacgaagcatgggaatccgatttggtggataagggcaagggaagggatgaggatgac tcctgggtttctggcttggacaatggtggaaatcacaaatctataggtgaggaggaagaa ttgggtttggaggagttgagtttgagatctctgtgggaccatcacgagaagtgtgtaatt atagttgctggcaccagtgttggctgctcaggcccaagcctggccaatgagaagaggctt agcaaagaaagatggcctctgcagcatcgtctttgtaggccagaaatgaccagtattcac cataattggaagggaattgaaaggaaacagtttaggctgttagagttgcgggctctggcc ttaccttatacatgcaacactttcatttgtggaagaaaggagttgtgtggagagagggcc tgcttggatgaaagactggatgacccagatgtctggtcagagatctggatggagctgcaa gttccagcccttagatcacttggtctttctggtgacagcccccatcctgaggctctccag gggccctaccctaagtcacctcattag >gi568815589r:70287779_70513363|GENSCAN_predicted_peptide_7|152_aa MISAAAAAAAGTTPPTHSPAFECRAVKDDLWTVHCEGGDPHIHTHTRTRSKIISMIVTSH PIGKEASRDTGEQGLFRDSPDLFVPEVSFPIYGVSQAEDAGRKVCSCCSQTAAAKANLPW REGNSNCQGVHGSPQGRHKKKMGFVQCKLCNG >gi568815589r:70287779_70513363|GENSCAN_predicted_CDS_7|459_bp atgatctcagctgccgcggcggcagcagcaggcaccacgccgccgacccacagcccggcg tttgaatgcagggctgtcaaagacgatctctggacagtacactgcgaaggtggggaccca cacatacacacacacacacgcacacggtccaagatcatctctatgattgtgaccagccat cccataggaaaagaagcttccagagatactggggaacaaggattgttcagggacagccct gatctctttgttcctgaagtctccttccccatctatggtgttagtcaagcagaagatgct ggcagaaaggtctgcagctgctgcagccaaacagcagcagcaaaggctaacctcccatgg agggaagggaacagcaactgtcagggagtgcatggcagtcctcaaggcaggcacaagaag aaaatgggatttgtccagtgcaaactgtgtaacggctga >gi568815589r:70287779_70513363|GENSCAN_predicted_peptide_8|104_aa MKRNWNNIAEDQYGGDMKDLIWSLPFKKDLLSLEKVQRWLELREAGKLGGSADTQHSCEI MTVAENEQSCKGFNQSCQGQRQAQEREDSSGGLCQDEEKEPQQQ >gi568815589r:70287779_70513363|GENSCAN_predicted_CDS_8|312_bp atgaagaggaactggaataacatagcagaggaccagtatggaggtgacatgaaggacctg atctggtcactgcccttcaagaaggacctgctgtctctggagaaggttcaaagatggcta gaactgagagaagcaggaaaattggggggcagtgctgatacacagcacagctgtgagatt atgaccgtggctgagaatgaacaaagctgtaagggatttaatcagagttgtcaaggtcag aggcaggcacaggagagagaggacagctctggtggtctctgtcaggatgaggagaaagag ccccaacagcag