GENSCAN 1.0 Date run: 8-Nov-116 Time: 06:52:41 Sequence gi568815589r:109279559_109563434 : 283876 bp : 45.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 3534 3613 80 2 2 74 53 61 0.688 1.73 1.02 Intr + 3957 4064 108 0 0 80 65 94 0.949 5.70 1.03 Intr + 8573 8725 153 2 0 72 66 97 0.429 5.09 1.04 Intr + 22427 22530 104 2 2 89 69 88 0.436 6.82 1.05 Intr + 28268 28385 118 0 1 56 101 53 0.061 2.92 1.06 Term + 37898 38024 127 2 1 87 49 68 0.066 0.56 1.07 PlyA + 38029 38034 6 1.05 2.05 PlyA - 38117 38112 6 1.05 2.04 Term - 39966 39949 18 0 0 83 46 11 0.119 -5.28 2.03 Intr - 41235 40583 653 1 2 60 117 665 0.836 58.23 2.02 Intr - 41639 41446 194 1 2 22 46 114 0.200 -0.26 2.01 Init - 47077 47037 41 1 2 68 110 19 0.392 1.85 2.00 Prom - 47197 47158 40 -7.86 3.00 Prom + 56128 56167 40 -4.86 3.01 Init + 58002 58058 57 2 0 35 88 36 0.272 -0.39 3.02 Intr + 65758 65934 177 0 0 104 44 93 0.933 6.62 3.03 Term + 66001 66198 198 0 0 30 47 150 0.544 2.50 3.04 PlyA + 69111 69116 6 1.05 4.24 PlyA - 69527 69522 6 1.05 4.23 Term - 91358 91208 151 1 1 104 37 159 0.882 9.78 4.22 Intr - 99556 99504 53 1 2 116 52 6 0.024 -2.49 4.21 Intr - 102229 102094 136 0 1 114 70 102 0.922 11.57 4.20 Intr - 102889 102744 146 1 2 75 80 91 0.997 6.08 4.19 Intr - 103993 103865 129 1 0 103 99 241 0.973 27.59 4.18 Intr - 109821 109675 147 0 0 74 114 116 0.996 13.23 4.17 Intr - 111641 111580 62 0 2 122 111 17 0.983 5.85 4.16 Intr - 112003 111913 91 1 1 61 70 91 0.983 4.17 4.15 Intr - 125050 124890 161 2 2 53 94 132 0.775 10.01 4.14 Intr - 127060 126904 157 1 1 20 80 177 0.628 9.68 4.13 Intr - 130518 130441 78 0 0 82 95 58 0.979 5.65 4.12 Intr - 130857 130671 187 2 1 17 105 266 0.957 20.89 4.11 Intr - 141042 140866 177 2 0 77 52 196 0.800 13.93 4.10 Intr - 143294 143160 135 1 0 61 84 40 0.547 0.58 4.09 Intr - 147564 147392 173 0 2 94 87 117 0.924 10.94 4.08 Intr - 149126 149063 64 1 1 40 109 3 0.332 -3.68 4.07 Intr - 153603 153515 89 0 2 86 111 32 0.747 4.07 4.06 Intr - 157412 157325 88 1 1 102 75 74 0.993 7.47 4.05 Intr - 158676 158556 121 1 1 89 94 63 0.993 6.55 4.04 Intr - 165734 165682 53 1 2 106 101 22 0.255 3.85 4.03 Intr - 176518 176376 143 1 2 101 9 51 0.062 -2.35 4.02 Intr - 177841 177734 108 1 0 90 95 133 0.958 14.68 4.01 Init - 182699 182640 60 2 0 96 43 27 0.348 0.57 4.00 Prom - 184179 184140 40 -3.46 5.00 Prom + 192720 192759 40 -2.26 5.01 Init + 199085 199153 69 1 0 76 75 80 0.789 6.55 5.02 Intr + 203304 203351 48 2 0 110 48 30 0.152 0.08 5.03 Term + 212607 212690 84 2 0 70 49 100 0.357 1.95 5.04 PlyA + 213372 213377 6 1.05 6.00 Prom + 213994 214033 40 -4.46 6.01 Init + 216256 216270 15 0 0 81 91 27 0.329 2.50 6.02 Intr + 218101 218466 366 0 0 87 99 75 0.402 3.84 6.03 Intr + 218616 218690 75 2 0 50 96 74 0.518 4.11 6.04 Intr + 241038 241175 138 2 0 116 116 74 0.992 13.66 6.05 Term + 241473 241484 12 2 0 106 44 7 0.295 -3.70 6.06 PlyA + 242522 242527 6 1.05 7.06 PlyA - 242986 242981 6 1.05 7.05 Term - 254014 253891 124 0 1 12 40 141 0.343 -0.44 7.04 Intr - 254317 254204 114 0 0 59 72 67 0.531 1.76 7.03 Intr - 254839 254479 361 2 1 41 67 388 0.662 26.28 7.02 Intr - 281081 280820 262 2 1 99 94 38 0.118 2.56 7.01 Intr - 282884 282768 117 2 0 -24 -36 303 0.176 7.36 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815589r:109279559_109563434|GENSCAN_predicted_peptide_1|229_aa MKMQRLQMHTWVTKLEKNKEAATIKSGDAEPRYINTPEYCSAIIQDNYEDEIKNKENFND NVRNGTSAFMKDIEASSLASAAARGCSNRVPSTSTSPVTESTGSVILDIQAPEPPMLQMP VPAPKVQQNNYARALATKSQLSTSPEKQEEMAESQRGKEARWMSLISAGGLRAGSYTCYT LVTAHLPRTQLGTLETPNNCGLDWTEVPQHGHTEGVIVFQTISEHKHPN >gi568815589r:109279559_109563434|GENSCAN_predicted_CDS_1|690_bp atgaaaatgcaacgtctacagatgcacacttgggtgacaaaactggagaagaacaaggaa gcagccacaataaagtcaggggacgcagaaccacgatacatcaacactccagaatactgt tcagccattatacaagataattacgaagatgagataaaaaacaaggaaaattttaatgac aatgttaggaatgggactagtgcctttatgaaagacattgaagccagcagccttgcctct gctgctgcgagaggatgcagcaacagggtgccatccacaagtacctcaccagtcactgaa tctactggctctgtgatcttggatatccaggccccagaacctcccatgttacagatgcca gttcctgcacctaaagtgcagcaaaacaactatgccagagctctggccaccaagagtcaa ctttcaacatccccagagaagcaagaagagatggcagagtctcagaggggcaaagaggct cgctggatgtcgctcataagcgctggggggctacgtgcaggaagctacacttgttacacc cttgttactgctcatttgccacgcacacagcttggcacattggagacgcccaataactgt ggtctggactggactgaggtgccacagcacggtcacactgagggtgttattgtgttccag acaatttcagaacacaagcatcctaactag >gi568815589r:109279559_109563434|GENSCAN_predicted_peptide_2|301_aa MGPQIGTQAKAAIRALGAGNTCVLTYVYTREHTYSSRGRSGRQGAAQTRLAITRRSPEGG GRAAVPAAAAAAAAAAAGAPWQPAAARPEQPLGSPRRPPARESARRRRRRDGPERALPPA ATRSRRWPGRRAPQAARARPRPRRGLAPGRSPREAARAAEGTDGVAGRSRAAAQRQRQRQ RQRRGRRQGAPPARMLRFLRRTFGRRSMQRYARGAAGRGAAGLGDERDGGPRGGPAAAAS SSALPAAPGGSVFPAGGGPLLTGGAAVHISAAGAAKATLYCRVFLLDGTEVSVDLPPPRS G >gi568815589r:109279559_109563434|GENSCAN_predicted_CDS_2|906_bp atgggtcctcagataggaacccaggctaaggctgccatcagggctctgggcgccggaaac acgtgtgtgctcacctacgtgtacacacgtgagcacacctacagcagcagggggcgctcg ggccggcagggggcagcacagacccggcttgcaattacccggcgcagccccgagggagga gggcgtgctgcagtcccggcggcagcggcggcggcggcggcggcggcggctggagccccc tggcagcccgcagccgcccgtccggagcagcccctcggcagcccgcgcaggccgccggcc cgggagagcgcgcggcggcggcggcggcgggacggccccgagcgcgccctcccgcctgcg gccactcgcagccggcgctggccgggccggcgcgccccgcaggcggctagagcgcggcct cggcctcggcgcgggcttgccccgggccgtagcccgcgagaggcggcgcgggcggccgag ggcactgacggcgtcgcgggacgctcccgggcggcggcgcagcggcagcggcagcggcag cggcagcgcagggggcggaggcagggggcgcccccagccaggatgctgcggttcctgcgc cggacctttggccgccgctccatgcagcgctacgcgcggggcgcggcggggcgcggggcc gccgggctgggggacgagcgcgatggggggccacgggggggcccggccgccgccgcctcc tcctcggcgctgcccgccgcgcccgggggcagcgtgttcccggcgggcggcgggcccctg ctcaccggcggcgcggccgtgcacatctccgccgccggcgccgccaaggccaccctctac tgccgcgtcttcctgctcgacgggaccgaagtgagcgtggacctgccgcctcctcgcagc gggtag >gi568815589r:109279559_109563434|GENSCAN_predicted_peptide_3|143_aa MNKKPECPGLQDKRGTQAELHLSSYYGLLGLWQDLPVSESSTVRTLSKGVEGETEEELKA QELRSLPPPRPPQHSPDHSLKQLLRTCPASSAARSGLLNSPQDCGSLRLERSLFSEASQI DFLYKLNVAFAVTLLVIHRSKDS >gi568815589r:109279559_109563434|GENSCAN_predicted_CDS_3|432_bp atgaacaagaagcctgaatgtccagggttgcaagataaaagaggaacccaggcagagctg catttatccagctactatgggctgctcgggctctggcaggatctccctgtgtcagagtct tctacagtaagaaccctctcaaaaggtgtggaaggggaaacagaagaggagctaaaggct caggagctcaggtccctgcccccacctcgtcctcctcagcacagccccgatcactctctc aagcagctgcttagaacttgcccagcgagttctgcagcaaggtctggcctgctgaactca ccccaggactgtggctccctgaggctggagcggagccttttttcagaagcctctcagatc gactttctttataagctgaatgtggcctttgcggtcaccctcttggtcatccacagatct aaagattcctga >gi568815589r:109279559_109563434|GENSCAN_predicted_peptide_4|902_aa MGRWPSALAADVSLSVQKTQKQDTGQVLLDMVHNHLGVTEKEYFGLQHDDDSVDSPGARF SKFFPPMSCILCVQKYLWEPQGHLQTNTCLIQVPRNESTQGMNRLTCPLNSAVVLASYAV QSHFGDYNSSIHHPGYLSDSHFIPDQNEDFLTKVESLHEQHSGLKQSEAESCYINIARTL DFYGVELHSGRDLHNLDLMIGIASAGVAVYRKYICTSFYPWVNILKISFKRKKFFIHQRQ KQAESREHIVAFNMLNYRSCKNLWKSCVEHHTFFQAKKLLPQEKNVLSQYWTMGSRNTKK SVNNQYCKKVIGGMVWNPAMRRSLSVEHLETKSLPSRSPPITPNWRSPRLRHEIRKPRHS SADNLANEMTYITETEDVFYTYKGSLAPQDSDSEVSQNRSPHQESLSENNPAQSYLTQKS SSSVSPSSNAPGSCSPDGVDQQLLDDFHRVTKGGSTEDASQYYCDKNDNGDSYLVLIRIT PDEDGKFGFNLKADTCIPKLNEGDQIVLINGRDISEHTHDQVVMFIKASRESHSRELALV IRRRAVRSFADFKSEDELNQLFPEAIFPMCPEGGDTLEGSMAQLKKGLESGTVLIQFEQL YRKKPGLAITFAKLPQNLDKNRYKDVLPYDTTRVLLQGNEDYINASYVNMEIPAANLVNK YIATQGPLPHTCAQFWQVVWDQKLSLIVMLTTLTERGRTKCHQYWPDPPDVMNHGGFHIQ CQSEDCTIAYVSREMLVTNTQTGEEHTVTHLQYVAWPDHGVPDDSSDFLEFVNYVRSLRV DSEPVLVHCSAGIGRTGVLVTMETAMCLTERNLPIYPLDIVRKMRDQRAMMVQTSVELEQ FRFTRPDAKETLTRQCKGGPPELCKVAVLTMLPHIDHKNHCAKFCTALELRRYRITTDGF LA >gi568815589r:109279559_109563434|GENSCAN_predicted_CDS_4|2709_bp atggggcggtggcccagtgccttggctgcagacgtctccctgtcagtgcagaagactcag aaacaagacactggccaggttcttctggatatggtgcacaaccacctgggtgtgactgaa aaggaatattttggtttacagcatgatgacgactccgtggactctcctggagcccgattc tcaaagttcttccctcccatgagctgcatcctctgcgttcagaagtacctgtgggagcct cagggccacttgcagacaaatacctgcctcatccaagtccccaggaatgagtccacacaa ggcatgaataggttaacctgccctcttaactcagcagtggttctagcgtcctatgccgta caatctcattttggagactataattcttccatacatcatccaggctatctttccgatagt cactttatacccgatcaaaatgaggactttttaacaaaagtcgaatctctgcatgagcag cacagtgggctaaaacaatcagaagcagaatcctgctatatcaacatagcgcggaccctc gacttctatggagtagaactgcacagtggtagggatctgcacaatttagacctaatgatt ggaattgcttccgcgggtgttgctgtgtaccgaaaatacatttgcacaagtttctatcct tgggtgaacattctcaaaatttctttcaaaaggaaaaagttcttcatacatcagcgacag aaacaggctgaatccagggaacatattgtggccttcaacatgctgaattaccgatcttgc aaaaacttgtggaaatcctgtgttgagcaccatacgttctttcaggcaaagaagctacta cctcaggaaaagaatgttctgtctcagtactggactatgggctctcggaacaccaaaaag tcggtaaataaccaatattgcaaaaaggtgattggcgggatggtgtggaacccagccatg cggagatccttatcagtggagcacttagaaaccaagagtctgccttctcgttcccctccc attactcccaactggcgaagtcctcggctccggcacgaaatccgaaagccacgccactct tctgcagataaccttgcaaatgaaatgacctacatcacggaaacggaagatgtattttac acgtacaagggctctctggcccctcaagacagcgattctgaagtttctcagaaccgaagc ccgcaccaagagagtttatccgagaacaatccggcacaaagctacctgacccagaagtca tccagttctgtgtctccatcttcaaatgctccaggctcctgctcacctgacggcgttgat cagcagctcttagatgacttccacagggtgaccaaagggggctccaccgaggacgccagc cagtactactgtgacaagaatgataatggtgacagctacttagtcttgatccgtatcaca ccagatgaagatggaaaatttggatttaatcttaaggcggacacctgcattcctaagctg aacgaaggggatcaaatcgtgttaatcaatggccgggacatctcagaacacacgcatgac caagtggtgatgttcatcaaagccagccgggagtcccactcacgggagctggccctggtg atcaggaggagagctgtccgctcatttgctgacttcaagtctgaagatgaactgaaccag cttttccccgaagccattttccccatgtgtccggagggtggggacactttggagggatcc atggcacagctaaagaagggcctcgaaagcgggacggtgctgatccagtttgagcaactc tacagaaaaaagccaggtttggccatcacgtttgcaaagctgcctcaaaatttggacaaa aaccgatataaagatgtgctgccttatgacaccacccgggtattattgcagggaaatgaa gattatattaatgcaagttacgtgaacatggaaattcctgctgctaaccttgtgaacaag tacatcgccactcaggggcccctgccgcatacctgtgcacagttttggcaggttgtctgg gatcagaagttgtcactcattgtcatgttgacgactctcacagaacgagggcggaccaaa tgtcaccagtactggccagatccccccgacgtcatgaaccacggcggctttcacatccag tgtcagtcagaggactgcaccatcgcctatgtgtcccgagaaatgctggtcacaaacacc cagaccggggaagaacacacagtgacacatctccagtacgtcgcatggcctgaccacggt gtgcccgatgactcctccgactttctggaatttgtaaactatgtgaggtctctgagagtg gacagcgagcccgtcctagttcactgcagtgctggaataggtcgaaccggtgtgttggtc actatggaaacagccatgtgcctaactgagaggaacctgcccatttacccactggatatt gtccgaaaaatgcgagaccagcgcgccatgatggtgcagacatcagtggagctggaacaa ttcagattcacccgccctgatgctaaggaaaccctgacacggcaatgtaaaggaggcccc ccggagctatgcaaggtagcagtactgaccatgcttcctcacatagaccacaagaaccac tgtgccaaattctgcactgccctggagctccgcaggtatcgcatcacaactgatggcttt ctggcctaa >gi568815589r:109279559_109563434|GENSCAN_predicted_peptide_5|66_aa MSKVTDDDLRKIAKLTVFCLFYQENETDINPQGPRQHSQAQWWLAVKKQLMELVAASLTE NLWASP >gi568815589r:109279559_109563434|GENSCAN_predicted_CDS_5|201_bp atgagcaaggtaactgatgatgatttgcgcaaaatcgccaagttaactgtgttctgtctt ttctaccaggaaaatgaaactgacattaacccacagggtcctcgtcagcactcgcaggcc caatggtggctggcggtcaaaaaacagctgatggagctggttgctgcctccctcaccgag aacctctgggctagcccttga >gi568815589r:109279559_109563434|GENSCAN_predicted_peptide_6|201_aa MTDDMVPDKSFHIVRGTGRIYWLWSFVPGPGNAIFESREADSPWEPSAGSRSRLAPRGLC APSEPRAGVAAARGPPEVPDARAPRVAPAPARLERTPTLRRRVPAASPPRPSPTRRPARA LPPRPRQPRTLPPARWGRTGPYLQHPGARSAQSLLDSVSFWIGKYVGQFVSSTQSLCALY CSVIALSVAWFVKFTYVKESV >gi568815589r:109279559_109563434|GENSCAN_predicted_CDS_6|606_bp atgactgatgatatggtacctgataaatcattccatatcgtccgcggaacggggaggatt tattggctctggagtttcgtgccaggcccaggaaatgccatctttgaaagccgagaggcg gattcgccctgggagccgagcgctggcagccgctctcggctagctccccgcgggctgtgt gccccctcggagccccgagccggggtcgcggcggcacggggtcctccggaggtgccggac gcacgggctccgcgggttgcgcccgcccctgcccggctggagcgcacgcccacgctccgg cgccgcgtcccggccgccagcccgccccggccgagccccacccggcgccccgcccgcgcc ctcccgccccggccccgccagccccgcaccctgcccccggcccgctggggtcgcacgggc ccgtacctgcagcatccgggggcgcggagcgcccagagccttcttgacagtgtttccttt tggattggcaagtatgtgggacagtttgtgtcaagcacacagtctctttgcgccctctac tgttcagttatagcactgtccgtggcctggtttgttaaattcacgtacgttaaggagagt gtgtga >gi568815589r:109279559_109563434|GENSCAN_predicted_peptide_7|325_aa KEEEEEEEKEEEEEEEEEKEEEEEEEEEEEEKLSLEGLIVGPVSRLVLGPLFTSLCMALL CMMSLAQGSNSKMGNQKGHSGHLSLAKTLGSDSTGVLGSFHFCGHKKAKTAQSCNQVHFM YLLLKPTTITPREEPQLPQPAPVTITTTLSSEAETQQPPAACRSPPPALSAGDTTPGTTG SGTGNGGPGGFTSAAPAGGDKKVIATKVLGTVKWFNLRNGYAFINRNDIKEDTFVPQTAK TKNNPRRGPPPNYQQNYQNSESGGKNEGWERAPQGQAQPRRPYSRQPREDGNEEDKENQR GETQGQQPPQRRYRRNLPTQTHRKP >gi568815589r:109279559_109563434|GENSCAN_predicted_CDS_7|978_bp aaggaagaagaagaagaagaagaaaaagaagaagaagaggaagaggaagaagaaaaagaa gaagaagaagaagaagaagaagaagaagaagaaaaattgtccttggagggtcttatagtg gggcctgtgagccgcctggtgctcggaccgttgttcacttctctgtgcatggctttgctc tgcatgatgtcattagcacaaggaagcaactcaaaaatgggcaaccagaaggggcactca ggccatctgtccttagccaagacccttgggtcagattccactggtgtccttggttcattt cacttctgtggtcacaagaaagccaaaactgcacagtcttgtaaccaagtacatttcatg tacctactgctgaaacctactaccatcacaccccgggaggagccgcagctgccgcagccg gccccagtcaccatcaccacaaccttgagcagcgaggccgagacccagcagccgcccgcc gcttgccgctcgccgccccccgccctcagcgccggtgacaccacgcccggcactacgggc agcggcacaggaaacggtggcccgggaggcttcacatcagcagcacctgccggcggggac aagaaggtcatcgcaacgaaggttttgggaacagtaaaatggttcaacttaaggaacgga tatgctttcatcaacaggaatgacatcaaggaagatacatttgtaccccagactgccaaa acgaagaataaccccaggaggggtcctccacccaattaccagcaaaactaccagaatagc gagagtgggggaaagaacgagggatgggagcgtgctccccaaggccaggctcaaccacgc cggccctacagcagacagcctagagaggacggcaatgaagaggataaagaaaatcaacga ggtgagacccaaggtcagcagccacctcaacgtcggtaccgccgcaacttaccgacgcag acacacagaaaaccctaa