GENSCAN 1.0 Date run: 4-Aug-121 Time: 20:33:11 Sequence gi568815582f:10643960_10869102 : 225143 bp : 46.49% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 12098 12249 152 0 2 43 44 132 0.236 2.37 1.02 PlyA + 12685 12690 6 1.05 2.09 PlyA - 12954 12949 6 1.05 2.08 Term - 15192 15136 57 0 0 83 40 53 0.176 -2.31 2.07 Intr - 18772 18523 250 0 1 46 89 75 0.209 0.84 2.06 Intr - 23620 23517 104 1 2 4 92 119 0.071 2.87 2.05 Intr - 32222 32000 223 1 1 127 105 386 0.998 42.33 2.04 Intr - 38177 38034 144 1 0 76 113 197 0.990 20.40 2.03 Intr - 45364 45294 71 1 2 98 91 -5 0.917 -1.22 2.02 Intr - 46066 45983 84 1 0 82 90 102 0.972 9.82 2.01 Init - 50914 50351 564 1 0 85 109 790 0.992 75.84 2.00 Prom - 54589 54550 40 -5.76 3.00 Prom + 58174 58213 40 -8.76 3.01 Init + 63821 63891 71 1 2 40 64 95 0.713 2.82 3.02 Intr + 63966 64097 132 0 0 50 68 107 0.538 4.66 3.03 Intr + 67985 68149 165 2 0 44 103 127 0.274 8.88 3.04 Intr + 72985 73078 94 1 1 23 46 130 0.116 2.27 3.05 Term + 73487 73612 126 1 0 62 45 81 0.408 -0.52 3.06 PlyA + 75211 75216 6 1.05 4.00 Prom + 79377 79416 40 -2.36 4.01 Init + 98308 98324 17 0 2 72 127 35 0.719 5.49 4.02 Intr + 99796 99923 128 1 2 25 115 95 0.979 6.22 4.03 Intr + 100002 100106 105 1 0 94 80 44 0.942 4.39 4.04 Intr + 103184 103317 134 0 2 116 94 86 0.997 12.36 4.05 Intr + 108651 108719 69 2 0 115 90 34 0.960 5.68 4.06 Intr + 112731 112821 91 2 1 93 14 99 0.982 2.57 4.07 Intr + 113914 114068 155 2 2 60 99 258 0.979 23.89 4.08 Intr + 117405 117515 111 2 0 71 100 176 0.856 17.68 4.09 Intr + 117798 117900 103 2 1 79 80 24 0.539 0.45 4.10 Intr + 123990 124073 84 1 0 105 89 5 0.088 2.09 4.11 Term + 125088 125146 59 1 2 110 47 23 0.090 -1.75 4.12 PlyA + 125372 125377 6 1.05 5.12 PlyA - 125826 125821 6 -0.45 5.11 Term - 127669 127622 48 1 0 89 38 48 0.508 -2.80 5.10 Intr - 127839 127711 129 0 0 34 113 109 0.841 8.89 5.09 Intr - 129482 129354 129 2 0 94 95 -19 0.462 0.29 5.08 Intr - 130169 130080 90 2 0 86 61 82 0.944 5.49 5.07 Intr - 131137 130993 145 0 1 52 100 210 0.760 18.88 5.06 Intr - 174223 174144 80 1 2 120 89 188 0.994 20.45 5.05 Intr - 182013 181903 111 0 0 65 62 73 0.334 2.98 5.04 Intr - 185122 185110 13 0 1 93 105 0 0.047 -3.02 5.03 Intr - 190560 190424 137 0 2 56 110 58 0.161 4.17 5.02 Intr - 197187 196518 670 2 1 94 66 263 0.627 16.41 5.01 Init - 204133 204102 32 1 2 93 98 40 0.916 4.71 5.00 Prom - 211468 211429 40 -3.86 6.03 PlyA - 212901 212896 6 1.05 6.02 Term - 215731 215087 645 1 0 37 54 1739 0.923 159.53 6.01 Init - 215936 215883 54 2 0 42 7 93 0.710 -4.09 6.00 Prom - 217990 217951 40 -3.16 7.00 Prom + 219819 219858 40 -4.06 7.01 Init + 222268 222613 346 0 1 36 103 293 0.613 20.98 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:10643960_10869102|GENSCAN_predicted_peptide_1|50_aa XSLIKFVEETVLCIEGCLVSSLPGLYPLDASSTHFQCDNQNYLQTLPTTP >gi568815582f:10643960_10869102|GENSCAN_predicted_CDS_1|153_bp nactctttgataaaatttgtggaggagaccgtcctgtgcattgaaggatgtttagttagc agcctccctggtctctacccactagatgccagtagcacccacttccagtgtgacaaccaa aattatctccagacgttgccaactaccccctag >gi568815582f:10643960_10869102|GENSCAN_predicted_peptide_2|498_aa MEFLGTTQTASYCGPKKCCGLTSLPAVQAPVIQECYQPYYLPGYRYLNSWRPSLFYKIAN VQTCPDESTSTLRPPTILPTLRSALFSRYSPHDWDQSNQLQVRGAEASRLWASRLTDDSM RLLQDKDQLTHQMQEGTCRNLGQRLSDIGFWKSELSYELDRLLTENQNLETVKRRLECAA NEVNCPLQVALECLYHREKRIGIDLVHDNVEKNLIREVDLLKCCQEQMRKLAQRIDIQMR DNRDAQHVLERDLEDKSSAQCIDEKCFNLRNTSDCISFFHGMEKIDGTISVPETWAKFSN DNIKHSQNMRANSIQLREEAEHLFETLSDQMWRQFTDTNLAFNARISEVTDVKNKLQTQL AKPPEPEWIQSNPKAATWSDDIYRQKKESDAQKTEVRWSCHGCNWLGPVLWMVKEFAKIV MGKERQMYWRKYKDTLQESNGQHSTEGAACKEAGAGRKFYRVVVPGLRAEQSIWEQNAVQ GIFRNVVVTACGVDATGI >gi568815582f:10643960_10869102|GENSCAN_predicted_CDS_2|1497_bp atggagtttcttgggactactcagaccgccagttactgtggtcccaagaaatgctgtggc ttgacctcactgccagctgtacaggcgccagtgatccaggaatgctatcagccctactac ctgcccgggtaccgctacctcaattcatggaggcctagcctcttctacaagatagccaac gtccagacctgcccggacgagagcaccagtaccctgcggccgcccaccatcctgcccaca ctgcgctccgcactcttctctcgctatagcccccacgactgggaccagtccaaccagctg caggtgcgtggggccgaggcctcccggctgtgggccagccggctgacggatgactccatg aggctcttgcaggacaaggaccagctgacgcaccagatgcaggagggcacctgccggaac ctgggccagaggctgtcggacattggcttctggaagtcagagctgagctatgagctggac aggcttctgactgagaaccagaacttggagacggtcaagaggcggctggagtgcgcggcc aatgaggtgaactgcccattgcaggtggccttggagtgtctgtaccatcgagagaagagg attgggattgatttggtccatgacaacgtggagaaaaaccttatccgggaagtggatttg ctaaaatgttgccaagaacagatgagaaaattagctcaaagaattgatatccagatgcgg gataaccgggatgctcagcacgtgctggagagggacctcgaagacaaaagctcggcccag tgtatcgatgagaagtgctttaacctgagaaatacgtcagactgcatcagcttcttccac ggcatggagaaaattgacggcacgatctccgtacctgagacctgggccaagttcagtaac gacaacatcaaacactctcagaacatgcgggccaactccatccagctgcgggaggaggcg gagcacctctttgagaccttgtcggatcagatgtggaggcagttcacagacaccaacctg gccttcaacgcccgcatctctgaggtgacggatgtgaagaataagctgcagacgcagctg gcgaagccccctgaaccagaatggattcagagcaaccccaaagctgccacgtggtcggat gatatttacagacagaaaaaggaaagtgatgcacagaagacggaagtcaggtggtcctgt catgggtgcaactggctggggccagtgctgtggatggtaaaagaatttgccaagatagtc atgggtaaagaaaggcagatgtattggagaaagtacaaggatacgttgcaagaaagcaat gggcagcacagcacagaaggggctgcctgcaaagaggcaggagctggaaggaagttttat agggtggtggtgccggggctacgtgcagaacaaagtatttgggaacagaatgctgtgcaa gggatatttcgcaatgtggttgtcacagcttgcggggtggatgctactggcatttag >gi568815582f:10643960_10869102|GENSCAN_predicted_peptide_3|195_aa MGSNIILSTPGYYEQYHMKVYTMSDMESNSIVSHPGYYEQHHKKVYTPRYTGSKYHLPAS VCDEPYRRTNITEGVYIPCNTVSNNILSLPGYSEQYQGKVYALCDIVSNIILFLPEYYEQ YYRGLYTYCDMESNIILFLPEYYGQYCEGGTTPMGVYIPCDIRSNITLFPPGYYEPYHGE LYKPSIGKVISPYPP >gi568815582f:10643960_10869102|GENSCAN_predicted_CDS_3|588_bp atgggtagcaacatcatcctctccacccctggatattatgaacaatatcatatgaaggtg tacaccatgagcgatatggagagcaatagtatcgtctctcatcctggatattacgaacaa catcataaaaaagtgtacaccccccgctatacgggtagtaaataccacctccccgcttct gtatgtgatgaaccctatcgcaggaccaatatcacagaaggggtgtacatcccctgcaat actgtgagtaataacatcctctccctccctggatattcggaacaatatcaagggaaggtg tacgctctctgcgatattgtaagtaacatcatcctctttctccctgaatattatgaacaa tattacaggggactgtacacctactgcgatatggagagtaacatcatcctcttccttcct gaatattacggacaatattgcgaggggggtacaacccctatgggggtgtacatcccgtgc gatataaggagtaatattacactctttcctcctggatattacgaaccatatcatggggaa ttgtacaagccctccatagggaaagtaatatcaccctatcccccttag >gi568815582f:10643960_10869102|GENSCAN_predicted_peptide_4|351_aa MTAAKRPLSETYFARSENQIVPTRRKCTTAGSGDHEGGKGDGMEEVPHDCPGADSAQAGR GASCQGCPNQRLCASGAGATPDTAIEEIKEKMKTVKHKILVLSGKGGVGKSTFSAHLAHG LAEDENTQIALLDIDICGPSIPKIMGLEGEQYVEDNLGVMSVGFLLSSPDDAVIWRGPKK NGMIKQFLRDVDWGEVDYLIVDTPPGTSDEHLSVVRYLATAHIDGAVIITTPQEVSLQDV RKEINFCRKVKLPIIGVVENMSGFICPKCKKESQIFPPTTGGAELMCQDLEVPLLGRVPL DPLIGKNCDKGQSFFIDAPDSPATLAYRSIIQRIQEFCNLHQSKEENLISS >gi568815582f:10643960_10869102|GENSCAN_predicted_CDS_4|1056_bp atgacagctgccaaacgccccctttccgagacttatttcgcgcggtcggagaatcagatt gtgcccacaaggcggaaatgtacgacagcgggttccggtgaccacgaaggcggcaaaggc gacggaatggaggaggtgcctcacgactgtccaggggccgacagcgcccaggcgggcaga ggggcttcatgtcagggatgccccaaccagcggctgtgcgcttctggagcgggggccact ccggacacggctatagaggaaatcaaagagaaaatgaagactgtaaaacacaaaatcttg gtattgtctgggaaaggcggtgttgggaaaagcacattcagcgcccaccttgcccatggc ctagcagaggatgaaaacacacagattgctcttctagacatcgatatatgtgggccatcg attcccaagataatgggattggaaggagagcagtacgtggaagacaacctgggggtgatg tcagtgggcttcctgctcagcagtcctgatgatgctgttatctggaggggacccaagaaa aacggcatgatcaagcagttcctccgagatgtggactggggagaggtcgactacctcatt gtggacaccccacctgggacgtcggatgaacacctctcggtcgtccggtacctggccaca gcacacatcgatggagcagtgatcatcaccactccccaggaggtgtcactccaggatgtc cggaaagaaatcaacttctgccgcaaggtgaagctgcccatcatcggggtggtggagaac atgagtggcttcatctgtcctaagtgcaagaaagaatctcagatattccctcccacaacc gggggcgcggagctcatgtgccaggacttggaggtccctctcctcggcagagtgcccctg gatccgctcataggtaagaattgtgacaaaggccagtcttttttcattgacgccccagat tccccagccacgttagcctacagaagtataattcagagaatccaagagttttgtaatctc catcagtcaaaagaagagaacctcatcagttcctga >gi568815582f:10643960_10869102|GENSCAN_predicted_peptide_5|527_aa MVYETPVPSTQKKSARFKSDSGSLGDAKNEKETPSLTKVFDVMKKGKSTVSLLTPTRGGS EKQESTWKTKKADQLKLRPRAPADDVFGVGNHKANAATAKRKSTRRRHTLGGHRDATEIS VWNFWKAHERSWERESELSAVSRLKPKCSAQDVSISDWLARNCLHTSTSDLSSGESGDPQ AENPRTREIATTDTPLACQYDTGSSSSTLASTNRPLLSIPPQSPDQINGESFQNKVTSEL PPEGHGRAGRVMSMPGHVEGMPGKGDGEGIVPKACGECSSGAFPKVGLKRVSNQQLISLA KSCVHVHTSVLGTAGVRICTQALVDDTEDVSLDFGNEEELAFRKAKIRHPLATFFHLFFR VSAIVTYVSCDWFSKSFVGCFVMVLLLLSLDFWSVKNVTGRLLVGLRWWNQIDEDGKSHW IFEARKVSPNSIAATEAEARIFWLGLIICPMIWIVFFFSTLFSLKLKWLALVVAGISLQA ANLYGYILCKMGGNSDIGKVTASFLSQTVFQTVVKVKVACWQLRFVM >gi568815582f:10643960_10869102|GENSCAN_predicted_CDS_5|1584_bp atggtctatgaaacaccagtgcccagtacacagaaaaaatcagcgagattcaagtcagac agtggaagtctaggagatgccaagaacgagaaagaaacaccttcattaactaaagtgttt gatgttatgaaaaaaggaaagtcaactgtgagtttactgacacccaccagaggtggatcc gaaaaacaggaatccacatggaaaacgaaaaaagcagatcagctaaaactgagacccaga gcccctgcggatgacgtgtttggagtagggaatcacaaagcgaatgccgcgactgctaaa aggaaaagcacccggcgcagacatacgctaggagggcacagagatgctactgaaatcagc gtttggaatttttggaaagcgcatgagcggagttgggagagagaatctgaactttcagct gtaagccggttaaaaccaaaatgctcagcccaggacgtttccatctcagactggctggcc aggaattgcctacacactagtacctctgaccttagcagcggagaaagcggagatccccag gcagagaacccaaggacacgagaaatagccacgaccgacacacctttggcttgtcagtac gacacaggcagttcttccagcaccttggcttcaacaaacaggccccttctttccatacca ccgcagtcacctgaccaaataaacggagaaagcttccagaacaaagtgacctctgagctg ccacctgaaggacatggaagagctggccgtgtgatgagcatgccagggcatgtggagggc atgccaggcaagggtgatggcgagggcatagtccccaaagcgtgtggagagtgttccagt ggagcatttcccaaagttggcctaaagagggtcagcaatcagcagcttatatctcttgct aagagctgcgtccatgtccacaccagcgtgctgggcacagctggtgtcaggatttgcacc caggccctggtggacgataccgaggatgtgtccctggactttggaaacgaggaggagctg gcctttaggaaagccaagatcagacaccccttggccacctttttccacctgtttttccga gtgagtgccatcgtcacctacgtgagctgcgactggttcagcaagagctttgtgggctgt tttgtcatggtgctgctcctcctgtccctggacttctggtctgtgaagaatgtaaccgga agactcctggtgggccttcgatggtggaaccagatagatgaagatgggaagagccactgg atctttgaagccaggaaggtctctccgaatagcattgctgccacagaagctgaagcacga atcttctggctgggcctcataatctgccccatgatatggattgtgtttttttttagcacc ttattttccttgaagctaaagtggctggctctggtggttgctgggatctctctccaagct gcaaacctgtatggctacatcctttgtaagatgggaggcaacagtgacattggcaaggtc acagccagtttcctgtcccagacagtgttccagacggtagtcaaagtcaaggtggcctgc tggcaactgcggtttgtaatgtaa >gi568815582f:10643960_10869102|GENSCAN_predicted_peptide_6|232_aa MGTSRLAPVAAACLPTDQDDITTSVITTSIITTTIITITIITTTIVTTTITIITTIIITT TITTAIVTTITTTITTTTIITITATITTIATITTIITITTTITTTIIITTTNTTTINITT NITTITTTITTTITTIIITTTIATITIITTTITTVITTTITAITTITTTIMTIIITNTIT TTTITTTTTITTTITTTTTTTIITTIIITTTVITTIITTTIPATFISILLFF >gi568815582f:10643960_10869102|GENSCAN_predicted_CDS_6|699_bp atgggcacgtccaggctggcccctgtggctgctgcctgtctgcccacagaccaggatgac atcaccacctccgtcatcaccacctccatcatcaccaccaccatcatcaccatcaccatc atcaccaccactatcgtcaccaccaccatcaccatcatcaccaccatcatcatcaccacc accatcaccaccgctatcgtcaccaccatcaccaccaccatcaccaccaccaccatcatc accatcaccgccaccatcaccaccatcgccaccatcaccaccatcatcaccatcaccacc accatcaccaccaccatcatcatcaccaccaccaacaccaccaccatcaacatcaccacc aacatcactaccatcaccaccaccatcaccaccaccatcaccaccatcattatcaccacc accattgccaccatcaccatcatcaccacaaccatcaccaccgtcatcaccaccaccatc accgccatcaccaccatcactaccaccatcatgaccatcatcatcaccaacaccatcacc accaccactatcaccaccaccaccaccatcaccaccaccatcaccaccaccaccaccacc accatcatcaccaccatcatcatcaccaccactgtcatcaccaccatcatcaccaccacc attcctgccacctttatcagtattcttctatttttctga >gi568815582f:10643960_10869102|GENSCAN_predicted_peptide_7|116_aa MLGERRFQALARAAALTPRAAMNNFQAILTQVRMLLSSHQPSLVQALLDNLLKEDLLSRE YHCTLLHEPDSEALARKISLTLLEKGDLDLALLGWARSGLQPPAAERGPGHSDHGX >gi568815582f:10643960_10869102|GENSCAN_predicted_CDS_7|348_bp atgctgggtgagcggagattccaggcactggccagggcagctgccctgactccaagggct gccatgaacaacttccaggccatcctgactcaggtgagaatgctgctctccagccatcag cccagcctggtgcaggccctcttggacaacctgctgaaggaggacctcctctccagggaa taccactgcactctgctccatgagcctgatagtgaggctctggccaggaagatctctttg accctactagagaaaggagacctggatttggccctcctggggtgggcccggagtgggctg cagcccccagcagccgagaggggccccggccacagtgaccatggtgnn