GENSCAN 1.0 Date run: 3-Nov-116 Time: 00:21:49 Sequence gi568815596f:6896927_7139757 : 242831 bp : 43.86% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1662 1657 6 1.05 1.01 Sngl - 7018 6419 600 1 0 79 49 312 0.994 22.60 1.00 Prom - 19018 18979 40 -3.56 2.10 PlyA - 19217 19212 6 1.05 2.09 Term - 24713 24349 365 0 2 91 44 132 0.467 3.83 2.08 Intr - 26753 26599 155 1 2 47 44 53 0.221 -3.48 2.07 Intr - 27511 27304 208 2 1 56 32 162 0.322 5.54 2.06 Intr - 28156 27966 191 0 2 19 70 53 0.235 -4.17 2.05 Intr - 29798 29643 156 1 0 26 69 121 0.431 3.13 2.04 Intr - 32073 31888 186 2 0 28 94 118 0.120 5.20 2.03 Intr - 34241 34133 109 0 1 55 76 39 0.062 -1.26 2.02 Intr - 39136 39067 70 1 1 95 77 75 0.111 5.85 2.01 Init - 59550 59389 162 0 0 85 57 101 0.284 6.43 2.00 Prom - 65205 65166 40 -3.16 3.03 PlyA - 65314 65309 6 1.05 3.02 Term - 71957 71822 136 1 1 79 49 119 0.973 4.59 3.01 Init - 76994 76906 89 2 2 78 87 67 0.928 5.72 3.00 Prom - 81945 81906 40 -4.66 4.02 PlyA - 82378 82373 6 -0.45 4.01 Sngl - 84766 84476 291 1 0 59 48 166 0.578 5.35 4.00 Prom - 92616 92577 40 -3.66 5.00 Prom + 94851 94890 40 -4.56 5.01 Init + 95617 95674 58 0 1 80 69 -5 0.155 -1.66 5.02 Intr + 99990 100135 146 1 2 127 16 167 0.067 13.40 5.03 Term + 109168 109389 222 0 0 66 47 109 0.075 1.52 5.04 PlyA + 110335 110340 6 1.05 6.00 Prom + 119718 119757 40 -1.26 6.01 Init + 122400 122454 55 2 1 64 81 56 0.748 3.95 6.02 Intr + 123547 123754 208 2 1 51 80 145 0.732 8.14 6.03 Term + 127443 127743 301 0 1 71 48 349 0.681 23.89 6.04 PlyA + 128750 128755 6 1.05 7.05 PlyA - 129548 129543 6 1.05 7.04 Term - 135492 135199 294 0 0 5 41 251 0.342 7.41 7.03 Intr - 136152 135988 165 0 0 106 31 64 0.565 2.66 7.02 Intr - 148337 148286 52 1 1 76 71 49 0.418 0.91 7.01 Init - 148653 148394 260 0 2 65 70 161 0.334 8.61 7.00 Prom - 155458 155419 40 -1.86 8.06 PlyA - 155983 155978 6 1.05 8.05 Term - 181521 181508 14 1 2 141 53 9 0.030 1.06 8.04 Intr - 202538 202509 30 0 0 78 106 15 0.562 0.40 8.03 Intr - 202820 202701 120 0 0 72 115 57 0.793 7.27 8.02 Intr - 211246 211101 146 0 2 54 109 27 0.045 1.33 8.01 Init - 236748 236216 533 0 2 52 63 316 0.621 19.83 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 99990 100153 164 1 2 127 38 169 0.927 13.90 S.002 Init - 203607 203589 19 0 1 106 69 28 0.805 3.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:6896927_7139757|GENSCAN_predicted_peptide_1|199_aa MRKNQHKNAENSKNHNASSPPKDHNSPPARKQNWMQNEFDRLTEVDFRRWVTTNSSKLKE HVLTQCKEAKNLDKRLQELLTRITSLEKNINDLIELKNTAQELHEAYTSINSRIDQVEEM ISKIEDQLNEIKREDKIREKRMQRNKQSLQEIWDYVKRPNLRLIGVPERDEDNGTKLENT SGYHPGELPQPSKTGQHSN >gi568815596f:6896927_7139757|GENSCAN_predicted_CDS_1|600_bp atgaggaaaaaccagcacaaaaatgctgaaaattccaaaaaccacaatgcctcttctcct ccaaaggatcacaactccccaccagcaaggaaacaaaactggatgcagaatgagtttgac agactgacagaggtagacttcagaaggtgggtaacaacaaactcctccaagctaaaggag catgtcctaacccaatgcaaggaagctaagaaccttgataaaaggttacaggaactgcta actagaataaccagtttagagaagaacataaatgacctgatagagctgaaaaacacagca caagaacttcatgaagcatacacaagtatcaatagccgaatcgatcaagtggaagaaatg atatcaaagattgaagatcaacttaatgaaataaagcgtgaagacaagattagagaaaaa agaatgcaaaggaacaaacaaagcctccaagaaatatgggattatgtgaaaagaccaaat ctacgtttaattggtgtacctgaaagagacgaggataatggaaccaagttggaaaacact tcaggatatcatccaggagaacttccccaacctagcaagacaggccaacattcaaattag >gi568815596f:6896927_7139757|GENSCAN_predicted_peptide_2|533_aa MGIYATTYRNDIQKDVWCTDEIRELKAASSGGTSDSLVGLLHNHFRYTPWVQTKAEPPKF PKGTVKAVKNRNLTDIEALTTTAALGDASGLCRAHCPSSDMNSTQSDLQVHGIREQMDAD FIPCHPQNAKLLRELTGSEGATQNPQKWQEEDGPVSDHHRRQAATAGDTGNAGKSREKTQ IHFNKYLLSASYEKPVCRVLEGYKDNQDRTGIPSPHCTHTEPIDGTHRLQSSSAYAIKIR RVAIEVARKLTEGPRCFLTFSIRQRASPKVSGSPHPSQSKGSLGAHSLCYAYSQDMRSDG HIQALGSNHSTLCISLHIAGKGHVLCFPCIAQCLGPPSNTATEVSPAVTLDLTYMVSTQP RSAPCGSIFSSVQCQCQRRASPQSPTLAILQLEGKAAFVMSLQESLGSWHMDDYSLLKVM GSILICRPLGSKPGFKIDDVCAMLDNIPALCFTLLLCQLGIATYACAAGLCENCIHDASK ALKIRQIHNEYQPPPPPLVPNTPEMIKYWLTHCMPPRSLLLSGTLHPAYKPAP >gi568815596f:6896927_7139757|GENSCAN_predicted_CDS_2|1602_bp atggggatctatgcaacaacctacagaaatgacatccaaaaagatgtgtggtgtacagat gagattagggagctgaaagctgcttcttctggaggcacctctgactctcttgttgggcta cttcacaaccatttcaggtacacgccatgggtacagacgaaggccgagccgcccaagttc cctaaaggaacagtgaaagcagtgaaaaacagaaacctgacagatattgaagcactgacc acgaccgctgcactaggagatgcttctggactctgccgggcacactgtcccagttctgac atgaacagtactcagtcagacctgcaggtgcatggtatcagggagcaaatggatgctgac ttcattccatgtcacccccaaaatgccaagctgctcagggaactgacagggtctgaaggg gccacgcagaacccccagaagtggcaggaagaggatgggccagtttctgaccatcaccga aggcaggctgccacagcaggagacactgggaatgcaggaaaaagcagggaaaagacgcag attcatttcaacaaatatttattgagcgcctcctatgagaagcctgtgtgccgagtgcta gaggggtacaaagacaaccaggacaggactgggattcccagcccacactgtacccacacc gaacctattgatggaacacacaggctacaaagctccagtgcatacgctatcaaaattcgc cgtgtggcaatcgaggtcgccagaaaactcacagaaggacccaggtgcttcctgacattc tccatccgtcaaagagccagccccaaggtctctggaagcccacacccttcccaaagcaaa ggctcccttggagcacacagcctgtgttatgcttactcccaggacatgcgctcggatggg cacattcaggcgcttggttctaaccattccacattatgcatctcccttcacatcgcaggc aaaggccacgtcctatgcttcccgtgcatcgcacagtgcctcgggccccccagcaacacg gccacggaagtctcgccagctgttactctggatctcacttacatggtgtctacacaacca cgttctgctccgtgcggctccatcttctcatctgtgcaatgccagtgtcagaggagagca tctccacagtcacccacattggcaattttacagctggaaggaaaggcagcatttgtgatg agccttcaagaatcattaggaagctggcacatggatgactacagcctcctcaaggtcatg ggatctatccttatctgcaggcctctgggttcaaagccaggcttcaaaattgacgacgtg tgtgcgatgttggacaacatccctgctctctgcttcactctgctcctctgtcagctaggg atagcaacatacgcctgtgctgcagggctgtgtgagaactgcatccacgatgcaagcaaa gccctgaagatcaggcaaatccataatgagtatcaaccacctcctccacctctagtgcca aacacacctgagatgataaaatactggttgacacactgcatgccaccaaggagcttgctt ttgtctggaacactgcatccagcttataagcctgctccttag >gi568815596f:6896927_7139757|GENSCAN_predicted_peptide_3|74_aa MENISGFVDHGVSVEKTSLMLEHEAARDGTDSMGQPEHTTPENTVGSDHQSCLRRMMPRD TGNALAKFTVPALP >gi568815596f:6896927_7139757|GENSCAN_predicted_CDS_3|225_bp atggaaaatatttcaggctttgtggatcatggggtctctgtagaaaagacctctctgatg ctggagcacgaagcagccagagacggcacggactccatgggacagcctgaacacacaaca cctgagaacaccgtgggctctgaccaccagtcatgcttgaggaggatgatgccaagagac actggcaatgccctcgccaagttcacagtgcctgctttgccctga >gi568815596f:6896927_7139757|GENSCAN_predicted_peptide_4|96_aa MWESLELPRDLLNGFDQHADSRIDNEVQAEVVSDGDEEPNGNWSKGHSHYALVKRLVAFS PCPRDLWNFEIERDNLRYLAEEISKQQSIEMRPGFS >gi568815596f:6896927_7139757|GENSCAN_predicted_CDS_4|291_bp atgtgggaaagtttggaacttcctagagacttgttgaatggttttgaccaacatgctgac agtcgtatcgacaatgaagttcaggctgaggtggtctcagatggagatgaggaacctaat gggaactggagcaaaggtcactctcactatgctttagtaaagagactggtggcattttcc ccctgccctagagatctgtggaactttgaaattgagagagataatttaaggtatctggca gaggaaatttctaagcagcaaagcatcgagatgcgacctggcttttcctga >gi568815596f:6896927_7139757|GENSCAN_predicted_peptide_5|141_aa MRVSQATGGEQGLAGGRHENCSAMTTTRYRPTWDLALDPLVSCKLCLGEYPVEQMTTIAQ CQCIFCTLSKARIPTISVTQNVAQNLHEAEQLDEGLELSEFLHINLGRGLHSEPRDAVVI TFSNLMCIFPNQEYEKLKCGG >gi568815596f:6896927_7139757|GENSCAN_predicted_CDS_5|426_bp atgagagtgtcccaggcaactggaggagagcagggacttgctggaggaaggcatgaaaac tgttctgcgatgaccacaacaaggtaccggcccacctgggacctggccctcgacccgctg gtgtcttgcaagctctgtcttggggagtacccagtggagcagatgacaaccatagcccag tgccaatgcatcttctgtactctgtctaaagcccggatcccaacaatctctgttacccag aatgtagcccagaacctccatgaagcagaacagcttgacgagggtctggagttgtcggag ttcctgcacattaatcttggcagaggacttcactcagagcccagggatgctgtagttata actttttctaacttgatgtgtatttttcctaatcaagagtatgaaaaactaaaatgcgga ggatga >gi568815596f:6896927_7139757|GENSCAN_predicted_peptide_6|187_aa MRVGEVPVHVRRQHVVDSKVLFDPCRTWCPASTCQAVCQLQDVGLQTPQPVQCKACRMEF CSTCKASWHPGQGCPETMPITFLPGETSAAFKMEEDDAPIKRCPKCKVYIERDEGCAQMM CKNCKHAFCWYCLESLDVSTAFSFTLRHLALRFRFGMRKIDQLFPKELGAYSCVTVKQHR LVFLLGL >gi568815596f:6896927_7139757|GENSCAN_predicted_CDS_6|564_bp atgcgtgtaggtgaagtcccggtccacgtgcggcgccagcatgttgtggattccaaggtg ctgtttgatccctgtcggacttggtgcccggcgtccacctgccaagctgtgtgtcagctc caggacgtggggctgcagaccccccagccagtgcagtgcaaagcctgccgtatggaattc tgctccacctgcaaagccagctggcaccctggccagggctgcccggagaccatgccgatc accttcctccccggggagaccagtgctgctttcaaaatggaagaagatgacgcgcccatc aagcgctgccccaagtgcaaagtctacatcgagcgagacgaaggctgcgcgcagatgatg tgcaagaactgcaagcacgccttctgctggtactgcctggagtctctggacgtgagtacg gccttcagcttcaccttgcggcatttagctttgaggtttagatttggaatgagaaaaata gaccagctgtttcctaaagagcttggtgcctactcctgtgtaacagtgaagcaacaccgt ttggtgtttctcctgggtttgtga >gi568815596f:6896927_7139757|GENSCAN_predicted_peptide_7|256_aa MTLPLDEGAVSTGVHTGRTPWEPAGRDWGCIYQPRNTQDCQQYQEVAEQHGHPPYRSQKI PALQIPEPSVSNLQTWKTVKLCCCSPCLYHEGPVPEFGTLCPHQGNCQSWAMVSEFSPSS DSVSAKVHTTASTHTVAVQMPGPPSQHPVTPYLQPRENQRFVTAARAELQGARIRAELQG ARIPAELQGARIPAELQGARIPAELQGARIPAELQGARIPAELQGARIPAELQGAWIQAV PGHVQREQSEELKPTL >gi568815596f:6896927_7139757|GENSCAN_predicted_CDS_7|771_bp atgactttgcccttagatgaaggagctgtgtccacaggcgtgcacacagggagaacgccg tgggaacctgcaggcagagactgggggtgcatctaccagccaaggaacacccaggactgc cagcagtaccaggaagttgcagagcaacacgggcatcctccttaccgctctcagaagatt ccagccctgcagatacctgaaccttcagtttccaacctccagacctggaagacagtaaag ctctgttgttgtagcccctgcctttatcatgaaggtccagttcctgagttcggcacactg tgcccccaccagggaaactgccagagttgggcaatggtttccgagttctcaccaagctct gactcagtttctgccaaggtgcacacaacagcatccactcacacggtggctgtgcaaatg ccgggtcccccgagtcagcacccagtcaccccctacctacagcccagggaaaaccaacgc tttgtcacagcagccagagcagagctgcaaggggcgcggattcgagcagagctgcaaggg gcgcggattccagcagagctgcaaggggcgcggattccagcagagctgcaaggggcgcgg attccagcagagctgcaaggggcgcggattccagcagagctgcaaggggcgcggattcca gcagagctgcaaggggcgcggattccagcagagctgcaaggggcgtggattcaagccgtg ccgggccacgtgcaaagagaacagtcagaggagctcaaaccaactctctag >gi568815596f:6896927_7139757|GENSCAN_predicted_peptide_8|280_aa MRLAEQQGARPAEYQGTSPAEYQGTSPAEYQGTSPAEYQGTSPAEYQGTSPAEYQGTSPA EYQGTSPAEYQGTSPAEYQGTSPAEYQGTSPAEYQGTSPAEYQGTSPAEYQGTSPAEYQG TSPAEYQGTSPAEYQGTSPAEYQGTSPAEYQGTSTGGQATEQQGVTSGLWGVVETIRRIM DFYRNAQVQSTRPEVWIFTQNGSETHLRTKEHSQSSTIQVERAELSAPGDYNSRGDLGGD KKPNHITYMVASKNLTAMLIEEIDFQGLYISNFREESIKS >gi568815596f:6896927_7139757|GENSCAN_predicted_CDS_8|843_bp atgaggctagcagagcagcagggggccaggccagcagagtaccagggaaccagcccagca gagtaccagggaaccagcccagcagagtaccagggaaccagcccagcagagtaccaggga accagcccagcagagtatcagggaaccagcccagcagagtatcagggaaccagcccagca gagtaccagggaaccagcccagcagagtatcagggaaccagcccagcagagtatcaggga accagcccagcagagtaccagggaaccagcccagcagagtaccagggaaccagcccagca gagtatcagggaaccagcccagcagagtaccagggaaccagcccagcagagtatcaggga accagcccagcagagtatcagggaaccagcccagcagagtaccagggaaccagcccagca gagtaccagggaaccagcccagcagagtatcagggaaccagcacagggggccaggccaca gagcagcagggagtcacatcaggcctgtggggtgtggtggaaaccatcagaagaatcatg gacttttataggaatgcacaagtacagagcactaggccagaggtatggattttcacacag aatgggtctgaaacacatctcaggaccaaagaacacagccagtcatctactattcaggta gaaagagctgagctgtcagcacctggggattacaattcaagaggagatttgggtggggac aaaaagcctaaccatatcacctatatggtagctagtaaaaacctcacagcaatgctcatc gaggaaatagattttcaagggctctatattagcaactttagagaagaaagtatcaagtcc tga