GENSCAN 1.0 Date run: 5-Nov-116 Time: 22:42:37 Sequence gi568815585f:27170945_27373491 : 202547 bp : 44.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Term + 11890 11990 101 2 2 135 37 64 0.860 4.29 1.02 PlyA + 12032 12037 6 1.05 2.00 Prom + 18298 18337 40 -3.96 2.01 Init + 37034 37108 75 1 0 89 56 62 0.081 4.21 2.02 Intr + 43999 44290 292 0 1 14 52 142 0.036 -0.09 2.03 Intr + 45762 46430 669 1 0 56 69 162 0.054 2.77 2.04 Term + 48324 48490 167 1 2 107 43 59 0.427 1.38 2.05 PlyA + 49404 49409 6 1.05 3.00 Prom + 50569 50608 40 0.24 3.01 Init + 52069 52222 154 0 1 95 60 21 0.073 0.14 3.02 Intr + 80522 80641 120 0 0 66 91 48 0.005 3.37 3.03 Intr + 83276 83337 62 0 2 79 92 40 0.257 1.95 3.04 Intr + 84298 84410 113 0 2 61 97 48 0.580 2.18 3.05 Intr + 85240 85390 151 1 1 66 110 141 0.994 14.26 3.06 Intr + 90473 90675 203 1 2 80 97 15 0.103 -0.32 3.07 Intr + 98398 98497 100 1 1 99 93 57 0.617 7.41 3.08 Intr + 99929 100124 196 1 1 34 101 234 0.677 18.29 3.09 Intr + 100540 100596 57 2 0 92 94 84 0.994 8.26 3.10 Intr + 100695 100774 80 1 2 60 110 130 0.994 11.67 3.11 Term + 102083 102550 468 1 0 70 43 392 0.914 27.87 3.12 PlyA + 102650 102655 6 -0.45 4.03 PlyA - 102738 102733 6 1.05 4.02 Term - 104479 104378 102 1 0 82 48 81 0.223 1.78 4.01 Init - 124435 124316 120 1 0 44 43 119 0.276 3.19 4.00 Prom - 127840 127801 40 -2.36 5.03 PlyA - 130307 130302 6 1.05 5.02 Term - 138238 137964 275 2 2 15 42 195 0.585 3.23 5.01 Init - 139886 139649 238 2 1 90 19 158 0.512 7.51 5.00 Prom - 146989 146950 40 -5.36 6.00 Prom + 151694 151733 40 -4.36 6.01 Init + 153104 153203 100 1 1 81 48 106 0.555 6.32 6.02 Intr + 162008 162129 122 0 2 1 -73 321 0.488 7.51 6.03 Intr + 191464 191563 100 0 1 83 58 23 0.015 -1.52 6.04 Term + 196485 196720 236 1 2 72 36 106 0.119 0.28 6.05 PlyA + 197845 197850 6 1.05 7.02 PlyA - 198149 198144 6 1.05 7.01 Term - 201263 201155 109 1 1 107 41 100 0.373 5.18 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 80455 80276 180 1 0 11 50 199 0.896 8.46 S.002 Intr - 80861 80747 115 1 1 114 66 30 0.877 3.85 S.003 Init - 81979 81948 32 1 2 72 93 20 0.870 0.11 S.004 Term + 85492 85581 90 0 0 88 38 83 0.907 1.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815585f:27170945_27373491|GENSCAN_predicted_peptide_1|33_aa XPQSTITSKTVTCLQIHSSAPTVHMSTLGAMAY >gi568815585f:27170945_27373491|GENSCAN_predicted_CDS_1|102_bp ngtccccagagcaccatcacttctaagactgtgacctgcttacagatccactcctctgcc cccacagtccacatgtccaccctgggggccatggcttattga >gi568815585f:27170945_27373491|GENSCAN_predicted_peptide_2|400_aa MKAVIGMKALTRCGALKLDFSTSITVHVPFSLYDLKQIKADLGKFSEDPDRYIDVLQGLG QTFNLAWRDVMLLLDQTLAFNEKNVALAAAREFGDTWYLSQVNDRMIAKERDKFPTGQQA VPRSHRPHVQLTWMVLPQRFRDSPHLFGQALAQDLGHFSSPGTLVLQYVDDLLLATSSEA LCQQATVDLLNFLADQGYKVSRSKAKLCLHQIKYLGLIFAKGTRALSKERIQPILAYPHP KTLKQLRGFLGITGFCRLWIPRYSKIARPLYPLIKETQRANTHLVEWEPEAETAFKTLKQ ALIHAPGLSFPTGQNFSLYITERAGIALGVLTQTRGTTSQPMAYLSPVAAILLLLTFGLC IFNLLVKFVSSRIEAIKLQMVLQMEPQMSSINSFYQGPLD >gi568815585f:27170945_27373491|GENSCAN_predicted_CDS_2|1203_bp atgaaggctgtcatcggaatgaaggctctcaccagatgtggtgccttgaagttggacttc tcaacctccatcactgtacatgtccccttctccctctatgatttaaagcagatcaaggca gaccttgggaagttttcagaagatcctgataggtacatagatgtcctacagggtctaggg caaaccttcaaccttgcttggagagatgtcatgctactattagatcaaaccctggccttt aatgaaaagaatgtggctttagctgcagcccgagagtttggagatacctggtatcttagt caagtaaatgatagaatgatagccaaagagagggataaattccctaccggtcagcaagcc gtcccccgatcccacagaccacacgtccaacttacgtggatggtcttgccccaacggttt agggatagccctcacctgtttggtcaggccctggcccaagatcttggtcacttctcaagt ccaggcactctggtccttcagtatgtggatgacttacttttggctaccagttcggaagcc ttgtgccagcaggctactgtagatctcttgaactttttagctgatcaagggtacaaggtg tctaggtcaaaggccaagctttgcctacatcagatcaaatatctaggcctaatctttgcc aaagggaccagggccctcagcaaggaacgaatacagcctatactggcttatcctcaccct aagacattaaaacagttgcgggggttccttggaatcactggcttttgccgactatggatc cccagatacagcaagatagccaggcccctctatcctctaatcaaggagacccagagagca aatactcatctagtagaatgggaaccagaggcagaaacagccttcaaaaccttaaagcag gccctaatacatgctccaggtttaagctttcccacaggacaaaacttctctttatacatc acagagagagcagggatagctcttggagtccttactcagactcgtgggacaacctcacaa ccaatggcatacctaagtcccgtggcagccatcttgctgttactcacctttggactctgt atttttaacctacttgtcaaatttgtttcctctagaatcgaggccatcaagctacagatg gtcttacaaatggaaccccaaatgagctcaattaacagcttctaccaaggacccctggac tga >gi568815585f:27170945_27373491|GENSCAN_predicted_peptide_3|567_aa MDTAGGHYPKLSNTGTESQILQVLISKWELNNEHTDTRRGTTDIGAFLRVKERSRLRLAF PSGSAVGSPQALVRRAPRPRLFPFGRNRHLPGVVPLATYMRIYKKGDIVDIKGMGTVQKG MPHKCYHGKTGRVYNVTQHAVGIVVNKQVKGKILAKRINVRIEHIKHSKSRDSFLKRVKE NDQKKKEAKEKGTWVQLKRQVRFSLMALNTIYILMIIMAYKTLHYLYYSYLCDLVLYHFL PQSQIQFHEPSRCAFNMPSAFVPEGFCNQQLPDMSKAIQGQPAPTSRTWKLPADASKGST TLASRLRVRRGPALSDCAPDPGTRSMRPLSMSGHFLLAPIPESSSDYLLPKDIKLAVLGA GRVGKSAMIVRFLTKRFIGDYEPNTGKLYSRLVYVEGDQLSLQIQDTPGGVQIQDSLPQV VDSLSKCVQWAEGFLLVYSITDYDSYLSIRPLYQHIRKVHPDSKAPVIIVGNKGDLLHAR QVQTQDGIQLANELGSLFLEISTSENYEDVCDVFQHLCKEVSKMHGLSGERRRASIIPRP RSPNMQDLKRRFKQALSPKVKAPSALG >gi568815585f:27170945_27373491|GENSCAN_predicted_CDS_3|1704_bp atggatacagctggaggtcattatcctaagctaagtaacacaggaacagaaagccaaata ctgcaagttctcatttctaagtgggagctaaacaacgagcacacagatactagaagggga acaacagacattggggcctttttgagggtgaaggaaagaagtcgcttacggttggcgttt ccctcaggcagcgccgtgggcagcccacaggcccttgtaaggcgcgcgccgcggccccgc ctctttcctttcggccggaaccgccatcttccaggagttgttcctttggccacatatatg cgaatctataagaaaggtgatattgtagacatcaagggaatgggtactgttcaaaaagga atgccccacaagtgttaccatggcaaaactggaagagtctacaatgttacccagcatgct gttggcattgttgtaaacaaacaagttaagggcaagattcttgccaagagaattaatgtg cgtattgagcacattaagcactctaagagccgagatagcttcctgaaacgtgtgaaggaa aatgatcagaaaaagaaagaagccaaagagaaaggtacctgggttcaactaaagcgccag gtgagatttagcctcatggctttgaataccatttatattctgatgattatcatggcctac aaaaccctacattatctgtactatagctacctttgcgacctagtcttgtaccacttcctg ccccaaagtcagatccaatttcatgagccttctcgctgtgcctttaatatgccaagtgca ttcgtgcctgagggcttttgtaaccaacagctgccagacatgagtaaggctatccaaggg cagccagcccccacttccagaacctggaagctccctgcagatgcatcaaagggctcaaca actctggcgagccggctccgggtgcggcgaggcccagccctctcggattgcgcgccggac cccgggacgcgctccatgcggccgctcagcatgtccgggcactttctgctcgcacccatc cccgagtcctcctcggactacctactgcccaaggacatcaaactggcggtgctgggcgcc ggccgcgtgggcaagagcgcaatgatcgtgcgcttcctgaccaagagattcattggagac tatgaaccgaatacaggcaagctgtattcacggctggtctatgtcgagggggaccagctc tccctgcagatccaggatactcccgggggcgtccagatccaagacagcctcccccaggtc gtcgattccctgtccaaatgcgtgcagtgggccgagggttttctgctggtctattccatc acagactatgacagctacttgtccatccgacccctttatcagcacatccggaaggtccac cctgactctaaagcccctgtcatcatcgtgggcaacaagggggaccttttgcatgcccgg caggtgcagacacaggacggtattcagctagccaatgagctgggcagcctgttccttgaa atttccactagcgaaaactacgaagatgtctgtgatgtgtttcagcatctctgcaaagaa gtgagcaagatgcacggcctcagtggggaaagaagaagagcctccatcatccctcggccc cgctctcccaacatgcaggacctgaagagacgcttcaagcaggctctgtctcccaaagtc aaagccccctctgcactggggtga >gi568815585f:27170945_27373491|GENSCAN_predicted_peptide_4|73_aa MPLEVAKRVHVSTYVGVMRQASGDSGSENTESAGNVGVEWLLDYDLTFSHGMFDLSSLKA EKPLVTGGSLDML >gi568815585f:27170945_27373491|GENSCAN_predicted_CDS_4|222_bp atgcccttggaagtggcaaagcgtgtgcacgtgagcacctatgttggagtgatgagacaa gccagcggagacagtggatcagagaacactgaatcagctgggaacgtgggagttgagtgg cttttagattatgatttaactttttcccatgggatgtttgatctcagttccttgaaggca gagaagcccctggtgacaggtggcagtctggacatgttgtag >gi568815585f:27170945_27373491|GENSCAN_predicted_peptide_5|170_aa MAGPYWTGLEERWEMAQAVLQEDGIPQSGQKLRGLSDQAAHNDGRLNSDRRGPVIGDQRT PPLNFYLPSLGKTTRCLARGSASGDGRITGGTEAAGTRRRLDVDGNGPTGTRRQDRQKRK DEKFGWGSQRRKPRPLSGLTPWEDLSTLLPSDLPRVTSTRNLALVLQARV >gi568815585f:27170945_27373491|GENSCAN_predicted_CDS_5|513_bp atggctggaccgtactggactggactggaggagaggtgggagatggctcaggcagttctt caggaagatggcattccccagtctggccagaagctcagaggcctgtcggaccaggctgcc cacaatgacggccgtctcaacagtgatcgccgtgggcccgtcattggtgaccagaggacc ccacccctcaacttttatctcccctcacttggcaaaacaacacggtgcctggcacgtggc tcagcaagtggagacgggcgcatcaccggaggaaccgaggcggctggaacacgcaggcgg ctggatgttgacgggaatggacccacaggtactaggagacaggaccggcagaagcggaag gatgagaagtttggctggggcagtcagaggagaaagcccaggcctttgagcggcctgact ccatgggaagatctctccactctactcccttctgacttaccccgagttacttccactcga aaccttgcactcgttctccaagcccgggtgtga >gi568815585f:27170945_27373491|GENSCAN_predicted_peptide_6|185_aa MTSHYAPIGMAKIQTLTTPNDGEDVEQQELSLAEEDEDEDEDEDEEEEEEEEEEEEEEEE EEEEEEGWMQWLMPSPELPGVRPDPWLRLASLPRPERRSQVGRCLHRELPQDLPHSDGKQ MALEERFISCICGISPSTGKNLMLSYQPLWGFHLCSKKMDSFKNSKINWNGASSVQRVIN GPGAD >gi568815585f:27170945_27373491|GENSCAN_predicted_CDS_6|558_bp atgacctcccactatgcacctattggaatggccaaaatccaaacactgacaacaccaaat gatggtgaggatgtggagcaacaggaactctcacttgctgaagaagatgaagatgaagac gaagacgaagatgaagaagaagaggaagaggaggaggaagaggaagaagaagaagaagag gaagaagaagaagaagaaggctggatgcagtggctcatgcctagtccagagctccccggg gtccgccctgacccgtggctccgcctcgcctcgctcccgcgcccagagcggcgctcacag gtcggtcgctgcctgcacagagagctgccgcaggacctgcctcattcagatggaaagcaa atggcactggaagaacgtttcatctcctgcatctgtgggatcagccccagtacaggcaaa aatctcatgctttcataccagcccctctggggctttcatctctgctccaaaaaaatggac agcttcaaaaattcaaaaatcaattggaatggagccagctctgtccagcgggttattaat ggccctggagctgactaa >gi568815585f:27170945_27373491|GENSCAN_predicted_peptide_7|36_aa XCVSAGCRHSCRLPRELPSAKGIVPPGYAWKVVPND >gi568815585f:27170945_27373491|GENSCAN_predicted_CDS_7|111_bp nngtgtgtgagtgccggctgcagacacagctgccgacttcccagagagctgccctctgca aagggcattgtaccacctggctatgcctggaaggttgtacccaatgactga