GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:17:55 Sequence gi568815581r:36277849_36428929 : 151081 bp : 44.90% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 19461 19538 78 1 0 88 110 55 0.967 7.22 1.02 Intr + 29449 29570 122 2 2 80 64 50 0.734 2.01 1.03 Intr + 35589 35789 201 2 0 92 84 171 0.941 16.58 1.04 Intr + 42108 42212 105 2 0 73 93 108 0.803 10.21 1.05 Intr + 47798 47883 86 1 2 81 79 118 0.909 8.82 1.06 Intr + 49167 49206 40 0 1 80 116 -10 0.919 -0.77 1.07 Intr + 49689 49769 81 2 0 52 92 42 0.589 0.83 1.08 Intr + 50213 50320 108 1 0 102 101 162 0.993 19.38 1.09 Intr + 50650 50759 110 0 2 102 82 150 0.999 14.88 1.10 Intr + 52008 52056 49 0 1 97 117 62 0.988 8.68 1.11 Intr + 52600 52720 121 0 1 92 110 146 0.994 17.27 1.12 Intr + 53036 53088 53 0 2 100 38 77 0.355 2.53 1.13 Intr + 54442 54541 100 0 1 91 89 78 0.429 7.88 1.14 Intr + 54912 55064 153 1 0 85 80 82 0.436 7.14 1.15 Intr + 56028 56240 213 1 0 101 49 81 0.305 4.19 1.16 Term + 57956 58113 158 0 2 95 39 129 0.375 6.80 1.17 PlyA + 61413 61418 6 1.05 2.00 Prom + 79376 79415 40 -4.06 2.01 Init + 85218 85220 3 2 0 113 81 0 0.577 1.80 2.02 Term + 95617 95748 132 0 0 105 43 126 0.556 7.89 2.03 PlyA + 96687 96692 6 1.05 3.14 PlyA - 96963 96958 6 1.05 3.13 Term - 98638 98481 158 2 2 95 39 129 0.048 6.80 3.12 Intr - 100566 100354 213 1 0 101 49 81 0.041 4.19 3.11 Intr - 101700 101548 153 1 0 85 80 82 0.058 7.14 3.10 Intr - 102170 102071 100 2 1 91 89 78 0.062 7.88 3.09 Intr - 103576 103524 53 2 2 100 38 77 0.048 2.53 3.08 Intr - 104012 103892 121 2 1 92 110 146 0.994 17.27 3.07 Intr - 104604 104556 49 2 1 97 117 62 0.988 8.68 3.06 Intr - 105962 105853 110 2 2 102 82 150 0.999 14.88 3.05 Intr - 106399 106292 108 1 0 102 101 153 0.993 18.48 3.04 Intr - 106923 106843 81 0 0 52 92 42 0.605 0.83 3.03 Intr - 107445 107406 40 2 1 80 116 -10 0.949 -0.77 3.02 Intr - 108813 108728 86 0 2 81 79 118 0.907 8.82 3.01 Init - 109047 108976 72 0 0 71 82 55 0.670 4.27 3.00 Prom - 109640 109601 40 -14.47 4.00 Prom + 109866 109905 40 -14.86 4.01 Init + 110311 110362 52 0 1 123 105 85 0.998 15.02 4.02 Intr + 112725 112757 33 1 0 80 110 35 0.887 3.09 4.03 Intr + 113444 113582 139 0 1 70 59 112 0.801 6.02 4.04 Intr + 116751 116819 69 0 0 87 102 28 0.272 2.40 4.05 Intr + 141431 141446 16 2 1 138 55 2 0.040 -2.55 4.06 Term + 146698 146829 132 0 0 105 43 126 0.574 7.89 4.07 PlyA + 147768 147773 6 1.05 5.02 PlyA - 148044 148039 6 1.05 5.01 Term - 149719 149562 158 2 2 95 39 133 0.949 7.20 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 18402 18450 49 2 1 83 58 48 0.912 0.41 S.002 Term - 103576 103419 158 2 2 100 48 128 0.937 8.10 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:36277849_36428929|GENSCAN_predicted_peptide_1|592_aa XIHATGFNYQNEDEKVTLSFPSTLQTGTGTLKIDFVGELNDKMKGFYRSKYTTPSGEVRY AAVTQFENVIDRKPYPDDENLVEVKFARTPVTSTYLVAFVVGEYDFVETRSKDGVCVCVY TPVGKAEQGKFALEVSVGHPSEVDEICDAISYSKGASVIRMLHDYIGDKGHRAGLPEDKG PKPFRSYNNNVDHLGIVHETELPPLTAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLI DRAYKGMPMNIRGPMWSVLLNIEEMKLKNPGRYQIMKEKGKRSSEHIQRIDRDISGTLRK HMFFRDRYGTKQRELLHILLAYEEYNPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLA SERHSLQAARAPAAIGAHEWADQAQISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKR LTKTSRCGPWARFCNRFVDTWARDEDTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASR PVPASRGGKTLCKGDRQAPPGPPARFPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQ ACRKAGVNAIVNARRRNLTVRPGFSRVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:36277849_36428929|GENSCAN_predicted_CDS_1|1779_bp naaatacatgctacaggatttaactatcagaatgaagatgaaaaagtcaccttgtctttc cctagtactctgcaaacaggtacgggaaccttaaagatagattttgttggagagctgaat gacaaaatgaaaggtttctatagaagtaagtatactaccccttctggagaggtgcgctat gctgctgtaacacagtttgagaatgtaattgaccggaaaccataccctgatgatgaaaat ttagtggaagtgaagtttgcccgcacacctgttacatctacatatctggtggcatttgtt gtgggtgaatatgactttgtagaaacaaggtcaaaagatggtgtgtgtgtctgtgtttac actcctgttggcaaagcagaacaaggaaaatttgcattagaggtcagtgtgggccatcca tctgaggttgatgagatatgtgatgctatatcatatagcaaaggtgcatctgtcatccga atgctgcatgactacattggggataagggacaccgagctgggctgccagaggacaagggg cctaagccttttcgaagctacaacaacaacgtcgatcatttggggattgtacatgagacg gagctgcctcctctgactgcgcgggaggcgaagcaaattcggcgggagatcagccgaaag agcaagtgggtggatatgctgggagactgggagaaatacaaaagcagcagaaagctcata gatcgagcgtacaagggaatgcccatgaacatccggggcccgatgtggtcagtcctcctg aacattgaggaaatgaagttgaaaaaccccggaagataccagatcatgaaggagaagggc aagaggtcatctgagcacatccagcgcatcgaccgggacataagcgggacattaaggaag catatgttcttcagggatcgatacggaaccaagcagcgggaactactccacatcctcctg gcatatgaggagtataacccggaggtgggctactgcagggacctgagccacatcgccgcc ttgttcctcctctatcttcctgaggaggatgcattctgggcactggtgcagctgctggcc agtgagaggcactccctgcaggctgcccgggctcctgctgccatcggtgcccacgaatgg gccgaccaagcccagatctctctcgggctcaccctgcgcctgtgggacgtgtatctggta gaaggcgaacaggcgttgatgccgataacaagaatcgcctttaaggttcagcagaagcgc ctcacgaagacgtccaggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacc tgggccagggatgaggacactgtgctcaagcatcttagggcctctatgaagaaactaaca agaaagcagggggacctgccacccccagccaaacccgagcaagggtcgtcggcatccagg cctgtgccggcttcacgtggcgggaagaccctctgcaagggggacaggcaggcccctcca ggcccaccagcccggttcccgcggcccatttggtcagcttccccgccacgggcacctcgt tcttccacaccctgtcctggtggggctgtccgggaagacacctaccctgtgggcactcag gcgtgccgcaaagcaggcgtcaacgccattgttaatgcacggaggaggaacctgactgtt agacctgggttttccagggttgcacggcttctgggagacggatgtgaccctgaggacagg gcacaggccagtgtaatgccaggatggaatgagctgtga >gi568815581r:36277849_36428929|GENSCAN_predicted_peptide_2|44_aa MAVCEMFHVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQPLDLS >gi568815581r:36277849_36428929|GENSCAN_predicted_CDS_2|135_bp atggcagtctgtgagatgtttcatgtccgaggcaaacagcacattcagatccccaagctc tacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcct ttggacctcagctaa >gi568815581r:36277849_36428929|GENSCAN_predicted_peptide_3|447_aa MDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEM KLKNPGRYQIMKEKGKRSSEHIQRIDRDISGTLRKHMFFRDRYGTKQRELLHILLAYEEY NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQ ISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDE DTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPAR FPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFS RVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:36277849_36428929|GENSCAN_predicted_CDS_3|1344_bp atggacgtggtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatg aaatacgaaaagggacaccgagctgggctgccagaggacaaggggcctaagccttttcga agctacaacaacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctg actgcgcgggaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggat atgctgggagactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaag ggaatgcccatgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatg aagttgaaaaaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgag cacatccagcgcatcgaccgggacataagcgggacattaaggaagcatatgttcttcagg gatcgatacggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtat aacccggaggtgggctactgcagggacctgagccacatcgccgccttgttcctcctctat cttcctgaggaggatgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgcaggctgcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccag atctctctcgggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcg ttgatgccgataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtcc aggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgag gacactgtgctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggac ctgccacccccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttca cgtggcgggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccgg ttcccgcggcccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgt cctggtggggctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagca ggcgtcaacgccattgttaatgcacggaggaggaacctgactgttagacctgggttttcc agggttgcacggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgta atgccaggatggaatgagctgtga >gi568815581r:36277849_36428929|GENSCAN_predicted_peptide_4|146_aa METKEPIVYTGSVERAPGPFGALATSSPDHTMEAGSPVGTTRASCDLQFLLPGVWGAHSG EQGIPHTPHFEDAGRGWNICPEKANTSIGHVSLSPFATPLVMYAVCEMFHVRGKQHIQIP KLYTSSVTRHLHHFRLMQDSQPLDLS >gi568815581r:36277849_36428929|GENSCAN_predicted_CDS_4|441_bp atggagacgaaagagccaatcgtctacacgggcagtgtagaacgggcgcctgggcctttt ggagccctggccacgtcctccccagatcacacgatggaagctggcagccccgtgggcacc actcgagccagctgtgacctgcagtttctgcttcctggagtgtggggcgcccactcagga gagcagggcataccccacacccctcattttgaggatgctgggaggggatggaatatttgt cctgagaaggccaatacatccatcggacacgtgtctctatccccatttgctacgcctttg gttatgtatgcagtctgtgagatgtttcatgtccgaggcaaacagcacattcagatcccc aagctctacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactca cagcctttggacctcagctaa >gi568815581r:36277849_36428929|GENSCAN_predicted_peptide_5|52_aa XCRKAGVNAIVNARRRNLTVRPGFSRVARLLGDGCDPEDRAQASVMPGWDEL >gi568815581r:36277849_36428929|GENSCAN_predicted_CDS_5|159_bp ncgtgccgcaaagcaggcgtcaacgccattgttaatgcacggaggaggaacctgactgtt agacctgggttttccagggttgcacggcttctgggagacggatgtgaccctgaggacagg gcacaggccagtgtaatgccaggatgggatgagctgtga