GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:17:53 Sequence gi568815581r:36175051_36362985 : 187935 bp : 45.01% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 1264 1315 52 0 1 123 105 85 0.887 15.02 1.02 Intr + 3680 3712 33 0 0 80 82 35 0.546 0.29 1.03 Intr + 4401 4539 139 1 1 70 59 125 0.809 7.32 1.04 Intr + 7404 7528 125 0 2 94 84 22 0.735 2.63 1.05 Intr + 7582 7926 345 2 0 29 59 283 0.436 14.86 1.06 Term + 12105 12208 104 1 2 54 42 75 0.212 -2.06 1.07 PlyA + 13581 13586 6 1.05 2.04 PlyA - 18967 18962 6 1.05 2.03 Term - 20326 20236 91 0 1 128 55 78 0.976 5.69 2.02 Intr - 20862 20748 115 1 1 126 103 111 0.999 15.91 2.01 Init - 21623 21548 76 2 1 82 94 103 0.940 9.66 2.00 Prom - 34956 34917 40 -0.66 3.00 Prom + 35526 35565 40 -7.66 3.01 Init + 36092 36167 76 1 1 68 108 73 0.943 6.59 3.02 Intr + 44713 44889 177 2 0 51 58 85 0.166 1.69 3.03 Intr + 45230 45492 263 0 2 69 77 138 0.359 8.01 3.04 Intr + 66649 66738 90 0 0 110 0 65 0.075 0.09 3.05 Term + 74523 74654 132 2 0 105 43 163 0.842 11.59 3.06 PlyA + 75593 75598 6 1.05 4.14 PlyA - 75869 75864 6 1.05 4.13 Term - 77544 77387 158 1 2 95 39 96 0.006 3.50 4.12 Intr - 79472 79260 213 0 0 101 49 81 0.005 4.19 4.11 Intr - 80587 80435 153 2 0 85 80 82 0.020 7.14 4.10 Intr - 81057 80958 100 0 1 91 89 78 0.022 7.88 4.09 Intr - 82463 82411 53 0 2 100 38 72 0.016 2.03 4.08 Intr - 82899 82779 121 0 1 92 110 146 0.994 17.27 4.07 Intr - 83491 83443 49 0 1 97 117 62 0.987 8.68 4.06 Intr - 84849 84740 110 0 2 102 82 146 0.996 14.48 4.05 Intr - 85286 85179 108 2 0 99 101 160 0.975 18.88 4.04 Intr - 85810 85730 81 1 0 52 92 42 0.602 0.83 4.03 Intr - 86332 86293 40 0 1 80 116 -10 0.939 -0.77 4.02 Intr - 87701 87616 86 2 2 81 79 118 0.923 8.82 4.01 Init - 87935 87864 72 2 0 71 82 55 0.663 4.27 4.00 Prom - 93611 93572 40 -3.26 5.00 Prom + 93900 93939 40 -6.96 5.01 Init + 97689 97691 3 2 0 71 101 0 0.685 -0.40 5.02 Intr + 99149 99233 85 1 1 131 116 54 0.961 11.89 5.03 Intr + 122259 122336 78 1 0 88 110 55 0.971 7.22 5.04 Intr + 132247 132368 122 2 2 80 64 50 0.736 2.01 5.05 Intr + 138387 138587 201 2 0 92 84 171 0.941 16.58 5.06 Intr + 144906 145010 105 2 0 73 93 108 0.803 10.21 5.07 Intr + 150596 150681 86 1 2 81 79 118 0.909 8.82 5.08 Intr + 151965 152004 40 0 1 80 116 -10 0.919 -0.77 5.09 Intr + 152487 152567 81 2 0 52 92 42 0.589 0.83 5.10 Intr + 153011 153118 108 1 0 102 101 162 0.993 19.38 5.11 Intr + 153448 153557 110 0 2 102 82 150 0.999 14.88 5.12 Intr + 154806 154854 49 0 1 97 117 62 0.988 8.68 5.13 Intr + 155398 155518 121 0 1 92 110 146 0.994 17.27 5.14 Intr + 155834 155886 53 0 2 100 38 77 0.293 2.53 5.15 Intr + 157240 157339 100 0 1 91 89 78 0.355 7.88 5.16 Intr + 157710 157862 153 1 0 85 80 82 0.361 7.14 5.17 Intr + 158826 159038 213 1 0 101 49 81 0.257 4.19 5.18 Term + 160754 160911 158 0 2 95 39 129 0.318 6.80 5.19 PlyA + 164211 164216 6 1.05 6.03 PlyA - 165089 165084 6 1.05 6.02 Term - 174110 174096 15 2 0 69 55 14 0.085 -5.56 6.01 Intr - 183378 183269 110 1 2 49 111 112 0.927 9.60 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 82463 82306 158 0 2 100 48 123 0.977 7.60 S.002 Init + 89199 89250 52 2 1 123 105 85 0.939 15.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:36175051_36362985|GENSCAN_predicted_peptide_1|265_aa METKEPIVYTGSVERAPGPFGALATSSLDHTMEAGSPVGTTRASCDLQFLLPGVWGAHSG EHGTPHTPHFEDAGRCERVDASTTCLHMCVCTCIVLAHLRVSRESGAQHVLAPAGTLSRR RGWGPYLRSLRIQHRSSPVQPPAKPPEDEPDAEGYEWTIAVSFQLADFAPLHWLRLDGPG FGVLSVPPHRVVAFSLGITPSRGAADPKSSMSSPRPAHRPRPRPLPRPDQRALEQCPLEK AERSHPPCTRAAKNSHVPEYVSAAV >gi568815581r:36175051_36362985|GENSCAN_predicted_CDS_1|798_bp atggagacgaaagagccaatcgtctacacgggcagtgtagaacgggcgcctgggcctttt ggagccctggccacgtcctccctggatcacacgatggaagctggcagccccgtgggcacc actcgagccagctgtgacctgcagtttctgcttcctggagtgtggggcgcccactcagga gagcacggcacaccccacacccctcattttgaggatgctgggaggtgtgagcgagtggat gcttccactacgtgtctgcacatgtgtgtgtgcacttgcattgtccttgcacacctgcgt gtgtcccgtgagagcggagcccagcatgtgctggcacctgcaggaacattgtcacgacga cgaggatggggaccatatcttcggagcctgaggatccaacacaggtccagccctgtgcag ccgcccgccaagccgccagaggacgagccggacgccgaaggctacgagtggacgattgca gttagtttccaactcgccgacttcgcgcccctccactggctccggcttgatggtcccggc ttcggggtgctctcggtccctccccatcgcgtcgtcgctttctcccttggcataaccccc agccgcggggccgcagaccctaagagctccatgagctctccgcgccctgcccaccggccc cggccccgacccctccccagaccggaccagagagctttggaacaatgtcctctagaaaaa gctgaacgtagccacccaccttgtacgagagccgcaaagaatagccatgtgcctgaatat gtgtcagccgctgtgtga >gi568815581r:36175051_36362985|GENSCAN_predicted_peptide_2|93_aa MQVSTAALAVLLCTMALCNQVLSAPLAADTPTACCFSYTSRQIPQNFIADYFETSSQCSK PSVIFLTKRGRQVCADPSEEWVQKYVSDLELSA >gi568815581r:36175051_36362985|GENSCAN_predicted_CDS_2|282_bp atgcaggtctccactgctgcccttgccgtcctcctctgcaccatggctctctgcaaccag gtcctctctgcaccacttgctgctgacacgccgaccgcctgctgcttcagctacacctcc cgacagattccacagaatttcatagctgactactttgagacgagcagccagtgctccaag cccagtgtcatcttcctaaccaagagaggccggcaggtctgtgctgaccccagtgaggag tgggtccagaaatacgtcagtgacctggagctgagtgcctga >gi568815581r:36175051_36362985|GENSCAN_predicted_peptide_3|245_aa MKLCVTVLSLLVLVAAFCSLALSAPIKLGAYGGQVINGVLAQVQLTVGPVGPWIHPVVIS PMTECVIGIDIFNSWQNPYIGSLTGIDLANAFFSIPVHKAHQKQFAFSWQGQQYTFTVLP HGYINSPALRHNLIQRDLDHFLLLKDITLVHYNDIMMIGSSEQKVANTLDLLVETELKLI CGDVLDVLDKHLIPAATTGKSKAVCEMFDVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQ PLDRS >gi568815581r:36175051_36362985|GENSCAN_predicted_CDS_3|738_bp atgaagctctgcgtgactgtcctgtctctcctcgtgctagtagctgccttctgctctcta gcactctcagcaccaattaaattaggggcttatggaggtcaggtaattaatggagtttta gctcaggtccaacttacagtgggcccagtgggtccctggattcatcctgtggtcatttcc ccaatgacagaatgtgtaattggcatagatatattcaacagctggcagaacccctacatt ggctccctgactggtattgacttagcaaatgcctttttctccattcctgtccataaggcc caccagaagcaatttgccttcagctggcaaggtcagcaatatacctttactgtcctacct cacgggtatatcaactctccagctttgcgtcataatcttattcagagagaccttgatcac tttttgcttctgaaagatatcacactggtccattacaatgacattatgatgattggatcc agtgagcaaaaagtagcaaacacactggacttattggttgagactgagctaaagttaatc tgtggcgacgttctggatgtactggacaaacacctcattccagcagctacaactggcaag tccaaggcagtctgtgagatgtttgatgtccgaggcaaacagcacattcagatccccaag ctctacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacag cctttggaccgcagctaa >gi568815581r:36175051_36362985|GENSCAN_predicted_peptide_4|447_aa MDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEM KMKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEY NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAAQAPAAIGAHEWADQAQ ISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDE DTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPAR FPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNSIVNARRRNLTVRPGFS RVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:36175051_36362985|GENSCAN_predicted_CDS_4|1344_bp atggacgtggtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatg aaatacgaaaagggacaccgagctgggctgccagaggacaaggggcctaagccttttcga agctacaacaacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctg actgcgcgggaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggat atgctgggagactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaag ggaatgcccatgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatg aagatgaaaaaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgag cacatccagcgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagg gatcgatacggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtat aacccggaggtgggctactgcagggacctgagccacatcgccgccttgttcctcctctat cttcctgaggaggatgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgcaggctgcccaggctcctgctgccatcggtgcccacgaatgggccgaccaagcccag atctctctcgggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcg ttgatgccgataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtcc aggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgag gacactgtgctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggac ctgccacccccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttca cgtggcgggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccgg ttcccgcggcccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgt cctggtggggctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagca ggcgtcaactccattgttaatgcacggaggaggaacctgactgttagacctgggttttcc agggttgcacggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgta atgccaggatggaatgagctgtga >gi568815581r:36175051_36362985|GENSCAN_predicted_peptide_5|621_aa MVRQATNQIVMNCADIDIITASYAPEGDEEIHATGFNYQNEDEKVTLSFPSTLQTGTGTL KIDFVGELNDKMKGFYRSKYTTPSGEVRYAAVTQFENVIDRKPYPDDENLVEVKFARTPV TSTYLVAFVVGEYDFVETRSKDGVCVCVYTPVGKAEQGKFALEVSVGHPSEVDEICDAIS YSKGASVIRMLHDYIGDKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPLTAREAK QIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNIEEMKLKNPG RYQIMKEKGKRSSEHIQRIDRDISGTLRKHMFFRDRYGTKQRELLHILLAYEEYNPEVGY CRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQISLGLT LRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDEDTVLKH LRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPARFPRPIW SASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFSRVARLL GDGCDPEDRAQASVMPGWNEL >gi568815581r:36175051_36362985|GENSCAN_predicted_CDS_5|1866_bp atggtgaggcaggcgactaatcagattgtgatgaattgtgctgatattgatattattaca gcttcatatgcaccagaaggagatgaagaaatacatgctacaggatttaactatcagaat gaagatgaaaaagtcaccttgtctttccctagtactctgcaaacaggtacgggaacctta aagatagattttgttggagagctgaatgacaaaatgaaaggtttctatagaagtaagtat actaccccttctggagaggtgcgctatgctgctgtaacacagtttgagaatgtaattgac cggaaaccataccctgatgatgaaaatttagtggaagtgaagtttgcccgcacacctgtt acatctacatatctggtggcatttgttgtgggtgaatatgactttgtagaaacaaggtca aaagatggtgtgtgtgtctgtgtttacactcctgttggcaaagcagaacaaggaaaattt gcattagaggtcagtgtgggccatccatctgaggttgatgagatatgtgatgctatatca tatagcaaaggtgcatctgtcatccgaatgctgcatgactacattggggataagggacac cgagctgggctgccagaggacaaggggcctaagccttttcgaagctacaacaacaacgtc gatcatttggggattgtacatgagacggagctgcctcctctgactgcgcgggaggcgaag caaattcggcgggagatcagccgaaagagcaagtgggtggatatgctgggagactgggag aaatacaaaagcagcagaaagctcatagatcgagcgtacaagggaatgcccatgaacatc cggggcccgatgtggtcagtcctcctgaacattgaggaaatgaagttgaaaaaccccgga agataccagatcatgaaggagaagggcaagaggtcatctgagcacatccagcgcatcgac cgggacataagcgggacattaaggaagcatatgttcttcagggatcgatacggaaccaag cagcgggaactactccacatcctcctggcatatgaggagtataacccggaggtgggctac tgcagggacctgagccacatcgccgccttgttcctcctctatcttcctgaggaggatgca ttctgggcactggtgcagctgctggccagtgagaggcactccctgcaggctgcccgggct cctgctgccatcggtgcccacgaatgggccgaccaagcccagatctctctcgggctcacc ctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcgttgatgccgataacaaga atcgcctttaaggttcagcagaagcgcctcacgaagacgtccaggtgtggcccgtgggca cgtttttgcaaccggttcgttgatacctgggccagggatgaggacactgtgctcaagcat cttagggcctctatgaagaaactaacaagaaagcagggggacctgccacccccagccaaa cccgagcaagggtcgtcggcatccaggcctgtgccggcttcacgtggcgggaagaccctc tgcaagggggacaggcaggcccctccaggcccaccagcccggttcccgcggcccatttgg tcagcttccccgccacgggcacctcgttcttccacaccctgtcctggtggggctgtccgg gaagacacctaccctgtgggcactcaggcgtgccgcaaagcaggcgtcaacgccattgtt aatgcacggaggaggaacctgactgttagacctgggttttccagggttgcacggcttctg ggagacggatgtgaccctgaggacagggcacaggccagtgtaatgccaggatggaatgag ctgtga >gi568815581r:36175051_36362985|GENSCAN_predicted_peptide_6|41_aa XQSEEDVSQFDSKFTRQTPVDSPDDTTLSESANQVFLPPEL >gi568815581r:36175051_36362985|GENSCAN_predicted_CDS_6|126_bp nagcaatctgaagaggatgtaagtcagtttgattccaagtttacacgtcagacacctgtc gacagcccagatgacacaactctcagtgaaagtgccaatcaggtgtttttgcctccggaa ctgtga