GENSCAN 1.0 Date run: 4-Nov-116 Time: 14:18:03 Sequence gi568815581r:36065998_36253956 : 187959 bp : 45.27% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4450 4561 112 2 1 108 103 76 0.973 10.54 1.02 Term + 4954 5044 91 1 1 128 55 146 0.999 12.49 1.03 PlyA + 6012 6017 6 1.05 2.05 PlyA - 8251 8246 6 1.05 2.04 Term - 22765 22675 91 0 1 128 55 104 0.999 8.29 2.03 Intr - 23300 23186 115 0 1 135 103 127 0.999 18.41 2.02 Intr - 24182 23989 194 1 2 75 94 104 0.793 8.84 2.01 Init - 32478 32234 245 0 2 86 55 123 0.588 6.01 2.00 Prom - 36787 36748 40 -0.16 3.00 Prom + 37344 37383 40 -5.46 3.01 Init + 37909 37984 76 0 1 68 108 85 0.987 7.77 3.02 Intr + 39228 39287 60 1 0 110 45 46 0.041 1.21 3.03 Intr + 46280 46368 89 0 2 69 39 46 0.037 -2.51 3.04 Intr + 47357 47512 156 1 0 75 77 60 0.142 3.81 3.05 Intr + 51728 51781 54 1 0 108 72 30 0.233 2.58 3.06 Intr + 52995 53070 76 2 1 22 113 55 0.371 0.49 3.07 Intr + 53291 53348 58 0 1 109 101 42 0.399 5.54 3.08 Term + 55184 55298 115 2 1 53 44 78 0.248 -2.06 3.09 PlyA + 56689 56694 6 1.05 4.00 Prom + 57586 57625 40 -1.56 4.01 Init + 63487 63598 112 0 1 68 72 69 0.537 3.67 4.02 Intr + 81583 81698 116 2 2 68 109 20 0.083 2.27 4.03 Intr + 83583 83714 132 2 0 91 72 21 0.021 1.64 4.04 Intr + 90954 91004 51 2 0 62 79 51 0.049 0.70 4.05 Term + 95616 95747 132 2 0 105 43 104 0.247 5.69 4.06 PlyA + 96686 96691 6 1.05 5.14 PlyA - 96962 96957 6 1.05 5.13 Term - 98638 98481 158 2 2 95 39 129 0.728 6.80 5.12 Intr - 100566 100354 213 1 0 101 49 71 0.634 3.19 5.11 Intr - 101699 101547 153 0 0 85 80 82 0.818 7.14 5.10 Intr - 102169 102070 100 1 1 91 89 78 0.700 7.88 5.09 Intr - 103578 103526 53 1 2 100 38 77 0.764 2.53 5.08 Intr - 104014 103894 121 1 1 92 110 146 0.994 17.27 5.07 Intr - 104606 104558 49 1 1 97 117 62 0.988 8.68 5.06 Intr - 105967 105858 110 1 2 92 82 150 0.999 13.88 5.05 Intr - 106404 106297 108 0 0 107 101 162 0.994 19.88 5.04 Intr - 106928 106848 81 2 0 52 92 42 0.599 0.83 5.03 Intr - 107450 107411 40 1 1 80 116 -10 0.935 -0.77 5.02 Intr - 108819 108734 86 0 2 81 79 118 0.921 8.82 5.01 Init - 109053 108982 72 0 0 71 82 50 0.673 3.77 5.00 Prom - 109646 109607 40 -14.47 6.00 Prom + 109872 109911 40 -14.86 6.01 Init + 110317 110368 52 0 1 123 105 85 0.997 15.02 6.02 Intr + 112733 112765 33 0 0 80 82 35 0.546 0.29 6.03 Intr + 113454 113592 139 1 1 70 59 125 0.809 7.32 6.04 Intr + 116457 116581 125 0 2 94 84 22 0.735 2.63 6.05 Intr + 116635 116979 345 2 0 29 59 283 0.436 14.86 6.06 Term + 121158 121261 104 1 2 54 42 75 0.212 -2.06 6.07 PlyA + 122634 122639 6 1.05 7.04 PlyA - 128020 128015 6 1.05 7.03 Term - 129379 129289 91 0 1 128 55 78 0.976 5.69 7.02 Intr - 129915 129801 115 1 1 126 103 111 0.999 15.91 7.01 Init - 130676 130601 76 2 1 82 94 103 0.940 9.66 7.00 Prom - 144009 143970 40 -0.66 8.00 Prom + 144579 144618 40 -7.66 8.01 Init + 145145 145220 76 1 1 68 108 73 0.943 6.59 8.02 Intr + 153766 153942 177 2 0 51 58 85 0.166 1.69 8.03 Intr + 154283 154545 263 0 2 69 77 138 0.359 8.01 8.04 Intr + 175702 175791 90 0 0 110 0 65 0.075 0.09 8.05 Term + 183576 183707 132 2 0 105 43 163 0.853 11.59 8.06 PlyA + 184646 184651 6 1.05 9.02 PlyA - 184922 184917 6 1.05 9.01 Term - 186597 186440 158 1 2 95 39 96 0.601 3.50 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 38531 38645 115 0 1 82 59 48 0.873 0.81 S.002 Term + 39228 39315 88 0 1 110 49 89 0.876 4.33 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:36065998_36253956|GENSCAN_predicted_peptide_1|67_aa XGTNKELCCLVYTSWQIPQKFIVDYSETSPQCPKPGVILLTKRGRQICADPNKKWVQKYI SDLKLNA >gi568815581r:36065998_36253956|GENSCAN_predicted_CDS_1|204_bp nttggtaccaacaaagagctctgctgcctcgtctatacctcctggcagattccacaaaag ttcatagttgactattctgaaaccagcccccagtgccccaagccaggtgtcatcctccta accaagagaggccggcagatctgtgctgaccccaataagaagtgggtccagaaatacatc agcgacctgaagctgaatgcctga >gi568815581r:36065998_36253956|GENSCAN_predicted_peptide_2|214_aa MVSKSLWRKWEQKLDCELKKQIHNYPECRILQDEICRTMSWLEQLISDCHTLTTELASPT LLTMEMDLRRLPQPALACERQRAAYKEESWFQTSEGHGQQTVVSPFLALLTLEPTFRHLL RIMQVSTAALAVLLCTMALCNQFSASLAADTPTACCFSYTSRQIPQNFIADYFETSSQCS KPGVIFLTKRSRQVCADPSEEWVQKYVSDLELSA >gi568815581r:36065998_36253956|GENSCAN_predicted_CDS_2|645_bp atggtgtccaagtcactttggaggaaatgggagcagaagctggattgtgaattgaagaaa caaatacacaattatccagagtgcagaattcttcaggatgaaatctgtaggacaatgagc tggttggagcagctcatcagtgactgtcacactctgactacggagctggccagccccacg cttcttaccatggaaatggaccttcgtcgcctgccacaacctgctcttgcttgtgaaagg cagagggctgcctataaagaggagagctggtttcagacttcagaaggacacgggcagcag acagtggtcagtcctttcttggctctgctgacactcgagcccacattccgtcacctgctc agaatcatgcaggtctccactgctgcccttgctgtcctcctctgcaccatggctctctgc aaccagttctctgcatcacttgctgctgacacgccgaccgcctgctgcttcagctacacc tcccggcagattccacagaatttcatagctgactactttgagacgagcagccagtgctcc aagcccggtgtcatcttcctaaccaagcgaagccggcaggtctgtgctgaccccagtgag gagtgggtccagaaatatgtcagcgacctggagctgagtgcctga >gi568815581r:36065998_36253956|GENSCAN_predicted_peptide_3|227_aa MKLCVTVLSLLMLVAAFCSPALSAPNSKPKEASKSVLIPVNPGSRKRAEIVEKQTQAFIM QVADLQQKGHAQPHQGYINSPVLCHNLIQRDLEHFLLLQDITLVHYNDDIMMTRSSEQEV ANTLDLLAVSMDDDDCWQNRGVVVKNSSPQAQATDWYWSMTCYEPSHTAEDGTIKLQENK LRAPTDSTLCTMYNSKDMEPTQIAINDRRDKENVAYKHPGILRSHKK >gi568815581r:36065998_36253956|GENSCAN_predicted_CDS_3|684_bp atgaagctctgcgtgactgtcctgtctctcctcatgctagtagctgccttctgctctcca gcgctctcagcaccaaattccaaaccaaaagaagcaagcaagtctgtgctgatcccagtg aatcctgggtccaggaaaagagctgaaattgtagaaaaacagacacaagcttttatcatg caagtggctgacctgcaacaaaaggggcatgcacagcctcaccaggggtatatcaactct ccggttttgtgtcataatcttattcagagagaccttgaacactttttgcttctgcaagat atcacactggtccattacaatgatgacattatgatgactagatccagtgaacaagaagta gcaaacacactggacttattggcagtgagcatggatgatgatgattgctggcaaaacagg ggagtggtagtgaagaacagcagtccccaggcccaggccacagattggtactggtccatg acctgttacgaaccgagccacacagcagaagatgggaccatcaagttgcaggaaaacaag ctcagggctcccactgattctacattatgcactatgtacaatagcaaagacatggaacca acccaaattgccatcaacgatagaagggataaagaaaatgtggcatataaacaccctgga atactacgcagccataaaaaatga >gi568815581r:36065998_36253956|GENSCAN_predicted_peptide_4|180_aa MEEVRVSWIEDQTSHNIPLSQNLIKALPVFNSVKAERSICVHQCRKDGDYKEGKCCIRKG CWHIFVPPPQNLIPKQTHAFLLSAKSELEKTNSVPVGFPIFGWLAGLPPQPLSRLLSRSW ASVCISEEKTGTVEGKTAVCEMFHVRGKQHIQIPKLSTSSVTRHLHHFRLMQDSQPLDLS >gi568815581r:36065998_36253956|GENSCAN_predicted_CDS_4|543_bp atggaggaagttcgagtgtcctggatagaagatcaaaccagccacaacattcccttaagc caaaacctaatcaaagccctacctgtcttcaactctgtcaaggctgagagaagcatctgt gtgcaccaatgcagaaaagacggggactacaaggaaggaaaatgttgcatcaggaagggc tgctggcacatctttgttccccctccccaaaacctcatccccaagcagacccatgcgttc ctgctctctgccaagtccgaactggagaaaacaaattctgtcccagtggggtttcccatc tttggctggctggctggacttcctccacaaccactctctcgccttctaagcaggagctgg gcttctgtgtgcatcagtgaagagaagacggggactgtggaagggaaaacagcagtctgt gagatgtttcatgtccgaggcaaacagcacattcagatccccaagctctccacctccagt gtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcctttggacctcagc taa >gi568815581r:36065998_36253956|GENSCAN_predicted_peptide_5|447_aa MDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNIEEM KLKNPGRYQIMKEKGKRSSEHIQRIDRDISGTLRKHMFFRDRYGTKQRELLHILLAYEEY NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQ ISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDE DTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGRKTLCKGDRQAPPGPPAR FPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFS RVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:36065998_36253956|GENSCAN_predicted_CDS_5|1344_bp atggacgtggtagaggtcgcgggtagttggtgggcacaagagcgagaggacatcattatg aaatacgaaaagggacaccgagctgggctgccagaggacaaggggcctaagccttttcga agctacaacaacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctg actgcgcgggaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggat atgctgggagactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaag ggaatgcccatgaacatccggggcccgatgtggtcagtcctcctgaacattgaggaaatg aagttgaaaaaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgag cacatccagcgcatcgaccgggacataagcgggacattaaggaagcatatgttcttcagg gatcgatacggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtat aacccggaggtgggctactgcagggacctgagccacatcgccgccttgttcctcctctat cttcctgaggaggatgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgcaggctgcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccag atctctctcgggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcg ttgatgccgataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtcc aggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgag gacactgtgctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggac ctgccacccccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttca cgtggcaggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccgg ttcccgcggcccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgt cctggtggggctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagca ggcgtcaacgccattgttaatgcacggaggaggaacctgactgttagacctgggttttcc agggttgcacggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgta atgccaggatggaatgagctgtga >gi568815581r:36065998_36253956|GENSCAN_predicted_peptide_6|265_aa METKEPIVYTGSVERAPGPFGALATSSLDHTMEAGSPVGTTRASCDLQFLLPGVWGAHSG EHGTPHTPHFEDAGRCERVDASTTCLHMCVCTCIVLAHLRVSRESGAQHVLAPAGTLSRR RGWGPYLRSLRIQHRSSPVQPPAKPPEDEPDAEGYEWTIAVSFQLADFAPLHWLRLDGPG FGVLSVPPHRVVAFSLGITPSRGAADPKSSMSSPRPAHRPRPRPLPRPDQRALEQCPLEK AERSHPPCTRAAKNSHVPEYVSAAV >gi568815581r:36065998_36253956|GENSCAN_predicted_CDS_6|798_bp atggagacgaaagagccaatcgtctacacgggcagtgtagaacgggcgcctgggcctttt ggagccctggccacgtcctccctggatcacacgatggaagctggcagccccgtgggcacc actcgagccagctgtgacctgcagtttctgcttcctggagtgtggggcgcccactcagga gagcacggcacaccccacacccctcattttgaggatgctgggaggtgtgagcgagtggat gcttccactacgtgtctgcacatgtgtgtgtgcacttgcattgtccttgcacacctgcgt gtgtcccgtgagagcggagcccagcatgtgctggcacctgcaggaacattgtcacgacga cgaggatggggaccatatcttcggagcctgaggatccaacacaggtccagccctgtgcag ccgcccgccaagccgccagaggacgagccggacgccgaaggctacgagtggacgattgca gttagtttccaactcgccgacttcgcgcccctccactggctccggcttgatggtcccggc ttcggggtgctctcggtccctccccatcgcgtcgtcgctttctcccttggcataaccccc agccgcggggccgcagaccctaagagctccatgagctctccgcgccctgcccaccggccc cggccccgacccctccccagaccggaccagagagctttggaacaatgtcctctagaaaaa gctgaacgtagccacccaccttgtacgagagccgcaaagaatagccatgtgcctgaatat gtgtcagccgctgtgtga >gi568815581r:36065998_36253956|GENSCAN_predicted_peptide_7|93_aa MQVSTAALAVLLCTMALCNQVLSAPLAADTPTACCFSYTSRQIPQNFIADYFETSSQCSK PSVIFLTKRGRQVCADPSEEWVQKYVSDLELSA >gi568815581r:36065998_36253956|GENSCAN_predicted_CDS_7|282_bp atgcaggtctccactgctgcccttgccgtcctcctctgcaccatggctctctgcaaccag gtcctctctgcaccacttgctgctgacacgccgaccgcctgctgcttcagctacacctcc cgacagattccacagaatttcatagctgactactttgagacgagcagccagtgctccaag cccagtgtcatcttcctaaccaagagaggccggcaggtctgtgctgaccccagtgaggag tgggtccagaaatacgtcagtgacctggagctgagtgcctga >gi568815581r:36065998_36253956|GENSCAN_predicted_peptide_8|245_aa MKLCVTVLSLLVLVAAFCSLALSAPIKLGAYGGQVINGVLAQVQLTVGPVGPWIHPVVIS PMTECVIGIDIFNSWQNPYIGSLTGIDLANAFFSIPVHKAHQKQFAFSWQGQQYTFTVLP HGYINSPALRHNLIQRDLDHFLLLKDITLVHYNDIMMIGSSEQKVANTLDLLVETELKLI CGDVLDVLDKHLIPAATTGKSKAVCEMFDVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQ PLDRS >gi568815581r:36065998_36253956|GENSCAN_predicted_CDS_8|738_bp atgaagctctgcgtgactgtcctgtctctcctcgtgctagtagctgccttctgctctcta gcactctcagcaccaattaaattaggggcttatggaggtcaggtaattaatggagtttta gctcaggtccaacttacagtgggcccagtgggtccctggattcatcctgtggtcatttcc ccaatgacagaatgtgtaattggcatagatatattcaacagctggcagaacccctacatt ggctccctgactggtattgacttagcaaatgcctttttctccattcctgtccataaggcc caccagaagcaatttgccttcagctggcaaggtcagcaatatacctttactgtcctacct cacgggtatatcaactctccagctttgcgtcataatcttattcagagagaccttgatcac tttttgcttctgaaagatatcacactggtccattacaatgacattatgatgattggatcc agtgagcaaaaagtagcaaacacactggacttattggttgagactgagctaaagttaatc tgtggcgacgttctggatgtactggacaaacacctcattccagcagctacaactggcaag tccaaggcagtctgtgagatgtttgatgtccgaggcaaacagcacattcagatccccaag ctctacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacag cctttggaccgcagctaa >gi568815581r:36065998_36253956|GENSCAN_predicted_peptide_9|52_aa XCRKAGVNSIVNARRRNLTVRPGFSRVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:36065998_36253956|GENSCAN_predicted_CDS_9|159_bp ncgtgccgcaaagcaggcgtcaactccattgttaatgcacggaggaggaacctgactgtt agacctgggttttccagggttgcacggcttctgggagacggatgtgaccctgaggacagg gcacaggccagtgtaatgccaggatggaatgagctgtga