GENSCAN 1.0 Date run: 6-Nov-116 Time: 06:37:31 Sequence gi568815581r:36095289_36296673 : 201385 bp : 45.92% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.02 PlyA - 1628 1623 6 1.05 1.01 Sngl - 3187 2900 288 1 0 86 42 114 0.422 2.19 1.00 Prom - 7496 7457 40 -0.16 2.00 Prom + 8053 8092 40 -5.46 2.01 Init + 8618 8693 76 1 1 68 108 85 0.985 7.77 2.02 Intr + 9937 9996 60 2 0 110 45 46 0.041 1.21 2.03 Intr + 16989 17077 89 1 2 69 39 46 0.036 -2.51 2.04 Intr + 18066 18221 156 2 0 75 77 60 0.142 3.81 2.05 Intr + 22437 22490 54 2 0 108 72 30 0.233 2.58 2.06 Intr + 23704 23779 76 0 1 22 113 55 0.371 0.49 2.07 Intr + 24000 24057 58 1 1 109 101 42 0.399 5.54 2.08 Term + 25893 26007 115 0 1 53 44 78 0.248 -2.06 2.09 PlyA + 27398 27403 6 1.05 3.00 Prom + 28295 28334 40 -1.56 3.01 Init + 34196 34307 112 1 1 68 72 69 0.537 3.67 3.02 Intr + 52292 52407 116 0 2 68 109 20 0.083 2.27 3.03 Intr + 54292 54423 132 0 0 91 72 21 0.021 1.64 3.04 Intr + 61663 61713 51 0 0 62 79 51 0.049 0.70 3.05 Term + 66325 66456 132 0 0 105 43 104 0.247 5.69 3.06 PlyA + 67395 67400 6 1.05 4.14 PlyA - 67671 67666 6 1.05 4.13 Term - 69347 69190 158 0 2 95 39 129 0.728 6.80 4.12 Intr - 71275 71063 213 2 0 101 49 71 0.634 3.19 4.11 Intr - 72408 72256 153 1 0 85 80 82 0.818 7.14 4.10 Intr - 72878 72779 100 2 1 91 89 78 0.700 7.88 4.09 Intr - 74287 74235 53 2 2 100 38 77 0.764 2.53 4.08 Intr - 74723 74603 121 2 1 92 110 146 0.994 17.27 4.07 Intr - 75315 75267 49 2 1 97 117 62 0.988 8.68 4.06 Intr - 76676 76567 110 2 2 92 82 150 0.999 13.88 4.05 Intr - 77113 77006 108 1 0 107 101 162 0.994 19.88 4.04 Intr - 77637 77557 81 0 0 52 92 42 0.599 0.83 4.03 Intr - 78159 78120 40 2 1 80 116 -10 0.935 -0.77 4.02 Intr - 79528 79443 86 1 2 81 79 118 0.921 8.82 4.01 Init - 79762 79691 72 1 0 71 82 50 0.673 3.77 4.00 Prom - 80355 80316 40 -14.47 5.00 Prom + 80581 80620 40 -14.86 5.01 Init + 81026 81077 52 1 1 123 105 85 0.997 15.02 5.02 Intr + 83442 83474 33 1 0 80 82 35 0.546 0.29 5.03 Intr + 84163 84301 139 2 1 70 59 125 0.809 7.32 5.04 Intr + 87166 87290 125 1 2 94 84 22 0.735 2.63 5.05 Intr + 87344 87688 345 0 0 29 59 283 0.436 14.86 5.06 Term + 91867 91970 104 2 2 54 42 75 0.212 -2.06 5.07 PlyA + 93343 93348 6 1.05 6.04 PlyA - 98729 98724 6 1.05 6.03 Term - 100088 99998 91 1 1 128 55 78 0.976 5.69 6.02 Intr - 100624 100510 115 2 1 126 103 111 0.999 15.91 6.01 Init - 101385 101310 76 0 1 82 94 103 0.940 9.66 6.00 Prom - 114718 114679 40 -0.66 7.00 Prom + 115288 115327 40 -7.66 7.01 Init + 115854 115929 76 2 1 68 108 73 0.943 6.59 7.02 Intr + 124475 124651 177 0 0 51 58 85 0.166 1.69 7.03 Intr + 124992 125254 263 1 2 69 77 138 0.359 8.01 7.04 Intr + 146411 146500 90 1 0 110 0 65 0.075 0.09 7.05 Term + 154285 154416 132 0 0 105 43 163 0.842 11.59 7.06 PlyA + 155355 155360 6 1.05 8.14 PlyA - 155631 155626 6 1.05 8.13 Term - 157306 157149 158 2 2 95 39 96 0.006 3.50 8.12 Intr - 159234 159022 213 1 0 101 49 81 0.005 4.19 8.11 Intr - 160349 160197 153 0 0 85 80 82 0.020 7.14 8.10 Intr - 160819 160720 100 1 1 91 89 78 0.022 7.88 8.09 Intr - 162225 162173 53 1 2 100 38 72 0.016 2.03 8.08 Intr - 162661 162541 121 1 1 92 110 146 0.994 17.27 8.07 Intr - 163253 163205 49 1 1 97 117 62 0.987 8.68 8.06 Intr - 164611 164502 110 1 2 102 82 146 0.996 14.48 8.05 Intr - 165048 164941 108 0 0 99 101 160 0.975 18.88 8.04 Intr - 165572 165492 81 2 0 52 92 42 0.602 0.83 8.03 Intr - 166094 166055 40 1 1 80 116 -10 0.939 -0.77 8.02 Intr - 167463 167378 86 0 2 81 79 118 0.923 8.82 8.01 Init - 167697 167626 72 0 0 71 82 55 0.663 4.27 8.00 Prom - 173373 173334 40 -3.26 9.00 Prom + 173662 173701 40 -6.96 9.01 Init + 177451 177453 3 0 0 71 101 0 0.679 -0.40 9.02 Intr + 178911 178995 85 2 1 131 116 54 0.949 11.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr + 9240 9354 115 1 1 82 59 48 0.872 0.81 S.002 Term + 9937 10024 88 1 1 110 49 89 0.875 4.33 S.003 Term - 162225 162068 158 1 2 100 48 123 0.977 7.60 S.004 Init + 168961 169012 52 0 1 123 105 85 0.939 15.02 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581r:36095289_36296673|GENSCAN_predicted_peptide_1|95_aa MVSKSLWRKWEQKLDCELKKQIHNYPECRILQDEICRTMSWLEQLISDCHTLTTELASPT LLTMEMDLRRLPQPALACERQRSGTSGNAKFLREG >gi568815581r:36095289_36296673|GENSCAN_predicted_CDS_1|288_bp atggtgtccaagtcactttggaggaaatgggagcagaagctggattgtgaattgaagaaa caaatacacaattatccagagtgcagaattcttcaggatgaaatctgtaggacaatgagc tggttggagcagctcatcagtgactgtcacactctgactacggagctggccagccccacg cttcttaccatggaaatggaccttcgtcgcctgccacaacctgctcttgcttgtgaaagg cagaggtcaggaacttcaggcaatgctaaattcctgagagagggataa >gi568815581r:36095289_36296673|GENSCAN_predicted_peptide_2|227_aa MKLCVTVLSLLMLVAAFCSPALSAPNSKPKEASKSVLIPVNPGSRKRAEIVEKQTQAFIM QVADLQQKGHAQPHQGYINSPVLCHNLIQRDLEHFLLLQDITLVHYNDDIMMTRSSEQEV ANTLDLLAVSMDDDDCWQNRGVVVKNSSPQAQATDWYWSMTCYEPSHTAEDGTIKLQENK LRAPTDSTLCTMYNSKDMEPTQIAINDRRDKENVAYKHPGILRSHKK >gi568815581r:36095289_36296673|GENSCAN_predicted_CDS_2|684_bp atgaagctctgcgtgactgtcctgtctctcctcatgctagtagctgccttctgctctcca gcgctctcagcaccaaattccaaaccaaaagaagcaagcaagtctgtgctgatcccagtg aatcctgggtccaggaaaagagctgaaattgtagaaaaacagacacaagcttttatcatg caagtggctgacctgcaacaaaaggggcatgcacagcctcaccaggggtatatcaactct ccggttttgtgtcataatcttattcagagagaccttgaacactttttgcttctgcaagat atcacactggtccattacaatgatgacattatgatgactagatccagtgaacaagaagta gcaaacacactggacttattggcagtgagcatggatgatgatgattgctggcaaaacagg ggagtggtagtgaagaacagcagtccccaggcccaggccacagattggtactggtccatg acctgttacgaaccgagccacacagcagaagatgggaccatcaagttgcaggaaaacaag ctcagggctcccactgattctacattatgcactatgtacaatagcaaagacatggaacca acccaaattgccatcaacgatagaagggataaagaaaatgtggcatataaacaccctgga atactacgcagccataaaaaatga >gi568815581r:36095289_36296673|GENSCAN_predicted_peptide_3|180_aa MEEVRVSWIEDQTSHNIPLSQNLIKALPVFNSVKAERSICVHQCRKDGDYKEGKCCIRKG CWHIFVPPPQNLIPKQTHAFLLSAKSELEKTNSVPVGFPIFGWLAGLPPQPLSRLLSRSW ASVCISEEKTGTVEGKTAVCEMFHVRGKQHIQIPKLSTSSVTRHLHHFRLMQDSQPLDLS >gi568815581r:36095289_36296673|GENSCAN_predicted_CDS_3|543_bp atggaggaagttcgagtgtcctggatagaagatcaaaccagccacaacattcccttaagc caaaacctaatcaaagccctacctgtcttcaactctgtcaaggctgagagaagcatctgt gtgcaccaatgcagaaaagacggggactacaaggaaggaaaatgttgcatcaggaagggc tgctggcacatctttgttccccctccccaaaacctcatccccaagcagacccatgcgttc ctgctctctgccaagtccgaactggagaaaacaaattctgtcccagtggggtttcccatc tttggctggctggctggacttcctccacaaccactctctcgccttctaagcaggagctgg gcttctgtgtgcatcagtgaagagaagacggggactgtggaagggaaaacagcagtctgt gagatgtttcatgtccgaggcaaacagcacattcagatccccaagctctccacctccagt gtgaccaggcacctgcaccacttcaggctcatgcaggactcacagcctttggacctcagc taa >gi568815581r:36095289_36296673|GENSCAN_predicted_peptide_4|447_aa MDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNIEEM KLKNPGRYQIMKEKGKRSSEHIQRIDRDISGTLRKHMFFRDRYGTKQRELLHILLAYEEY NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAARAPAAIGAHEWADQAQ ISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDE DTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGRKTLCKGDRQAPPGPPAR FPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNAIVNARRRNLTVRPGFS RVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:36095289_36296673|GENSCAN_predicted_CDS_4|1344_bp atggacgtggtagaggtcgcgggtagttggtgggcacaagagcgagaggacatcattatg aaatacgaaaagggacaccgagctgggctgccagaggacaaggggcctaagccttttcga agctacaacaacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctg actgcgcgggaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggat atgctgggagactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaag ggaatgcccatgaacatccggggcccgatgtggtcagtcctcctgaacattgaggaaatg aagttgaaaaaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgag cacatccagcgcatcgaccgggacataagcgggacattaaggaagcatatgttcttcagg gatcgatacggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtat aacccggaggtgggctactgcagggacctgagccacatcgccgccttgttcctcctctat cttcctgaggaggatgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgcaggctgcccgggctcctgctgccatcggtgcccacgaatgggccgaccaagcccag atctctctcgggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcg ttgatgccgataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtcc aggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgag gacactgtgctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggac ctgccacccccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttca cgtggcaggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccgg ttcccgcggcccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgt cctggtggggctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagca ggcgtcaacgccattgttaatgcacggaggaggaacctgactgttagacctgggttttcc agggttgcacggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgta atgccaggatggaatgagctgtga >gi568815581r:36095289_36296673|GENSCAN_predicted_peptide_5|265_aa METKEPIVYTGSVERAPGPFGALATSSLDHTMEAGSPVGTTRASCDLQFLLPGVWGAHSG EHGTPHTPHFEDAGRCERVDASTTCLHMCVCTCIVLAHLRVSRESGAQHVLAPAGTLSRR RGWGPYLRSLRIQHRSSPVQPPAKPPEDEPDAEGYEWTIAVSFQLADFAPLHWLRLDGPG FGVLSVPPHRVVAFSLGITPSRGAADPKSSMSSPRPAHRPRPRPLPRPDQRALEQCPLEK AERSHPPCTRAAKNSHVPEYVSAAV >gi568815581r:36095289_36296673|GENSCAN_predicted_CDS_5|798_bp atggagacgaaagagccaatcgtctacacgggcagtgtagaacgggcgcctgggcctttt ggagccctggccacgtcctccctggatcacacgatggaagctggcagccccgtgggcacc actcgagccagctgtgacctgcagtttctgcttcctggagtgtggggcgcccactcagga gagcacggcacaccccacacccctcattttgaggatgctgggaggtgtgagcgagtggat gcttccactacgtgtctgcacatgtgtgtgtgcacttgcattgtccttgcacacctgcgt gtgtcccgtgagagcggagcccagcatgtgctggcacctgcaggaacattgtcacgacga cgaggatggggaccatatcttcggagcctgaggatccaacacaggtccagccctgtgcag ccgcccgccaagccgccagaggacgagccggacgccgaaggctacgagtggacgattgca gttagtttccaactcgccgacttcgcgcccctccactggctccggcttgatggtcccggc ttcggggtgctctcggtccctccccatcgcgtcgtcgctttctcccttggcataaccccc agccgcggggccgcagaccctaagagctccatgagctctccgcgccctgcccaccggccc cggccccgacccctccccagaccggaccagagagctttggaacaatgtcctctagaaaaa gctgaacgtagccacccaccttgtacgagagccgcaaagaatagccatgtgcctgaatat gtgtcagccgctgtgtga >gi568815581r:36095289_36296673|GENSCAN_predicted_peptide_6|93_aa MQVSTAALAVLLCTMALCNQVLSAPLAADTPTACCFSYTSRQIPQNFIADYFETSSQCSK PSVIFLTKRGRQVCADPSEEWVQKYVSDLELSA >gi568815581r:36095289_36296673|GENSCAN_predicted_CDS_6|282_bp atgcaggtctccactgctgcccttgccgtcctcctctgcaccatggctctctgcaaccag gtcctctctgcaccacttgctgctgacacgccgaccgcctgctgcttcagctacacctcc cgacagattccacagaatttcatagctgactactttgagacgagcagccagtgctccaag cccagtgtcatcttcctaaccaagagaggccggcaggtctgtgctgaccccagtgaggag tgggtccagaaatacgtcagtgacctggagctgagtgcctga >gi568815581r:36095289_36296673|GENSCAN_predicted_peptide_7|245_aa MKLCVTVLSLLVLVAAFCSLALSAPIKLGAYGGQVINGVLAQVQLTVGPVGPWIHPVVIS PMTECVIGIDIFNSWQNPYIGSLTGIDLANAFFSIPVHKAHQKQFAFSWQGQQYTFTVLP HGYINSPALRHNLIQRDLDHFLLLKDITLVHYNDIMMIGSSEQKVANTLDLLVETELKLI CGDVLDVLDKHLIPAATTGKSKAVCEMFDVRGKQHIQIPKLYTSSVTRHLHHFRLMQDSQ PLDRS >gi568815581r:36095289_36296673|GENSCAN_predicted_CDS_7|738_bp atgaagctctgcgtgactgtcctgtctctcctcgtgctagtagctgccttctgctctcta gcactctcagcaccaattaaattaggggcttatggaggtcaggtaattaatggagtttta gctcaggtccaacttacagtgggcccagtgggtccctggattcatcctgtggtcatttcc ccaatgacagaatgtgtaattggcatagatatattcaacagctggcagaacccctacatt ggctccctgactggtattgacttagcaaatgcctttttctccattcctgtccataaggcc caccagaagcaatttgccttcagctggcaaggtcagcaatatacctttactgtcctacct cacgggtatatcaactctccagctttgcgtcataatcttattcagagagaccttgatcac tttttgcttctgaaagatatcacactggtccattacaatgacattatgatgattggatcc agtgagcaaaaagtagcaaacacactggacttattggttgagactgagctaaagttaatc tgtggcgacgttctggatgtactggacaaacacctcattccagcagctacaactggcaag tccaaggcagtctgtgagatgtttgatgtccgaggcaaacagcacattcagatccccaag ctctacacctccagtgtgaccaggcacctgcaccacttcaggctcatgcaggactcacag cctttggaccgcagctaa >gi568815581r:36095289_36296673|GENSCAN_predicted_peptide_8|447_aa MDVVEVAGSWWAQEREDIIMKYEKGHRAGLPEDKGPKPFRSYNNNVDHLGIVHETELPPL TAREAKQIRREISRKSKWVDMLGDWEKYKSSRKLIDRAYKGMPMNIRGPMWSVLLNTEEM KMKNPGRYQIMKEKGKRSSEHIQRIDRDVSGTLRKHIFFRDRYGTKQRELLHILLAYEEY NPEVGYCRDLSHIAALFLLYLPEEDAFWALVQLLASERHSLQAAQAPAAIGAHEWADQAQ ISLGLTLRLWDVYLVEGEQALMPITRIAFKVQQKRLTKTSRCGPWARFCNRFVDTWARDE DTVLKHLRASMKKLTRKQGDLPPPAKPEQGSSASRPVPASRGGKTLCKGDRQAPPGPPAR FPRPIWSASPPRAPRSSTPCPGGAVREDTYPVGTQACRKAGVNSIVNARRRNLTVRPGFS RVARLLGDGCDPEDRAQASVMPGWNEL >gi568815581r:36095289_36296673|GENSCAN_predicted_CDS_8|1344_bp atggacgtggtagaggtcgcgggcagttggtgggcacaagagcgagaggacatcattatg aaatacgaaaagggacaccgagctgggctgccagaggacaaggggcctaagccttttcga agctacaacaacaacgtcgatcatttggggattgtacatgagacggagctgcctcctctg actgcgcgggaggcgaagcaaattcggcgggagatcagccgaaagagcaagtgggtggat atgctgggagactgggagaaatacaaaagcagcagaaagctcatagatcgagcgtacaag ggaatgcccatgaacatccggggcccgatgtggtcagtcctcctgaacactgaggaaatg aagatgaaaaaccccggaagataccagatcatgaaggagaagggcaagaggtcatctgag cacatccagcgcatcgaccgggacgtaagcgggacattaaggaagcatatattcttcagg gatcgatacggaaccaagcagcgggaactactccacatcctcctggcatatgaggagtat aacccggaggtgggctactgcagggacctgagccacatcgccgccttgttcctcctctat cttcctgaggaggatgcattctgggcactggtgcagctgctggccagtgagaggcactcc ctgcaggctgcccaggctcctgctgccatcggtgcccacgaatgggccgaccaagcccag atctctctcgggctcaccctgcgcctgtgggacgtgtatctggtagaaggcgaacaggcg ttgatgccgataacaagaatcgcctttaaggttcagcagaagcgcctcacgaagacgtcc aggtgtggcccgtgggcacgtttttgcaaccggttcgttgatacctgggccagggatgag gacactgtgctcaagcatcttagggcctctatgaagaaactaacaagaaagcagggggac ctgccacccccagccaaacccgagcaagggtcgtcggcatccaggcctgtgccggcttca cgtggcgggaagaccctctgcaagggggacaggcaggcccctccaggcccaccagcccgg ttcccgcggcccatttggtcagcttccccgccacgggcacctcgttcttccacaccctgt cctggtggggctgtccgggaagacacctaccctgtgggcactcaggcgtgccgcaaagca ggcgtcaactccattgttaatgcacggaggaggaacctgactgttagacctgggttttcc agggttgcacggcttctgggagacggatgtgaccctgaggacagggcacaggccagtgta atgccaggatggaatgagctgtga >gi568815581r:36095289_36296673|GENSCAN_predicted_peptide_9|30_aa MVRQATNQIVMNCADIDIITASYAPEGDEX >gi568815581r:36095289_36296673|GENSCAN_predicted_CDS_9|90_bp atggtgaggcaggcgactaatcagattgtgatgaattgtgctgatattgatattattaca gcttcatatgcaccagaaggagatgaagnn