GENSCAN 1.0 Date run: 8-Nov-116 Time: 01:34:46 Sequence gi568815587f:117994816_118217455 : 222640 bp : 48.06% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 774 895 122 1 2 64 105 230 0.773 22.51 1.02 Term + 3900 4826 927 2 0 132 41 624 0.591 54.26 1.03 PlyA + 6645 6650 6 1.05 2.05 PlyA - 8838 8833 6 1.05 2.04 Term - 19065 18966 100 2 1 76 44 176 0.958 9.60 2.03 Intr - 19926 19893 34 1 1 105 117 -2 0.965 1.68 2.02 Intr - 20994 20878 117 1 0 68 96 173 0.969 16.54 2.01 Init - 22933 22924 10 1 1 85 74 16 0.711 -0.37 2.00 Prom - 24276 24237 40 -3.86 3.03 PlyA - 25651 25646 6 1.05 3.02 Term - 42522 42286 237 0 0 90 54 201 0.940 13.07 3.01 Init - 56933 56889 45 2 0 79 91 33 0.113 1.43 3.00 Prom - 59908 59869 40 -3.76 4.00 Prom + 74701 74740 40 -2.66 4.01 Init + 80181 80242 62 2 2 93 103 45 0.249 5.74 4.02 Intr + 90698 90725 28 2 1 71 87 27 0.023 -0.98 4.03 Intr + 100007 100040 34 1 1 142 123 16 0.967 8.30 4.04 Intr + 104170 104283 114 2 0 113 89 115 0.997 14.52 4.05 Intr + 104888 104943 56 0 2 107 42 96 0.887 5.60 4.06 Intr + 105366 105435 70 2 1 97 17 47 0.324 -2.75 4.07 Intr + 108286 108438 153 2 0 100 109 148 0.517 18.14 4.08 Intr + 109876 110005 130 2 1 82 91 102 0.996 9.65 4.09 Intr + 112926 113060 135 0 0 78 106 135 0.729 13.98 4.10 Intr + 114041 114081 41 2 2 79 97 -8 0.799 -2.93 4.11 Intr + 116926 117085 160 2 1 89 110 146 0.965 15.95 4.12 Intr + 118454 118620 167 2 2 104 94 263 0.999 28.10 4.13 Intr + 120014 120112 99 0 0 85 107 36 0.971 5.28 4.14 Intr + 120323 120465 143 0 2 49 75 232 0.999 18.07 4.15 Intr + 122490 122639 150 2 0 111 116 102 0.987 15.66 4.16 Intr + 130480 130562 83 0 2 102 61 51 0.223 2.24 4.17 Term + 134639 134693 55 2 1 113 43 18 0.113 -3.17 4.18 PlyA + 138505 138510 6 1.05 5.14 PlyA - 138582 138577 6 1.05 5.13 Term - 142305 142212 94 2 1 144 52 107 0.645 10.00 5.12 Intr - 146521 146392 130 2 1 39 81 317 0.996 25.85 5.11 Intr - 149246 149018 229 2 1 94 26 331 0.987 24.94 5.10 Intr - 150414 150242 173 1 2 42 87 333 0.977 28.26 5.09 Intr - 151103 150866 238 2 1 87 31 120 0.777 3.39 5.08 Intr - 160066 160029 38 2 2 104 82 36 0.262 2.58 5.07 Intr - 161598 161493 106 0 1 101 47 26 0.197 -0.41 5.06 Intr - 171391 171355 37 0 1 91 91 41 0.234 2.96 5.05 Intr - 172271 172076 196 0 1 105 11 346 0.051 27.07 5.04 Intr - 173480 173270 211 2 1 70 67 378 0.978 32.39 5.03 Intr - 173936 173770 167 0 2 90 101 247 0.830 25.88 5.02 Intr - 174773 174733 41 1 2 97 97 -6 0.229 -0.93 5.01 Init - 179683 179595 89 1 2 89 80 22 0.106 1.64 5.00 Prom - 182987 182948 40 -5.36 6.12 PlyA - 183364 183359 6 1.05 6.11 Term - 185845 185725 121 0 1 39 47 171 0.923 6.05 6.10 Intr - 197273 197173 101 2 2 55 73 58 0.361 -0.09 6.09 Intr - 198788 198579 210 2 0 44 59 86 0.306 0.41 6.08 Intr - 202006 201920 87 1 0 86 81 133 0.951 12.57 6.07 Intr - 203276 203183 94 1 1 95 51 68 0.937 3.77 6.06 Intr - 205797 205659 139 1 1 65 77 94 0.887 5.52 6.05 Intr - 208850 208613 238 2 1 99 -6 261 0.779 14.99 6.04 Intr - 211176 211067 110 1 2 95 87 -11 0.508 -0.50 6.03 Intr - 215897 215672 226 2 1 103 107 230 0.869 23.96 6.02 Intr - 217746 217592 155 1 2 73 92 21 0.796 0.79 6.01 Init - 220051 220009 43 1 1 56 106 70 0.500 4.32 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:117994816_118217455|GENSCAN_predicted_peptide_1|349_aa XFTVTNVIIFFAFVLLLSGALAYCLALQLYVRRRKKLPSVLLFKKPSPFIFISQRPSPET QDTIHPLDEEAFLKVSPELKNLDLHGSTDSGFGSTKPSLQTEEPQFLLPDPHPQADRTLG NREPPVLGDSCSSGSSNSTDSGICLQEPSLSPSTGPTWEQQVGSNSRGQDDSGIDLVQNS EGRAGDTQGGSALGHHSPPEPEVPGEEDPAAVAFQGYLRQTRCAEEKATKTGCLEEESPL TDGLGPKFGRCLVDEAGLHPPALAKGYLKQDPLEMTLASSGAPTGQWNQPTEEWSLLALS SCSDLGISDWSFAHDLAPLGCVAAPGGLLGSFNSDLVTLPLISSLQSSE >gi568815587f:117994816_118217455|GENSCAN_predicted_CDS_1|1050_bp natttcaccgtgaccaacgtcatcatcttctttgcctttgtcctgctgctctccggagcc ctcgcctactgcctggccctccagctgtatgtgcggcgccgaaagaagctacccagtgtc ctgctcttcaagaagcccagccccttcatcttcatcagccagcgtccctccccagagacc caagacaccatccacccgcttgatgaggaggcctttttgaaggtgtccccagagctgaag aacttggacctgcacggcagcacagacagtggctttggcagcaccaagccatccctgcag actgaagagccccagttcctcctccctgaccctcacccccaggctgacagaacgctggga aacagggagccccctgtgctgggggacagctgcagtagtggcagcagcaatagcacagac agcgggatctgcctgcaggagcccagcctgagccccagcacagggcccacctgggagcaa caggtggggagcaacagcaggggccaggatgacagtggcattgacttagttcaaaactct gagggccgggctggggacacacagggtggctcggccttgggccaccacagtcccccggag cctgaggtgcctggggaagaagacccagctgctgtggcattccagggttacctgaggcag accagatgtgctgaagagaaggcaaccaagacaggctgcctggaggaagaatcgcccttg acagatggccttggccccaaattcgggagatgcctggttgatgaggcaggcttgcatcca ccagccctggccaagggctatttgaaacaggatcctctagaaatgactctggcttcctca ggggccccaacgggacagtggaaccagcccactgaggaatggtcactcctggccttgagc agctgcagtgacctgggaatatctgactggagctttgcccatgaccttgcccctctaggc tgtgtggcagccccaggtggtctcctgggcagctttaactcagacctggtcaccctgccc ctcatctctagcctgcagtcaagtgagtga >gi568815587f:117994816_118217455|GENSCAN_predicted_peptide_2|86_aa MLNCEDSISTLGLILGVGLLLLLVSILGYSLAKWYQRGYCWEGPNFVFNLYQIRNLKDLE MGPPFTISGHISSTDGGYMKFSNGLV >gi568815587f:117994816_118217455|GENSCAN_predicted_CDS_2|261_bp atgctgaactgtgaggactccatcagcaccttgggcctgatccttggcgtggggctcttg ctgctgctcgtgtccatcctcggctacagcctggccaagtggtaccagcgcgggtactgc tgggaggggcctaattttgtcttcaacttatatcaaatccggaacctgaaggatctggag atgggtccacccttcaccatcagtggtcacatcagcagcacagatggtggctacatgaag ttctccaacgggctagtctga >gi568815587f:117994816_118217455|GENSCAN_predicted_peptide_3|93_aa MAGFVILPFLWCVLMNQSCKATNGSSNGAPDAVHGPLDRPASLCSDVNDIEGILLRKSQL HNRYYTPVQQEAVRVVAGQPPQQHLGFPVERGD >gi568815587f:117994816_118217455|GENSCAN_predicted_CDS_3|282_bp atggctggctttgttattctccccttcctgtggtgtgttttgatgaatcaaagctgtaaa gctacaaatggttcttcaaatggagccccagatgcagtccatggacccctggaccggcct gctagcctatgctctgatgttaatgacatcgaaggcatcctcctgagaaaatctcaactg cacaaccgctactacaccccagttcagcaggaagcagttagagtggtcgctggccaacct ccccaacagcacttgggttttcctgttgagaggggggactga >gi568815587f:117994816_118217455|GENSCAN_predicted_peptide_4|559_aa MPSSLHAQTAASFLVHHLSTSLKDNSGQAKDPDSDQPLNSLDVKPLRKPRIPMETFRKVG IPIIIALLSLASIIIVVVLNCKLSGSEQRLEEQGEFLLSDIMSTSEITHVKDQAKNQVLQ KIKVILDKYYFLCGQPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRS TLQVLDSATGNWFSACFDNFTEALAETACRQMGYSSSQLSLPLDVSSKPTFRAVEIGPDQ DLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSLKTPRVVGVEEASVDSWPWQV SIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKLGSFPSLAVAKIIIIE FNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWGFTKQNGGKMS DILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQSDQWHV VGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKDRTIQRSCNSPGTGLVIQQPAVPLME LRNDNSLLPIVQGLKTIVS >gi568815587f:117994816_118217455|GENSCAN_predicted_CDS_4|1680_bp atgcccagcagccttcatgcccagacagctgcctctttcctggttcatcacctgagcacc agtttaaaggacaactcaggacaagcaaaagatcctgacagtgatcaacctctgaacagc ctcgatgtcaaacccctgcgcaaaccccgtatccccatggagaccttcagaaaggtgggg atccccatcatcatagcactactgagcctggcgagtatcatcattgtggttgtcctcaat tgcaagctgagtggctctgagcagcggctggaagaacagggcgagtttctgcttagcgat atcatgagcaccagtgagataacacatgtaaaggaccaagctaaaaaccaagttttgcaa aagatcaaggtgattctggataaatactacttcctctgcgggcagcctctccacttcatc ccgaggaagcagctgtgtgacggagagctggactgtcccttgggggaggacgaggagcac tgtgtcaagagcttccccgaagggcctgcagtggcagtccgcctctccaaggaccgatcc acactgcaggtgctggactcggccacagggaactggttctctgcctgtttcgacaacttc acagaagctctcgctgagacagcctgtaggcagatgggctacagcagctcacaactctct ctccctcttgatgtgagcagcaaacccactttcagagctgtggagattggcccagaccag gatctggatgttgttgaaatcacagaaaacagccaggagcttcgcatgcggaactcaagt gggccctgtctctcaggctccctggtctccctgcactgtcttgcctgtgggaagagcctg aagaccccccgtgtggtgggtgtggaggaggcctctgtggattcttggccttggcaggtc agcatccagtacgacaaacagcacgtctgtggagggagcatcctggacccccactgggtc ctcacggcagcccactgcttcaggaaacataccgatgtgttcaactggaaggtgcgggca ggctcagacaaactgggcagcttcccatccctggctgtggccaagatcatcatcattgaa ttcaaccccatgtaccccaaagacaatgacatcgccctcatgaagctgcagttcccactc actttctcaggcacagtcaggcccatctgtctgcccttctttgatgaggagctcactcca gccaccccactctggatcattggatggggctttacgaagcagaatggagggaagatgtct gacatactgctgcaggcgtcagtccaggtcattgacagcacacggtgcaatgcagacgat gcgtaccagggggaagtcaccgagaagatgatgtgtgcaggcatcccggaagggggtgtg gacacctgccagggtgacagtggtgggcccctgatgtaccaatctgaccagtggcatgtg gtgggcatcgttagttggggctatggctgcgggggcccgagcaccccaggagtatacacc aaggtctcagcctatctcaactggatctacaatgtctggaaggatagaactattcagaga agctgtaactccccagggacaggtcttgtgattcagcaaccagctgtaccgctgatggag ctacggaatgacaattctctgctgcctattgtccagggtctgaaaaccattgtatcatag >gi568815587f:117994816_118217455|GENSCAN_predicted_peptide_5|582_aa MEDATSDEGGDSVGAGLVGMTMEVAKRKGSLSALFLVCCKPEEVPPGRSMEVTVPATLNV LNGSDARLPCTFNSCYTVNHKQFSLNWTYQECNNCSEEMFLQFRMKIINLKLERFQDRVE FSGNPSKYDVSVMLRNVQPEDEGIYNCYIMNPPDRHRGHGKIHLQVLMEEPPERDSTVAV IVGASVGGFLAVVILVLMVVKCVRRKKEQKLSTDDLKTEEEGKTDGEGNPDDGAKTVDEA RLEEQLQGLDAFLRTDRGGSIPCTHSDHLYKVILLQGVNAFQSRSTFDVASPGAKAVFLR FPGSRLAARGRRTRLAQRGPARTCPSASLPLGTREEPPGRGNERLLDGGPGLRRPRFSSS PYGTRLFPLLDRPARLFLLPVTLSLEVSVGKATDIYAVNGTEILLPCTFSSCFGFEDLHF RWTYNSSDAFKILIEGTVKNEKSDPKVTLKDDDRITLVGSTKEKMNNISIVLRDLEFSDT GKYTCHVKNPKENNLQHHATIFLQVVDRLEEVDNTVTLIILAVVGGVIGLLILILLIKKL IIFILKKTREKKKECLVSSSGNDNTENGLPGSKAEEKPPSKV >gi568815587f:117994816_118217455|GENSCAN_predicted_CDS_5|1749_bp atggaggatgccacaagtgacgagggaggggactctgtcggggctgggcttgtggggatg actatggaggtggcaaagagaaagggcagcttgtctgccctctttcttgtctgttgcaag cctgaggaagtgccaccaggacggagcatggaggtcacagtacctgccaccctcaacgtc ctcaatggctctgacgcccgcctgccctgcaccttcaactcctgctacacagtgaaccac aaacagttctccctgaactggacttaccaggagtgcaacaactgctctgaggagatgttc ctccagttccgcatgaagatcattaacctgaagctggagcggtttcaagaccgcgtggag ttctcagggaaccccagcaagtacgatgtgtcggtgatgctgagaaacgtgcagccggag gatgaggggatttacaactgctacatcatgaacccccctgaccgccaccgtggccatggc aagatccatctgcaggtcctcatggaagagccccctgagcgggactccacggtggccgtg attgtgggtgcctccgtcgggggcttcctggctgtggtcatcttggtgctgatggtggtc aagtgtgtgaggagaaaaaaagagcagaagctgagcacagatgacctgaagaccgaggag gagggcaagacggacggtgaaggcaacccggatgatggcgccaagactgtggatgaggcc agactggaagagcagctccagggtctcgatgccttcctgagaactgacaggggaggaagc attccatgtacccactctgatcatctgtacaaggtgatcctgctccagggggtgaatgcc ttccagtctaggtccacttttgatgttgcaagtcctggagcaaaggccgtcttcctgcgg ttcccagggtcccgtttggcggccagagggcgtcggactcggctggcccagcgaggtcca gcccgaacgtgtccttctgcctctctgcccctggggaccagggaggagcctccagggcgg gggaacgagaggctactggacggcggcccgggactgcggcggccgcgtttctcttcttca ccttacgggacccggctcttccccctcctcgaccgccccgcccgcctcttcctgctcccc gtaaccctgtcgctggaggtgtctgtgggaaaggccaccgacatctacgctgtcaatggc acggagatcctgctgccctgcaccttctccagctgctttggcttcgaggacctccacttc cggtggacctacaacagcagtgacgcattcaagattctcatagaggggactgtgaagaat gagaagtctgaccccaaggtgacgttgaaagacgatgaccgcatcactctggtaggctct actaaggagaagatgaacaacatttccattgtgctgagggacctggagttcagcgacacg ggcaaatacacctgccatgtgaagaaccccaaggagaataatctccagcaccacgccacc atcttcctccaagtcgttgatagactggaagaagtggacaacacagtgacactcatcatc ctggctgtcgtgggcggggtcatcgggctcctcatcctcatcctgctgatcaagaaactc atcatcttcatcctgaagaagactcgggagaagaagaaggagtgtctcgtgagctcctcg gggaatgacaacacggagaacggcttgcctggctccaaggcagaggagaaaccaccttca aaagtgtga >gi568815587f:117994816_118217455|GENSCAN_predicted_peptide_6|507_aa MFCPLKLILLPVLLDYSLGLNDLNVSPPELTVHVGDSALMGCVFQSTEDKCIFKIDWTLS PGEHAKDEYVLYYYSNLSVPIGRFQNRVHLMGDILCNDGSLLLQDVQEADQGTYICEIRL KGESQVFKKAVVLHVLPEEPKELMVHVGGLIQMGCVFQSTEVKHVTKVEWIFSGRRAKEE IVFRYYHKLRMSVEYSQSWGHFQNRVNLVGDIFRNDGSIMLQGVRESDGGNYTCSIHLGN LVFKKTIVLHVSPEEPRTLVTPAALRPLVLGGNQLVIIVGIVCATILLLPVLILIVKKTC GNKSSVNSTVLVKNTKKTNPEIKEKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEAT YMTMWTATDLILGLKCAVPILLVAIVRFDKVSLKNPLAGGIQIGNEEVKLSLFADDIIIN LENPKDSSKKLLELVTLMQEVGSHGLRQLCPVALQGTASFLAVFMGWQKKEEAGYVKPKI SGNGAELTCVCGSMDGCAAEWDLAVHA >gi568815587f:117994816_118217455|GENSCAN_predicted_CDS_6|1524_bp atgttttgcccactgaaactcatcctgctgccagtgttactggattattccttgggcctg aatgacttgaatgtttccccgcctgagctaacagtccatgtgggtgattcagctctgatg ggatgtgttttccagagcacagaagacaaatgtatattcaagatagactggactctgtca ccaggagagcacgccaaggacgaatatgtgctatactattactccaatctcagtgtgcct attgggcgcttccagaaccgcgtacacttgatgggggacatcttatgcaatgatggctct ctcctgctccaagatgtgcaagaggctgaccagggaacctatatctgtgaaatccgcctc aaaggggagagccaggtgttcaagaaggcggtggtactgcatgtgcttccagaggagccc aaagagctcatggtccatgtgggtggattgattcagatgggatgtgttttccagagcaca gaagtgaaacacgtgaccaaggtagaatggatattttcaggacggcgcgcaaaggaggag attgtatttcgttactaccacaaactcaggatgtctgtggagtactcccagagctggggc cacttccagaatcgtgtgaacctggtgggggacattttccgcaatgacggttccatcatg cttcaaggagtgagggagtcagatggaggaaactacacctgcagtatccacctagggaac ctggtgttcaagaaaaccattgtgctgcatgtcagcccggaagagcctcgaacactggtg accccggcagccctgaggcctctggtcttgggtggtaatcagttggtgatcattgtggga attgtctgtgccacaatcctgctgctccctgttctgatattgatcgtgaagaagacctgt ggaaataagagttcagtgaattctacagtcttggtgaagaacacgaagaagactaatcca gagataaaagaaaaaccctgccattttgaaagatgtgaaggggagaaacacatttactcc ccaataattgtacgggaggtgatcgaggaagaagaaccaagtgaaaaatcagaggccacc tacatgaccatgtggacagccactgatttaattcttggtctgaaatgtgcggttcctata cttttagttgctattgtaagatttgataaggttagtcttaaaaatcctttggcggggggc atccaaattggtaatgaggaagtcaaactgtctctgtttgctgatgacataatcataaac cttgaaaaccctaaagactcatccaaaaagctcctagaactggtcacactgatgcaagaa gtgggttcccatggtcttcggcagctctgccctgtggctttgcagggtacagcctccttc ctggctgttttcatgggctggcaaaaaaaagaggaggccggctacgtcaaaccaaaaatc tctggcaacggggcagagctgacatgtgtctgcggcagcatggatggctgtgctgcagag tgggacttggcagtgcacgcctaa