GENSCAN 1.0 Date run: 7-Nov-116 Time: 22:34:46 Sequence gi568815587r:118066890_118276431 : 209542 bp : 46.65% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.00 Prom + 2627 2666 40 -2.66 1.01 Init + 8107 8168 62 0 2 93 103 45 0.281 5.74 1.02 Intr + 18624 18651 28 0 1 71 87 27 0.026 -0.98 1.03 Intr + 27933 27966 34 2 1 142 123 16 0.967 8.30 1.04 Intr + 32096 32209 114 0 0 113 89 115 0.997 14.52 1.05 Intr + 32814 32869 56 1 2 107 42 96 0.887 5.60 1.06 Intr + 33292 33361 70 0 1 97 17 47 0.324 -2.75 1.07 Intr + 36212 36364 153 0 0 100 109 148 0.517 18.14 1.08 Intr + 37802 37931 130 0 1 82 91 102 0.996 9.65 1.09 Intr + 40852 40986 135 1 0 78 106 135 0.729 13.98 1.10 Intr + 41967 42007 41 0 2 79 97 -8 0.799 -2.93 1.11 Intr + 44852 45011 160 0 1 89 110 146 0.965 15.95 1.12 Intr + 46380 46546 167 0 2 104 94 263 0.999 28.10 1.13 Intr + 47940 48038 99 1 0 85 107 36 0.971 5.28 1.14 Intr + 48249 48391 143 1 2 49 75 232 0.999 18.07 1.15 Intr + 50416 50565 150 0 0 111 116 102 0.987 15.66 1.16 Intr + 58406 58488 83 1 2 102 61 51 0.223 2.24 1.17 Term + 62565 62619 55 0 1 113 43 18 0.113 -3.17 1.18 PlyA + 66431 66436 6 1.05 2.14 PlyA - 66508 66503 6 1.05 2.13 Term - 70231 70138 94 0 1 144 52 107 0.645 10.00 2.12 Intr - 74447 74318 130 0 1 39 81 317 0.996 25.85 2.11 Intr - 77172 76944 229 0 1 94 26 331 0.987 24.94 2.10 Intr - 78340 78168 173 2 2 42 87 333 0.977 28.26 2.09 Intr - 79029 78792 238 0 1 87 31 120 0.777 3.39 2.08 Intr - 87992 87955 38 0 2 104 82 36 0.262 2.58 2.07 Intr - 89524 89419 106 1 1 101 47 26 0.197 -0.41 2.06 Intr - 99317 99281 37 1 1 91 91 41 0.234 2.96 2.05 Intr - 100197 100002 196 1 1 105 11 346 0.051 27.07 2.04 Intr - 101406 101196 211 0 1 70 67 378 0.978 32.39 2.03 Intr - 101862 101696 167 1 2 90 101 247 0.830 25.88 2.02 Intr - 102699 102659 41 2 2 97 97 -6 0.229 -0.93 2.01 Init - 107609 107521 89 2 2 89 80 22 0.106 1.64 2.00 Prom - 110913 110874 40 -5.36 3.18 PlyA - 111290 111285 6 1.05 3.17 Term - 113771 113651 121 1 1 39 47 171 0.923 6.05 3.16 Intr - 125199 125099 101 0 2 55 73 58 0.361 -0.09 3.15 Intr - 126714 126505 210 0 0 44 59 86 0.306 0.41 3.14 Intr - 129932 129846 87 2 0 86 81 133 0.951 12.57 3.13 Intr - 131202 131109 94 2 1 95 51 68 0.937 3.77 3.12 Intr - 133723 133585 139 2 1 65 77 94 0.887 5.52 3.11 Intr - 136776 136539 238 0 1 99 -6 261 0.779 14.99 3.10 Intr - 139102 138993 110 2 2 95 87 -11 0.508 -0.50 3.09 Intr - 143823 143598 226 0 1 103 107 230 0.807 23.96 3.08 Intr - 145672 145518 155 2 2 73 92 21 0.738 0.79 3.07 Intr - 147997 147935 63 2 0 85 106 34 0.243 3.59 3.06 Intr - 150130 150025 106 1 1 66 50 31 0.033 -3.11 3.05 Intr - 166634 166571 64 1 1 81 119 20 0.727 3.12 3.04 Intr - 168700 168535 166 2 1 135 80 75 0.935 10.42 3.03 Intr - 170371 170161 211 1 1 81 86 122 0.972 9.79 3.02 Intr - 173488 173322 167 2 2 45 89 168 0.998 12.28 3.01 Init - 185405 185333 73 2 1 85 96 102 0.875 9.95 3.00 Prom - 189980 189941 40 -5.26 4.05 PlyA - 190044 190039 6 1.05 4.04 Term - 190424 190361 64 1 1 112 42 54 0.817 0.56 4.03 Intr - 193312 193165 148 2 1 87 69 152 0.845 12.49 4.02 Intr - 195759 195549 211 0 1 91 82 267 0.999 24.89 4.01 Intr - 196208 196042 167 0 2 128 78 94 0.984 12.08 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587r:118066890_118276431|GENSCAN_predicted_peptide_1|559_aa MPSSLHAQTAASFLVHHLSTSLKDNSGQAKDPDSDQPLNSLDVKPLRKPRIPMETFRKVG IPIIIALLSLASIIIVVVLNCKLSGSEQRLEEQGEFLLSDIMSTSEITHVKDQAKNQVLQ KIKVILDKYYFLCGQPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRS TLQVLDSATGNWFSACFDNFTEALAETACRQMGYSSSQLSLPLDVSSKPTFRAVEIGPDQ DLDVVEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSLKTPRVVGVEEASVDSWPWQV SIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKLGSFPSLAVAKIIIIE FNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWGFTKQNGGKMS DILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQSDQWHV VGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKDRTIQRSCNSPGTGLVIQQPAVPLME LRNDNSLLPIVQGLKTIVS >gi568815587r:118066890_118276431|GENSCAN_predicted_CDS_1|1680_bp atgcccagcagccttcatgcccagacagctgcctctttcctggttcatcacctgagcacc agtttaaaggacaactcaggacaagcaaaagatcctgacagtgatcaacctctgaacagc ctcgatgtcaaacccctgcgcaaaccccgtatccccatggagaccttcagaaaggtgggg atccccatcatcatagcactactgagcctggcgagtatcatcattgtggttgtcctcaat tgcaagctgagtggctctgagcagcggctggaagaacagggcgagtttctgcttagcgat atcatgagcaccagtgagataacacatgtaaaggaccaagctaaaaaccaagttttgcaa aagatcaaggtgattctggataaatactacttcctctgcgggcagcctctccacttcatc ccgaggaagcagctgtgtgacggagagctggactgtcccttgggggaggacgaggagcac tgtgtcaagagcttccccgaagggcctgcagtggcagtccgcctctccaaggaccgatcc acactgcaggtgctggactcggccacagggaactggttctctgcctgtttcgacaacttc acagaagctctcgctgagacagcctgtaggcagatgggctacagcagctcacaactctct ctccctcttgatgtgagcagcaaacccactttcagagctgtggagattggcccagaccag gatctggatgttgttgaaatcacagaaaacagccaggagcttcgcatgcggaactcaagt gggccctgtctctcaggctccctggtctccctgcactgtcttgcctgtgggaagagcctg aagaccccccgtgtggtgggtgtggaggaggcctctgtggattcttggccttggcaggtc agcatccagtacgacaaacagcacgtctgtggagggagcatcctggacccccactgggtc ctcacggcagcccactgcttcaggaaacataccgatgtgttcaactggaaggtgcgggca ggctcagacaaactgggcagcttcccatccctggctgtggccaagatcatcatcattgaa ttcaaccccatgtaccccaaagacaatgacatcgccctcatgaagctgcagttcccactc actttctcaggcacagtcaggcccatctgtctgcccttctttgatgaggagctcactcca gccaccccactctggatcattggatggggctttacgaagcagaatggagggaagatgtct gacatactgctgcaggcgtcagtccaggtcattgacagcacacggtgcaatgcagacgat gcgtaccagggggaagtcaccgagaagatgatgtgtgcaggcatcccggaagggggtgtg gacacctgccagggtgacagtggtgggcccctgatgtaccaatctgaccagtggcatgtg gtgggcatcgttagttggggctatggctgcgggggcccgagcaccccaggagtatacacc aaggtctcagcctatctcaactggatctacaatgtctggaaggatagaactattcagaga agctgtaactccccagggacaggtcttgtgattcagcaaccagctgtaccgctgatggag ctacggaatgacaattctctgctgcctattgtccagggtctgaaaaccattgtatcatag >gi568815587r:118066890_118276431|GENSCAN_predicted_peptide_2|582_aa MEDATSDEGGDSVGAGLVGMTMEVAKRKGSLSALFLVCCKPEEVPPGRSMEVTVPATLNV LNGSDARLPCTFNSCYTVNHKQFSLNWTYQECNNCSEEMFLQFRMKIINLKLERFQDRVE FSGNPSKYDVSVMLRNVQPEDEGIYNCYIMNPPDRHRGHGKIHLQVLMEEPPERDSTVAV IVGASVGGFLAVVILVLMVVKCVRRKKEQKLSTDDLKTEEEGKTDGEGNPDDGAKTVDEA RLEEQLQGLDAFLRTDRGGSIPCTHSDHLYKVILLQGVNAFQSRSTFDVASPGAKAVFLR FPGSRLAARGRRTRLAQRGPARTCPSASLPLGTREEPPGRGNERLLDGGPGLRRPRFSSS PYGTRLFPLLDRPARLFLLPVTLSLEVSVGKATDIYAVNGTEILLPCTFSSCFGFEDLHF RWTYNSSDAFKILIEGTVKNEKSDPKVTLKDDDRITLVGSTKEKMNNISIVLRDLEFSDT GKYTCHVKNPKENNLQHHATIFLQVVDRLEEVDNTVTLIILAVVGGVIGLLILILLIKKL IIFILKKTREKKKECLVSSSGNDNTENGLPGSKAEEKPPSKV >gi568815587r:118066890_118276431|GENSCAN_predicted_CDS_2|1749_bp atggaggatgccacaagtgacgagggaggggactctgtcggggctgggcttgtggggatg actatggaggtggcaaagagaaagggcagcttgtctgccctctttcttgtctgttgcaag cctgaggaagtgccaccaggacggagcatggaggtcacagtacctgccaccctcaacgtc ctcaatggctctgacgcccgcctgccctgcaccttcaactcctgctacacagtgaaccac aaacagttctccctgaactggacttaccaggagtgcaacaactgctctgaggagatgttc ctccagttccgcatgaagatcattaacctgaagctggagcggtttcaagaccgcgtggag ttctcagggaaccccagcaagtacgatgtgtcggtgatgctgagaaacgtgcagccggag gatgaggggatttacaactgctacatcatgaacccccctgaccgccaccgtggccatggc aagatccatctgcaggtcctcatggaagagccccctgagcgggactccacggtggccgtg attgtgggtgcctccgtcgggggcttcctggctgtggtcatcttggtgctgatggtggtc aagtgtgtgaggagaaaaaaagagcagaagctgagcacagatgacctgaagaccgaggag gagggcaagacggacggtgaaggcaacccggatgatggcgccaagactgtggatgaggcc agactggaagagcagctccagggtctcgatgccttcctgagaactgacaggggaggaagc attccatgtacccactctgatcatctgtacaaggtgatcctgctccagggggtgaatgcc ttccagtctaggtccacttttgatgttgcaagtcctggagcaaaggccgtcttcctgcgg ttcccagggtcccgtttggcggccagagggcgtcggactcggctggcccagcgaggtcca gcccgaacgtgtccttctgcctctctgcccctggggaccagggaggagcctccagggcgg gggaacgagaggctactggacggcggcccgggactgcggcggccgcgtttctcttcttca ccttacgggacccggctcttccccctcctcgaccgccccgcccgcctcttcctgctcccc gtaaccctgtcgctggaggtgtctgtgggaaaggccaccgacatctacgctgtcaatggc acggagatcctgctgccctgcaccttctccagctgctttggcttcgaggacctccacttc cggtggacctacaacagcagtgacgcattcaagattctcatagaggggactgtgaagaat gagaagtctgaccccaaggtgacgttgaaagacgatgaccgcatcactctggtaggctct actaaggagaagatgaacaacatttccattgtgctgagggacctggagttcagcgacacg ggcaaatacacctgccatgtgaagaaccccaaggagaataatctccagcaccacgccacc atcttcctccaagtcgttgatagactggaagaagtggacaacacagtgacactcatcatc ctggctgtcgtgggcggggtcatcgggctcctcatcctcatcctgctgatcaagaaactc atcatcttcatcctgaagaagactcgggagaagaagaaggagtgtctcgtgagctcctcg gggaatgacaacacggagaacggcttgcctggctccaaggcagaggagaaaccaccttca aaagtgtga >gi568815587r:118066890_118276431|GENSCAN_predicted_peptide_3|776_aa MQQRGAAGSRGCALFPLLGVLFFQGVYIVFSLEIRADAHVRGYVGEKIKLKCTFKSTSDV TDKLTIDWTYRPPSSSHTVSIFHYQSFQYPTTAGTFRDRISWVGNVYKGDASISISNPTI KDNGTFSCAVKNPPDVHHNIPMTELTVTERGFGTMLSSVALLSILVFVPSAVVVALLLVR MGRKAAGLKKRSRSGYKKSSIEVSDDTDQEEEEACMARLCVRCAECLRREHNQLPGEGNR CPRCDELMLGHTIKEAPYDLHPYLKVESSMFCPLKLILLPVLLDYSLGLNDLNVSPPELT VHVGDSALMGCVFQSTEDKCIFKIDWTLSPGEHAKDEYVLYYYSNLSVPIGRFQNRVHLM GDILCNDGSLLLQDVQEADQGTYICEIRLKGESQVFKKAVVLHVLPEEPKELMVHVGGLI QMGCVFQSTEVKHVTKVEWIFSGRRAKEEIVFRYYHKLRMSVEYSQSWGHFQNRVNLVGD IFRNDGSIMLQGVRESDGGNYTCSIHLGNLVFKKTIVLHVSPEEPRTLVTPAALRPLVLG GNQLVIIVGIVCATILLLPVLILIVKKTCGNKSSVNSTVLVKNTKKTNPEIKEKPCHFER CEGEKHIYSPIIVREVIEEEEPSEKSEATYMTMWTATDLILGLKCAVPILLVAIVRFDKV SLKNPLAGGIQIGNEEVKLSLFADDIIINLENPKDSSKKLLELVTLMQEVGSHGLRQLCP VALQGTASFLAVFMGWQKKEEAGYVKPKISGNGAELTCVCGSMDGCAAEWDLAVHA >gi568815587r:118066890_118276431|GENSCAN_predicted_CDS_3|2331_bp atgcagcagagaggagcagctggaagccgtggctgcgctctcttccctctgctgggcgtc ctgttcttccagggtgtttatatcgtcttttccttggagattcgtgcagatgcccatgtc cgaggttatgttggagaaaagatcaagttgaaatgcactttcaagtcaacttcagatgtc actgacaaacttactatagactggacatatcgccctcccagcagcagccacacagtatca atatttcattatcagtctttccagtacccaaccacagcaggcacatttcgggatcggatt tcctgggttggaaatgtatacaaaggggatgcatctataagtataagcaaccctaccata aaggacaatgggacattcagctgtgctgtgaagaatcccccagatgtgcatcataatatt cccatgacagagctaacagtcacagaaaggggttttggcaccatgctttcctctgtggcc cttctttccatccttgtctttgtgccctcagccgtggtggttgctctgctgctggtgaga atggggaggaaggctgctgggctgaagaagaggagcaggtctggctataagaagtcatct attgaggtttccgatgacactgatcaggaggaggaagaggcgtgtatggcgaggctttgt gtccgttgcgctgagtgcctgaggagggagcacaaccaactccctggggaaggcaacaga tgtcccagatgtgatgaacttatgttaggacacaccataaaggaagccccttatgacctt catccatatttgaaagttgagagcagcatgttttgcccactgaaactcatcctgctgcca gtgttactggattattccttgggcctgaatgacttgaatgtttccccgcctgagctaaca gtccatgtgggtgattcagctctgatgggatgtgttttccagagcacagaagacaaatgt atattcaagatagactggactctgtcaccaggagagcacgccaaggacgaatatgtgcta tactattactccaatctcagtgtgcctattgggcgcttccagaaccgcgtacacttgatg ggggacatcttatgcaatgatggctctctcctgctccaagatgtgcaagaggctgaccag ggaacctatatctgtgaaatccgcctcaaaggggagagccaggtgttcaagaaggcggtg gtactgcatgtgcttccagaggagcccaaagagctcatggtccatgtgggtggattgatt cagatgggatgtgttttccagagcacagaagtgaaacacgtgaccaaggtagaatggata ttttcaggacggcgcgcaaaggaggagattgtatttcgttactaccacaaactcaggatg tctgtggagtactcccagagctggggccacttccagaatcgtgtgaacctggtgggggac attttccgcaatgacggttccatcatgcttcaaggagtgagggagtcagatggaggaaac tacacctgcagtatccacctagggaacctggtgttcaagaaaaccattgtgctgcatgtc agcccggaagagcctcgaacactggtgaccccggcagccctgaggcctctggtcttgggt ggtaatcagttggtgatcattgtgggaattgtctgtgccacaatcctgctgctccctgtt ctgatattgatcgtgaagaagacctgtggaaataagagttcagtgaattctacagtcttg gtgaagaacacgaagaagactaatccagagataaaagaaaaaccctgccattttgaaaga tgtgaaggggagaaacacatttactccccaataattgtacgggaggtgatcgaggaagaa gaaccaagtgaaaaatcagaggccacctacatgaccatgtggacagccactgatttaatt cttggtctgaaatgtgcggttcctatacttttagttgctattgtaagatttgataaggtt agtcttaaaaatcctttggcggggggcatccaaattggtaatgaggaagtcaaactgtct ctgtttgctgatgacataatcataaaccttgaaaaccctaaagactcatccaaaaagctc ctagaactggtcacactgatgcaagaagtgggttcccatggtcttcggcagctctgccct gtggctttgcagggtacagcctccttcctggctgttttcatgggctggcaaaaaaaagag gaggccggctacgtcaaaccaaaaatctctggcaacggggcagagctgacatgtgtctgc ggcagcatggatggctgtgctgcagagtgggacttggcagtgcacgcctaa >gi568815587r:118066890_118276431|GENSCAN_predicted_peptide_4|196_aa XLWPIAAVEIYTSRVLEAVNGTDARLKCTFSSFAPVGDALTVTWNFRPLDGGPEQFVFYY HIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGTYTCQVKNPPDVDGVIGEIR LSVVHTVRFSEIHFLALAIGSACALMIIIVIVVVLFQHYRKKRWAERAHKVVEIKSKEEE RLNQEKKVSVYLEDTD >gi568815587r:118066890_118276431|GENSCAN_predicted_CDS_4|591_bp nctctttggcctatagcagctgtggaaatttatacctcccgggtgctggaggctgttaat gggacagatgctcggttaaaatgcactttctccagctttgcccctgtgggtgatgctcta acagtgacctggaattttcgtcctctagacgggggacctgagcagtttgtattctactac cacatagatcccttccaacccatgagtgggcggtttaaggaccgggtgtcttgggatggg aatcctgagcggtacgatgcctccatccttctctggaaactgcagttcgacgacaatggg acatacacctgccaggtgaagaacccacctgatgttgatggggtgataggggagatccgg ctcagcgtcgtgcacactgtacgcttctctgagatccacttcctggctctggccattggc tctgcctgtgcactgatgatcataatagtaattgtagtggtcctcttccagcattaccgg aaaaagcgatgggccgaaagagctcataaagtggtggagataaaatcaaaagaagaggaa aggctcaaccaagagaaaaaggtctctgtttatttagaagacacagactaa