GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:40:01 Sequence gi568815594f:4287630_4517432 : 229803 bp : 45.72% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 2305 2627 323 0 2 51 -39 228 0.048 3.23 1.02 Intr + 12298 12468 171 1 0 67 80 87 0.324 4.86 1.03 Intr + 14360 15462 1103 2 2 70 93 451 0.199 33.34 1.04 Intr + 18509 18555 47 0 2 105 75 11 0.177 -0.37 1.05 Intr + 25412 25485 74 1 2 119 78 94 0.397 9.70 1.06 Intr + 26087 26210 124 2 1 77 20 111 0.374 3.79 1.07 Intr + 27708 27780 73 2 1 85 75 35 0.953 0.88 1.08 Intr + 28180 28341 162 2 0 106 101 90 0.996 12.05 1.09 Term + 33011 33687 677 0 2 123 41 715 0.993 64.18 1.10 PlyA + 34132 34137 6 1.05 2.00 Prom + 34742 34781 40 -8.26 2.01 Init + 34907 34916 10 1 1 84 94 16 0.145 2.25 2.02 Intr + 42217 42284 68 2 2 123 88 0 0.248 2.12 2.03 Intr + 47157 47310 154 2 1 72 86 137 0.319 11.55 2.04 Intr + 51962 52070 109 0 1 -14 103 109 0.143 1.54 2.05 Intr + 60763 60842 80 1 2 69 101 76 0.220 6.19 2.06 Intr + 61055 61205 151 0 1 45 37 75 0.034 -2.68 2.07 Intr + 76320 76450 131 0 2 93 81 31 0.400 3.24 2.08 Intr + 77837 77967 131 0 2 57 82 67 0.184 3.41 2.09 Intr + 85015 85154 140 0 2 74 -3 84 0.297 -2.84 2.10 Term + 89670 89880 211 0 1 45 37 219 0.354 9.27 2.11 PlyA + 90099 90104 6 -0.45 3.00 Prom + 92705 92744 40 -2.06 3.01 Init + 100001 100129 129 1 0 94 121 250 0.829 29.05 3.02 Intr + 103846 103962 117 0 0 51 113 196 0.991 19.06 3.03 Intr + 105827 105952 126 1 0 60 42 90 0.039 2.48 3.04 Intr + 109838 109953 116 1 2 53 101 28 0.050 -0.15 3.05 Intr + 113783 113891 109 2 1 111 107 39 0.074 8.39 3.06 Intr + 115500 115583 84 2 0 78 105 7 0.036 1.42 3.07 Intr + 119207 119365 159 1 0 69 53 79 0.082 2.68 3.08 Intr + 121944 122054 111 2 0 131 93 250 0.998 30.38 3.09 Intr + 122148 122230 83 2 2 103 96 40 0.996 4.74 3.10 Intr + 124580 124756 177 2 0 111 60 38 0.821 2.33 3.11 Intr + 125486 125514 29 2 2 72 92 28 0.734 -0.64 3.12 Intr + 127000 127091 92 2 2 64 42 125 0.606 5.21 3.13 Term + 129606 129806 201 2 0 95 44 252 0.968 18.89 3.14 PlyA + 129949 129954 6 1.05 4.11 PlyA - 130070 130065 6 1.05 4.10 Term - 132500 132405 96 2 0 95 42 167 0.957 10.77 4.09 Intr - 133315 133235 81 1 0 121 80 79 0.999 10.23 4.08 Intr - 135958 135889 70 0 1 33 98 34 0.388 -1.92 4.07 Intr - 137593 137535 59 1 2 92 87 80 0.274 5.98 4.06 Intr - 147229 147141 89 2 2 79 50 60 0.147 0.99 4.05 Intr - 150880 150765 116 0 2 80 96 62 0.301 6.29 4.04 Intr - 169628 169562 67 0 1 79 87 33 0.279 0.26 4.03 Intr - 169871 169773 99 0 0 103 43 30 0.345 0.08 4.02 Intr - 171858 171743 116 2 2 96 98 83 0.601 10.19 4.01 Init - 178250 178219 32 2 2 86 61 50 0.512 1.54 4.00 Prom - 179493 179454 40 -2.76 5.04 PlyA - 180497 180492 6 1.05 5.03 Term - 180824 180766 59 0 2 68 51 68 0.795 -1.05 5.02 Intr - 181962 181905 58 0 1 96 84 31 0.918 1.96 5.01 Init - 182092 182024 69 1 0 81 86 89 0.940 9.05 5.00 Prom - 193437 193398 40 -5.06 6.00 Prom + 204586 204625 40 -4.06 6.01 Init + 208569 208607 39 2 0 77 87 33 0.917 2.28 6.02 Term + 208698 208820 123 2 0 63 32 126 0.946 2.88 6.03 PlyA + 210223 210228 6 1.05 7.00 Prom + 213597 213636 40 -3.86 7.01 Sngl + 219540 219938 399 2 0 74 48 280 0.738 18.87 7.02 PlyA + 220168 220173 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 22757 22838 82 1 1 73 92 75 0.907 7.53 S.002 Intr + 24747 24883 137 1 2 47 93 29 0.925 -0.41 S.003 Term + 105827 105985 159 1 0 60 41 138 0.816 4.24 S.004 Init - 119039 118978 62 2 2 92 37 116 0.846 5.70 S.005 Init + 121226 121288 63 1 0 56 72 98 0.893 4.15 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815594f:4287630_4517432|GENSCAN_predicted_peptide_1|917_aa MRNPEGLTVEIPSLPYSLSEPQPPRSDLWATTPHLSASARQRFQPAAGSTWTRRRQPSAA ATCLSGFAVQNSLLNHPARLRTAATPPPVEVTPTEAGRRFRQAKGALSSPEWLSMDPVAT HSCHLLQQLHEQRIQGLLCDCMLVVKGVCFKAHKNVLAAFSQYFRSLFQNSSSQKNDVFH LDVKNVSGIGQILDFMYTSHLDLNQDNIQVMLDTAQCLQVQNVLSLCHTFLKSATVVQPP GMPCNSTLSLQSTLTPDATCVISENYPPHLLQECSADAQQNKTLDESHPHASPSVNRHHS AGEISKQAPDTSDGSCTELPFKQPNYYYKLRNFYSKQYHKHAAGPSQERVVEQPFAFSTS TDLTTVESQPCAVSHSECILESPEHLPSNFLAQPVNDSAPHPESDATCQQPVKQMRLKKA IHLKKLNFLKSQKYAEQVSEPKSDDGLTKRLESASKNTLEKASSQSAEEKESEEVVSCEN FNCISETERPEDPAALEDQSQTLQSQRQYACELCGKPFKHPSNLELHKRSHTGEKPFECN ICGKHFSQAGNLQTHLRRHSGEKPYICEICGKRLAPSTLLPVLGLYVNQATYSIIWWLCQ DNRHQPGLCQSRSSVVMLNRTVSDLVLCKTEEESVMKSGFSNFSNLKEHKKTHTADKVFT CDECGKSFNMQRKLVKHRIRHTGERPYSCSACGKCFGGSGDLRRHVRTHTGEKPYTCEIC NKCFTRSAVLRRHKKMHCKAGDESPDVLEELSQAIETSDLEKSQSSDSFSQDTSVTLMPV SVKLPVHPVENSVAEFDSHSGGSYCKLRSMIQPHGVSDQEKLSLDPGKLAKPQMQQTQPQ AYAYSDVDTPAGGEPLQADGMAMIRSSLAALDNHGGDPLGSRASSTTYRNSEGQFFSSMT LWGLAMKTLQNENELDQ >gi568815594f:4287630_4517432|GENSCAN_predicted_CDS_1|2754_bp atgcgtaacccagaggggctgacggtggaaatcccgtccctcccctactcactctctgag ccccagcctcccaggtcggacctctgggccaccacccctcacctctcagccagcgcccgc caacggtttcaaccggccgcggggagcacgtggactcgccgccgccagcccagcgccgca gctacctgcctctcaggcttcgcggtgcagaattcgcttctgaaccacccagcgcgcctg cgcaccgcggccacgccccccccagtggaagtgacgccaacggaagcaggaaggcggttc cggcaagccaagggggcgttgtcgtcacctgaatggttgagcatggaccctgttgctacc cacagctgccatctgctccagcaactgcatgagcagcgaatccaaggcctgctttgtgac tgtatgttggtggtaaaaggagtctgctttaaagcgcataagaatgtcctggcagcattc agccagtattttaggagcctctttcagaattcttcaagccagaagaatgatgtttttcac ttggatgttaaaaatgtcagtggcatagggcagatcctggacttcatgtacacttctcat ctagatcttaaccaggacaatatacaagtaatgctggacacagcacagtgtttgcaagtt caaaatgttctgagtctgtgtcacacatttttaaaatcagccactgtagtacagccacct ggcatgccttgtaatagtacattgtctctacaaagcaccctgaccccagatgccacttgt gttatcagtgaaaactacccccctcatttactgcaggaatgttcagcagatgcacagcag aacaaaacgttggatgaatcgcatccgcatgcttcaccatcagttaatcgtcatcactcc gcaggtgaaatctcaaaacaagctcctgatacttcagatggcagctgcacagaactgcct ttcaaacagccaaattactattacaaactcagaaacttttacagtaagcagtaccataaa cacgcagctggtcccagtcaggagagagttgttgagcagccttttgctttcagcacctct acagaccttaccacggtagagagccagccttgtgccgtcagtcattctgaatgcatcctg gagtctcccgagcacttaccttccaacttcctggcccagcctgtgaatgactctgcccca caccctgagtcagacgccacatgccaacaacctgtcaagcagatgaggctcaaaaaggcc attcatctgaagaagctcaatttcctgaagtcacagaaatacgcagagcaagtatctgaa cccaagtcagatgatggtttgacaaagaggttggaatctgctagtaaaaataccctagag aaagctagcagccaaagtgctgaagaaaaagaaagtgaagaagtcgtcagttgtgagaat tttaattgcattagtgagacggagaggcctgaagacccggctgccctggaagaccagtcc cagacacttcagtcccagagacaatacgcgtgtgaattatgcgggaaaccttttaaacac ccaagcaacttggagcttcacaaacggtctcatacaggtgagaaaccttttgaatgtaac atttgtgggaaacatttctctcaggcaggtaacttgcagactcacttacgacggcattct ggtgaaaaaccatacatctgcgagatctgtggaaagaggctggcaccctcgaccctcctt cctgtccttgggctgtacgtcaatcaggccacgtactccatcatctggtggttgtgccag gacaacaggcatcagccaggactatgccagtcacgaagctctgtagtgatgctgaaccgt acggtatctgatcttgtgctctgtaaaactgaggaagaaagtgtgatgaaatcagggttt agtaacttcagtaatttgaaggagcacaaaaagacacacacggctgataaagtcttcacc tgtgatgagtgtggaaagtcttttaatatgcaaaggaagttagtaaagcacagaattcgg cacacgggggagcggccttacagctgctctgcctgcgggaaatgttttgggggatcaggt gacctccgcaggcatgtccgcactcacactggggagaagccgtacacatgtgagatctgt aacaagtgctttacccgctctgcggtgctccggcggcacaagaagatgcactgcaaagct ggtgacgagagcccagatgtgctggaggagctcagccaagccatcgagacctccgacctc gagaaatctcagagctcagactctttctcccaagacacgtctgtgacgctgatgccagtg tcggttaaactccctgtccacccagtggaaaattctgtggcagaatttgatagccactct ggcggctcctattgtaagttacggtccatgatccaacctcatggagttagtgaccaggag aagctgagtttggatcctggtaaacttgccaagccccagatgcagcagacacagcctcag gcctatgcttactcggatgtggacaccccagccggtggcgaaccactgcaggccgatggc atggccatgatccgttcctctctggctgctttggacaaccacggcggtgaccccctgggc agtcgagcatcttccaccacttataggaactcagagggtcagtttttctccagcatgact ctctgggggctagcgatgaagacgctgcagaatgaaaacgagttagaccagtga >gi568815594f:4287630_4517432|GENSCAN_predicted_peptide_2|394_aa MAARAKGTGFPSFSFTASPPIPAKHQRREIVEIKTQDKEVEEKTAGPEGHYHLDVETSSG PECLAAQLFIGYKTRGQGRSQSCCQCFGGDASAGRMFFRERLICVAAVLQNVECAGVSGI GYFWCVLRLADFKNEATDPRGVKPQTFTVSVTIKLVGTQRVSSSKIYCEEPKIKASTSCK GTPTVCSCWLRGVLFWVEALFKYKPTNPESTLQPPVFLGSTAPATSTCPHQLRAGTPRAS CLHTSPLMQNLNPGPVQDSHQKGHIDEEAHVLPGVTLKHQHIFTNQKYGADVNMFEAPYS LRSQESGTPPSNSLDSGVRVICVTARLETVHWKERGVHYEDRVCNRCGDGGHRGSDKGRG KPSQAYSLWTASVGHESQPIAPNPVRSMVFMKLT >gi568815594f:4287630_4517432|GENSCAN_predicted_CDS_2|1185_bp atggccgcacgtgccaagggaacaggattcccgtccttctcattcactgcatctcctcca atcccagcaaagcaccagagacgagagattgtagaaataaagacacaagacaaagaggta gaagaaaagacagctgggcccgagggccactaccacctagacgtggagaccagtagtggc cccgaatgcctggctgcacagttatttattggatacaagacaagggggcagggtcgaagc cagagttgttgtcagtgctttggtggagacgccagtgctgggcgaatgttcttccgagag cgccttatctgcgttgctgcagtcctgcagaatgtcgagtgtgctggtgtgtctggaatt ggttacttctggtgcgttcttcgtctcgctgacttcaagaatgaagccacggaccctcgc ggagtgaagccacagaccttcacagtgagtgttacaataaagctagtggggacccaaaga gtgagcagcagcaagatttattgtgaagagccaaagatcaaagcttccacatcctgcaag gggaccccaacggtttgcagctgctggctcagaggagtcctcttctgggtagaggcactt ttcaaatacaaaccaaccaacccagagtccacgctgcagccacctgtttttctaggctct actgctccagccacatctacctgtcctcatcagctcagggcaggcacccccagagcatcg tgtctgcacacctcacctctcatgcagaatctgaatcctgggccagttcaggactcacac cagaaagggcacattgatgaagaagcacatgtcttacctggggtgaccctgaagcaccaa cacatcttcaccaaccaaaaatatggggccgatgtcaacatgttcgaagcaccttattct ctaagatctcaggagagtggaactccgccctccaactctctagactctggagtgagagtt atctgtgtgacagccagattagaaacagtgcactggaaagagcgaggggtgcactatgag gatcgcgtctgcaaccgctgtggggacggcggccacagaggcagcgacaagggcaggggc aagcctagccaagcctacagcctgtggacagccagtgtggggcacgagtcccagcccata gcacccaatccagttcggtctatggtgttcatgaagctgacctga >gi568815594f:4287630_4517432|GENSCAN_predicted_peptide_3|510_aa MVKLGNNFAEKGTKQPLLEDGFDTIPLMTPLDVNQLQFPPPDKVVVKTKTEYEPDRKKGK ARPPQIAEFTVSITEGVTERFKNLKAQKTMGDYAHPNRLAAGSRSPASSPCLYLLWLSEK AMYAWGVQGNSPVDPELGSGPRPERADGSTASAEAAHASLSRRVCDAAATLVVDTGFFDA SGPWHKPFLPAGLPFPSSLEASSLCQMCESQPCPRGLHLWEDELKGEAPSGLAAIIARQP SLAGLLPLPLAATCLRRLRLLSPASSACFITALRSAALAQVSVLVLFALAFLTCVVFLVV YKVYKYDRACPDGFVLKLAHGQALEQQGASVRSVPVEGPPGEQGREEGVAKTAFPILQAQ VPLDTWPGWEMTQEMKPQSCVRLQMRLAATMEGAWHLLFKRRSRRWNTAESQSECIQKPF EVDAVISPFYQWLEGVQRLFVEPKNTQCIPEGLESYYAEQDSSAREKFYTVINHYNLAKQ SITRSVSPWMSVLSEEKLSEQETEAAEKSA >gi568815594f:4287630_4517432|GENSCAN_predicted_CDS_3|1533_bp atggtgaagttggggaacaatttcgcagagaagggcaccaagcagccgctgctggaggat ggcttcgacaccattcccctgatgacgcccctcgatgtcaatcagctgcagttcccgccc ccggataaggtggtcgtgaaaactaagaccgagtatgaacctgaccgcaagaaagggaaa gcacgtcctccccaaattgctgagttcaccgtcagcatcacggagggtgtcaccgagagg tttaagaacctgaaggctcagaagaccatgggtgattatgctcatcccaatagactggct gctggcagcagaagccctgcctcatctccctgcctctacctcctgtggctgtctgagaag gcgatgtatgcatggggtgtgcagggcaactcccctgtggaccctgagctgggcagtggc cccagacctgagagggcagacgggagcacagccagtgctgaggccgctcacgcttccctg tccagacgggtgtgcgatgctgcagccacactggtcgtagacactgggttttttgatgct tcaggcccgtggcacaaaccctttcttcctgctggactcccatttccctcctccctggag gcatcttctctgtgccagatgtgtgaatcacagccctgccctcgggggctccatctgtgg gaagatgaactgaaaggagaggcaccatcagggcttgcggctatcatcgccaggcagccc tccctggctgggctcctgccactgcctctggctgccacgtgcctcagacggctgcgcctt ctttctccggccagctctgcctgcttcatcacagctctgcgctctgctgccctggcccag gtctccgtgttggtcctcttcgccctggccttcctcacctgcgtcgtcttcctggttgtc tacaaggtgtacaagtatgaccgcgcctgccccgatgggttcgtcctcaagctggcccac ggacaggccttagagcagcagggggccagtgtcaggagtgtccctgtggaaggcccacct ggagagcagggcagggaggagggggtggctaaaactgccttccctatcctgcaagcccaa gtcccgctggatacctggcctgggtgggagatgacccaggagatgaagccccagtcctgt gtccggctgcagatgagactggcggccacaatggaaggggcctggcaccttttgtttaaa aggcgctccaggcgctggaacacagccgagagccagtcagaatgcatccagaagcccttt gaggtggatgctgttatatccccattttaccagtggttagagggtgtacagaggctcttc gtggagcccaagaacacccagtgcatcccagaaggcttggagagctactacgcggagcaa gactccagtgcccgggagaaattttacacagtcataaaccactacaacctggccaagcag agcatcacgcgctccgtatcgccctggatgtcagttctgtcagaagagaagctgtccgag caggagactgaagcggctgagaagtcagcttag >gi568815594f:4287630_4517432|GENSCAN_predicted_peptide_4|274_aa MVVPAFMELLSHTMSEYGRMTDTERDQIDQDAQIFMRTCSEAIQQLRTEAHKEIHSQQVK EHRTAVLDFIEDYLKSISFSFKGVCKLYSEQRAIRVKRVVDKKRLSKLEPEPNTKTREST SSEKVSQSPSKDSEENPATEERPEKILAETQPELGTWGDGKGEDELSPEEIQMFEQENQR LIGEMNSLFDEVRQIEGRVVEISRLQEIFTEKVLQQEAEIDSIHQLVVGATENIKEGNED IREAIKNNAGFRVWILFFLVMCSFSLLFLDWYDS >gi568815594f:4287630_4517432|GENSCAN_predicted_CDS_4|825_bp atggttgtccctgccttcatggagctgctaagccataccatgtctgaatatgggaggatg acagacacagaacgagaccagatagaccaggatgcccagatattcatgaggacctgttca gaagcaattcagcaactacgaacagaagctcacaaggagatacattcccagcaagtgaag gagcacaggaccgctgttttggatttcattgaagattacttgaaaagtatttcattttct tttaaaggagtatgtaaactttactcagaacagagagccatccgagttaaaagagtggtg gataagaaaagattatctaagctggaaccagaaccaaatacaaagacaagagaatccaca tcttctgagaaagtttcacagagtccttcaaaagactctgaagaaaaccctgccactgaa gaacgtccagaaaaaattttggctgaaacacaacctgaattgggaacgtggggagatggc aaaggcgaagatgagttatccccagaagaaatacaaatgtttgaacaggaaaatcagcga ctaattggtgaaatgaacagcttgtttgatgaagtgaggcaaatcgaagggagagtggtt gagatttccagactccaagagatattcacggaaaaggttttgcaacaggaagctgagatt gacagcattcaccagttagttgtgggggcaactgaaaatatcaaggaaggcaacgaagac ataagagaggccattaaaaacaacgctggcttccgcgtgtggatcctcttcttcctcgtg atgtgctccttctccttgctcttcctcgactggtacgacagctag >gi568815594f:4287630_4517432|GENSCAN_predicted_peptide_5|61_aa MNAAVPEKWCECILYVITHTASQFIHSAVQAYPLSHPSHRFSAIIAGWRHVSNGRVVDYP P >gi568815594f:4287630_4517432|GENSCAN_predicted_CDS_5|186_bp atgaatgcagcagtacctgagaagtggtgcgagtgcatcctttatgtcatcactcacacc gcctctcagtttatacattcagcggtacaagcatatcctttatctcatccctcacaccgc ttctcagctatcatcgcaggctggcgacatgtcagcaatggccgggttgtagactacccg ccttga >gi568815594f:4287630_4517432|GENSCAN_predicted_peptide_6|53_aa MNTDFLNSNASGKSSEILCLGYENNLTADHCLLLNHCSQPAIKSSSFYIFKGS >gi568815594f:4287630_4517432|GENSCAN_predicted_CDS_6|162_bp atgaacactgatttcttaaatagcaacgccagtggtaagtccagtgagatcctgtgtctt ggttatgagaacaacctcacagctgatcactgtctcctgctgaaccactgcagccaaccc gccatcaagtcctccagcttctacatctttaagggctcctaa >gi568815594f:4287630_4517432|GENSCAN_predicted_peptide_7|132_aa MTLPLDAQQQLSVSHAIMRVNNPYFTVYYKESGISQALLELEKNSDLKAQLTELNIMAAK EIEVGGGQKAITIFVPVPPLKSFQKIQVRLVCELEKKFSGKHIIFIAQRRILPKPIRKSR TKNKQKRPRSAL >gi568815594f:4287630_4517432|GENSCAN_predicted_CDS_7|399_bp atgacactccctcttgatgctcagcagcagctctcagtcagccatgcaattatgagggta aacaacccatactttacagtgtactataaagagtccggcatctcccaggctcttctggag ctggaaaagaactcggacctcaaggctcagctcacagagctgaatattatggcagccaag gaaattgaagttggtggtggtcagaaagctatcacaatctttgttcccgttcctccactg aaatctttccagaaaatccaagtccggctagtatgcgaactggagaaaaagttcagtgga aagcatattatctttatcgctcagaggagaattctgcctaagccaattcgaaaaagccgc acaaaaaataagcaaaagcgtcccaggagtgcactctga