GENSCAN 1.0 Date run: 4-Nov-116 Time: 11:52:02 Sequence gi568815575f:52652681_52860441 : 207761 bp : 43.35% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 Intr - 304 190 115 1 1 97 60 133 0.913 11.42 1.03 Intr - 812 724 89 0 2 79 100 107 0.916 10.69 1.02 Intr - 3851 3683 169 2 1 90 110 93 0.330 11.22 1.01 Init - 13932 13885 48 0 0 82 60 36 0.291 1.15 1.00 Prom - 19512 19473 40 -3.46 2.03 PlyA - 19713 19708 6 1.05 2.02 Term - 22605 22499 107 1 2 58 49 104 0.776 2.07 2.01 Init - 25226 25136 91 2 1 92 106 70 0.852 9.94 2.00 Prom - 32289 32250 40 -7.36 3.11 PlyA - 33734 33729 6 1.05 3.10 Term - 34858 34678 181 0 1 -36 55 200 0.011 1.38 3.09 Intr - 44838 44749 90 2 0 95 57 82 0.123 4.91 3.08 Intr - 45408 45312 97 1 1 92 -1 105 0.210 1.07 3.07 Intr - 47898 47763 136 0 1 29 99 157 0.597 11.04 3.06 Intr - 49936 49839 98 2 2 37 99 104 0.941 6.13 3.05 Intr - 51912 51817 96 1 0 94 96 71 0.993 8.48 3.04 Intr - 52562 52448 115 2 1 133 60 145 0.999 16.22 3.03 Intr - 53136 53001 136 2 1 123 100 139 0.953 19.17 3.02 Intr - 56534 56362 173 2 2 77 37 84 0.721 0.94 3.01 Init - 59463 59350 114 0 0 91 93 78 0.955 8.81 3.00 Prom - 76119 76080 40 -2.26 4.00 Prom + 76947 76986 40 -2.26 4.01 Init + 93603 93716 114 2 0 91 93 78 0.955 8.81 4.02 Intr + 96536 96708 173 1 2 77 37 84 0.721 0.94 4.03 Intr + 99934 100069 136 1 1 123 100 139 0.953 19.17 4.04 Intr + 100508 100622 115 1 1 133 60 145 0.999 16.22 4.05 Intr + 101158 101253 96 2 0 94 96 71 0.993 8.48 4.06 Intr + 103134 103231 98 1 2 37 99 104 0.940 6.13 4.07 Intr + 105174 105309 136 2 1 29 99 157 0.597 11.04 4.08 Term + 107664 107764 101 1 2 100 35 125 0.775 6.69 4.09 PlyA + 108836 108841 6 1.05 5.14 PlyA - 108969 108964 6 1.05 5.13 Term - 113763 113667 97 2 1 96 43 95 0.094 3.24 5.12 Intr - 127924 127835 90 0 0 95 57 109 0.069 7.61 5.11 Intr - 136338 136223 116 0 2 67 75 20 0.261 -2.15 5.10 Intr - 138375 138162 214 2 1 97 53 140 0.881 10.12 5.09 Intr - 143261 143189 73 0 1 87 61 6 0.291 -3.74 5.08 Intr - 143788 143558 231 2 0 19 72 316 0.536 20.84 5.07 Intr - 144448 144433 16 1 1 61 80 -5 0.292 -8.88 5.06 Intr - 144782 144594 189 2 0 42 94 106 0.279 6.38 5.05 Intr - 158893 158627 267 1 0 32 51 187 0.005 7.13 5.04 Intr - 171713 171589 125 0 2 88 69 40 0.266 2.40 5.03 Intr - 175469 175289 181 2 1 19 49 188 0.604 7.44 5.02 Intr - 178249 178178 72 1 0 91 51 61 0.542 2.30 5.01 Init - 181219 180878 342 1 0 56 86 270 0.386 20.84 5.00 Prom - 181577 181538 40 -3.16 6.03 PlyA - 181720 181715 6 1.05 6.02 Term - 183659 183130 530 0 2 67 43 126 0.082 0.42 6.01 Init - 191158 191026 133 1 1 78 47 87 0.357 3.90 6.00 Prom - 191282 191243 40 -1.76 7.04 PlyA - 192047 192042 6 1.05 7.03 Term - 193286 192447 840 2 0 -25 43 232 0.128 0.34 7.02 Intr - 193573 193380 194 2 2 57 41 198 0.093 11.21 7.01 Init - 195684 195474 211 0 1 81 0 153 0.012 4.75 7.00 Prom - 199726 199687 40 -1.96 8.00 Prom + 202404 202443 40 -2.06 8.01 Sngl + 206209 207105 897 0 0 82 47 263 0.562 17.77 8.02 PlyA + 207153 207158 6 -0.45 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 127858 127948 91 0 1 92 106 70 0.822 9.94 S.002 Intr + 162412 162537 126 2 0 78 87 94 0.903 8.95 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815575f:52652681_52860441|GENSCAN_predicted_peptide_1|141_aa MFLITLQVCVDVDAREHCRHRVSAPRKGLSEEDYIRSDIFMINSQPSTSSGQSYLYFSSN EQNQAGSTRWDEGQTAPGAMNGDDAFARRPRAGAQIPEKIQKSFDDIAKYFSKKEWEKMK SLEKISYVYMKRKYEAMTKLX >gi568815575f:52652681_52860441|GENSCAN_predicted_CDS_1|423_bp atgtttcttatcacacttcaagtctgtgttgatgttgatgccagagagcactgcagacat agagtatcagcccccagaaaaggcctttccgaggaggactatatcaggtctgacattttc atgatcaacagccagccatctaccagttctggccaatcctatctgtatttcagcagcaat gaacagaaccaagctgggagcacgagatgggatgagggtcagactgctcctggtgccatg aacggagacgacgcctttgcaaggagacctagggctggtgctcaaataccagagaagatc caaaagtccttcgatgatattgccaaatacttctctaagaaagagtgggaaaagatgaaa tccttggagaaaatcagctatgtgtatatgaagagaaagtatgaggccatgactaaacta gnn >gi568815575f:52652681_52860441|GENSCAN_predicted_peptide_2|65_aa MPVFMKGHHVLLLIMGMCRIPEAEARREKEKTATATPIFSNHHLDQPAAINTKARPSTSK KSVTH >gi568815575f:52652681_52860441|GENSCAN_predicted_CDS_2|198_bp atgcccgtgttcatgaaaggtcaccacgttctgcttctcatcatgggcatgtgtcgtatc cccgaggctgaggcaagaagagagaaggaaaaaactgccacagccactccaatcttcagc aaccaccaccttgatcagccagcagccattaacaccaaggcaagaccctccaccagcaaa aagagtgtgactcactga >gi568815575f:52652681_52860441|GENSCAN_predicted_peptide_3|411_aa MRYHYTLITIAKIKDIVTTPNGDKDAEKLDTSLSAAGKHCRHRVSTSRKGLSKEDYIRTD IFMINSQPSTSSGQSYLYFINNEQNQAGSMRWDEGKIKAAVAGKTQAISLAGQTAPGAMN GDDAFARRPTVGAQIPEKIQKAFDDIAKYFSKEEWEKMKASEKIFYVYMKRKYEAMTKLG FKATLPPFMCNKRAEDFQGNDLDNDPNRGNQDDFRQAPGNLPEGECLSDLKDQRTFVPPR MRTLIMPKKPAEEGNDSEEVPEASGPQNDGKELCPPGKPTTSEKIHERSGPKRGEHAWTH RLRERKQLVIYEEISDPEEDDDLRDTTHAHDEKQNVVTFHEHGHGCGPLVIRTGCLHEEP EAPSVFDHMMKKLDLTSDGQLDFQECLHLMDGMTVAYHDSFLKAAHSKKRI >gi568815575f:52652681_52860441|GENSCAN_predicted_CDS_3|1236_bp atgaggtatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaactggacacctcattaagtgctgctgggaagcactgc agacacagagtatcaacctccagaaagggcctttccaaggaggactatatcaggactgac attttcatgatcaacagccagccatctaccagttctggccaatcctatctgtatttcatc aacaatgaacagaaccaagctgggagcatgagatgggacgagggcaagatcaaagctgct gtggctggaaagactcaggctatttctcttgcaggtcagactgctcccggtgccatgaac ggagacgacgcctttgcaaggagacccacggttggtgctcaaataccagagaagatccaa aaggccttcgatgatattgccaaatacttctctaaggaagagtgggaaaagatgaaagcc tcggagaaaatcttctatgtgtatatgaagagaaagtatgaggctatgactaaactaggt ttcaaggccaccctcccacctttcatgtgtaataaacgggccgaagacttccaggggaat gatttggataatgaccctaaccgtgggaatcaggatgactttcggcaggctccagggaat ctccccgaaggtgagtgtctctcagatctaaaggaccagagaacctttgtccctccacgg atgcgaacactgatcatgcccaagaagccagcagaggaaggaaatgattcggaggaagtg ccagaagcatctggcccacaaaatgatgggaaagagctgtgccccccgggaaaaccaact acctctgagaagattcacgagagatctggacccaaaaggggggaacatgcctggacccac agactgcgtgagagaaaacagctggtgatttatgaagagatcagcgaccctgaggaagat gacgacctcagggatacgacacatgcccatgatgagaagcagaacgtggtgacctttcac gaacatgggcatggctgcggacccctcgtcatcagaactggatgccttcatgaagaacca gaggcccccagtgtctttgaccacatgatgaagaaactggacctcactagtgatgggcag ctggatttccaagaatgtctgcatctgatggatggcatgactgtggcttaccatgactct tttctcaaggctgcccattccaagaagcggatctga >gi568815575f:52652681_52860441|GENSCAN_predicted_peptide_4|322_aa MRYHYTLITIAKIKDIVTTPNGDKDAEKLDTSLSAAGKHCRHRVSTSRKGLSKEDYIRTD IFMINSQPSTSSGQSYLYFINNEQNQAGSMRWDEGKIKAAVAGKTQAISLAGQTAPGAMN GDDAFARRPTVGAQIPEKIQKAFDDIAKYFSKEEWEKMKASEKIFYVYMKRKYEAMTKLG FKATLPPFMCNKRAEDFQGNDLDNDPNRGNQDDFRQAPGNLPEGECLSDLKDQRTFVPPR MRTLIMPKKPAEEGNDSEEVPEASGPQNDGKELCPPGKPTTSEKIHERSGPKRGEHAWTH RLRERKQLVIYEEISDPEEDDE >gi568815575f:52652681_52860441|GENSCAN_predicted_CDS_4|969_bp atgaggtatcactacacacttattacaatagctaaaataaaagacatagtgacaacacca aatggtgacaaggatgcagagaaactggacacctcattaagtgctgctgggaagcactgc agacacagagtatcaacctccagaaagggcctttccaaggaggactatatcaggactgac attttcatgatcaacagccagccatctaccagttctggccaatcctatctgtatttcatc aacaatgaacagaaccaagctgggagcatgagatgggacgagggcaagatcaaagctgct gtggctggaaagactcaggctatttctcttgcaggtcagactgctcccggtgccatgaac ggagacgacgcctttgcaaggagacccacggttggtgctcaaataccagagaagatccaa aaggccttcgatgatattgccaaatacttctctaaggaagagtgggaaaagatgaaagcc tcggagaaaatcttctatgtgtatatgaagagaaagtatgaggctatgactaaactaggt ttcaaggccaccctcccacctttcatgtgtaataaacgggccgaagacttccaggggaat gatttggataatgaccctaaccgtgggaatcaggatgactttcggcaggctccagggaat ctccccgaaggtgagtgtctctcagatctaaaggaccagagaacctttgtccctccacgg atgcgaacactgatcatgcccaagaagccagcagaggaaggaaatgattcggaggaagtg ccagaagcatctggcccacaaaatgatgggaaagagctgtgccccccgggaaaaccaact acctctgagaagattcacgagagatctggacccaaaaggggggaacatgcctggacccac agactgcgtgagagaaaacagctggtgatttatgaagagatcagcgaccctgaggaagat gacgagtaa >gi568815575f:52652681_52860441|GENSCAN_predicted_peptide_5|670_aa MSGGSMDYNRERGSPEGMDPDGAIESNCNEIVDNFDDMNLKESLLWGIYAYDFEKSSAIQ QRAIITCMKGYDVIAQAQSGTGKTATFAISILQQLETQALMLAPTRELAPQIQKSVQPWL GLPAAASVMAAAAAIMVTSQEPSDEEPREEPPTESRDPTPGQKREDQGAADIQGGGKGKK ERLWGEEAYVCIMHYAMTVPDLEADLQSCLSQRLGMNAEMVLMFRGQFCQNQSNLKCQKE PSFFLRPDPYKTRRPIAAGPHEKDDDIKALLTRSHQSRAPTLRKPAGSSALSLTLQLPVG SACGPTGCVSVGEKESRPQGPEVQDHSSREGIYREGKSSGPLGSFNIAVEVWTLQDPAVD IQQPTRIMEKPTSSTNGEKRKSPCDSNSKNDEPGVLTVQEEEDEGSSQEDEDLDSSAESS KQDEDLQLPEGSSQEDEDLGLSEGSSQEDEDLDSSEGSLMEEEDPDSSEGSSEEGSGDVY MFMQLYPAAHSSAECTYQRVYIARICLILPRSPDDDFICVSSHIHPNGPSQTLKPKIIQR AKAKMPNHLLQNPAPGLKCIVCIKIKTGGQSLAYEPLETLTGFLPYSVEGVYRWMILVVI AQLRGQRESLGDTTHAHDEKQNVVTFHEHGHGCGPLVIRALEPSLRNVNTEEDDVLVSLS PGEFSLDALL >gi568815575f:52652681_52860441|GENSCAN_predicted_CDS_5|2013_bp atgtctggtggctccatggattataacagagaacgtggcagcccagagggaatggaccct gatggtgccattgagagcaactgtaatgagatcgttgataactttgatgatatgaattta aaggagtctcttctttggggcatctatgcttacgattttgagaagtcttctgctattcag cagagagctattattacctgtatgaaagggtatgatgtgattgctcaagctcagtcaggt actggcaagacagccacatttgctatttccatcctgcaacagttggagacccaagcacta atgctggcccccaccagagaactggctccacagatccaaaagagcgtgcagccctggctg ggcctccctgctgcagccagtgtgatggcagcagctgctgctatcatggtgacatcccag gagcccagtgatgaggagcctcgagaggaaccaccaactgaaagtcgggatcctacacct ggtcagaagagggaagatcagggtgcagctgatattcaaggtggtgggaagggaaagaaa gaacgtctatggggggaggaggcctatgtgtgcatcatgcattatgccatgaccgtgcct gacctggaagctgatctccagagctgtctcagtcaaagactggggatgaatgcggagatg gtcctgatgttcaggggacaattctgccaaaatcagagcaatttaaaatgccagaaggag ccctccttcttccttcgtcctgacccctacaagacaagaaggcccatagctgcggggccg cacgagaaggacgacgacatcaaggcccttctcactcggagtcaccaatcacgcgccccg accctccggaagcccgctggctcctccgcactctcactcacacttcaactcccagttgga tcggcctgtggacctactggctgcgtctcagtaggggagaaagaatccagacctcaggga cccgaagtgcaggatcacagctcccgggagggtatatatagggagggcaagagctctggg ccactgggaagcttcaatatagctgtggaagtctggactctacaagatcctgctgtagac attcaacaaccaaccagaatcatggaaaagcccacttcaagcaccaatggggagaagagg aagagcccctgtgactccaacagcaaaaatgatgagcctggggtacttacagtccaagag gaggaggacgaaggatcctcacaggaggacgaagacctagactcatctgcagaatcttca aagcaggatgaagacctacaattacctgaaggatcttcacaggaggatgaagacctaggg ttatctgaaggatcttcacaagaggatgaagacctagactcatctgaaggatctttgatg gaggaagaagacccagactcatctgaaggatcgtcagaggagggaagtggagatgtgtat atgttcatgcaactgtacccggcagcacatagttctgctgaatgtacatatcaaagggtc tacattgcacgcatctgccttattcttccacgttccccagatgacgatttcatctgtgtc tcctcccacatccacccaaatggaccgtcccagaccttgaaaccgaaaatcattcagaga gcaaaggccaagatgcccaaccacctgctacagaatcctgctccaggactgaagtgtata gtctgtatcaaaataaaaactggaggacagtccctagcctatgagcccctggagaccctt acaggatttctgccttacagtgtggagggagtctacaggtggatgatattggttgtaatc gcccagctcagagggcagagggagagcctcggggatacgacacatgcccatgatgagaag cagaacgtggtgacctttcatgaacacgggcatggctgtggacccctcgtcatcagggcc ttggagccatctcttcgaaatgtgaacactgaggaagatgatgtccttgtctccctgtca ccaggggagtttagcctagatgccttgctctaa >gi568815575f:52652681_52860441|GENSCAN_predicted_peptide_6|220_aa MEYYAAIKKDEAMSFVGTWMKLETIILSKVSQGQKTKHLMLLFVVLEVLARAIRQEKEIK GIQLGKEEVKLSLFADDMIVYLENPNVSAQNLLNLIGNFSKVSGYKINVQKSQAFLYTNN RQTESQIMSELPFTIASKRIKYLGIQLTRDMKNLFKENYKPLLNEIKEDTNKWKNIPCSW VGRINIVKMAILPKVIYRFHAIPIKLPMCFVIFQQAKLKT >gi568815575f:52652681_52860441|GENSCAN_predicted_CDS_6|663_bp atggaatactatgcagccataaaaaaggatgaggctatgtcctttgtggggacatggatg aagctggaaaccatcattctgagcaaagtatcacaaggacagaaaaccaaacacctcatg ctcttatttgtagtgttggaagttctggccagggcaatcaggcaggagaaagaaataaag ggtattcaattaggaaaagaggaagtcaaattgtccctgtttgcagatgacatgattgta tatctggaaaaccccaacgtctcagcccaaaatctccttaatctgataggcaacttcagc aaagtctcaggatacaaaatcaatgtgcaaaaatcacaagcattcttatacaccaataat agacaaacagagagccaaatcatgagtgaactcccattcacaatagcttcaaagagaata aaatatctaggaatccaacttacaagggacatgaagaacctcttcaaggagaactacaaa ccactgctcaatgaaataaaagaggatacaaacaaatggaagaacattccatgctcatgg gtaggaagaatcaatatcgtgaaaatggccatactgcccaaggtaatttatagattccat gccatccccatcaagttaccaatgtgttttgtgatttttcaacaagcaaaattaaaaaca tga >gi568815575f:52652681_52860441|GENSCAN_predicted_peptide_7|414_aa MAQELRDTCTSFSSRFDQVEERVMVIEDQINEMKQEEKFREKRVKRNKQSLQEIWDYVKR PNLRLIGVPEKNLEETDKFLDTYTLPRLNQEEVESLNRPITDSEIEAIINSLPTKKSSGP DGFTAEFYQRYKEELPGRDTTKKENFRPISLMNIDAKVLNKILANRIQQHIKKLIHHDQV GFIPGMQGWFNIHKSINVIQHINRTKDKNHMIISVDVEKAFDKIQQPFMLKTLNKLGIDG TYLKTIRAIYDKPTANIILNGQKLEAFSLKTGTREGCPLSPLLFNIVLEVLARAIRQEKE IKGIQLGKEEVKLSLFTDDIIVYLENPIISAQNLLKLVSNFSKVSGYKINVQKSQAFLHT NNRQTESQIMSELPFTIATKRIKYLGIQLTRDVKDLFKGSYKPLLNEVKKDTNK >gi568815575f:52652681_52860441|GENSCAN_predicted_CDS_7|1245_bp atggcacaagaactacgtgacacatgcacaagcttcagtagccgattcgatcaagtggaa gaaagggtaatggtgattgaagatcaaattaatgaaatgaagcaagaagagaagtttaga gaaaaaagagtaaaaagaaacaaacaaagcctccaagaaatatgggactatgtgaaaaga ccaaatctacgtctgattggtgtacctgaaaaaaatctagaagaaacggataaattcctg gacacatacaccctcccaagactaaaccaggaagaagttgaatctctgaatagaccaata acagactctgaaattgaggcaataattaatagcttaccaaccaagaaaagttcaggacca gatggattcacagctgaattctaccagaggtacaaggaggagctgcctggcagagacaca acaaaaaaagagaattttagaccaatatccctgatgaacatcgatgcaaaagtcctcaat aaaatactggcaaaccgaatccagcagcacatcaaaaagcttatccaccatgatcaagtg ggcttcatccctgggatgcaaggctggttcaacatacacaaatcaataaacgtaatccag catataaacagaaccaaggacaaaaaccacatgattatctcagtagatgtagaaaaggcc tttgacaaaattcagcagcccttcatgctaaaaactctcaataaattaggtattgatggg acatatctcaaaacaataagagctatttatgacaaacccacagccaatatcatactgaat gggcaaaagctggaagcattttctttgaaaactggcacaagagagggatgccctctctca ccactcctattcaacatagtgttggaagttctggccagggcaatcaggcaggagaaagaa ataaagggtattcaattaggaaaagaggaagtcaaattgtccctgtttacagatgacata attgtatatctagaaaaccccatcatctcagcccaaaatctccttaagctggtaagcaac ttcagcaaagtctcaggatataaaatcaatgtgcaaaaatcacaagcattcttacacacc aataacagacaaacagagagccaaatcatgagtgaactcccattcacaattgctacaaag agaataaaatatctaggaatccaacttacaagggatgtgaaggacctcttcaaggggagc tacaaaccactgctcaacgaagtaaaaaaggacacaaacaaatag >gi568815575f:52652681_52860441|GENSCAN_predicted_peptide_8|298_aa MKDLFKENYKPLLNEIKEDTNKWKNTPHSWVGTINIVKMAILPKVIYRFNAIPIKLPMTF FTELEKTTLKFIWNQKRAHIAKSILSQKNKARDIKLPDFKLYYKDTVTKTAWYWYQNRAI DQGNRTEPSEIMPHIYNHLIFDKPDKNKQWGKDSLFNKWCLENWLAICRKLKLEPFLPPY TKINSSWIKDLNVRPENIKTLEENLGNTIQDIGMGKDFMSKTLKAMATKAKIDKWDLIKL KSFCTAKETTIRVNRQPTEWEKIFAIYSSDKGLISRIYNELHQIYKKKTTPSKSGQRI >gi568815575f:52652681_52860441|GENSCAN_predicted_CDS_8|897_bp atgaaggacctcttcaaggagaactacaaaccactgctcaatgaaataaaagaggataca aacaaatggaagaacactccacactcatgggtaggaacaatcaatattgtgaaaatggcc atactgcccaaggtaatttatagattcaatgccatccccatcaagctaccaatgactttc ttcacagaattggaaaaaactactttaaagttcatatggaaccaaaaaagagcccacatc gccaagtcaatcctaagccaaaagaacaaagctagagacatcaagctacctgacttcaaa ctatactacaaggatacagtaaccaaaacagcatggtactggtaccaaaacagagctata gatcaagggaacagaacagagccctcagaaataatgccgcatatctataaccatctgatc tttgacaaacctgacaaaaacaagcaatggggaaaggattccctatttaataaatggtgc ttggaaaactggctagccatatgcagaaagctgaaactggaacccttccttccaccttat acaaaaattaattcaagctggattaaagacttaaatgttagacctgaaaacataaaaact ctggaagaaaacctaggcaataccattcaggacataggcatgggcaaggacttcatgtct aaaacactaaaagcaatggcaacaaaagccaaaattgacaaatgggatctaattaaactc aagagcttctgcacagcaaaagaaactaccatcagagtgaacaggcagcctacagaatgg gagaaaatttttgcaatctactcatctgacaaagggctaatatccagaatctacaatgaa ctccatcaaatttacaagaaaaaaacaaccccatcaaaaagtgggcaaaggatatga