GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:10:26 Sequence gi568815591f:151867247_152119798 : 252552 bp : 43.71% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 652 831 180 1 0 111 71 53 0.687 4.88 1.02 Term + 2437 2617 181 1 1 75 49 147 0.433 6.58 1.03 PlyA + 7483 7488 6 1.05 2.03 PlyA - 7690 7685 6 1.05 2.02 Term - 8153 8130 24 2 0 60 52 -2 0.160 -8.38 2.01 Init - 9359 9261 99 2 0 68 117 152 0.661 16.36 2.00 Prom - 14501 14462 40 -5.46 3.00 Prom + 14885 14924 40 -3.16 3.01 Init + 15884 15942 59 1 2 65 98 114 0.734 10.88 3.02 Intr + 18076 18174 99 1 0 49 72 65 0.053 0.23 3.03 Intr + 26449 26598 150 1 0 77 17 104 0.065 1.48 3.04 Term + 34854 34938 85 0 1 95 43 92 0.187 2.53 3.05 PlyA + 35818 35823 6 1.05 4.00 Prom + 46428 46467 40 -2.36 4.01 Init + 52397 52638 242 1 2 68 94 48 0.005 0.75 4.02 Intr + 63648 63753 106 0 1 77 37 73 0.003 1.32 4.03 Intr + 68932 69046 115 0 1 94 60 76 0.039 5.42 4.04 Intr + 77048 77084 37 0 1 107 103 10 0.071 1.72 4.05 Intr + 83174 83284 111 2 0 73 52 111 0.531 5.49 4.06 Intr + 115740 115906 167 0 2 109 91 50 0.280 7.00 4.07 Intr + 135468 135622 155 1 2 111 61 198 0.536 19.19 4.08 Intr + 138160 138317 158 0 2 90 50 51 0.303 0.31 4.09 Intr + 140581 140698 118 1 1 121 53 33 0.390 3.57 4.10 Intr + 147398 147547 150 1 0 71 81 25 0.126 0.46 4.11 Term + 152400 152555 156 2 0 73 48 121 0.943 4.53 4.12 PlyA + 152579 152584 6 1.05 5.00 Prom + 156707 156746 40 -2.66 5.01 Init + 158587 158638 52 0 1 81 101 58 0.704 7.93 5.02 Term + 161450 161484 35 0 2 110 42 9 0.171 -3.75 5.03 PlyA + 162866 162871 6 1.05 6.05 PlyA - 163845 163840 6 1.05 6.04 Term - 165192 164621 572 1 2 34 42 172 0.362 1.80 6.03 Intr - 166174 165904 271 1 1 63 32 159 0.591 4.91 6.02 Intr - 167541 167427 115 2 1 63 109 40 0.870 4.05 6.01 Init - 168779 168574 206 2 2 47 73 120 0.747 4.72 6.00 Prom - 171039 171000 40 -5.16 7.04 PlyA - 171695 171690 6 1.05 7.03 Term - 172512 172214 299 1 2 52 47 226 0.757 10.33 7.02 Intr - 173936 173577 360 0 0 48 98 236 0.867 15.79 7.01 Init - 176652 176544 109 0 1 72 98 43 0.546 4.08 7.00 Prom - 181707 181668 40 -0.36 8.05 PlyA - 183072 183067 6 1.05 8.04 Term - 183642 183514 129 0 0 39 37 114 0.409 -0.42 8.03 Intr - 185454 185293 162 0 0 38 102 60 0.328 2.57 8.02 Intr - 191790 191711 80 1 2 43 45 66 0.239 -2.93 8.01 Init - 193588 193456 133 1 1 78 47 72 0.222 2.40 8.00 Prom - 193712 193673 40 -1.56 9.02 PlyA - 194476 194471 6 1.05 9.01 Sngl - 195977 195531 447 2 0 70 48 173 0.879 7.73 9.00 Prom - 196921 196882 40 -4.96 10.03 PlyA - 197086 197081 6 1.05 10.02 Term - 198251 198078 174 2 0 65 44 144 0.880 5.46 10.01 Init - 201058 200993 66 1 0 38 93 45 0.582 1.07 10.00 Prom - 214612 214573 40 -3.26 11.00 Prom + 214776 214815 40 -5.66 11.01 Init + 222661 222731 71 0 2 61 64 32 0.284 -1.38 11.02 Intr + 227074 227276 203 1 2 64 97 149 0.628 12.33 11.03 Intr + 233552 233675 124 0 1 111 41 60 0.993 3.24 11.04 Intr + 235866 236032 167 0 2 91 87 126 0.993 12.40 11.05 Intr + 237999 238124 126 1 0 64 69 37 0.462 0.05 11.06 Intr + 240792 240957 166 1 1 134 -12 367 0.550 30.32 11.07 Intr + 243282 243399 118 0 1 78 58 69 0.971 3.37 11.08 Intr + 246000 246152 153 2 0 82 100 117 0.977 12.57 11.09 Intr + 249911 250129 219 1 0 58 90 96 0.972 5.30 11.10 Term + 251432 251650 219 1 0 81 40 185 0.691 10.04 11.11 PlyA + 251919 251924 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_1|120_aa XTVEVTAGPPPAMNSPGRRPAELQGTPLQDQAFGSWKRRWEPGVTEQTGLCRAFISSFTA RTSTGPCSSGIGPMCRAPLRTTLADQPSHTQRAAERQPSFIKNPGATGQSEDPITKKLDY >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_1|363_bp nncacagtggaggtcactgcagggccacccccagcaatgaactcccctggaagaaggcca gcagagctgcagggcacccccctccaggaccaggcctttggctcatggaaaagacgctgg gagcctggggttacggagcaaactggtctttgtcgtgcattcatcagcagcttcactgcc aggaccagcactgggccgtgctcaagcggcattgggccaatgtgccgtgccccactgagg acgaccctcgctgatcagccctctcacacccagcgtgctgcagagcggcaaccaagcttc atcaaaaaccctggagccacaggccagtctgaggaccccatcaccaagaagcttgactac tga >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_2|40_aa MDTKKKKDVSSPGGSGGKKNASQKRRSLRVHIPGALPRVC >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_2|123_bp atggacaccaagaagaaaaaagatgtttccagccccggcgggagcggcggcaagaaaaat gccagccagaagaggcgttcgctgcgcgtgcacattccgggcgcactgccccgtgtttgc tga >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_3|130_aa MPVNHKEVLIVKSKKYFGLSLTLPQWAQRLPGSLDCGPRVTEYSSRPLTDMVGWGECEGP KRGSGLSVKDARPPSACIVILPRDGCDPPLRHPPPPHHQGCGSRTKKKGLNTKEMSCIEV EELTDEQRKN >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_3|393_bp atgccagtcaaccataaggaagtcctcatcgtgaaatccaagaagtactttggattaagc ctgaccctccctcagtgggcccagcgtctgccgggcagccttgactgtggccctagagtc acggaatactcgagcagacctctgacggacatggtggggtggggagagtgtgaaggaccc aagagaggttctgggctgtcagtcaaggatgccagacccccaagtgcctgcattgtgatt ctcccccgggatggctgtgaccccccgctgcgtcaccccccacctccacaccaccaaggc tgtggcagccgaacaaagaagaagggcttgaacactaaagaaatgtcatgcattgaggta gaagaactgactgatgaacagaggaaaaactga >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_4|504_aa MGPQNQLNPGKSQREGQAGSGESRKQQYRAAVLWQVHAFHPWGQERPTPISSAEQPPQQL HVGCPVCEESKQKLGHRKLSRHYQSITALFTHSQGLQTSLEKDSWKQEWSAASDAAELSN AKLNGPSMPSGSERLVELKMKSARVFVLFRLRHRAPQPQPRAHALRWAPIKINWTVANDS NCLKLNQVVAPVVDVLSSVGFGTWCLQKHYPARLPTASIVICFYNEECNALFQTMSSVTN LTPHYFLEEIILVDDMSKVGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMVVCPLIDVID DRTLEYKPSPLTTADSNVFGLHTKWTELQCPEMRNTMSGAGFWQEDSLWGIFEMSIWHSS GDVKSPAMSGGIFAIRRHYFNEIGQYDKDMDFWGRENLELSLRIWMCGGQLFIIPCSRVG HISKKQTGKPSTIISAMTHNYLRLVHVWLDEYKEQFFLRKPGLKYVTYGNIRERVELRKR LGCKSFQWYLDNVFPELEASVNSL >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_4|1515_bp atgggccctcagaaccagcttaacccaggcaagagtcagcgggaaggccaggctggtagt ggggagagcaggaagcagcagtacagggccgctgtcctttggcaagtacatgcctttcac ccctggggtcaggagaggcccactccgatttcctccgcagagcagccaccgcagcagctg catgtgggatgccctgtatgtgaggagtccaagcagaagctgggacacaggaagttgtcc aggcattatcaatcaatcacagcactcttcacgcacagccaagggctgcagacatcactg gaaaaggattcctggaagcaggaatggtctgcagcgtcagatgccgcggaactaagtaat gcaaaactcaacgggcctagtatgccatctggtagtgaacgtctagttgaattaaagatg aaaagtgctagggtgtttgtattgtttcgactaagacacagagccccacagccccagccc cgggcccacgctttgagatgggcccctatcaaaatcaactggaccgtagcaaatgacagt aactgtcttaaacttaatcaagtggtagccccagtagtagatgtgctgtcttcagtgggc tttggcacttggtgtcttcaaaaacattacccagcccgcctcccgactgccagcattgtc atttgcttctataatgaagaatgtaatgccttgtttcagaccatgtccagtgtcacgaac ctcacgccacactattttcttgaagaaattattttggtagatgacatgagcaaagttggg gatgttctggtgttcctggacagccactgtgaggtgaacagagtatggctggagcccctg ctgcatgccattgccaaggaccccaaaatggtggtgtgccccctgatagatgtcattgat gatagaactctggagtataagccctctcctcttaccacagctgactccaacgttttcggc ctgcacaccaaatggacggaactgcagtgccctgagatgagaaataccatgagtggagca ggtttttggcaggaagattctctttggggcatatttgagatgtctatttggcattcaagt ggggatgtgaagtcacctgcaatgtctggaggaatttttgctatacgtcggcattatttt aatgaaattggacagtatgacaaggatatggatttttggggaagagaaaatttggaactt tcactaaggatctggatgtgtggaggccaactctttataatcccctgctctcgagtagga catatcagtaagaaacaaactggaaaaccttctacaatcatcagtgctatgacacataac tacctaagactggtgcacgtttggctggatgaatataaggagcagttttttcttcgaaag cctggtctgaaatatgtcacctacggaaatattcgcgagcgtgttgagttaaggaaacga ctgggttgcaagtcatttcagtggtatttggataatgtcttcccagagttggaggcatct gtgaacagcctgtga >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_5|28_aa MPQPESQKAAILGCGQGGHPNCCWEFGQ >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_5|87_bp atgccgcagcccgagtcccagaaggcggcgatcctgggctgcgggcaaggcggacacccc aactgctgttgggaatttggccaatga >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_6|387_aa MRKDSCASSMHQQVSRSKKRAGQKTPPEDQEGGQRALRSSHIRLGQFLLIEDCKTPSPSS LGADAIAKQRKTSVSAAASVSATIPIRRVQGPTVVGSWARGVSAAASGPRGTGPKGKARS EKGCSLSHGPQTNKPLVVQKGQKMEQANHPVGLVISVVYKDILKKIVQRETSHPLIHVRY AEAITGRRTAPEDKGSLVVPNPYTLLSQIPEEAEWFPVLDLKDAFFCIPLHYDSHDSQFL FAFEDPTDHTSQLIWTVLPQGFRDSPHLFGQALAQDLGHFSSPGTLVLQYVDDLLLATSS EASCQQATLDLLNFLANQGYKASRSKAQLCLQQVKYLGLILARGTRTLGKERIQPILAYP HPKTLKQLWGFLQITAFANYGSPDRAR >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_6|1164_bp atgcgcaaggacagttgcgccagcagcatgcatcagcaagttagcagaagcaagaagaga gctgggcagaagacaccccctgaagatcaagagggaggccagcgggcactacgcagcagc cacatcagactgggacaattcctgcttatagaggactgtaaaaccccttccccatcctca cttggggctgacgctattgccaagcagaggaaaactagtgtttctgctgctgcgtcagtg agtgcaactattcccatccgcagggtccagggaccgaccgttgtgggttcttgggcaaga ggtgtttctgctgctgcatcgggaccaagaggaacaggcccaaaaggaaaagcgagatca gagaaaggctgcagccttagtcatggccctcagacaaacaaacctttggtggttcagaaa ggacagaaaatggagcaggccaatcacccggtagggcttgttatcagtgtggtttacaaa gacattttaaaaaagattgtccaacgagaaacaagccaccccctcatccatgtccgctat gccgaggcaatcactggaaggcgcactgccccggaggacaaaggttctctggttgtaccc aacccctataccctgctctctcaaataccagaggaagcagaatggttccctgttctggac ctcaaggatgccttcttctgtattcccctgcactatgactcccatgactcccagtttctc tttgcctttgaggatcccacagaccacacatcccaacttatatggaccgtcttgccccaa gggtttagggatagccctcatctgtttggtcaggcactggcccaagatctaggccacttc tcaagtccaggcactctggtccttcagtatgtggatgatttacttttggctaccagttcg gaagcctcatgccagcaggctactctagatctcttgaattttctagctaatcaagggtac aaggcatctaggtcaaaggcccagctttgcctacagcaggtcaaatatctaggcctaatc ttagccagagggaccaggacccttggcaaggaacgaatacagcctatactggcttatcct caccctaagactttaaaacagttgtgggggttccttcaaatcaccgcttttgccaactat ggatccccagatagagcaagatag >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_7|255_aa MGQVWALVHSTLETFHTDEEEGEYNEVTEQVCLPAKGPNQQPIWIPSRYLKPYHKPDAKE EIPEGSQGFPVAAMSRLTLRRTPTVTSNTHRTQPPTWGQIEKLPQMAEENLRKAGQPVTI SNWILPRITKFKPIEGAENVFTDGSSNGKASYSGSKGPNQQPIWIPSKHLKPYHKPDAGE KIPGESRGPPVAAMSRLTLRRMPTVMSNTHRTQPPTWGQIKKLSQMAEENLRKAGQPVTM NNLMIAVITTAFNKG >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_7|768_bp atgggacaagtgtgggctctggttcattccaccttggaaacttttcacactgatgaggag gaaggagagtataatgaagtaacagagcaggtttgtttgccagctaaaggaccaaatcaa cagccaatttggataccatcaagatacctgaaaccttatcataagccagatgccaaggaa gagattccggaaggatcccaaggattcccagttgcagccatgtcaagactgacgctgagg aggaccccaactgtcacgagcaacacccatcgaacacagccacccacctggggacagatc gagaagctgccacagatggcggaagaaaacctgaggaaagctggacaaccagtcacaata agtaattggattctccctagaataactaaatttaaaccaattgaaggtgctgagaatgtt tttacagatgggtctagcaacggtaaagcttcttattctggctcaaaaggaccaaatcaa cagccgatttggataccatcaaaacacctgaaaccttatcataagccagatgctggggaa aagattccaggagaatcccgaggacccccggttgcagccatgtcaagactgacgctgagg aggatgccaactgtcatgagcaacacccatcgaacacagccacccacctggggacagatc aagaagctgtcacagatggcggaagaaaacctgaggaaagcgggacaaccagtcacaatg aataatttaatgatagcggtgatcaccactgccttcaacaagggctga >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_8|167_aa MEYYTAIEKGEFMSFVGTWMKLETIILSKLLQGQKTKHRMFSLIELQNVNINGSLEEVGS NLDRQVQDFSGDCVEEEEPRKGLRRITKQKNKRVYRQQCEVPQRFIEKLRQVRREVEPSD LATSKWKAKGGRIQLAPTEEAFRAVARGELPILVVETQVPARLTTMG >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_8|504_bp atggaatactatacagccatagaaaagggtgagttcatgtcctttgtagggacatggatg aagctggaaaccatcattctcagcaaactactgcaaggacaaaaaaccaaacaccgcatg ttctcactcatagaactacaaaatgtcaatattaacgggagtttagaagaagttggttcc aaccttgaccgacaggttcaagacttcagtggagactgtgtggaggaggaagaacccagg aagggcctgagaaggattacaaagcagaagaataagagggtctatcgacagcaatgtgag gtgccacaaagatttatagagaagctaagacaggtgagaagagaggtagagccatcagat ttagcaactagtaagtggaaagcaaaagggggcagaattcagctggcgcccacagaggaa gcatttagagctgtagccagaggggaattgcccatcctggtggtggaaactcaagttcca gcaagacttaccaccatgggctaa >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_9|148_aa MDKFLDTYTLPRLNQEEVESLNRKITGSEIEAILNSLPTKKSPGPDGFTVEFYQKYKEEL VPFLLKLFLSIEKEGILPNSFYEASIILIPKLGRDTTKKENFRPISLMNTDAKILNKILA YQIQQHIKKLIHHDQVGFIPGMQGWFNI >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_9|447_bp atggataaattcctggacacatacaccctcccaagactaaaccaggaagaagttgaatcc ctgaataggaaaataacaggctctgaaattgaggcaatacttaatagcctaccaacaaaa aaaagtccaggaccagatggattcacagttgaattctaccagaagtacaaggaggagctg gtaccattccttctgaaactattcctatcaatagaaaaagagggaatcctccctaactca ttttatgaagccagcatcatcctgataccaaagcttggcagagacacaacaaaaaaagag aattttagaccaatatccctgatgaacaccgatgcaaaaatcctcaataaaatacttgca taccaaatccagcagcacatcaaaaagcttatccaccatgatcaagtgggcttcatccct gggatgcaaggctggttcaacatatga >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_10|79_aa MKDANLKRLHIECFPLYDILEKECSSSPAIEQSWMENDFDELREGFRRSNYSELKEEVRM HRKEAKNLEKRLDKWLTRM >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_10|240_bp atgaaggacgccaacttgaaaaggctacatattgaatgctttccattgtatgacattctg gaaaaggaatgcagttcctcaccagcaatagaacaaagctggatggagaatgactttgac gagttgagagaagggttcagacgatcaaactactccgagctaaaggaggaagttcgaatg catcgcaaagaagctaaaaaccttgaaaaaagattagacaaatggctaactagaatgtag >gi568815591f:151867247_152119798|GENSCAN_predicted_peptide_11|521_aa MMLSIGTIEEVMNFATSGHMTPEHEVTQPLKNVPVKGSGPHGPSPKKFYPRFTRGPSRVL EPQFKANKIDDVIDSRVEDPEEGHLKFSSELGMIFNERDQELRDLGYQKHAFNMLISDRL GYHRDVPDTRNAACKEKFYPPDLPAASVVICFYNEAFSALLRTVHSVIDRTPAHLLHEII LVDDDSDFDDLKGELDEYVQKYLPGKIKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSH CEVNVMWLQPLLAAIREDRHTVVCPVIDIISADTLAYSSSPVVRGGSPTMAGGLFAMNRQ YFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRPYGSPEGQDTM THNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFKWYLDNVYPEM QISGSHAKPQQPIFVNRGPKRPKVLQRGRLYHLQTNKCLVAQGRPSQKGGLVVLKACDYS DPNQVSDTSGLTASVRKEASSVGKLLPVTPEDISVYYSGFS >gi568815591f:151867247_152119798|GENSCAN_predicted_CDS_11|1566_bp atgatgttatctataggaacaatcgaggaagtcatgaattttgcgacgtctggccacatg actcccgagcatgaagtgactcagccacttaagaatgtgcccgtcaaggggtctgggccc cacggaccatctccaaaaaaattctatccccgtttcactcgaggcccaagtcgagtgctc gagccacagttcaaagcaaacaaaattgacgatgtgatagacagtcgtgttgaagatcca gaagaaggccacttgaaattctcttctgaattaggtatgatttttaatgaacgcgatcaa gagttgagagacttgggctatcagaaacatgcttttaatatgcttatcagtgaccgcttg ggctaccacagagatgtgccagacacaaggaatgcagcatgtaaagaaaagttctaccca cctgacctgccagctgctagtgttgttatctgtttctataatgaagcgttttctgccttg cttcggacagtgcacagtgtcatagaccgcacgccagcacacctgcttcatgagatcatc cttgtggatgatgatagtgactttgatgatttgaaaggagaactagatgaatatgtccaa aaatacctccctggaaaaattaaagtcataagaaatacaaagcgtgaggggttgattcga gggagaatgattggcgcggcccacgcgacaggagaagtccttgtgttcctggacagccac tgtgaagtgaatgtgatgtggctgcagcccttgctggccgccatccgtgaggaccggcac accgtggtgtgcccagtgattgacatcatcagcgccgacacgctggcctacagctcgtcc cctgtcgtccgcggagggtcaccaacaatggctggaggtttgtttgccatgaacagacag tatttccatgaacttggacagtatgatagtggcatggatatctggggaggagaaaatttg gaaatatcatttcggatctggatgtgtggcggtaagctcttcatcatcccttgctctaga gtaggacacattttccgaaaaaggcgaccatatggatctcccgaaggccaggacaccatg acacacaactctttgcggctggcacatgtctggttggatgaatacaaggagcagtatttt tccttaagacctgacctgaagacgaaaagctatggcaatatcagtgagcgtgtggaactg agaaagaagttgggctgtaaatcatttaaatggtatttggataatgtatacccagagatg cagatatctgggtcccacgccaaaccccaacaacccatttttgtcaatagagggccaaaa cgacccaaagtccttcaacgtggaaggctctatcacctccagaccaacaaatgcctggtg gcccagggccgcccaagtcagaagggaggtctcgtggtgcttaaggcctgtgactacagt gacccaaatcaggtgagtgacacctcggggctcacagccagtgtccgcaaggaggcttcc agtgtagggaagctgctgccagtcactcctgaggacatcagtgtctactattctggcttt tcttaa