GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:57:28 Sequence gi568815578f:4624552_4825079 : 200528 bp : 45.10% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init - 1218 1167 52 0 1 106 94 88 0.644 12.53 1.00 Prom - 3356 3317 40 -6.36 2.02 PlyA - 3519 3514 6 1.05 2.01 Sngl - 5668 5297 372 1 0 100 48 266 0.535 19.93 2.00 Prom - 6663 6624 40 -5.96 3.00 Prom + 14344 14383 40 -2.36 3.01 Init + 15549 15613 65 2 2 69 76 45 0.609 2.02 3.02 Intr + 54640 54693 54 1 0 80 42 78 0.002 0.49 3.03 Intr + 68519 68632 114 2 0 106 17 80 0.230 2.26 3.04 Term + 74660 75431 772 2 1 136 55 886 0.627 82.97 3.05 PlyA + 79253 79258 6 1.05 4.04 PlyA - 82889 82884 6 1.05 4.03 Term - 88674 88580 95 1 2 60 47 96 0.577 0.69 4.02 Intr - 89274 89175 100 0 1 110 89 42 0.627 6.18 4.01 Init - 94744 94658 87 1 0 82 42 34 0.103 -1.15 4.00 Prom - 99477 99438 40 -3.56 5.00 Prom + 99534 99573 40 -4.56 5.01 Sngl + 100001 100531 531 1 0 85 43 792 0.940 68.47 5.02 PlyA + 101083 101088 6 1.05 6.18 PlyA - 103026 103021 6 1.05 6.17 Term - 111740 111478 263 0 2 49 48 171 0.948 4.89 6.16 Intr - 124470 124270 201 1 0 66 60 59 0.011 0.16 6.15 Intr - 151499 151423 77 1 2 34 75 80 0.092 0.46 6.14 Intr - 152331 152201 131 0 2 53 63 87 0.168 2.19 6.13 Intr - 157979 157907 73 1 1 77 78 65 0.292 3.81 6.12 Intr - 161777 161680 98 2 2 126 108 86 0.992 13.21 6.11 Intr - 163203 163082 122 1 2 77 86 216 0.938 20.51 6.10 Intr - 163717 163666 52 1 1 59 103 83 0.795 5.38 6.09 Intr - 165146 165075 72 2 0 77 34 140 0.784 7.10 6.08 Intr - 166060 165900 161 2 2 80 116 215 0.999 23.21 6.07 Intr - 168076 167988 89 0 2 98 88 92 0.619 9.81 6.06 Intr - 171415 171264 152 1 2 82 84 200 0.999 17.96 6.05 Intr - 173534 173459 76 1 1 93 94 90 0.934 9.62 6.04 Intr - 176825 176653 173 2 2 67 52 2 0.190 -6.76 6.03 Intr - 177796 177675 122 2 2 43 40 109 0.305 1.81 6.02 Intr - 181428 181236 193 0 1 50 49 176 0.526 8.97 6.01 Init - 182196 182146 51 0 0 71 78 44 0.403 0.86 6.00 Prom - 182424 182385 40 -5.96 7.00 Prom + 184862 184901 40 -1.26 7.01 Init + 188981 189203 223 1 1 76 82 106 0.403 7.56 7.02 Term + 195334 195449 116 2 2 52 40 79 0.044 -1.77 7.03 PlyA + 195603 195608 6 1.05 8.03 PlyA - 197559 197554 6 1.05 8.02 Term - 197852 197777 76 1 1 96 39 63 0.052 -0.49 8.01 Init - 198785 198697 89 2 2 94 56 105 0.952 6.02 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Intr - 197852 197778 75 1 0 96 92 68 0.912 6.63 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:4624552_4825079|GENSCAN_predicted_peptide_1|18_aa MAEKEQLQSAAPSEINAX >gi568815578f:4624552_4825079|GENSCAN_predicted_CDS_1|54_bp atggccgaaaaggaacagctccagtctgcagctcccagcgagatcaacgcagnn >gi568815578f:4624552_4825079|GENSCAN_predicted_peptide_2|123_aa MARGPKKHLKQVAAPKHWMLDKLTGVFAPHPSTSPHKLRECLPLIIFIRNRLKYALTGEE VKKICMQRFIKIDGKVRTDITYPAGFMDVISIDKMGENFCLICDTKGHFAVHRITPEEAK YKL >gi568815578f:4624552_4825079|GENSCAN_predicted_CDS_2|372_bp atggctcgtggtcccaagaagcatctaaagcaggtagcagctccaaagcattggatgctg gataaattgactggtgtgtttgctcctcatccatccaccagtccccacaagttgagagag tgtctccccctcatcattttcataaggaacagacttaagtatgccctgacaggagaggaa gtaaagaagatttgcatgcagcggttcattaagatcgatggcaaggtccgcactgatata acctaccctgctggattcatggatgtcatcagcattgacaagatgggagagaatttctgt ctgatctgtgacaccaagggtcactttgctgtacatcgtattacacctgaggaggccaag tacaagttgtga >gi568815578f:4624552_4825079|GENSCAN_predicted_peptide_3|334_aa MRLESGVNGSVMIRKQISSRARTFYEIEGGSTLQVIQLQEHDPPPVCPISVDGITAISLV MEVPDPGNSHSSIHSTSRAVIMANLGCWMLVLFVATWSDLGLCKKRPKPGGWNTGGSRYP GQGSPGGNRYPPQGGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTHSQW NKPSKPKTNMKHMAGAAAAGAVVGGLGGYMLGSAMSRPIIHFGSDYEDRYYRENMHRYPN QVYYRPMDEYSNQNNFVHDCVNITIKQHTVTTTTKGENFTETDVKMMERVVEQMCITQYE RESQAYYQRGSSMVLFSSPPVILLISFLIFLIVG >gi568815578f:4624552_4825079|GENSCAN_predicted_CDS_3|1005_bp atgaggctagaaagtggggtaaatggaagtgtcatgatcagaaaacagatcagcagccgg gcacgaacattctacgagatagagggtggtagcacacttcaggtcatccagctgcaagag catgaccctccaccagtttgtcctatttcagtagatggcatcaccgccatctccctggtt atggaagtcccagacccgggcaactctcactcctccatccactcaaccagcagagcagtc attatggcgaaccttggctgctggatgctggttctctttgtggccacatggagtgacctg ggcctctgcaagaagcgcccgaagcctggaggatggaacactgggggcagccgatacccg gggcagggcagccctggaggcaaccgctacccacctcagggcggtggtggctgggggcag cctcatggtggtggctgggggcagcctcatggtggtggctgggggcagccccatggtggt ggctggggacagcctcatggtggtggctggggtcaaggaggtggcacccacagtcagtgg aacaagccgagtaagccaaaaaccaacatgaagcacatggctggtgctgcagcagctggg gcagtggtggggggccttggcggctacatgctgggaagtgccatgagcaggcccatcata catttcggcagtgactatgaggaccgttactatcgtgaaaacatgcaccgttaccccaac caagtgtactacaggcccatggatgagtacagcaaccagaacaactttgtgcacgactgc gtcaatatcacaatcaagcagcacacggtcaccacaaccaccaagggggagaacttcacc gagaccgacgttaagatgatggagcgcgtggttgagcagatgtgtatcacccagtacgag agggaatctcaggcctattaccagagaggatcgagcatggtcctcttctcctctccacct gtgatcctcctgatctctttcctcatcttcctgatagtgggatga >gi568815578f:4624552_4825079|GENSCAN_predicted_peptide_4|93_aa MAIYEPQSGFSPDTQSAGTLILDLPASRTLPHGNITDFPTGHEPMPLFHLLCSPDSNCKC YAGTRMKLEGIILTKLTREQKTKHRMFSLTSGS >gi568815578f:4624552_4825079|GENSCAN_predicted_CDS_4|282_bp atggccatctatgaaccacaaagtgggttctcaccagacacccaatctgctggcaccttg atcttggacttgccagcctccagaacactgccacatggaaacattactgattttccaact ggacatgaaccaatgcctctcttccatcttctctgctctcccgactccaactgcaagtgc tacgcagggacacgaatgaagctggaaggcatcatcctcaccaaactaacacgggaacag aaaaccaaacaccgcatgttctcactcacaagtgggagttga >gi568815578f:4624552_4825079|GENSCAN_predicted_peptide_5|176_aa MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFI KQGRKLDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEF QKPDNKLHQQVLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLTVK >gi568815578f:4624552_4825079|GENSCAN_predicted_CDS_5|531_bp atgaggaagcacctgagctggtggtggctggccactgtctgcatgctgctcttcagccac ctctctgcggtccagacgaggggcatcaagcacagaatcaagtggaaccggaaggccctg cccagcactgcccagatcactgaggcccaggtggctgagaaccgcccgggagccttcatc aagcaaggccgcaagctcgacattgacttcggagccgagggcaacaggtactacgaggcc aactactggcagttccccgatggcatccactacaacggctgctctgaggctaatgtgacc aaggaggcatttgtcaccggctgcatcaatgccacccaggcggcgaaccagggggagttc cagaagccagacaacaagctccaccagcaggtgctctggcggctggtccaggagctctgc tccctcaagcattgcgagttttggttggagaggggcgcaggacttcgggtcaccatgcac cagccagtgctcctctgccttctggctttgatctggctcacggtgaaataa >gi568815578f:4624552_4825079|GENSCAN_predicted_peptide_6|701_aa MDRLQWLTVIPALWEAEACCTSGYLHWKFPWVCLKLSPGPHFSNCEGGTSAYVAVLASSL CVTRTLPVTPRFIRQHTLSPKVLVIITGVVLLASSEQNPEMLLNTCNAQDSPTTKNSPRP GARGGLISHPWNQCGCQCALWLGHLCLDCPRVVVFPLDVCSQRLWQAEAAASSVAPHSLG NELLLHLKTYNLYYEGQNLQLRHREEEDEFIVEGLLNISWGLRRPIRLQMQDDNERIRPP PSSSSWHSGCNLGAQGTTLKPLTVPKVQISEVDAPPEGDQMPSSTDSRGLKPLQEDTPQL MRTRSDVGVRRRGNVRTPSDQRRIRRHRFSINGHFYNHKTSVFTPAYGSVTNVRINSTMT TPQIENSAEEFALYVVHTSGEKQKLKATDYPLIARILQGPCEQISKVFLMEKDQVEEVTY DVAQYIKFEMPVLKSFIQKLQEEEDREVKKLMRKDSSTSPVGHGVEAQKGFCKLHRTGSL SPSKLPLMPPTLTCRAIYGVNSTESGVSYNILPWDNPGVLRSSPVMRFFRYTRQEHRDTE SPLSLRQGREDKTRSQSFLQLARTMPWTALRHAENNGALESLGPASSGQSSSSPDVFAEV TLLSWKGVGSCALGEKCILIDENDSNIGTETKKNCHENENIGNGLLHQALSVFLLNTKMS YSDSRDQMLKLPFEPVSPILGCSHPLSNPDKLERNDVIDIS >gi568815578f:4624552_4825079|GENSCAN_predicted_CDS_6|2106_bp atggaccggctgcagtggctcactgtaatcccagcactttgggaggctgaggcctgctgt acttctggttatctgcactggaagtttccatgggtctgcctgaaactcagtcctggtcct cacttcagcaactgtgaaggtggcacctctgcctacgttgcggtcctcgccagcagcctg tgcgtcaccaggactctgcctgtcaccccacgtttcatccggcagcacaccctttcccca aaggttttggtcatcatcactggggtagtgctgctggcatctagtgagcagaatccagag atgctgctaaacacctgcaatgcacaagacagtcccacgacaaagaacagtccaaggcca ggggcgagaggaggactcatcagtcatccttggaaccaatgtggctgtcagtgtgctctg tggctggggcatttgtgcttggattgtccaagggttgtcgtctttcctttggatgtttgt tctcagaggctgtggcaagccgaggcagcagccagcagtgttgcccctcacagccttgga aatgaacttctcttgcatctgaagacctacaacttgtactatgaaggccagaatttacag ctccggcaccgggaggaagaagacgagttcattgtggaggggctcctgaacatctcctgg ggcctgcgccggcccattcgcctgcagatgcaggatgacaacgaacgcattcgaccccct ccatcctcctcctcctggcactctggctgtaacctgggggctcagggaaccactctgaag cccctgactgtgcccaaagttcagatctcagaggtggatgccccgccggagggtgaccag atgccaagctccacagactccaggggcctgaagcccctgcaggaggacaccccacagctg atgcgcacacgcagtgatgttggggtgcgtcgccgtggcaatgtgaggacgcctagtgac cagcggcgaatcagacgccaccgcttctccatcaacggccatttctacaaccataagaca tccgtgttcacaccagcctatggctctgtcaccaacgtccgcatcaacagcaccatgacc accccacagattgagaattcagcagaggagtttgccttgtacgtggtccatacgagtggt gagaaacagaagctgaaggccaccgattacccgctgattgcccgaatcctccagggccca tgtgagcagatctccaaagtgttcctaatggagaaggaccaggtggaggaagtcacctac gacgtggcccagtatataaagttcgagatgccggtacttaaaagcttcattcagaagctc caggaggaagaagatcgggaagtaaagaagctgatgcgcaaggacagcagcaccagccct gtggggcatggagtggaagcccagaagggcttctgcaagctgcacagaactgggtcacta agcccctctaagctgcccttgatgccacctaccctcacctgcagagccatctatggggtc aacagcaccgagagtggggtatcttacaacatccttccctgggacaacccaggggttctg agaagctcacccgtgatgcgatttttccggtacacaaggcaagaacaccgggatacagaa agccctctgtccttgcgacaaggaagagaggacaaaacccggagccaaagcttccttcag ctggcccgcaccatgccctggacagctctgagacatgcagagaacaatggagccctggag agcctggggccggcttcctctggacaatcttcatcctcacctgatgtttttgctgaagtc actttgctctcatggaaaggtgttggcagttgtgccttgggggagaagtgtattcttatt gatgaaaatgacagtaatattggaactgagaccaagaagaattgtcacgagaatgaaaac attgggaatggattattgcatcaagctcttagtgtcttcttactcaacaccaaaatgagc tacagtgacagcagagatcagatgctgaaattacctttcgagccagtttcaccaatactt ggttgtagtcatccattaagtaatccagacaagcttgagagaaatgatgtcattgacata agttga >gi568815578f:4624552_4825079|GENSCAN_predicted_peptide_7|112_aa MASQTGGAATCLQGSPTGIHKCLTFRMDVAEGNSERLATNQRGRALERDEKGSIKEAQTP EKSSCVCVGVSIAPGMDERVPLWFCHSCLEQQQFPSPKLSPQTAASTCDDRL >gi568815578f:4624552_4825079|GENSCAN_predicted_CDS_7|339_bp atggcctcgcagaccggaggagcagccacatgtttacagggaagccccactggcatccac aagtgcctgacctttaggatggatgttgcagaaggaaattcagagcgcttagccactaat cagagaggcagggcgctggagcgggacgaaaaaggaagcataaaggaagcacagactcca gaaaaaagcagctgtgtgtgcgtgggtgtgagcatagctcccggaatggatgagagggtc cctctctggttctgccacagctgccttgagcagcagcagttccccagccccaagctgtca ccgcaaacagcagcatcaacctgtgatgacagactttag >gi568815578f:4624552_4825079|GENSCAN_predicted_peptide_8|54_aa MQLARGEGKARGAGRGALGARHWRGMPGARYLESRRLERVFRESKGCVYGSLGI >gi568815578f:4624552_4825079|GENSCAN_predicted_CDS_8|165_bp atgcagctggcccggggtgaaggcaaggcgcggggcgcggggcgcggggcactgggagcg aggcactggcgcgggatgcccggcgcaaggtatctggaatcccggcgcctagaacgtgtt tttcgggagagcaaaggctgtgtctacggcagcctggggatatag