GENSCAN 1.0 Date run: 4-Nov-116 Time: 02:31:14 Sequence gi568815578f:4599221_4799979 : 200759 bp : 44.21% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.04 PlyA - 444 439 6 1.05 1.03 Term - 2926 2822 105 1 0 62 43 70 0.172 -1.69 1.02 Intr - 9453 9393 61 2 1 82 123 33 0.438 4.94 1.01 Init - 9921 9842 80 0 2 87 44 67 0.207 0.84 1.00 Prom - 20285 20246 40 -3.06 2.03 PlyA - 21033 21028 6 1.05 2.02 Term - 25324 24888 437 2 2 0 38 291 0.838 10.75 2.01 Init - 26549 26498 52 2 1 106 94 88 0.997 12.53 2.00 Prom - 28687 28648 40 -6.36 3.02 PlyA - 28850 28845 6 1.05 3.01 Sngl - 30999 30628 372 0 0 100 48 266 0.537 19.93 3.00 Prom - 31994 31955 40 -5.96 4.00 Prom + 39675 39714 40 -2.36 4.01 Init + 40880 40944 65 1 2 69 76 45 0.609 2.02 4.02 Intr + 79971 80024 54 0 0 80 42 78 0.002 0.49 4.03 Intr + 93850 93963 114 1 0 106 17 80 0.230 2.26 4.04 Term + 99991 100762 772 1 1 136 55 886 0.627 82.97 4.05 PlyA + 104584 104589 6 1.05 5.04 PlyA - 108220 108215 6 1.05 5.03 Term - 114005 113911 95 0 2 60 47 96 0.577 0.69 5.02 Intr - 114605 114506 100 2 1 110 89 42 0.627 6.18 5.01 Init - 120075 119989 87 0 0 82 42 34 0.103 -1.15 5.00 Prom - 124808 124769 40 -3.56 6.00 Prom + 124865 124904 40 -4.56 6.01 Sngl + 125332 125862 531 0 0 85 43 792 0.940 68.47 6.02 PlyA + 126414 126419 6 1.05 7.14 PlyA - 128357 128352 6 1.05 7.13 Term - 137071 136809 263 2 2 49 48 171 0.948 4.89 7.12 Intr - 149801 149601 201 0 0 66 60 59 0.011 0.16 7.11 Intr - 176830 176754 77 0 2 34 75 80 0.092 0.46 7.10 Intr - 177662 177532 131 2 2 53 63 87 0.168 2.19 7.09 Intr - 183310 183238 73 0 1 77 78 65 0.292 3.81 7.08 Intr - 187108 187011 98 1 2 126 108 86 0.992 13.21 7.07 Intr - 188534 188413 122 0 2 77 86 216 0.938 20.51 7.06 Intr - 189048 188997 52 0 1 59 103 83 0.795 5.38 7.05 Intr - 190477 190406 72 1 0 77 34 140 0.784 7.10 7.04 Intr - 191391 191231 161 1 2 80 116 215 0.999 23.21 7.03 Intr - 193407 193319 89 2 2 98 88 92 0.619 9.81 7.02 Intr - 196746 196595 152 0 2 82 84 200 0.999 17.96 7.01 Intr - 198865 198790 76 0 1 93 94 90 0.996 9.62 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815578f:4599221_4799979|GENSCAN_predicted_peptide_1|81_aa MAASQGLSSQGASTILVHSPRPEGAQGACNMVIGNPFLEEQTCMMSSLAVPLRISGCLRT LNFNQSARSTGEIDCSQHTGY >gi568815578f:4599221_4799979|GENSCAN_predicted_CDS_1|246_bp atggctgcctcccagggcctgtcctcccaaggggccagcactatccttgtgcacagccct cggcccgagggggcccaaggagcctgcaatatggtgatagggaatcctttcttagaagaa caaacctgcatgatgagctcgctggcagtgcctctgcggatctcaggatgccttagaaca ctgaacttcaaccagagtgccaggtccacaggagagattgactgctcccaacacacaggc tactag >gi568815578f:4599221_4799979|GENSCAN_predicted_peptide_2|162_aa MAEKEQLQSAAPSEINAERNSININKKYVHTETPSEGHQHQRPKVDKSMKMRKIQRKKAE NSKNQNASSPPKDHNSSPAREQNWTEKEFDELTEVGFRRWVITNSSELKEHVITQCKEAK NLEKRLEELLTRVISLEKNTNDLMELKNIAQELLEAYTSINS >gi568815578f:4599221_4799979|GENSCAN_predicted_CDS_2|489_bp atggccgaaaaggaacagctccagtctgcagctcccagcgagatcaacgcagaaagaaat agcatcaatatcaacaaaaagtatgtccacacagaaaccccatccgaaggtcaccaacat caaagaccaaaggtagataaatccatgaagatgaggaaaatccagcgcaaaaaggctgaa aattccaaaaaccagaatgcctcttctcctccaaaggatcacaactcctcgccagcaagg gaacaaaactggacagagaaagagtttgacgaattgacagaagtaggcttcagaaggtgg gtaataacaaactcctccgagttaaaggagcatgttataacccaatgcaaggaagctaag aaccttgaaaaaaggctagaggaattgctaactagagtaatcagtttggagaagaacaca aatgacctgatggagctgaaaaacatagcacaagaacttcttgaagcatacacaagtatc aatagctga >gi568815578f:4599221_4799979|GENSCAN_predicted_peptide_3|123_aa MARGPKKHLKQVAAPKHWMLDKLTGVFAPHPSTSPHKLRECLPLIIFIRNRLKYALTGEE VKKICMQRFIKIDGKVRTDITYPAGFMDVISIDKMGENFCLICDTKGHFAVHRITPEEAK YKL >gi568815578f:4599221_4799979|GENSCAN_predicted_CDS_3|372_bp atggctcgtggtcccaagaagcatctaaagcaggtagcagctccaaagcattggatgctg gataaattgactggtgtgtttgctcctcatccatccaccagtccccacaagttgagagag tgtctccccctcatcattttcataaggaacagacttaagtatgccctgacaggagaggaa gtaaagaagatttgcatgcagcggttcattaagatcgatggcaaggtccgcactgatata acctaccctgctggattcatggatgtcatcagcattgacaagatgggagagaatttctgt ctgatctgtgacaccaagggtcactttgctgtacatcgtattacacctgaggaggccaag tacaagttgtga >gi568815578f:4599221_4799979|GENSCAN_predicted_peptide_4|334_aa MRLESGVNGSVMIRKQISSRARTFYEIEGGSTLQVIQLQEHDPPPVCPISVDGITAISLV MEVPDPGNSHSSIHSTSRAVIMANLGCWMLVLFVATWSDLGLCKKRPKPGGWNTGGSRYP GQGSPGGNRYPPQGGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQPHGGGWGQGGGTHSQW NKPSKPKTNMKHMAGAAAAGAVVGGLGGYMLGSAMSRPIIHFGSDYEDRYYRENMHRYPN QVYYRPMDEYSNQNNFVHDCVNITIKQHTVTTTTKGENFTETDVKMMERVVEQMCITQYE RESQAYYQRGSSMVLFSSPPVILLISFLIFLIVG >gi568815578f:4599221_4799979|GENSCAN_predicted_CDS_4|1005_bp atgaggctagaaagtggggtaaatggaagtgtcatgatcagaaaacagatcagcagccgg gcacgaacattctacgagatagagggtggtagcacacttcaggtcatccagctgcaagag catgaccctccaccagtttgtcctatttcagtagatggcatcaccgccatctccctggtt atggaagtcccagacccgggcaactctcactcctccatccactcaaccagcagagcagtc attatggcgaaccttggctgctggatgctggttctctttgtggccacatggagtgacctg ggcctctgcaagaagcgcccgaagcctggaggatggaacactgggggcagccgatacccg gggcagggcagccctggaggcaaccgctacccacctcagggcggtggtggctgggggcag cctcatggtggtggctgggggcagcctcatggtggtggctgggggcagccccatggtggt ggctggggacagcctcatggtggtggctggggtcaaggaggtggcacccacagtcagtgg aacaagccgagtaagccaaaaaccaacatgaagcacatggctggtgctgcagcagctggg gcagtggtggggggccttggcggctacatgctgggaagtgccatgagcaggcccatcata catttcggcagtgactatgaggaccgttactatcgtgaaaacatgcaccgttaccccaac caagtgtactacaggcccatggatgagtacagcaaccagaacaactttgtgcacgactgc gtcaatatcacaatcaagcagcacacggtcaccacaaccaccaagggggagaacttcacc gagaccgacgttaagatgatggagcgcgtggttgagcagatgtgtatcacccagtacgag agggaatctcaggcctattaccagagaggatcgagcatggtcctcttctcctctccacct gtgatcctcctgatctctttcctcatcttcctgatagtgggatga >gi568815578f:4599221_4799979|GENSCAN_predicted_peptide_5|93_aa MAIYEPQSGFSPDTQSAGTLILDLPASRTLPHGNITDFPTGHEPMPLFHLLCSPDSNCKC YAGTRMKLEGIILTKLTREQKTKHRMFSLTSGS >gi568815578f:4599221_4799979|GENSCAN_predicted_CDS_5|282_bp atggccatctatgaaccacaaagtgggttctcaccagacacccaatctgctggcaccttg atcttggacttgccagcctccagaacactgccacatggaaacattactgattttccaact ggacatgaaccaatgcctctcttccatcttctctgctctcccgactccaactgcaagtgc tacgcagggacacgaatgaagctggaaggcatcatcctcaccaaactaacacgggaacag aaaaccaaacaccgcatgttctcactcacaagtgggagttga >gi568815578f:4599221_4799979|GENSCAN_predicted_peptide_6|176_aa MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFI KQGRKLDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEF QKPDNKLHQQVLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLTVK >gi568815578f:4599221_4799979|GENSCAN_predicted_CDS_6|531_bp atgaggaagcacctgagctggtggtggctggccactgtctgcatgctgctcttcagccac ctctctgcggtccagacgaggggcatcaagcacagaatcaagtggaaccggaaggccctg cccagcactgcccagatcactgaggcccaggtggctgagaaccgcccgggagccttcatc aagcaaggccgcaagctcgacattgacttcggagccgagggcaacaggtactacgaggcc aactactggcagttccccgatggcatccactacaacggctgctctgaggctaatgtgacc aaggaggcatttgtcaccggctgcatcaatgccacccaggcggcgaaccagggggagttc cagaagccagacaacaagctccaccagcaggtgctctggcggctggtccaggagctctgc tccctcaagcattgcgagttttggttggagaggggcgcaggacttcgggtcaccatgcac cagccagtgctcctctgccttctggctttgatctggctcacggtgaaataa >gi568815578f:4599221_4799979|GENSCAN_predicted_peptide_7|522_aa XNELLLHLKTYNLYYEGQNLQLRHREEEDEFIVEGLLNISWGLRRPIRLQMQDDNERIRP PPSSSSWHSGCNLGAQGTTLKPLTVPKVQISEVDAPPEGDQMPSSTDSRGLKPLQEDTPQ LMRTRSDVGVRRRGNVRTPSDQRRIRRHRFSINGHFYNHKTSVFTPAYGSVTNVRINSTM TTPQIENSAEEFALYVVHTSGEKQKLKATDYPLIARILQGPCEQISKVFLMEKDQVEEVT YDVAQYIKFEMPVLKSFIQKLQEEEDREVKKLMRKDSSTSPVGHGVEAQKGFCKLHRTGS LSPSKLPLMPPTLTCRAIYGVNSTESGVSYNILPWDNPGVLRSSPVMRFFRYTRQEHRDT ESPLSLRQGREDKTRSQSFLQLARTMPWTALRHAENNGALESLGPASSGQSSSSPDVFAE VTLLSWKGVGSCALGEKCILIDENDSNIGTETKKNCHENENIGNGLLHQALSVFLLNTKM SYSDSRDQMLKLPFEPVSPILGCSHPLSNPDKLERNDVIDIS >gi568815578f:4599221_4799979|GENSCAN_predicted_CDS_7|1569_bp nnaaatgaacttctcttgcatctgaagacctacaacttgtactatgaaggccagaattta cagctccggcaccgggaggaagaagacgagttcattgtggaggggctcctgaacatctcc tggggcctgcgccggcccattcgcctgcagatgcaggatgacaacgaacgcattcgaccc cctccatcctcctcctcctggcactctggctgtaacctgggggctcagggaaccactctg aagcccctgactgtgcccaaagttcagatctcagaggtggatgccccgccggagggtgac cagatgccaagctccacagactccaggggcctgaagcccctgcaggaggacaccccacag ctgatgcgcacacgcagtgatgttggggtgcgtcgccgtggcaatgtgaggacgcctagt gaccagcggcgaatcagacgccaccgcttctccatcaacggccatttctacaaccataag acatccgtgttcacaccagcctatggctctgtcaccaacgtccgcatcaacagcaccatg accaccccacagattgagaattcagcagaggagtttgccttgtacgtggtccatacgagt ggtgagaaacagaagctgaaggccaccgattacccgctgattgcccgaatcctccagggc ccatgtgagcagatctccaaagtgttcctaatggagaaggaccaggtggaggaagtcacc tacgacgtggcccagtatataaagttcgagatgccggtacttaaaagcttcattcagaag ctccaggaggaagaagatcgggaagtaaagaagctgatgcgcaaggacagcagcaccagc cctgtggggcatggagtggaagcccagaagggcttctgcaagctgcacagaactgggtca ctaagcccctctaagctgcccttgatgccacctaccctcacctgcagagccatctatggg gtcaacagcaccgagagtggggtatcttacaacatccttccctgggacaacccaggggtt ctgagaagctcacccgtgatgcgatttttccggtacacaaggcaagaacaccgggataca gaaagccctctgtccttgcgacaaggaagagaggacaaaacccggagccaaagcttcctt cagctggcccgcaccatgccctggacagctctgagacatgcagagaacaatggagccctg gagagcctggggccggcttcctctggacaatcttcatcctcacctgatgtttttgctgaa gtcactttgctctcatggaaaggtgttggcagttgtgccttgggggagaagtgtattctt attgatgaaaatgacagtaatattggaactgagaccaagaagaattgtcacgagaatgaa aacattgggaatggattattgcatcaagctcttagtgtcttcttactcaacaccaaaatg agctacagtgacagcagagatcagatgctgaaattacctttcgagccagtttcaccaata cttggttgtagtcatccattaagtaatccagacaagcttgagagaaatgatgtcattgac ataagttga