GENSCAN 1.0 Date run: 4-Nov-116 Time: 10:56:10 Sequence gi568815586f:130772836_130975923 : 203088 bp : 45.73% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 10714 10874 161 1 2 118 78 52 0.501 6.83 1.02 Term + 15213 15331 119 1 2 110 49 10 0.253 -2.00 1.03 PlyA + 16199 16204 6 1.05 2.12 PlyA - 16787 16782 6 1.05 2.11 Term - 20128 20066 63 1 0 112 55 2 0.198 -2.81 2.10 Intr - 25800 25690 111 0 0 67 96 62 0.945 5.48 2.09 Intr - 28455 28318 138 0 0 58 68 207 0.998 16.36 2.08 Intr - 28653 28580 74 1 2 40 86 95 0.988 3.63 2.07 Intr - 33855 33676 180 1 0 84 81 14 0.433 0.14 2.06 Intr - 34255 34147 109 1 1 35 78 211 0.993 14.66 2.05 Intr - 35869 35796 74 2 2 58 41 125 0.059 3.93 2.04 Intr - 40196 40122 75 0 0 50 110 92 0.244 7.09 2.03 Intr - 48953 48854 100 2 1 133 100 53 0.924 10.68 2.02 Intr - 54432 54358 75 0 0 70 110 81 0.777 8.21 2.01 Init - 57485 57390 96 2 0 68 61 56 0.292 -0.64 2.00 Prom - 58007 57968 40 -6.06 3.00 Prom + 61910 61949 40 -5.26 3.01 Init + 65461 65582 122 0 2 87 68 35 0.929 1.06 3.02 Intr + 66112 66243 132 1 0 47 41 203 0.792 11.26 3.03 Intr + 79851 80032 182 0 2 51 71 138 0.725 7.91 3.04 Intr + 80071 80271 201 2 0 47 110 61 0.693 3.46 3.05 Term + 82810 82919 110 2 2 27 35 81 0.266 -4.63 3.06 PlyA + 83399 83404 6 1.05 4.00 Prom + 95948 95987 40 -5.26 4.01 Init + 99699 99794 96 2 0 114 56 163 0.843 14.01 4.02 Intr + 100001 100085 85 1 1 43 77 148 0.986 8.59 4.03 Intr + 100168 100293 126 2 0 96 102 64 0.970 9.25 4.04 Intr + 101711 101898 188 0 2 41 89 201 0.999 14.91 4.05 Intr + 102777 102947 171 2 0 62 57 210 0.999 15.44 4.06 Term + 103047 103091 45 2 0 98 54 79 0.998 2.71 4.07 PlyA + 103421 103426 6 1.05 5.03 PlyA - 103588 103583 6 1.05 5.02 Term - 106793 106653 141 2 0 110 42 47 0.485 0.23 5.01 Init - 109979 109788 192 2 0 74 78 116 0.242 8.17 5.00 Prom - 111642 111603 40 -6.76 6.03 PlyA - 112979 112974 6 1.05 6.02 Term - 115079 114549 531 2 0 -8 36 840 0.938 63.45 6.01 Init - 123113 123033 81 2 0 61 49 53 0.160 -2.29 6.00 Prom - 124664 124625 40 -5.16 7.00 Prom + 124938 124977 40 -4.86 7.01 Init + 127574 127725 152 1 2 66 37 173 0.543 9.51 7.02 Term + 130002 130071 70 0 1 87 48 67 0.516 0.01 7.03 PlyA + 131294 131299 6 1.05 8.03 PlyA - 131533 131528 6 1.05 8.02 Term - 132321 131804 518 1 2 45 48 262 0.229 12.38 8.01 Init - 135563 135485 79 2 1 77 34 27 0.103 -2.58 8.00 Prom - 140700 140661 40 -3.76 9.02 PlyA - 141230 141225 6 1.05 9.01 Sngl - 147012 146398 615 0 0 76 33 538 0.602 43.30 9.00 Prom - 153756 153717 40 -5.46 10.00 Prom + 162943 162982 40 -4.86 10.01 Init + 176131 176205 75 0 0 58 82 85 0.282 5.26 10.02 Intr + 181583 181696 114 1 0 18 95 78 0.075 2.14 10.03 Intr + 181789 181825 37 0 1 99 110 4 0.892 1.54 10.04 Intr + 193628 193711 84 0 0 92 65 69 0.250 4.79 10.05 Intr + 196142 196237 96 0 0 76 71 32 0.231 0.28 10.06 Intr + 198623 198745 123 0 0 108 113 145 0.837 19.56 10.07 Intr + 202264 202396 133 2 1 41 109 58 0.155 2.90 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 181631 181696 66 1 0 81 95 61 0.853 7.39 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_1|93_aa XASGIRAQMRGGCGPQAWHLGTKQEEAEGGGKGDTDREEQAHAYSLIDHGGTGESRELRS QEAQVTPPKEAGPPLHSLEPLPAKKPEALFSQA >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_1|282_bp nnagcctcaggaatccgagcgcagatgaggggtggatgcggccctcaggcctggcacctg ggcacaaagcaagaggaggctgagggtggcgggaaaggggatacagatagagaagaacag gctcacgcctactccctcatcgaccatgggggtacaggagaaagcagggaactgaggtcc caggaggcccaggtcaccccacccaaggaggcagggcctcccctgcactctctggaacct ctccctgccaagaagccagaggccctgttttctcaggcttag >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_2|364_aa MGRSPWESSLLSQGSVALGSIEMGAASFLELQCRKNDDGDTVVVVEKDHFMDDFFHQVEE IRNSIDKITQYVEEVKKNHSIILSAPNPEGKIKEELEDLNKEIKKTANKIRAKLKAIEQS FDQDESGNRTSVDLRIRRTQHSVLSRKFVEAMAEYNEAQTLFRERSKGRIQRQLEIIPPC PGSGASSWLLTSLLVCEQRQRPYCPRLFSGVSMGTVGCGSGDGAFFGDRRVCFRDKAGRT TTDDELEEMLESGKPSIFTSDIISDSQITRQALNEIESRHKDIMKLETSIRELHEMFMDM AMFVETQGEMINNIERNVMNATDYVEHAKEETKKAIKYQSKARRTSFLEETPFIKPKVVS LCLV >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_2|1095_bp atgggacggtcaccatgggaaagcagcttgctttcacagggaagcgtggcgctggggagc attgagatgggtgcagcctctttcctggagctgcagtgtaggaagaatgatgatggagac acagttgttgtggttgagaaagatcatttcatggatgatttcttccatcaggtggaggag attagaaacagtattgataaaataactcaatatgttgaagaagtaaagaaaaaccacagc atcattctttctgcaccaaacccggaaggaaaaataaaagaagagcttgaagatctgaac aaagaaatcaagaaaactgcgaataaaattcgagccaagttaaaggctattgaacaaagt tttgatcaggatgagagtgggaaccggacttcagtggatcttcggatacgaagaacccag cattcggtgctgtctcggaagtttgtggaagccatggcggagtacaatgaggcacagact ctgtttcgggagcggagcaaaggccgcatccagcgccagctggagataatcccgccctgc cctggttcgggggcatcttcatggctcctcacatctctccttgtctgtgagcagaggcag cggccctattgccccagactcttttcaggggtgtctatggggacagttggctgtggcagt ggagatggcgccttcttcggggacagacgggtctgtttcagggataaagctgggagaacc accacagacgacgagctagaagagatgctggagagcgggaagccatccatcttcacttcc gacattatatcagattcacaaattactagacaagctctcaatgaaatcgagtcacgtcac aaggacatcatgaagctggagaccagcatccgagagttgcatgagatgttcatggacatg gctatgtttgtggagactcagggtgaaatgatcaacaacatagaaagaaatgttatgaat gccacagactatgtagaacacgctaaagaagaaacaaaaaaagctatcaaatatcagagc aaggcaagaaggacaagcttcctggaggagactccatttattaagccaaaggtggtctcg ctttgccttgtgtga >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_3|248_aa MHSPVTFTRVECIEILPKFLVTCFSLAKVNSNLLCARLLGATPEIPALGTLPEPRPRRPE PPPALQTPPRPGLNRYPRLPLTAVRPIRLKIATPFELLYAQHFRQETRRLFYPSNPVSSS PDPSRVSNGSIPFCPELPGIGPDPTDANHRSGPPVRLTDWPLIEGSYSPSLLKFDNLLEC VTEHREVLYLLLLLYYKGFNSGTATKKRCTGPGAVGEVTNFVTPHPQLCDQGSQQLLGKD VKQSEVIL >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_3|747_bp atgcacagtcctgtcacctttacacgtgtagaatgcatcgaaatccttccaaaattttta gtgacctgcttttcccttgctaaagtaaacagtaacttgctgtgtgcccggctgctgggc gcaacgccggagatccccgccctggggaccctcccggagccccgcccaagacgccccgag cccccgcccgccctccagacgccgccccggccgggcctgaaccgctacccgcggctgccg ctcaccgccgtcagacccatccggttaaaaattgccaccccctttgaactgctctatgca cagcacttcagacaagagactcgcaggcttttctaccccagcaacccagtctccagctct ccagaccccagcagagtgtccaacggttccattccattctgccctgaactacctggaatt ggtccagaccccacagatgccaatcacagatcggggcctccagtgcgtctgactgactgg ccattaatagaaggctcctacagcccctccctcctcaagtttgataatttgctagaatgc gtcacagaacacagggaagtgctttacttactattactactttattacaaaggcttcaac tcaggaacagccacaaagaagaggtgcacagggccaggagcagttggggaagttacaaat tttgtgacaccccacccccagttatgtgaccagggcagtcagcaacttctaggaaaagac gtcaagcagtcggaggtcatcctttaa >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_4|236_aa MGCGAARALASVLCLRRNAAMAAQGEPQVQFKLVLVGDGGTGKTTFVKRHLTGEFEKKYV ATLGVEVHPLVFHTNRGPIKFNVWDTAGQEKFGGLRDGYYIQAQCAIIMFDVTSRVTYKN VPNWHRDLVRVCENIPIVLCGNKVDIKDRKVKAKSIVFHRKKNLQYYDISAKSNYNFEKP FLWLARKLIGDPNLEFVAMPALAPPEVVMDPALAAQYEHDLEVAQTTALPDEDDDL >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_4|711_bp atgggctgcggggccgcgcgagcgctcgcctccgtcctctgcctccgcaggaacgccgcg atggctgcgcagggagagccccaggtccagttcaaacttgtattggttggtgatggtggt actggaaaaacgaccttcgtgaaacgtcatttgactggtgaatttgagaagaagtatgta gccaccttgggtgttgaggttcatcccctagtgttccacaccaacagaggacctattaag ttcaatgtatgggacacagccggccaggagaaattcggtggactgagagatggctattat atccaagcccagtgtgccatcataatgtttgatgtaacatcgagagttacttacaagaat gtgcctaactggcatagagatctggtacgagtgtgtgaaaacatccccattgtgttgtgt ggcaacaaagtggatattaaggacaggaaagtgaaggcgaaatccattgtcttccaccga aagaagaatcttcagtactacgacatttctgccaaaagtaactacaactttgaaaagccc ttcctctggcttgctaggaagctcattggagaccctaacttggaatttgttgccatgcct gctctcgccccaccagaagttgtcatggacccagctttggcagcacagtatgagcacgac ttagaggttgctcagacaactgctctcccggatgaggatgatgacctgtga >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_5|110_aa METTQIHRRRRSSQEYLEVTSVMPATQIHSDGGGALSRKYLEVRSVMETTQIHRQRRSSQ EYLEGRFPLRVFPGLHCPPSQASDQAPAMLPKPQLSVLTLTVALSLIPGT >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_5|333_bp atggagaccacgcagatccacagacggaggaggagttctcaggagtacctagaggtgaca tcagtgatgccggctacgcagatccacagtgacggaggaggcgctctctctcggaaatac ctagaggtgagatcagtgatggagaccacgcagatccacagacagaggaggagttctcag gagtacctagaggggcgcttccccctgagggtctttcctggactccactgccctccttct caggcctcagaccaggcccctgcgatgctcccaaagcctcagctgtccgtcctcacactc actgtggcgctcagcctcatcccaggaacctga >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_6|203_aa MLPPMSTVGPALLSPCSNPRLLDTKFQSPVTLTVTYYARHTSLSRSPVTLTVTYYARHTS LSRSPVTLTVTYYARHTSLSRSPVTLTVTYYARHTSLSRSPVTLTVTYYARHTSLSRSPV TLTVTYYARHTSLSRSPVTLTVTYYARHTSLSRSPVTLTVTYYARHTSLSRSPVTLTVTY YARHTSLSRSHQIAIHFFSDPLL >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_6|612_bp atgttgcctccgatgagcaccgtcggccctgccctcctcagcccctgcagtaatcctcgg ttgttagacacaaaattccagtcacccgtcaccctcacggtcacctactacgccagacac acgtcactgtcacggtcacccgtcaccctcacggtcacctactacgccagacacacgtca ctgtcacggtcacccgtcaccctcacggtcacctactacgccagacacacgtcactgtca cggtcacccgtcaccctcacggtcacctactacgccagacacacgtcactgtcacggtca cccgtcaccctcacggtcacctactacgccagacacacgtcactgtcacggtcacccgtc accctcacggtcacctactacgccagacacacgtcactgtcacggtcacccgtcaccctc acggtcacctactacgccagacacacgtcactgtcacggtcacccgtcaccctcacggtc acctactacgccagacacacgtcactgtcacggtcacccgtcaccctcacggtcacctac tacgccagacacacgtcactgtcacggtcacaccagatagcaatacattttttttctgat cctctgctataa >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_7|73_aa MQGDTRDGRTPRKIPYEDTGKKAAVCKPRREPPGETRPADSLALNLQAPELQQGSILSVC ATFNFSSPRSPGF >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_7|222_bp atgcaaggagataccagggatggacgaacaccgagaaaaataccctatgaggacacgggc aagaaggcagccgtctgcaagccaaggagagagcccccaggagaaactcgccctgctgac tccttggccttgaacctgcaagctccggaactgcagcagggctccattctgagcgtctgt gctaccttcaacttctccagccctagatccccaggattctga >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_8|198_aa MLQPEYWHSPQNSSVEILTPKMMVSGARAQAAGSQSPCGSARAQAAGSQIPGGSARAQAA GSQSPCGSARAQAAGSQIPGGSARAQAAGSQSPGGSARAQAAGSQIPGGSARAQAAGSQI PGGSARAQAAGSQSPGGSARAQAAGSQSPCGSARAQAAGSQIPCGSARAQVAGSQSLCGS ACVMPPRLGFTPEPDNGH >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_8|597_bp atgctacagcctgaatattggcattccccccaaaattcatctgttgaaatcctgaccccc aagatgatggtatcaggagcccgtgcccaggcggccggctcccagagcccatgtggatca gcccgtgcccaggcggccggctcccagatcccgggtggttcagcccgtgcccaggcggcc ggctcccagagcccatgtggatcagcccgtgcccaggcggccggctcccagatcccgggt ggttcagcccgtgcccaggcggccggctcccagagcccgggtggttcagcccgtgcccag gcggccggctcccagatcccgggtggttcagcccgtgcccaggcggccggctcccagatc ccgggtggttcagcccgtgcccaggcggccggctcccagagcccgggtggttcagcccgt gcccaggcggccggctcccagagcccatgtggatcagcccgtgcccaggcggccggctcc cagatcccgtgtggttcagcccgtgcccaggtggccggctcccaaagcctctgtggctca gcctgtgtgatgccaccgcgcctgggctttacaccagaacctgacaatggtcattga >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_9|204_aa MGEYDSNMEKYDFNMGEYDFNMEKYDFNMEEYDFNIGKYDFNMGEYDFNMGKYDFNMGKY DFNMGKYDLNIGKYDFNMRECDFNVRKYDLNMGKYDFIMEYDFSMREYDLNMGKYDFNMR EYDLNMGKYDFNLGEYDFNMGKYDFNMEKYDLNIGKYDFNMRECDFNVRKYDLNMGKYDF IMEYDFSIREYDLNMGKYDFNMGK >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_9|615_bp atgggagaatatgattccaacatggagaaatatgacttcaacatgggggaatatgatttc aacatggagaaatatgacttcaacatggaggaatatgatttcaacatagggaaatatgac ttcaacatgggggaatatgatttcaacatggggaaatatgatttcaacatggggaaatat gatttcaacatggggaaatatgatttaaacatagggaaatatgatttcaacatgagggaa tgtgatttcaatgtgaggaaatatgatttaaacatggggaagtatgatttcatcatggaa tatgatttcagcatgagggaatatgatttaaacatggggaaatatgatttcaacatgagg gaatatgatttaaacatggggaaatatgacttcaacctgggggaatatgatttcaacatg gggaaatatgatttcaacatggagaaatatgatttaaacatagggaaatatgatttcaac atgagggaatgtgatttcaacgtgaggaaatatgatttaaacatggggaagtatgatttc atcatggaatatgatttcagcataagggaatatgatttaaatatggggaaatatgatttc aacatggggaagtag >gi568815586f:130772836_130975923|GENSCAN_predicted_peptide_10|221_aa MLWLLRASPSGATLVEASVKLIILLTLRNFTWLRALTSERAMEKLLRLCCWYSWLLLFYY NFQVRGVYSRSQDHPGFQVLASASHYWPLENVDGIHELQDTTGASRTHKLTVLPSRNATF VYSNDSAYSNLSATVDIVEGKVNKGIYLKEEKGVTLLYYGRYNSSCISKPEQCGPEVSTQ VGKLKIGRGTCWNVLAPSFTYLETLKAQLKCKLYGDELQSS >gi568815586f:130772836_130975923|GENSCAN_predicted_CDS_10|663_bp atgctctggcttctgagggcatcgccatcaggagcaacgctggtggaagccagtgtgaag ctgatcatcctactgaccctcaggaatttcacttggctccgagctttgacctccgagaga gccatggaaaagctgctgcggctgtgctgctggtactcctggctgctgctattttattac aactttcaggtgcgtggcgtctactccagatcgcaggaccatccaggatttcaggtgttg gcgtctgcttcccattactggccactggagaatgtggatgggatccatgaacttcaggat acaactggagcgtcccgaacccacaagctcactgtgcttccttcccgtaatgctactttt gtgtattccaatgattctgcctactcaaatctctctgcaactgtagatattgtggaaggg aaggtcaacaaaggcatttacctgaaagaggaaaagggagtcacgcttctctattacggc aggtacaacagctcctgcatcagcaagccagagcagtgtggccctgaagtgagcacacag gtggggaagctgaaaataggaaggggcacttgctggaatgttctggcaccttctttcacc tacctggagaccttgaaggctcagctcaagtgtaaactttacggggatgagctccagtcc agn