GENSCAN 1.0 Date run: 5-Nov-116 Time: 04:48:12 Sequence gi568815597f:206535704_206748962 : 213259 bp : 46.50% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 262 301 40 0 1 81 89 41 0.682 1.30 1.02 Intr + 2469 2590 122 1 2 108 96 102 0.886 13.21 1.03 Term + 11682 11750 69 2 0 80 39 111 0.717 3.34 1.04 PlyA + 11921 11926 6 1.05 2.06 PlyA - 12079 12074 6 1.05 2.05 Term - 16194 16120 75 0 0 94 43 70 0.647 0.94 2.04 Intr - 18032 17982 51 2 0 124 98 -11 0.810 2.60 2.03 Intr - 21953 21801 153 2 0 23 64 191 0.391 10.47 2.02 Intr - 29042 28936 107 0 2 -12 119 61 0.159 -0.87 2.01 Init - 36946 36892 55 1 1 95 91 17 0.564 4.15 2.00 Prom - 37081 37042 40 -4.16 3.00 Prom + 37792 37831 40 -6.96 3.01 Init + 45160 45222 63 0 0 84 93 32 0.619 4.49 3.02 Intr + 47566 47676 111 0 0 97 107 138 0.964 17.18 3.03 Intr + 48684 48981 298 2 1 76 97 578 0.999 53.95 3.04 Intr + 49477 49592 116 2 2 81 92 144 0.837 14.27 3.05 Intr + 51123 51271 149 2 2 82 26 172 0.371 9.43 3.06 Term + 54463 54577 115 1 1 91 49 63 0.117 0.74 3.07 PlyA + 55949 55954 6 1.05 4.17 PlyA - 55962 55957 6 1.05 4.16 Term - 56142 56072 71 1 2 27 48 102 0.727 -1.80 4.15 Intr - 58090 57916 175 1 1 116 85 207 0.905 22.71 4.14 Intr - 60135 60015 121 2 1 81 86 44 0.999 4.00 4.13 Intr - 61492 61397 96 0 0 87 97 30 0.883 2.92 4.12 Intr - 63389 63300 90 1 0 100 87 165 0.999 16.71 4.11 Intr - 63909 63760 150 2 0 85 89 80 0.992 7.08 4.10 Intr - 64133 64030 104 2 2 82 105 141 0.995 14.17 4.09 Intr - 64605 64560 46 2 1 62 121 35 0.979 2.61 4.08 Intr - 66750 66633 118 1 1 99 60 6 0.970 -1.58 4.07 Intr - 67501 67248 254 0 2 29 70 202 0.973 9.58 4.06 Intr - 69804 69697 108 2 0 102 52 72 0.860 4.40 4.05 Intr - 72623 72533 91 0 1 91 84 10 0.960 -0.05 4.04 Intr - 73756 73673 84 2 0 53 110 50 0.770 3.49 4.03 Intr - 75671 75481 191 1 2 77 80 217 0.980 19.03 4.02 Intr - 76654 76584 71 1 2 116 94 135 0.424 14.88 4.01 Init - 88243 88184 60 1 0 71 100 10 0.417 1.75 4.00 Prom - 88328 88289 40 -4.26 5.00 Prom + 90231 90270 40 -3.76 5.01 Init + 100035 100093 59 2 2 73 49 80 0.208 1.44 5.02 Intr + 101947 102058 112 1 1 73 101 38 0.255 3.98 5.03 Intr + 111685 113115 1431 0 0 116 35 963 0.015 83.55 5.04 Intr + 129406 129501 96 0 0 90 82 2 0.071 0.01 5.05 Intr + 137324 137463 140 1 2 67 54 91 0.191 2.86 5.06 Intr + 145813 145887 75 1 0 -40 100 205 0.225 7.53 5.07 Intr + 146887 146923 37 1 1 100 75 12 0.076 -0.64 5.08 Intr + 149545 149805 261 0 0 66 56 482 0.004 40.48 5.09 Intr + 158210 158357 148 1 1 82 70 50 0.063 2.41 5.10 Intr + 170596 170724 129 2 0 96 43 42 0.001 1.17 5.11 Intr + 170863 170911 49 2 1 96 74 23 0.000 -0.56 5.12 Intr + 191350 191446 97 1 1 113 65 49 0.220 5.11 5.13 Intr + 193007 193146 140 1 2 52 103 247 0.410 21.86 5.14 Intr + 193332 193396 65 0 2 44 60 56 0.943 -3.24 5.15 Intr + 193693 193772 80 2 2 119 45 110 0.695 9.07 5.16 Intr + 194269 194395 127 0 1 91 106 135 0.999 15.85 5.17 Intr + 194985 195060 76 1 1 79 76 92 0.988 5.67 5.18 Intr + 195435 195559 125 0 2 85 116 170 0.998 19.73 5.19 Intr + 195937 196022 86 2 2 108 103 114 0.999 14.44 5.20 Intr + 196136 196216 81 1 0 100 100 50 0.991 7.23 5.21 Term + 196872 197015 144 2 0 101 52 227 0.998 18.31 5.22 PlyA + 202194 202199 6 1.05 6.02 PlyA - 204656 204651 6 1.05 6.01 Term - 205662 205592 71 1 2 103 46 76 0.820 3.00 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 171430 171252 179 1 2 41 94 135 0.932 8.13 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:206535704_206748962|GENSCAN_predicted_peptide_1|76_aa SILHRSELYQSHTDCKFTCHPECRSLIQLDCSQQEGLSRDRPSPESTLTVTFSQTSGSIR FSEEPEPYCELRMRGI >gi568815597f:206535704_206748962|GENSCAN_predicted_CDS_1|231_bp agcattcttcatcgctcagagctctaccagagccatacagactgtaaattcacctgtcac ccagaatgccgcagcctgatccagttggactgcagtcagcaggagggtttatcccgggac agaccctctccagaaagcaccctcaccgtgaccttcagccagaccagcggcagcattaga ttctcggaggagcctgaaccctattgtgaactgcgcatgcgagggatctag >gi568815597f:206535704_206748962|GENSCAN_predicted_peptide_2|146_aa MPAVPHGEVTILSSCSVPGRLNHEEEETKDLQDRSQFPTQGAPKARRYTGVGYLVVLAVK KQSSSSSSRLQYPLLMLLSTVIYPGVPGRGRQRIKGAGLGMRRRELPGVFSSPPVYPCSP WGGQSSNIPGSMVTRVRCEQLLHCYI >gi568815597f:206535704_206748962|GENSCAN_predicted_CDS_2|441_bp atgcctgctgtgcctcatggagaagtgaccatactttccagctgttcagtcccaggtaga ctaaaccatgaagaagaagaaaccaaggacctgcaggacaggtctcagttccccacccag ggagctcccaaagccagaagatacacaggtgtggggtacctggtagtcttagcagtgaag aagcagtcttccagttcctcgtccaggctgcagtacccactgctcatgctgctgtccacg gtcatctaccccggagttcccggccgaggccgccagcgcatcaagggggccgggcttggg atgcggcggagggagcttcctggggtcttctcttctccccctgtgtacccctgctctcct tgggggggccaatcctccaacatcccgggcagcatggtcaccagggtgagatgtgaacaa ctattgcactgctacatctag >gi568815597f:206535704_206748962|GENSCAN_predicted_peptide_3|283_aa MVPLGPDPLLRLILDEHFTDQNVCKPVEETQRPPTLQEIKQKIDSYNTREKNCLGMKLSE DGTYTGFIKVHLKLRRPVTVPAGIRPQSIYDAIKEVNLAATTDKRTSFYLPLDAIKQLHI SSTTTVSEVIQGLLKKFMVVDNPQKFALFKRIHKDGQVLFQKLSIADRPLYLRLLAGPDT EVLSFVLKENETGEVEWDAFSIPELQNFLTILEKEEQDKIQQVQKKYDKFRQKLEEALRE SQGKPGGRGKMTQCKCVTVTGKTTQCTGLDSTVSWQNCGSNHN >gi568815597f:206535704_206748962|GENSCAN_predicted_CDS_3|852_bp atggtgcctctggggcctgacccgctgcttagactcatcttagatgaacatttcacagat cagaatgtctgtaaacctgtggaggagacacagcgcccgcccacactgcaggagatcaag cagaagatcgacagctacaacacgcgagagaagaactgcctgggcatgaaactgagtgaa gacggcacctacacgggtttcatcaaagtgcatctgaaactccggcggcctgtgacggtg cctgctgggatccggccccagtccatctatgatgccatcaaggaggtgaacctggcggct accacggacaagcggacatccttctacctgcccctagatgccatcaagcagctgcacatc agcagcaccaccaccgtcagtgaggtcatccaggggctgctcaagaagttcatggttgtg gacaatccccagaagtttgcactttttaagcggatacacaaggacggacaagtgctcttc cagaaactctccattgctgaccgccccctctacctgcgcctgcttgctgggcctgacacg gaggtcctcagctttgtgctaaaggagaatgaaactggagaggtagagtgggatgccttc tccatccctgaacttcagaacttcctaacaatcctggaaaaagaggagcaggacaaaatc caacaagtgcaaaagaagtatgacaagtttaggcagaaactggaggaggccttaagagaa tcccagggcaaacctggggggagagggaagatgactcaatgcaaatgtgtcaccgttact ggaaaaaccactcaatgcacaggtctggacagcactgtgtcatggcagaattgtgggtca aaccacaattaa >gi568815597f:206535704_206748962|GENSCAN_predicted_peptide_4|609_aa MIRLSSPGPPLDMWGLLQFKATPADMFAKAFRVKSNTAIKGSDRRKLRADVTTAFPTLGT DQVSELVPGKEELNIVKLYAHKGDAVTVYVSGGNPILFELEKNLYPTVYTLWSYPDLLPT FTTWPLVLEKLVGGADLMLPGLVMPPAGLPQVQKGDLCAISLVGNRAPVAIGVAAMSTAE MLTSGLKGRGFSVLHTYQDHLWRSGNKSSPPSIAPLALDSADLSEEKGSVQMDSTLQGDM RHMTLEGEEENGEVHQAREDKSLSEAPEDTSTRGLNQDSTDSKTLQEQMDELLQQCFLHA LKCRVKKADLPLLTSTFLGSHMFSCCPEGRQLDIKKSSYKKLSKFLQQMQQEQIIQVKEL SKGVESIVAVDWKHPRITSFVIPEPSPTSQTIQEGSREQPYHPPDIKPLYCVPASMTLLF QESGHKKGSFLEGSEVRTIVINYAKKNDLVDADNKNLVRLDPILCDCILEKNEQHTVMKL PWDSLLTRCLEKLQPAYQVTLPGQEPIVKKGRICPIDITLAQRASNKKVTVVRNLEAYGL DPYSVAAILQQRCQASTTVNPAPGAKDSLQVQIQGNQVHHLGWLLLEEYQLPRKHIQGLE KALKPGKKK >gi568815597f:206535704_206748962|GENSCAN_predicted_CDS_4|1830_bp atgattcgattatcttcacctggtcccccacttgacatgtggggattattacaattcaag gccaccccagcagacatgtttgccaaggcctttcgggtcaagtccaacacggccatcaag gggtcggacaggagaaagcttcgagctgatgtgacaactgctttccccacccttggaact gatcaagtctctgagttagtacctggaaaggaggagctcaacattgtgaagttgtatgct cacaaaggggatgcagtgactgtgtacgtgagtggtggtaaccccatcctctttgaactg gagaaaaatctgtatccaacagtgtacacgctgtggtcctatcctgatcttctgccaacc tttacaacatggcctctggtgctcgagaaactggtagggggagcagatttgatgctgcct ggactggtgatgccccctgctggtctgcctcaggtacagaagggcgacctctgtgccatt tctttggtggggaacagagcccctgtagccattggagttgcagccatgtccacagctgag atgctcacgtcaggcctgaagggaaggggcttctctgtgctccacacttaccaggaccac ttgtggcggtctggaaacaagtcctctccaccttccattgctccactggccctggattca gcagatctcagtgaagagaaggggtctgtccagatggactccaccctgcagggagacatg aggcacatgaccctggagggggaagaggagaatggggaggttcaccaggcacgtgaagac aagtctctctcagaagccccagaagacaccagcaccaggggcctgaaccaagactccaca gatagcaaaacgcttcaagaacaaatggatgagctgttacagcaatgcttcttacatgcc ttgaagtgccgagtcaaaaaggctgacctccctttactcaccagcactttccttggcagc cacatgttctcctgctgccccgaaggacgacaactggacataaagaagtcaagctacaaa aagctctctaagttcctgcagcaaatgcagcaggagcagattatacaggtgaaggagctg agcaaaggggtggagagcattgtggctgtggactggaaacacccgaggattacatctttc gtcatacccgagccctccccgacctcccagactatccaggagggtagcagggaacagccc tatcaccctccagatataaaacccctctactgtgtcccagccagcatgaccctgctcttc caggagtctggccacaagaaggggagctttctggagggcagtgaggtccgaacgatcgtc attaactacgccaagaaaaatgacctggttgatgcagacaacaaaaatcttgtgagattg gatcccatcctatgtgactgcatcttagagaaaaatgaacagcatacagtcatgaagctt ccatgggacagtcttctgaccaggtgtttggaaaaattacagcctgcctatcaagtgacc cttcccggacaagagcccattgtgaagaaagggagaatctgtccaattgacatcacccta gcacaaagagcgtctaataaaaaggtgaccgtggtccggaacttggaggcctatggtctg gacccatactcagtggctgccatccttcagcagcgatgccaggctagcaccaccgtcaat cctgcccctggggccaaggacagccttcaggtgcagatccagggaaaccaggtccaccac ctcggctggctattgcttgaagagtatcagctccctcgaaaacacatccaaggtctagaa aaggccctcaaacctggcaagaagaagtga >gi568815597f:206535704_206748962|GENSCAN_predicted_peptide_5|1185_aa MRGRLGPGSRPSSGGNGATGLGDGVYDTFMMIDETKCPPCSNVLCNPSEPPPPRRLNMTT EQFTGDHTQHFLDGGEMKVEQLFQEFGNRKSNTIQSDGISDSEKCSPTVSQGKSSDCLNT VKSNSSSKAPKVVPLTPEQALKQYKHHLTAYEKLEIINYPEIYFVGPNAKKRHGVIGGPN NGGYDDADGAYIHVPRDHLAYRYEVLKIIGKGSFGQVARVYDHKLRQYVALKMVRNEKRF HRQAAEEIRILEHLKKQDKTGSMNVIHMLESFTFRNHVCMAFELLSIDLYELIKKNKFQG FSVQLVRKFAQSILQSLDALHKNKIIHCDLKPENILLKHHGRSSTKVIDFGSSCFEYQKL YTYIQSRFYRAPEIILGSRYSTPIDIWSFGCILAELLTGQPLFPGEDEGDQLACMMELLG MPPPKLLEQSKRAKYFINSKGIPRYCSVTTQADGRVVLVGGRSRRGKKRGPPGSKDWGTA LKGCDDYLFIEFLKRCLHWDPSARLTPAQALRHPWISKSVPRPLTTIDKVSGKRMLDFVG ETQDREQILVLPKGDTSPMLVTLFPQHEKQEVKLTKMPSDMWLTLSPQLDLKAKEKLAKE NCSPATSFNNFWWKHSKNIVIVIIIIIINIIITIIMRRPIWLLQLREDSQGQSPPVPFPA PAPPPQPPTPALPHPPAQPPPPPPQQFPQFHVKSGLQIKKNAIIDDYKVTSQVLGLGING KVLQIFNKRTQEKFALKLSHMQNVTECMGFFYLLTAVLMCVHGGVASSSTPQGSWRLVEQ DEERGSATETQSHSDGGGAKPVPLIIASLLVEKGQMLLCSYPDHPAGRLGSKPLPRTSEL TEAGMGMSLGHPSPLNVQSSWSVDCVGLILTSHEDAKSMLQDCPKARREVELHWRASQCP HIVRIVDVYENLYAGRKCLLIVMECLDGGELFSRIQDRGDQAFTEREASEIMKSIGEAIQ YLHSINIAHRDVKPENLLYTSKRPNAILKLTDFGFAKETTSHNSLTTPCYTPYYVAPEVL GPEKYDKSCDMWSLGVIMYILLCGYPPFYSNHGLAISPGMKTRIRMGQYEFPNPEWSEVS EEVKMLIRNLLKTEPTQRMTITEFMNHPWIMQSTKVPQTPLHTSRVLKEDKERWEDVKEE MTSALATMRVDYEQIKIKKIEDASNPLLLKRRKKARALEAAALAH >gi568815597f:206535704_206748962|GENSCAN_predicted_CDS_5|3558_bp atgcggggccgcctggggccgggctcccgccccagcagcggaggtaacggcgccacgggg ttgggggatggtgtctatgacaccttcatgatgatagatgaaaccaaatgtcccccctgt tcaaatgtactctgcaatccttctgaaccacctccacccagaagactaaatatgaccact gagcagtttacaggagatcatactcagcactttttggatggaggtgagatgaaggtagaa cagctgtttcaagaatttggcaacagaaaatccaatactattcagtcagatggcatcagt gactctgaaaaatgctctcctactgtttctcagggtaaaagttcagattgcttgaataca gtaaaatccaacagttcatccaaggcacccaaagtggtgcctctgactccagaacaagcc ctgaagcaatataaacaccacctcactgcctatgagaaactggaaataattaattatcca gaaatttactttgtaggtccaaatgccaagaaaagacatggagttattggtggtcccaat aatggagggtatgatgatgcagatggggcctatattcatgtacctcgagaccatctagct tatcgatatgaggtgctgaaaattattggcaaggggagttttgggcaggtggccagggtc tatgatcacaaacttcgacagtacgtggccctaaaaatggtgcgcaatgagaagcgcttt catcgtcaagcagctgaggagatccggattttggagcatcttaagaaacaggataaaact ggtagtatgaacgttatccacatgctggaaagtttcacattccggaaccatgtttgcatg gcctttgaattgctgagcatagacctttatgagctgattaaaaaaaataagtttcagggt tttagcgtccagttggtacgcaagtttgcccagtccatcttgcaatctttggatgccctc cacaaaaataagattattcactgcgatctgaagccagaaaacattctcctgaaacaccac gggcgcagttcaaccaaggtcattgactttgggtccagctgtttcgagtaccagaagctc tacacatatatccagtctcggttctacagagctccagaaatcatcttaggaagccgctac agcacaccaattgacatatggagttttggctgcatccttgcagaacttttaacaggacag cctctcttccctggagaggatgaaggagaccagttggcctgcatgatggagcttctaggg atgccaccaccaaaacttctggagcaatccaaacgtgccaagtactttattaattccaag ggcataccccgctactgctctgtgactacccaggcagatgggagggttgtgcttgtgggg ggtcgctcacgtaggggtaaaaagcggggtcccccaggcagcaaagactgggggacagca ctgaaagggtgtgatgactacttgtttatagagttcttgaaaaggtgtcttcactgggac ccctctgcccgcttgaccccagctcaagcattaagacacccttggattagcaagtctgtc cccagacctctcaccaccatagacaaggtgtcagggaaacggatgctggattttgttggt gagacacaagacagagaacaaattcttgttctccctaagggagatacttcgcccatgctt gtgactctctttccacagcatgagaaacaggaagtgaaactgactaaaatgcccagtgat atgtggctgaccctgtctcctcagctggacctaaaggcaaaggagaaactggccaaagaa aactgtagccctgccacaagctttaacaacttctggtggaagcactcaaaaaacattgtc atcgtcatcatcatcatcatcatcaacatcatcatcaccatcatcatgagaaggcctatt tggctcttgcagctccgggaggactctcagggccagagcccgccggtgccgttccccgcc ccggccccgccgccgcagccccccacccctgccctgccgcaccccccggcgcagccgccg ccgccgcccccgcagcagttcccgcagttccacgtcaagtccggcctgcagatcaagaag aacgccatcatcgatgactacaaggtcaccagccaggtcctggggctgggcatcaacggc aaagttttgcagatcttcaacaagaggacccaggagaaattcgccctcaaactcagccac atgcagaatgtcaccgagtgcatgggtttcttctacctcttgactgcagtattgatgtgt gtccatggaggtgtggcctcttcctctactccccagggatcctggagacttgtggagcaa gatgaggaaaggggctcagccacagagacccagagtcactcagatgggggaggagcaaag ccagtccctctgatcattgcctctttgttagtggagaaagggcagatgctgctctgcagt tatcccgaccaccctgcaggacgtctggggagcaagcccttgccaagaacctctgagctc accgaggcgggtatgggcatgtctcttggacaccccagccctctgaatgtccaaagcagt tggagtgtggactgtgtagggctcatcttgacttcccatgaagatgctaaatctatgctt caggactgccccaaggcccgcagggaggtggagctgcactggcgggcctcccagtgcccg cacatcgtacggatcgtggatgtgtacgagaatctgtacgcagggaggaagtgcctgctg attgtcatggaatgtttggacggtggagaactctttagccgaatccaggatcgaggagac caggcattcacagaaagagaagcatccgaaatcatgaagagcatcggtgaggccatccag tatctgcattcaatcaacattgcccatcgggatgtcaagcctgagaatctcttatacacc tccaaaaggcccaacgccatcctgaaactcactgactttggctttgccaaggaaaccacc agccacaactctttgaccactccttgttatacaccgtactatgtggctccagaagtgctg ggtccagagaagtatgacaagtcctgtgacatgtggtccctgggtgtcatcatgtacatc ctgctgtgtgggtatccccccttctactccaaccacggccttgccatctctccgggcatg aagactcgcatccgaatgggccagtatgaatttcccaacccagaatggtcagaagtatca gaggaagtgaagatgctcattcggaatctgctgaaaacagagcccacccagagaatgacc atcaccgagtttatgaaccacccttggatcatgcaatcaacaaaggtccctcaaacccca ctgcacaccagccgggtcctgaaggaggacaaggagcggtgggaggatgtcaaggaggag atgaccagtgccttggccacaatgcgcgttgactacgagcagatcaagataaaaaagatt gaagatgcatccaaccctctgctgctgaagaggcggaagaaagctcgggccctggaggct gcggctctggcccactga >gi568815597f:206535704_206748962|GENSCAN_predicted_peptide_6|23_aa XICMPVREAGSPVKIKHQEAKQG >gi568815597f:206535704_206748962|GENSCAN_predicted_CDS_6|72_bp ngcatttgcatgccagtcagagaagcaggtagcccggtcaagatcaaacaccaggaggcc aaacaaggctga