GENSCAN 1.0 Date run: 4-Nov-116 Time: 12:40:35 Sequence gi568815586f:123912145_124112927 : 200783 bp : 46.18% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 954 1171 218 2 2 69 71 228 0.942 16.50 1.02 Intr + 2185 2406 222 1 0 90 89 470 0.999 44.54 1.03 Intr + 2708 2855 148 2 1 70 55 213 0.980 16.44 1.04 Intr + 4313 4592 280 1 1 124 99 294 0.999 31.15 1.05 Intr + 5440 5693 254 2 2 63 78 596 0.630 53.25 1.06 Intr + 6532 6805 274 0 1 96 99 512 0.986 50.21 1.07 Intr + 12134 12288 155 0 2 65 83 32 0.496 0.19 1.08 Intr + 12906 13060 155 2 2 75 100 228 0.995 21.57 1.09 Intr + 14493 14676 184 0 1 85 86 90 0.981 8.29 1.10 Intr + 16027 16164 138 0 0 54 33 89 0.491 0.66 1.11 Intr + 16243 16443 201 0 0 113 47 222 0.652 20.08 1.12 Intr + 17131 17340 210 0 0 79 61 297 0.999 25.21 1.13 Intr + 17520 17615 96 2 0 103 75 88 0.987 9.21 1.14 Intr + 18258 18429 172 2 1 73 77 344 0.999 31.32 1.15 Intr + 19197 19328 132 1 0 80 97 138 0.999 14.52 1.16 Intr + 19492 19703 212 2 2 42 84 304 0.977 24.03 1.17 Intr + 19797 19964 168 2 0 52 106 110 0.998 9.34 1.18 Intr + 21187 21367 181 0 1 137 81 293 0.998 32.94 1.19 Intr + 22477 22622 146 2 2 92 72 229 0.743 21.70 1.20 Term + 23191 23337 147 0 0 101 37 101 0.777 4.20 1.21 PlyA + 24239 24244 6 1.05 2.06 PlyA - 24378 24373 6 -3.24 2.05 Term - 25686 24914 773 1 2 84 48 1048 0.923 93.96 2.04 Intr - 30641 30600 42 0 0 77 110 28 0.785 2.11 2.03 Intr - 31349 31203 147 0 0 74 88 237 0.272 22.51 2.02 Intr - 34923 34650 274 0 1 40 40 93 0.003 -3.19 2.01 Init - 42738 42616 123 0 0 72 58 139 0.222 7.47 2.00 Prom - 51835 51796 40 -6.06 3.00 Prom + 52256 52295 40 -4.76 3.01 Init + 61667 61782 116 1 2 101 64 156 0.731 12.21 3.02 Intr + 61810 61876 67 1 1 52 97 81 0.531 4.31 3.03 Intr + 76879 76957 79 0 1 70 88 14 0.012 -1.28 3.04 Term + 78438 78481 44 1 2 107 53 32 0.047 -1.08 3.05 PlyA + 79386 79391 6 1.05 4.00 Prom + 88391 88430 40 -3.36 4.01 Sngl + 100001 100786 786 1 0 71 41 319 0.922 21.45 4.02 PlyA + 101959 101964 6 1.05 5.04 PlyA - 102996 102991 6 1.05 5.03 Term - 116423 116138 286 1 1 37 39 224 0.048 7.28 5.02 Intr - 122270 122193 78 1 0 60 103 45 0.052 1.87 5.01 Init - 131801 131737 65 2 2 90 70 28 0.151 1.15 5.00 Prom - 135110 135071 40 0.54 6.00 Prom + 138754 138793 40 -2.46 6.01 Sngl + 177251 177622 372 1 0 72 39 344 0.660 24.03 6.02 PlyA + 179588 179593 6 -0.45 7.02 PlyA - 179779 179774 6 -1.75 7.01 Sngl - 181546 180983 564 1 0 44 43 946 0.864 81.85 7.00 Prom - 192134 192095 40 -5.16 8.02 PlyA - 192792 192787 6 1.05 8.01 Term - 193766 193557 210 2 0 27 42 183 0.480 4.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 93889 93762 128 2 2 57 55 155 0.867 7.54 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:123912145_124112927|GENSCAN_predicted_peptide_1|1230_aa VARLERNFYLTKRELERIQNELAAIQKELETLGAKYEAAILEKQKLQEEAEIMERRLIAA DKLISGLGSENIRWLNDLDELMHRRVKLLGDCLLCAAFLSYEGAFTWEFRDEMVNRIWQN DILEREIPLSQPFRLESLLTDDVEISRWGSQGLPPDELSVQNGILTTRASRFPLCIDPQQ QALNWIKRKEEKNNLRVASFNDPDFLKQLEMSIKYGTPFLFRDVDEYIDPVIDNVLEKNI KVSQGRQFIILGDKEVDYDSNFRLYLNTKLANPRYSPSVFGKAMVINYTVTLKGLEDQLL SVLVAYERRELEEQREHLIQETSENKNLLKDLEDSLLRELATSTGNMLDNVDLVHTLEET KSKATEVATTVEEAVSEKLKLAEKTALDIDRLRDGYRPAARRGAILFFVLSEMALVNSMY QYSLIAFLEVFRLSLKKSLPDSILMKRLRNIMDTLTFSIYNHGCTGNISLEKSKRKKPCA WLSDQGWEDIILLSEMFSDNFGQLPDDVENNQTVWQEWYDLDSLEQFPVPLGYDNNITPF QKLLILRCFRVDRVYRAVTDYVTVTMGEKYVQPPMISFEAIFEQSTPHSPIVFILSPGSD PATDLMKLAERSGFGGNRLKFLAMGQGQEKAGCDQLSAPTGPMGFLKDGLFEGSTCSSWI LGLLLASAGAAIALLWVALQLLETAVARGQWLMLQNCHLLVKWLKDLEKSLERITKPHPD FRLWLTTDPTKGFPIGILQKSLKVVTEPPNGLKLNMRATYFKISHEMLDQCPHPAFKPLV YVLAFFHAVVQERRKFGKIGWNVYYDFNESDFQVCMEILNTYLTKAFQQRDPRIPWGSLK YLIGEVMYGGRAIDSFDRRILTIYMDEYLGDFIFDTFQPFHFFRNKEVDYKIPVGDEKEK FVEAIEALPLANTPEVFGLHPNAEIGYYTQAARDMWAHLLELQPQTGESSSGISRDDYIG QVAKEIENKMPKVFDLDQVRKRLGTGLSPTSVVLLQELERFNKLVVRMTKSLAELQRALA GEVGMSNELDDVARSLFIGHIPNIWRRLAPDTLKSLGNWMVYFLRRFSQYMLWVTESEPS VMWLSGLHIPESYLTALVQATCRKNGWPLDRSTLFTQVTKFQDADEVNERAGQGCFVSGL YLEGADWDIEKGCLIKSKPKVLVVDLPILKIIPIEAHRLKLQNTFRTPVYTTSMRRNAMG VGLVFEADLFTTRHISHWVLQGVCLTLNSD >gi568815586f:123912145_124112927|GENSCAN_predicted_CDS_1|3693_bp gtggccaggctggagcggaatttttacctcactaaacgggaactggaaaggatccagaat gagttggcagcaattcagaaagagctggaaacattgggtgccaaatatgaggccgccata ctggaaaagcagaagctgcaggaagaagccgagatcatggagaggcggctgattgccgca gacaaactcatctcgggtctggggtcagaaaacatcaggtggctgaacgacctggatgag ctgatgcaccggcgcgtgaagctgctgggggactgcctgctctgcgcggctttcctcagc tacgagggagccttcacctgggagttccgtgacgagatggtcaatcggatttggcaaaat gacatcctggagcgggagatccccctgagccagcctttccggctggaaagcctgctcacg gatgatgttgagatcagcagatggggatcccagggccttccccccgatgagctctccgtt cagaatggcatcctcaccacccgggccagccgcttccctctgtgtatcgacccccagcag caggccctcaactggatcaagagaaaagaggagaagaacaatctgcgggtcgcttccttt aatgaccctgacttcctcaagcagctagagatgtccataaagtacgggacccctttcctg ttccgcgatgttgatgaatacatcgatcctgtgattgacaacgtcttagaaaaaaatata aaagtctcccaaggacggcagtttattatcctgggagacaaggaagtggactatgattca aatttcagactgtacctgaacaccaagctggccaatcccagatattccccatccgtgttt gggaaagctatggtgatcaattacactgtcacgctgaagggcctggaggaccagctgctg agcgtgctggtggcttacgagaggcgggagctggaggagcagcgggagcacctcatccag gagaccagcgagaacaagaacctgctcaaggacctggaagattccctccttcgggagctg gccacgtccacggggaacatgctggacaatgtggacctggtgcacaccctggaggagacc aaatccaaggcaacagaggtagcaaccacagtggaagaggccgtctcagagaaactcaag ctggcggagaagacagccttggacatcgacaggctgcgggatggctaccggccagcagcc aggaggggggccatcctgttcttcgtcctgtctgagatggccctggtgaactccatgtac cagtactccctgattgccttcttagaggtcttcaggctgtcactgaagaagtcgctgcct gattccatcctcatgaaacgcctgaggaacatcatggacacgctgaccttcagcatctat aaccacggctgcacaggaaacatttccctggagaaaagcaaaagaaaaaagccctgcgct tggttgtctgaccaaggatgggaagatatcattcttttatcagaaatgttttcagacaac tttgggcaacttcctgatgatgttgagaataatcagactgtctggcaggagtggtatgac ctggattcactggagcagtttcccgtccccttgggttacgataacaacatcacccctttc cagaagttgcttattttgcgctgtttccgtgtggatcgggtctatcgggccgtgactgac tatgtgactgtaacaatgggagagaagtatgtgcagcccccaatgatcagctttgaagct atttttgagcagagcactccacattcgcccattgtgtttatcctgagtcctggctccgac cctgccactgatcttatgaaattagcagagcgaagtggttttggaggaaatcgcctcaaa ttccttgcaatgggtcaaggtcaagaaaaggctgggtgtgaccagctcagtgcacccacg gggcccatggggtttctcaaagatggtttatttgaaggctccacctgtagctcctggatc ctcggcttgcttttggcttctgctggagctgccatcgcccttctgtgggtggccctgcag ctgctggagacggcggtggctcgggggcagtggctgatgctgcagaactgccacctcctg gtcaagtggctgaaagatctggagaagtccctggagaggatcaccaagccccacccagac ttccgcctgtggctcaccacggaccccaccaagggcttccccattgggattctgcagaag tccctaaaggttgtcaccgagccacccaatgggctgaaactcaacatgagggcaacttac ttcaagatctctcacgaaatgctggaccagtgcccgcaccctgccttcaagccgctggtc tacgtgctggcgttctttcatgctgtggtgcaggagagaaggaagtttgggaagattggc tggaacgtgtactatgacttcaatgagtctgacttccaggtctgcatggaaattctgaac acgtacttaacgaaagccttccagcaacgggacccaaggatcccgtggggcagcctcaag tacctaattggagaggtcatgtatggaggacgggccatcgacagctttgatcgccgcatc ctgaccatctacatggatgagtacctgggggacttcatttttgatactttccagccattc cacttcttccggaacaaggaagtggactacaaaatccctgttggtgatgaaaaggagaaa tttgttgaagccatcgaggccctcccgcttgccaacacgccagaagtgtttggtctccac cccaacgctgagattggctattacacgcaggcggctcgagacatgtgggctcacctgctg gagctgcagcctcagacaggggaatccagcagtggtatcagccgcgatgattatattggc caagtggccaaagaaatagaaaacaagatgcccaaagtctttgacttggaccaggtgagg aagcgcctcggaacaggactctcccccacttcggtggtgctcctgcaggaactggaacgc ttcaacaagcttgtggtccggatgacgaagtctctggctgaacttcaaagggccttggct ggagaagttggaatgagcaatgagttagatgatgtggccaggtctctttttatcgggcat atccctaatatctggagaaggcttgctcctgacaccttaaagtcccttggaaactggatg gtctacttcctgcggcggttcagccagtacatgttgtgggtgaccgagagcgagcccagc gtgatgtggctctcggggctgcacatccctgagtcctacctcacggcgctggtgcaggcc acctgccggaagaacggctggccactggaccgctccaccttgttcacacaagtgaccaag ttccaggatgcagatgaagtgaatgagcgggcgggacaaggatgctttgtctcaggactg tacctggaaggtgctgactgggatatagaaaaaggatgtcttatcaagagcaaacccaag gtgctggttgtggacctgccgatcctgaagatcatccccattgaagcccatcgcctcaag ctgcagaatactttccggacccccgtctacaccacctccatgagaaggaacgccatggga gtcggcttggtttttgaagctgatctctttaccacgaggcacatttctcactgggtgctg caaggagtatgcctcaccctgaattctgattaa >gi568815586f:123912145_124112927|GENSCAN_predicted_peptide_2|452_aa MECWCSLLLGWGLPPAGLLAVRNPNFHEAQDTPRRHHTTQMGWQGRPAGCSECGARQAHA HPELQLACKRRRTQPRFCLRLSLHTSLQAEGAGSGLGQPRKGLPQCSGGLKGSSSAAKVG AQAEEVLRASEGCPLDVSMAATNLENQLHSAQKNLLFLQREHASTLKGLHSEIRRLQQHC TDLTYELTVKSSEQTGDGTSKSSELKKRCEELEAQLKVKENENAELLKELEQKNAMITVL ENTIKEREKKYLEELKAKSHKLTLLSSELEQRASTIAYLTSQLHAAKKKLMSSSGTSDAS PSGSPVLASYKPAPPKDKLPETPRRRMKKSLSAPLHPEFEEVYRFGAESRKLLLREPVDA MPDPTPFLLARESAEVHLIKERPLVIPPIASDRSGEQHSPAREKPHKAHVGVAHRIHHAT PPQAQPEVKTLAVDQVNGGKVVRKHSGTDRTV >gi568815586f:123912145_124112927|GENSCAN_predicted_CDS_2|1359_bp atggaatgctggtgctccctcctcctggggtggggtctgccgcccgctggcctcctggcg gttaggaacccaaacttccatgaggcccaggacactcccagaaggcaccacacgacacag atgggctggcagggccggcccgctggctgctctgagtgcggggcccgccaagcccacgcc cacccagaactccagctggcctgcaagcgccgccgcacgcagccccggttctgcttgcgc ctctccctccacacctccctgcaagctgagggagcgggctccggccttggccagcccaga aaggggctcccacagtgcagcggtgggctgaagggctcctcaagtgccgccaaagtggga gcccaggcagaggaggtgctgagagcaagcgagggctgtcctctggatgtcagcatggca gccacaaacctggagaaccagctgcacagcgcacagaagaacctcctgttccttcagcgg gagcatgccagcacgctcaaggggctgcactccgagatcaggcggctgcagcagcactgc acagatttaacatatgagctgacagtcaaaagttcggaacagacaggagacgggacttct aaaagcagtgaattaaagaaaagatgtgaagagctggaagcccaactgaaagtgaaagag aacgaaaatgctgagttgttgaaagaactggagcagaaaaacgcgatgatcacagtgctg gagaacaccatcaaggagcgagagaagaagtacctggaggagctgaaggccaagagtcac aagctgaccctgctgtctagcgagctggagcagcgggccagcaccatcgcctacctgacc tcccagctgcacgccgccaagaagaagctcatgagctccagcgggacctcagatgccagc ccgtcagggagccccgtgctggccagctacaagccagcgccccccaaagacaagctaccc gaaacgcctcgccgccgcatgaaaaagagcctctcagcccccttgcacccggaatttgaa gaggtctacagattcggggcagagagcaggaaactccttttgcgggaaccagtggatgct atgcccgaccccaccccatttctgctggctagggagtccgccgaggtccacctcatcaaa gagaggcccctcgtcatcccccccatcgcctccgaccgaagcggcgagcagcacagcccg gcccgcgaaaagccgcacaaggcccacgtcggggtggcacatcggatccaccacgccacc ccgccgcaggcccagcccgaggtgaagaccctggcggtcgaccaggtgaacggaggcaag gtggtgaggaagcactcagggacggacagaactgtgtga >gi568815586f:123912145_124112927|GENSCAN_predicted_peptide_3|101_aa MAAPGDRGAGRLRRSRLHGVLCVFQGALRPAEEASIPLSAEGCCRRTPPLFDHNKGRILT QRLVSQALGFCLPRLHFPMQQGGLAVGGLTTTVEGGNKFSK >gi568815586f:123912145_124112927|GENSCAN_predicted_CDS_3|306_bp atggccgcgccaggggaccggggagccgggcggctgcggaggagccggctgcatggggtg ctgtgtgtttttcagggcgccctgcgtccggcagaggaggcgagcatcccgctcagcgca gaaggctgctgccgccggacgcctccattgtttgaccacaacaagggccggattctcacc cagaggctggtttctcaggccctgggcttctgccttcccagacttcactttcccatgcaa caaggaggattggctgtgggaggtcttactaccacagtggaaggaggaaacaagtttagt aaatga >gi568815586f:123912145_124112927|GENSCAN_predicted_peptide_4|261_aa MIYKCPMCREFFSERADLFMHQKIHTAEKPHKCDKCDKGFFHISELHIHWRDHTGEKVYK CDDCGKDFSTTTKLNRHKKIHTVEKPYKCYECGKAFNWSSHLQIHMRVHTGEKPYVCSEC GRGFSNSSNLCMHQRVHTGEKPFKCEECGKAFRHTSSLCMHQRVHTGEKPYKCYECGKAF SQSSSLCIHQRVHTGEKPYRCCGCGKAFSQSSSLCIHQRVHTGEKPFKCDECGKAFSQST SLCIHQRVHTKERNHLKISVI >gi568815586f:123912145_124112927|GENSCAN_predicted_CDS_4|786_bp atgatctacaagtgccccatgtgtagggaatttttctctgagagagcagatctttttatg catcagaaaattcacacagctgagaagccccataaatgtgacaagtgtgataagggtttc tttcatatatcagaacttcatattcattggagagaccatacaggagagaaggtctataaa tgtgatgattgtggtaaggattttagcactacaacaaaacttaatagacataagaaaatc cacacagtggagaagccctataaatgttacgagtgtggcaaagccttcaattggagctcc catcttcaaattcatatgagagttcatacaggtgagaaaccgtatgtctgtagtgagtgt ggaaggggctttagtaatagttcaaacctttgcatgcatcagagagtccacaccggagag aagccctttaaatgtgaagagtgtgggaaggccttcaggcacacctccagcctctgcatg catcaaagagtccacacaggagagaaaccctataaatgttatgagtgtgggaaggcgttc agtcagagttcgagcctctgcatccaccagagagtccacactggagagaaaccctataga tgttgtggatgtgggaaggccttcagtcagagttcgagcctgtgcatccaccagagagtc cacacaggagagaaacctttcaaatgtgatgagtgcggaaaggccttcagtcagagtacg agcctctgcatccaccagagagtccacacaaaggagagaaaccatctcaaaatatcagtt atataa >gi568815586f:123912145_124112927|GENSCAN_predicted_peptide_5|142_aa MIQSAATWLSEVLCPLLSETVRQFKFQQQRGSYSSPDAGCDFMLQFFMSGVTRDHLPVPS NTGLLGSTLGPQRFACRQMTTTPLPTHCCHHASSYSQVCQTLLPLPDAGSAVVTFTGTAS SDGQCQPFCKITTLGTTATSWG >gi568815586f:123912145_124112927|GENSCAN_predicted_CDS_5|429_bp atgatccagagtgcagccacgtggctttctgaggtgctgtgtcccctgctctctgagact gtaagacaattcaaattccagcaacaacgaggtagttacagttccccagacgcaggctgt gacttcatgcttcagttcttcatgtctggagtcaccagggaccacctcccagtgccttca aacacggggctcttgggcagcactctgggacctcagcgcttcgcctgccggcaaatgacc accacaccgctgccaacccactgctgccaccacgcttccagctacagccaggtctgccag accctgctccctctgccagatgcaggctcagctgttgtcaccttcactgggacagccagc tctgacgggcagtgccagcctttctgcaagatcacaacacttgggaccactgccacttcc tggggatag >gi568815586f:123912145_124112927|GENSCAN_predicted_peptide_6|123_aa MSQPGNQHRHRYVNNSAMSQPGNQHRHRYVNNSAMSQPGNQHRHRYVNNSAMSQPGNQHR HRYVNNSAMSQPGNQHRHRYVNNSAMSQPGNQHRHRYVNNSAMSQPGNQHRHRYASVVKP FSF >gi568815586f:123912145_124112927|GENSCAN_predicted_CDS_6|372_bp atgtcacaaccaggaaatcagcacagacacaggtatgtcaacaatagcgccatgtcacaa ccaggaaatcagcacagacacaggtatgtcaacaatagcgccatgtcacaaccaggaaat cagcacagacacaggtatgtcaacaatagcgccatgtcacaaccaggaaatcagcacaga cacaggtatgtcaacaatagcgccatgtcacaaccaggaaatcagcacagacacaggtat gtcaacaatagcgccatgtcacaaccaggaaatcagcacagacacaggtatgtcaacaat agcgccatgtcacaaccaggaaatcagcacagacacaggtatgcctctgtcgtcaagccc ttcagcttctag >gi568815586f:123912145_124112927|GENSCAN_predicted_peptide_7|187_aa MIFSITIITITIITITIIITIMITITIIIITITTTIIITITITTIITNTITTIIITITII TTIMTTITIIITSSPSSHHHHHHHHHHHIHHHHHHHHHHTITIITITITIITITIITTIM ITITIIITPSPSSHHHHHHHHHHHHNHHYHHDHHHHHHTITINIITPSPSSPLPSPSPSP SSSSLPS >gi568815586f:123912145_124112927|GENSCAN_predicted_CDS_7|564_bp atgatcttctccatcactatcatcaccatcaccatcatcaccatcaccatcatcatcacc atcatgatcaccatcaccatcattatcatcaccatcaccaccaccatcatcatcaccatc accatcaccaccatcatcaccaacaccatcaccaccatcatcatcaccatcaccatcatc actaccatcatgaccaccatcaccatcatcatcacatcatcaccatcatcacaccaccat caccatcaccatcatcaccatcacattcatcaccaccaccatcaccatcatcatcacacc attaccatcatcaccatcaccatcaccatcatcaccatcacaatcatcactaccatcatg atcaccatcaccatcatcatcacaccatcaccatcatcacaccatcaccatcaccaccat caccatcatcatcacaatcatcactaccatcatgatcaccatcaccatcatcataccatc accatcaacatcatcacaccatcaccatcatcaccattaccatcaccatcaccatcacca tcatcatcatcactaccatcatga >gi568815586f:123912145_124112927|GENSCAN_predicted_peptide_8|69_aa RLAIVPVRAKCRGFVLIELNRAIMLTAWPKIPFLGICEAKNPRSENERLAAILEAARSHL GSSKNKDPR >gi568815586f:123912145_124112927|GENSCAN_predicted_CDS_8|210_bp aggctcgccattgttcctgtgcgggctaagtgccgggggttcgtcctaatcgagctgaat agagccataatgctcaccgcatggcccaagattccattccttggaatctgtgaggccaag aaccccaggtcagagaacgagaggcttgccgccatcttggaagcagcccgcagccatctt gggagttctaagaacaaggatccccggtaa