GENSCAN 1.0 Date run: 8-Nov-116 Time: 04:22:01 Sequence gi568815583f:31227213_31472296 : 245084 bp : 49.19% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 4506 4567 62 2 2 97 110 3 0.325 4.12 1.02 Intr + 17182 17392 211 1 1 86 115 136 0.922 15.02 1.03 Intr + 21414 21550 137 2 2 41 3 150 0.329 1.17 1.04 Intr + 22760 22809 50 2 2 76 105 1 0.250 -1.08 1.05 Intr + 24021 24117 97 1 1 91 93 30 0.364 2.87 1.06 Intr + 29133 29245 113 0 2 98 39 76 0.242 3.72 1.07 Intr + 34168 34449 282 2 0 101 98 10 0.134 0.69 1.08 Intr + 39698 39789 92 0 2 118 45 62 0.760 4.61 1.09 Term + 40758 40952 195 2 0 59 42 172 0.869 7.11 1.10 PlyA + 44976 44981 6 1.05 2.00 Prom + 65020 65059 40 -5.16 2.01 Init + 73738 73836 99 0 0 49 48 182 0.342 8.68 2.02 Intr + 76977 77085 109 2 1 56 85 52 0.449 1.56 2.03 Term + 79610 79707 98 0 2 65 44 146 0.494 6.03 2.04 PlyA + 83826 83831 6 1.05 3.00 Prom + 89832 89871 40 -3.06 3.01 Init + 100001 100577 577 1 1 85 82 795 0.980 73.90 3.02 Intr + 107050 107137 88 2 1 42 105 24 0.005 -1.47 3.03 Intr + 120575 120660 86 2 2 106 93 21 0.011 3.86 3.04 Term + 144798 145087 290 1 2 135 52 578 0.999 54.44 3.05 PlyA + 147607 147612 6 1.05 4.00 Prom + 148355 148394 40 -3.56 4.01 Init + 148605 148718 114 2 0 95 61 64 0.444 4.61 4.02 Intr + 151129 151228 100 0 1 57 41 76 0.184 -0.52 4.03 Intr + 152635 152702 68 2 2 115 94 22 0.316 4.12 4.04 Intr + 157442 157565 124 1 1 40 8 112 0.015 -1.44 4.05 Intr + 165994 166109 116 2 2 64 53 96 0.167 3.87 4.06 Intr + 166396 166479 84 0 0 82 100 24 0.666 3.02 4.07 Intr + 175348 175466 119 0 2 53 49 139 0.170 5.76 4.08 Intr + 177545 177646 102 2 0 56 72 85 0.235 2.99 4.09 Intr + 192972 193427 456 0 0 16 30 343 0.203 13.64 4.10 Intr + 211294 211348 55 1 1 108 77 44 0.531 4.28 4.11 Term + 213458 213604 147 1 0 67 55 83 0.345 0.80 4.12 PlyA + 215413 215418 6 1.05 5.10 PlyA - 215950 215945 6 1.05 5.09 Term - 226991 226414 578 0 2 34 55 308 0.067 16.73 5.08 Intr - 228653 228468 186 0 0 67 33 216 0.749 13.66 5.07 Intr - 229559 229400 160 2 1 33 76 50 0.412 -2.14 5.06 Intr - 230803 230612 192 1 0 80 47 67 0.305 1.49 5.05 Intr - 234937 234730 208 0 1 104 116 50 0.396 8.48 5.04 Intr - 238536 238419 118 1 1 152 99 25 0.888 9.52 5.03 Intr - 240406 240361 46 1 1 24 89 31 0.520 -5.22 5.02 Intr - 241579 241433 147 1 0 104 42 150 0.785 12.43 5.01 Init - 242538 242272 267 0 0 55 91 113 0.561 3.49 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 166754 166882 129 1 0 63 48 90 0.830 0.68 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815583f:31227213_31472296|GENSCAN_predicted_peptide_1|412_aa MTYKYQAGHMFLERRPKHVCSNYTHLEEASAAVKCCCLVLASQDSSIPPAPGEPCSVVAS PTTRPGLLRMATTELFSNTGATLTSKLLVTMRPSEAIGVHEHSPGPEKTIETAEGGMPSF HQKLSALQGQRAVIVVGKLRLSELEIPHLSPAAGMQPCLGRGLQTLEDLLSAMDKQPFCQ TLPIGRKLTSSEEDVQKTAAGDSAAPVGLGKPLPEMAYVPHPESQLVGALYMMGSLRLYL CLWLLLHFLTDLYDRTAGTCRNPRDSPWLLDLPSALVYSSTQLPLGNQGGPPRMRGHSRQ QEQHVSGYGAVKGMHLWVVGHCYIAEGISGYTGTCSEQLPQASEQNWAAEGPCGDSGSPS TQIRVAFESSCCTCGPNLNTCRQLQTWGTMPVQQENVTKCLLHDRGGPEELS >gi568815583f:31227213_31472296|GENSCAN_predicted_CDS_1|1239_bp atgacctacaagtaccaggcaggacatatgttcctagagaggagacctaaacatgtttgc agtaattatacccaccttgaagaagcaagtgctgcagtgaagtgctgctgcctggtcctt gctagtcaagattctagcatccctcctgctcctggagagccctgctccgtggtagcatct ccgaccaccagacctggcttgttgaggatggctaccactgagctttttagcaatactggt gctaccctcaccagcaagctccttgtcaccatgaggccctcagaggccattggagtccat gaacattctccagggccagagaagaccatagagacagcagaaggaggaatgccctccttc caccagaagctcagcgctttgcagggccagagggccgttattgtggtagggaaactgagg ctcagcgaacttgagatccctcacctgagccctgcagctggcatgcagccttgcctggga aggggcctgcagaccttggaggacttgctcagcgcaatggacaagcagcccttctgtcaa acactccccataggaaggaaactcacctctagtgaggaagatgtgcagaaaacagcagca ggagacagtgcagcccctgtgggcttaggaaagcctcttccagaaatggcctatgtgcca cacccagagtctcagctagtcggggctctttacatgatgggatcattgaggctttatttg tgcctttggttgttgctgcacttcctcactgacctttatgaccggacagcaggtacttgt aggaaccccagagactccccctggcttctggaccttccttctgccttggtgtatagcagc acccagcttccccttggtaatcagggtggaccaccccggatgagagggcactcaaggcag caggaacagcacgtttcagggtatggagcagtaaaaggcatgcacctttgggtcgtggga cactgctatattgccgaagggatctcagggtacactgggacctgctccgagcagctccct caggcgtcagagcagaactgggcagctgaaggcccctgtggagacagcggcagtccgtcc acccagataagggtagcgtttgagtccagctgctgcacctgtggccctaatctcaacaca tgccgccagttgcagacctggggaacaatgccagtgcagcaggagaatgtcaccaaatgc ctgcttcatgacagaggaggtcctgaggagctgagttag >gi568815583f:31227213_31472296|GENSCAN_predicted_peptide_2|101_aa MWLLKPPNIDPSVLAGLAGRLVQAQLGSRGEQLPGTAWSGRAPPENLQEEDHWGRRQKLA AMETVALPLATPADKSHIVGLTAAASTCSLHRGCLEALRRA >gi568815583f:31227213_31472296|GENSCAN_predicted_CDS_2|306_bp atgtggctcctgaagcccccaaacatcgacccctcggtgctggctgggctggctgggcgc ctcgtgcaggcgcagctgggcagccgtggggaacagttgccaggcacagcctggagtggg agggcaccgcctgagaacctgcaggaggaagaccactggggacgtcgccagaaattggca gccatggagacagtggcccttcccctcgctaccccggccgacaaaagccacattgtgggg ctgacggcagctgccagcacctgctccctgcaccgcggctgcctcgaggctctgcgccgc gcctga >gi568815583f:31227213_31472296|GENSCAN_predicted_peptide_3|346_aa MAAAAYVDHFAAECLVSMSSRAVVHGPREGPESRPEGAAVAATPTLPRVEERRDGKDSAS LFVVARILADLNQQAPAPAPAERREGAAARKARTPCRLPPPAPEPTSPGAEGAAAAPPSP AWSEPEPEAGLEPEREPGPAGSGEPGLRQRVRRGRSRADLESPQRKHKCHYAGCEKVYGK SSHLKAHLRTHTAHLNAGPVATGSLSPRAGEGLRRVTLSKSCLWGPGAAYRLQGLGAASV VVAAALVEQEGERPFACSWQDCNKKFARSDELARHYRTHTGEKKFSCPICEKRFMRSDHL TKHARRHANFHPGMLQRRGGGSRTGSLSDYSRSDASSPTISPASSP >gi568815583f:31227213_31472296|GENSCAN_predicted_CDS_3|1041_bp atggcagccgccgcctatgtggaccacttcgccgccgagtgcctcgtgtccatgtcgagc cgcgcggtcgtgcacgggccgcgggaggggccggagtcccggcccgagggcgcggccgtg gccgccacccccacgctgccccgcgtcgaggagcgccgcgacggtaaggacagcgcctcg ctcttcgtggtggcgcggatcctagcggacctcaaccagcaagcgccggcgcccgccccg gcggagcgcagggagggcgccgcggcccggaaggcgaggaccccctgccgcctgccgccg cccgcccccgagcccacctcccccggcgccgaaggcgcggcggccgcgccccccagcccg gcgtggagcgagccggagcccgaggcggggctggagcccgagcgggagccggggcccgcg gggagcggcgagcccggcctcagacaaagggtccggcggggccgaagtcgcgccgacctc gagtccccgcagaggaagcacaagtgccactacgcgggctgcgagaaagtttacgggaaa tcttcgcacctcaaggcgcacctgagaactcacacagcccacctgaatgccggccctgtg gccacaggcagcctgtctcctagggctggggagggcctacggcgggtgacactctctaag tcctgcctctggggccctggggcagcatataggctgcaggggttgggggctgcctctgtt gtagtggctgcagcgctggtggaacaggaaggtgagaggcccttcgcctgcagctggcag gactgcaacaagaagttcgcgcgctccgacgagctggcgcggcactaccgcacacacacg ggcgagaagaagttcagctgccccatctgcgagaagcgcttcatgcgcagcgaccacctg accaagcacgcgcgccgccacgccaacttccacccgggaatgctgcagcggcgcggcggg ggctcgcggaccggctccctcagcgactacagccgctccgacgccagcagccccaccatc agcccggccagctcgccctga >gi568815583f:31227213_31472296|GENSCAN_predicted_peptide_4|494_aa MVLEVPYLHPAAPPPTTLECKIASLLTLTLFPGPTPRQLPKFLQPTTGTDPKVCSSEDES QDTGSLGSRKAGLGLEVRFVAVRRSAVSGCIVTQMSTNIHERTFGHTPNALNMSTQGQTR PKAVHTLVQRDDIELTPTLLGLLGPPSGMQRSVSSRRARPSAFAFPGPDMEVTELQPLRC SKTREEAKTELSQPGIAALALKGLVLRSDLAVSGTGTQGCPAIPPPGAEELSPAAAAAQS PRAVTLTVKVQDFILAVSETTNPLERTNSRHGKAARAELSGGAAQNPVGKRLQQELMTLM ISGDKGISAFLESDNLFKWVGTIHGAAGTVYEDLRYKLLLEFPSGYPYNEPTVKFLTSCY HPNIDTQGNICLDIWKDKSSVLYGIRTILVSIQSLLGEPNIDSPLNMHAVEFWKNPTAFT FKKPIQSRLLPQTSSPSSKLIINNSKVHYGHPSRQDQIPRGLSQRPDPKPGLWQLAPFLQ VSWNRFALAGLVVD >gi568815583f:31227213_31472296|GENSCAN_predicted_CDS_4|1485_bp atggtcctggaggtgccctacctgcatcctgctgcacccccacccacgacactcgagtgc aagatcgcctcactcctaaccctgactttattcccaggccccacaccgaggcagctacca aagttcctacaacctaccacgggcactgaccccaaagtgtgctccagcgaggacgagtcc caggacactgggagtctggggagtaggaaggcaggacttggattagaagtcaggtttgtg gcagttaggcgaagtgctgtgtctggctgcatagtgacccagatgtccaccaacattcat gagcgtacttttggacataccccaaatgccctcaacatgtccacacaaggacagacacgc ccaaaggctgtccacacgttggtgcaaagggatgacatagagctgacccccaccctgctg gggctactgggacctcctagcggcatgcagcgttcagtaagcagccgccgggcccggcct tcggcctttgcatttcccggtccagatatggaggtgacggagctgcagccgcttcggtgc tccaagaccagagaggaagcaaagacagagctttcccagccaggcatcgctgctttggcc ttgaagggtctcgtgctcaggtccgacctggctgtcagcggcactgggacccagggctgc cccgccatcccaccgccaggagcggaggagctgtctccagcagcagctgctgcacagagc cccagagctgtaacactcactgtgaaggtccaggacttcattcttgcggtcagcgagacc acgaacccactggaaagaaccaactcccgacacggaaaggctgcaagagccgagctgagt gggggtgctgcccagaatcctgtgggcaaaagactacagcaggagctgatgaccctcatg atatctggtgacaaagggatttctgctttccttgaatcagacaaccttttcaaatgggta gggaccatccatggagcagctggcacagtgtatgaagacctgaggtataagctcttgcta gagttccccagtggctacccttacaatgagcccacagtgaagttcctcacatcctgctac caccccaacatagacacccagggtaacatatgcctggatatctggaaagacaagtcgtct gtactgtatggcatcaggaccattctggtctccatccagagcctgctaggagaacccaac attgatagccctttgaacatgcatgctgttgagttctggaaaaaccctacagcttttacc ttcaagaaacctattcaaagcaggcttcttcctcagaccagctctcccagcagcaagctg atcatcaataactccaaggtgcactacggccatccttcaaggcaggatcaaattcctcgt ggactttcccagcgtccagaccccaagcctgggctctggcagctggctcctttcttgcag gtctcttggaatcgcttcgcgcttgccggcctggttgtggactga >gi568815583f:31227213_31472296|GENSCAN_predicted_peptide_5|633_aa MAVAAAITLAAEGRWLGLHTPWSWWEPCVSELGQKLPESRCGRPTHSCRPRPPVLRRRQE PRPPGQGYSRPNCSCASKPPCALGGGQEQMQDENSDKDAMEVSRKKINTNQLPKDPETKD PNVLATPVPSGPISIVSMCLEVPQIGDISEPVPGVWCLGARILICPCAFEGSVDGVISTL SFGAEEKTSSAPSYSWHACCYSGPPCSRCFVAKAPTALGVPSPIRLKGKQERSVMEEGAE GPGQNLAVNWNAALMSLSKGFKLHLESLPTKAKGEKMASPGVAQLQAEILADPFCNQKAP CGGLPFVLGHLHLYLQLPVPLRQAGQTLQPEEQGAVDGDNCSRSMSGKQLHSSACRRSSH IPLYLESQLEVHSIFPGEAEQLNLKGIVENNTTISQQLLELKAGVVRERDSQRGTAKTTA ITVTVEPKLPEEAIANVQHQGRRRARGGRGLGWRPAQALDCSRMGKAKVPTSKRAPSSPV AKPGPVKTLTWKKNKKKKRFWKSKAQEVSKKPGSGPGAVVRPPKAPEDFSQNWKVLQEWL LKQKSQAPEKPFVISQMGSKKKPKIIQQNKKEISPQVKGEEMLAGKDQEASRGSVPSGSK MDRKAPVPRTKAGGAEHNKKGTKERTNGDIVPE >gi568815583f:31227213_31472296|GENSCAN_predicted_CDS_5|1902_bp atggcagtggctgctgccatcacactggctgcagaagggaggtggctggggctgcacact ccatggagctggtgggagccctgcgtttctgagttggggcagaagctccctgaaagtcgc tgtggccgcccaacccacagctgcagacccaggcctcctgtgcttcgaagaaggcaggag ccccgccctcctggacaaggctacagccgtccaaactgtagctgtgcttccaagcctccc tgtgctcttggaggggggcaggagcagatgcaggatgagaactcggacaaagacgccatg gaggtttccaggaagaaaatcaacaccaatcaactccccaaagatcccgaaacaaaggat cctaatgtacttgctactcccgtcccatcaggaccaatcagcatagtgtcaatgtgtctt gaagtcccccagatcggtgacatctcagagccagtgccaggtgtttggtgtctgggtgcc aggatcttgatttgcccctgtgcctttgaaggctcagtcgatggtgtgattagcaccctt tcctttggtgcagaagagaagacttcctctgcgcccagttattcctggcacgcatgttgc tattctggtccgccgtgtagcaggtgctttgtggccaaggcacccactgctttaggggtg ccaagccccatcaggctgaagggcaaacaggaaaggtcagtgatggaggagggggcagag ggccctgggcagaacctggctgtgaactggaatgctgcgctcatgtcgctatccaaaggc tttaagttgcacctggaatctctgcccacaaaagctaagggggagaaaatggccagtcca ggtgtcgcacagctgcaggcagagatcctggcggaccccttctgtaaccagaaggcaccc tgtggggggctcccttttgtcttggggcacctccacctctacttacagctgcctgtgccc ctgcggcaggcagggcagaccctccagcctgaggagcaaggtgcagtggatggggacaac tgttccagaagtatgagtgggaagcagctgcattcatctgcatgtaggagaagcagccac attcccctgtacctggagtctcagctggaggtccacagtatcttcccaggagaggcagag caattgaatctcaagggcattgttgagaacaacaccacaatcagccagcaattactggag cttaaagctggtgtggtcagggaaagagacagtcaaagagggactgccaaaaccactgcc atcactgtgactgtggagccaaagctgcctgaagaagcaatagcaaatgttcaacaccag gggaggcgccgggcccggggaggccggggtctcgggtggcggccggcccaggcgctggac tgcagcaggatggggaaggcgaaggtccccacctccaagcgcgccccgagcagccccgtg gctaagccgggtcctgtcaagacgctcacttggaagaaaaacaagaagaaaaaaaggttt tggaaaagcaaggcgcaggaagtaagcaagaagccaggaagcggccctggtgctgtggtg cgacctccaaaggcaccagaagacttttctcaaaactggaaggtgctgcaagagtggctg ctgaaacaaaaatctcaggccccagaaaagccttttgtcatctctcagatgggttccaaa aagaagcccaaaattatccagcaaaacaaaaaagagatctcgcctcaagtgaagggagag gaaatgctggcgggaaaagaccaagaggccagcaggggctctgttccttcaggctccaag atggacaggaaggcgccagtacctcgcaccaaggccggcggagcagagcacaataagaaa ggaaccaaggaaaggacaaatggtgatattgttccagaatga