GENSCAN 1.0 Date run: 6-Nov-116 Time: 15:19:52 Sequence gi568815588f:123566408_123788157 : 221750 bp : 46.47% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3673 3869 197 0 2 24 36 151 0.278 1.71 1.02 Term + 4270 4417 148 1 1 68 48 116 0.295 2.97 1.03 PlyA + 8728 8733 6 1.05 2.00 Prom + 12571 12610 40 -6.06 2.01 Init + 13571 13609 39 1 0 81 98 11 0.093 1.59 2.02 Intr + 16527 16668 142 2 1 79 -1 131 0.172 3.23 2.03 Intr + 17789 17883 95 0 2 70 60 46 0.151 -0.32 2.04 Intr + 18125 18272 148 1 1 29 82 170 0.526 10.31 2.05 Term + 38111 38307 197 0 2 59 37 150 0.524 4.57 2.06 PlyA + 38611 38616 6 1.05 3.00 Prom + 42318 42357 40 -1.96 3.01 Init + 44945 45103 159 1 0 79 97 187 0.769 18.52 3.02 Term + 48394 48513 120 0 0 70 44 71 0.249 -0.63 3.03 PlyA + 50307 50312 6 1.05 4.05 PlyA - 50402 50397 6 1.05 4.04 Term - 52848 52813 36 0 0 105 48 36 0.103 -1.36 4.03 Intr - 57282 57080 203 1 2 104 67 195 0.934 18.00 4.02 Intr - 58058 58011 48 0 0 108 83 5 0.453 0.65 4.01 Init - 60647 60617 31 2 1 56 48 35 0.349 -3.78 4.00 Prom - 64878 64839 40 -1.96 5.00 Prom + 65940 65979 40 -4.56 5.01 Init + 66833 66907 75 1 0 65 28 24 0.157 -4.78 5.02 Intr + 67718 67817 100 1 1 93 72 63 0.660 4.88 5.03 Intr + 69404 69576 173 0 2 60 109 89 0.860 7.86 5.04 Intr + 75955 76015 61 0 1 91 98 22 0.063 1.81 5.05 Intr + 79286 79413 128 0 2 29 89 114 0.277 6.00 5.06 Intr + 93238 93619 382 0 1 59 74 164 0.132 6.58 5.07 Term + 94698 94804 107 1 2 64 42 35 0.119 -4.93 5.08 PlyA + 96336 96341 6 1.05 6.00 Prom + 98036 98075 40 -7.26 6.01 Init + 100001 100668 668 1 2 95 99 1584 0.984 152.90 6.02 Intr + 108411 108524 114 0 0 82 99 171 0.957 17.16 6.03 Term + 121522 121753 232 1 1 114 49 329 0.979 27.55 6.04 PlyA + 123027 123032 6 1.05 7.00 Prom + 124634 124673 40 -6.46 7.01 Init + 127876 127942 67 0 1 66 94 23 0.081 2.01 7.02 Intr + 136815 136989 175 1 1 131 82 63 0.143 9.00 7.03 Intr + 152476 152593 118 1 1 69 80 26 0.285 0.37 7.04 Intr + 155356 155469 114 0 0 103 77 -2 0.609 0.84 7.05 Intr + 155878 155969 92 0 2 94 39 91 0.129 3.59 7.06 Term + 173291 173441 151 2 1 130 50 99 0.410 7.68 7.07 PlyA + 175824 175829 6 1.05 8.13 PlyA - 179252 179247 6 1.05 8.12 Term - 180610 180357 254 2 2 93 46 229 0.936 15.00 8.11 Intr - 186694 186552 143 0 2 128 76 -29 0.579 -0.10 8.10 Intr - 187121 187037 85 0 1 64 80 49 0.430 0.58 8.09 Intr - 188355 188256 100 0 1 70 93 92 0.533 7.58 8.08 Intr - 190945 190806 140 2 2 65 76 186 0.726 15.28 8.07 Intr - 195762 195465 298 0 1 68 74 417 0.939 34.75 8.06 Intr - 200745 200566 180 0 0 93 113 149 0.998 17.96 8.05 Intr - 202315 202119 197 2 2 16 47 516 0.875 39.43 8.04 Intr - 204632 204509 124 2 1 63 89 130 0.730 10.76 8.03 Intr - 206326 205657 670 0 1 -114 55 491 0.131 17.31 8.02 Intr - 207083 206554 530 2 2 47 94 540 0.970 42.23 8.01 Intr - 213848 213760 89 0 2 82 110 106 0.622 11.89 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term + 22343 22488 146 0 2 56 54 117 0.935 3.17 S.002 Term - 137241 137072 170 1 2 118 48 110 0.811 8.04 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815588f:123566408_123788157|GENSCAN_predicted_peptide_1|114_aa RPAHWGGKCGPRTNSPGRTRSLLETQHLRACQTPKSKQEPLTPMHTGVQSTAAEPWSPKL AMDGSSNLESCAPGEVSEVVESLVPGPSPAFVAWMCQYGDEQTPGAQVINFFKI >gi568815588f:123566408_123788157|GENSCAN_predicted_CDS_1|345_bp aggcctgcccactggggtgggaagtgtggtcctcggaccaacagccctggccgcaccagg agccttttagaaacgcagcatctcagggcttgccagacccctaaatccaaacaagagccc ctgacgcccatgcacaccggagtacagagcacagctgcagagccctggtcccctaagttg gccatggatggcagcagcaacctggagagttgtgctcctggggaggtgtcggaggtagtg gagtctctagtcccaggtccctctcccgcctttgtggcctggatgtgccaatacggtgat gagcagactccaggggcccaagtgatcaacttctttaagatctga >gi568815588f:123566408_123788157|GENSCAN_predicted_peptide_2|206_aa MGSAITHLTESPRVPALHGGYLANNPPTQGWKRMEKEILALFMSTVQKGTSPQVERVLET EWDAPVLFFNMATEPGSQAEPAPVEGAFPPAEEQGLIQRQQPVSHRLRSFRGPVPGGGDL PVTSDSGPAQDEGTAEGAFDKSFNNFTGFRVRCRNVSVLEATGTAGPHSALGSQKKEEPW FPCLDNEELAESDPEEGLCYGLNVSS >gi568815588f:123566408_123788157|GENSCAN_predicted_CDS_2|621_bp atgggttctgccatcacccatctcacagagagtccaagggttcctgccctgcatggaggc tacctggccaacaacccacctacccaaggatggaaaaggatggaaaaggagattttggcg ctctttatgtccacagtgcagaaagggaccagcccccaagtggaacgggttttggagact gagtgggatgctcctgtcctgttcttcaacatggccacggagcctggaagtcaggcggag ccggctcctgtggagggtgcattccctccggcagaggagcagggtctcattcagaggcag cagcccgtctcccacaggcttcgctccttccgaggcccagttcctggtggaggagacctc ccagtcaccagcgactcaggtcccgctcaggacgagggcacagcagagggcgccttcgac aaatccttcaacaacttcactggcttcagagtcagatgtcgaaatgtctctgtcctggaa gccactggaactgcaggtccccactctgcattaggcagccagaagaaagaggagccttgg tttccttgtctggacaatgaggaacttgcagaaagtgaccctgaggaaggcctttgctat ggtttgaatgtttcctcctaa >gi568815588f:123566408_123788157|GENSCAN_predicted_peptide_3|92_aa MIIGMTMFHHCHYLYRHHYHHHLHTTIIITIITGIIITLIIMIVIGESQGWGKIKKVQLQ KSKPFVRCRTAFDNYGGDSDDLALPQHGALCH >gi568815588f:123566408_123788157|GENSCAN_predicted_CDS_3|279_bp atgattattggcatgacaatgtttcatcactgtcactatctctatcgccatcactaccat catcatcttcacaccaccatcatcatcaccatcattactggcatcatcatcaccctcatc atcatgattgtcattggagagtctcagggttgggggaagataaagaaagtacagcttcaa aaatctaagccatttgtccgatgcagaacagcatttgacaactatggaggtgactctgat gacctggctctacctcagcatggggccctgtgtcattga >gi568815588f:123566408_123788157|GENSCAN_predicted_peptide_4|105_aa MEQLKLSYTADSDTDTIQTTNYFGCITIVQGRFLNVTTIIIAFFILFVIFTAITYAFFIL AIIIIGLLTPLTSFFTIHSPLLQRFWLNDQWFKQLGGGLDLGSIY >gi568815588f:123566408_123788157|GENSCAN_predicted_CDS_4|318_bp atggagcaactgaaactctcgtacactgctgattctgacacagacacaattcaaacaaca aattattttggctgcattacaattgtccaaggcaggttccttaatgtcaccaccatcatc attgccttcttcatccttttcgtcatcttcaccgccatcacctatgccttcttcatcctt gccatcatcatcattggcctcctcaccccccttacctctttctttacaatacactccccc ttgctccagaggttttggttaaatgaccaatggttcaagcagctgggtggaggcttggac ctgggctccatctattga >gi568815588f:123566408_123788157|GENSCAN_predicted_peptide_5|341_aa MLGFLGGVTQGPGMWLHIQSYYILPLWVRQPGDPAEMVKEFHWGGAGGSDNQAENTAQGD QKKEKKKKKKKKKKVLSLFSLFKMSHPADMLTLGDLYRQHIPQQKETPRLGINTCQTLEA APGLFAILHKDLQPFSGPAPRSISTQHDLGELTFRLIYLVLAALMNVSTDRTIPGPPAQA VVVGEWGLEWDKELGEIAFPEGSSPGNTRQAPLPHSSSEAGNEQGQTSLEVSLNPEKLLF MNQFNASQSNTKEDQSHASLLLESACIANKEAAATTDTFNGFSKQGANSLAGCVPVSSVE RNGFESGFYKNPMKNQICKSFKNRELLVGQGGETRRDLSPE >gi568815588f:123566408_123788157|GENSCAN_predicted_CDS_5|1026_bp atgcttggcttcctaggtggtgtgactcagggccctggcatgtggctgcacatccaatct tactacatcttaccactctgggtacgacagccaggggaccctgctgaaatggtgaaggag ttccactggggtggggctggggggtcagacaaccaagctgaaaacacagctcagggggac caaaaaaaagaaaaaaagaagaagaagaagaaaaagaagaaagtattgtcactgttcagc cttttcaagatgagtcacccagcagatatgctgactctaggtgacctctaccggcagcat atcccccaacagaaggagactcccaggcttggcattaatacctgccagaccctggaggca gctcctggcctttttgctatcctgcacaaggatttgcagccattttcaggaccagctccc aggagcatatccacccaacacgacctgggagagctcaccttccggctcatctacctggtg ctggctgctctgatgaacgtgagcacagacagaaccattccggggcctcctgctcaggct gtggttgtgggtgagtgggggctggagtgggacaaagaactaggagaaatagcttttcct gaaggctccagtcctgggaatacaagacaagctccactgcctcactccagctcagaggca ggaaatgaacagggccagacatctttagaagtcagtttaaacccagagaagctgttgttc atgaaccagttcaatgcctcgcagtctaacactaaggaagatcaatcacacgcttctctg ctcctggagtcagcctgcattgcaaataaagaagctgctgccaccacagacacatttaat ggattttcaaagcaaggtgcaaactcactggctggctgtgtgcctgtgagttcagtggag aggaatggctttgagtcagggttctacaagaatcccatgaagaatcagatttgtaaaagc tttaaaaatagagaactgctggtgggccaaggtggagagactagaagggacctatcccca gaatga >gi568815588f:123566408_123788157|GENSCAN_predicted_peptide_6|337_aa MNSWDAGLAGLLVGTMGVSLLSNALVLLCLLHSADIRRQAPALFTLNLTCGNLLCTVVNM PLTLAGVVAQRQPAGDRLCRLAAFLDTFLAANSMLSMAALSIDRWVAVVFPLSYRAKMRL RDAALMVAYTWLHALTFPAAALALSWLGFHQLYASCTLCSRRPDERLRFAVFTGAFHALS FLLSFVVLCCTYLKVLKVARFHCKRIDVITMQTLVLLVDLHPSVRERCLEEQKRRRQRAT KKISTFIGTFLVCFAPYVITRLVELFSTVPIGSHWGVLSKCLAYSKAASDPFVYSLLRHQ YRKSCKEILNRLLHRRSIHSSGLTGDSHSQNILPVSE >gi568815588f:123566408_123788157|GENSCAN_predicted_CDS_6|1014_bp atgaactcgtgggacgcgggcctggcggggctactggtgggcacgatgggcgtctcgctg ctgtccaacgcgctggtgctgctctgcctgctgcacagcgcggacatccgccgccaggcg ccggcgctcttcaccctgaacctcacgtgcgggaacctgctgtgcaccgtggtcaacatg ccgctcacgctggccggcgtcgtggcgcagcggcagccggcgggcgaccgcctgtgccgc ctggctgccttcctcgacaccttcctggctgccaactccatgctcagcatggccgcgctc agcatcgaccgctgggtggccgtggtcttcccgctgagctaccgggccaagatgcgcctc cgcgacgcggcgctcatggtggcctacacgtggctgcacgcgctcaccttcccagccgcc gcgctcgccctgtcctggctcggcttccaccagctgtacgcctcgtgcacgctgtgcagc cggcggccagacgagcgcctgcgcttcgccgtcttcactggcgccttccacgctctcagc ttcctgctctccttcgtcgtgctctgctgcacgtacctcaaggtgctcaaggtggcccgc ttccattgcaagcgcatcgacgtgatcaccatgcagacgctggtgctgctggtggacctg caccccagtgtgcgggaacgctgtctggaggagcagaagcggaggcgacagcgagccacc aagaagatcagcaccttcatagggaccttccttgtgtgcttcgcgccctatgtgatcacc aggctagtggagctcttctccacggtgcccatcggctcccactggggggtgctgtccaag tgcttggcgtacagcaaggccgcatccgacccctttgtgtactccttactgcgacaccag taccgcaaaagctgcaaggagattctgaacaggctcctgcacagacgctccatccactcc tctggcctcacaggcgactctcacagccagaacattctgccggtgtctgagtga >gi568815588f:123566408_123788157|GENSCAN_predicted_peptide_7|238_aa MGACLVRGPVVTELSVGPEMVSDQLPGECEERRATGTSEATLGTNRWLEAAQTSDIIQGT FINRKNVAGQRVEAPANTTSCLPMWLGLHSLANELQDQAFQKVKSQYASTYQALLVSSIM PPWTWLFVENKQVPKLPSVTKDCQNPRLLWGPAGSKSETQTQEGHSTSNAKTYEEIDEIW KIQANKVHLLAEGPEMEPDAVESLGFRELEKSGAQRMGSWPLPSPPSPDREPDSCAHA >gi568815588f:123566408_123788157|GENSCAN_predicted_CDS_7|717_bp atgggagcctgcctggtgaggggtccagtggtgacagagctcagtgtgggtccagagatg gtgtcagaccagctcccaggggaatgtgaagaaaggagggccacaggcaccagtgaggcc accctggggacaaacaggtggttagaggcagctcagacctcagacattatccagggcact tttataaacaggaaaaatgtggccgggcaaagagtagaggctcctgctaacaccaccagc tgcctccccatgtggcttggactccacagtttggccaatgagctccaagatcaagcgttc caaaaggtcaagtcccagtatgcaagcacttaccaagctctgcttgtcagttcaataatg cctccctggacttggctctttgttgaaaacaaacaagtacccaaactaccttctgtcacc aaagactgccagaatcccaggctcctgtggggaccagctggctccaaatcagagacacag actcaggaaggacactcaaccagcaatgcaaagacttacgaagagattgatgagatctgg aaaatacaagcaaataaggtgcatctgttggcagaaggcccagagatggaacctgatgct gtggagtcactgggcttccgggagctggaaaagagcggtgcccaaaggatgggatcatgg cctttgccatctccccctagtcccgacagggagcctgacagctgtgctcatgcttag >gi568815588f:123566408_123788157|GENSCAN_predicted_peptide_8|936_aa XPNNYYHRRNEMTTTDDLDFKHHNYKEMRQCSEWAIKKQEITVRKVITTTEGGENHNEGS EIHNKGSDNHNQGNDDHNEGGDDHNQGGDNHDQGSDNHNEGGDDHNQGGDNHNQGSDNHN EGADDHNERDKNHNQGGNNHNQGADNHNEGGGDHNNSDKNHNQRGDNHNQGSDNHNQGND DHKEGGDAHNQRGENHNKGGGDHKQESDKNHNQRGVNHNQESDNHDQGSDDHKEGGDDHS KGSDNHDQGSDDHNGGGNDHSKGSDNHDQGGDDHNQGGDNYTQGSDNCNQGSDNHDQGGD NYTQGGDNYTQGSDNQNKAGDDHNQGGDNHTQGGDNCKRGSDNHDQGGGDYNQGGENHNS EGENHNQGSDSYSNGGDNHNQGGDNHNQGGGNHNEGGDDHNQRGNNHNQGGDSHSQGRCH HNQGVDNHNQLMKVVNEMCPNITRIYNIGKSHQGLKLYAVEISDHPGEHEVGEPEFHYIA GAHGNEVLGRELLLLLVQFVCQEYLARNARIVHLVEETRIHVLPSLNPDGYEKAYEGGSE LGGWSLGRWTHDGIDINNNFPDLNTLLWEAEDRQNVPRKVPNHYIAIPEWFLSENATVAA ETRAVIAWMEKIPFVLGGNLQGGELVVAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYA STHRLMTDARRRVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKY PHESQLPEEWENNRESLIVFMEQVHRGIKGLVRDSHGKGIPNAIISVEGINHDIRTGEKQ KLKEGSDPLMVKGLVGFGFRFLTPRSLLRRHPLSATNPDPYEHGTPPRCTVGPSYLPQLC PFPVACLPSNTPPNDGDYWRLLNPGEYVVTAKAEGFTASTKNCMVGYDMGATRCDFTLSK TNMARIREIMEKFGKQPVSLPARRLKLRGQKRRQRG >gi568815588f:123566408_123788157|GENSCAN_predicted_CDS_8|2811_bp natcctaataattattatcaccgccggaacgagatgaccaccactgatgacctggatttt aagcaccacaattataaggaaatgcgccagtgtagtgagtgggcaataaagaagcaggaa atcacagtgagaaaagtaatcaccaccacagagggaggtgagaaccacaacgagggaagt gagatccacaacaaaggaagtgataaccacaaccaggggaatgatgaccacaatgaggga ggtgatgaccacaaccagggaggtgataaccacgaccagggaagtgataaccacaatgag ggaggtgatgaccacaaccagggaggtgataaccacaaccagggaagtgataaccacaat gagggagctgatgaccacaatgagagagataaaaaccacaaccagggaggcaataaccac aaccagggagctgataaccacaatgagggagggggtgaccacaataacagcgataagaac cacaaccagagaggtgataaccacaatcagggaagtgataaccacaaccaggggaatgat gaccacaaagagggaggtgatgctcacaatcagagaggtgagaaccacaacaagggaggt ggtgaccacaaacaagagagtgataagaaccacaaccagagaggtgttaaccacaatcag gaaagtgataaccatgaccaggggagtgatgaccacaaagagggaggtgatgatcacagt aagggaagtgataaccatgaccaggggagtgatgaccacaatggaggaggtaatgaccac agtaagggaagtgataaccatgaccagggaggtgatgaccacaaccagggaggtgataac tacacccaggggagtgataactgcaaccaggggagtgataaccatgaccagggaggtgac aactacacccagggaggtgataactacacccagggcagtgataaccagaacaaggcaggt gatgatcacaaccagggaggtgataaccacacccagggaggtgataactgcaaacggggg agtgataaccatgaccagggaggtggtgactacaaccagggaggtgagaaccacaacagt gaaggtgagaaccacaaccagggtagcgatagctatagcaatggaggtgataaccacaac cagggaggtgataaccacaaccagggaggtggtaaccacaacgaaggaggtgatgatcac aaccaaagaggtaataaccacaaccagggaggtgatagccacagtcagggaagatgtcac cataaccagggagttgataatcacaaccaattgatgaaagttgtgaatgaaatgtgtccc aatatcaccagaatttacaacattggaaaaagccaccagggcctgaagctgtatgctgtg gagatctcagatcaccctggggagcatgaagtcggtgagcccgagttccactacatcgcg ggggcccacggcaatgaggtgctgggccgggagctgctgctgctgctggtgcagttcgtg tgtcaggagtacttggcccggaatgcgcgcatcgtccacctggtggaggagacgcggatt cacgtcctcccctccctcaaccccgatggctacgagaaggcctacgaagggggctcggag ctgggaggctggtccctgggacgctggacccacgatggaattgacatcaacaacaacttt cctgatttaaacacgctgctctgggaggcagaggatcgacagaatgtccccaggaaagtt cccaatcactatattgcaatccctgagtggtttctgtcggaaaatgccacggtggctgcc gagaccagagcagtcatagcctggatggaaaaaatcccttttgtgctgggcggcaacctg cagggcggcgagctggtggtggcgtacccctacgacctggtgcggtccccctggaagacg caggaacacacccccacccccgacgaccacgtgttccgctggctggcctactcctatgcc tccacacaccgcctcatgacagacgcccggaggagggtgtgccacacggaggacttccag aaggaggagggcactgtcaatggggcctcctggcacaccgtcgctggaagtctgaacgat ttcagctaccttcatacaaactgcttcgaactgtccatctacgtgggctgtgataaatac ccacatgagagccagctgcccgaggagtgggagaataaccgggaatctctgatcgtgttc atggagcaggttcatcgtggcattaaaggcttggtgagagattcacatggaaaaggaatc ccaaacgccattatctccgtagaaggcattaaccatgacatccgaacaggtgaaaaacag aagctcaaggagggcagtgacccactgatggttaaggggttagtggggtttggcttcagg tttctgacccccaggtctttgctcagacgtcaccctctcagtgcaaccaaccctgacccc tatgagcatggcacacctccccgctgcactgttggtccctcttacctgccccaactttgt cccttccccgtggcatgtctcccttctaacacaccacccaacgatggggattactggcgc ctcctgaaccctggagagtatgtggtcacagcaaaggccgaaggtttcactgcatccacc aagaactgtatggttggctatgacatgggggccacaaggtgtgacttcacacttagcaaa accaacatggccaggatccgagagatcatggagaagtttgggaagcagcccgtcagcctg ccagccaggcggctgaagctgcgggggcagaagagacgacagcgtgggtga