GENSCAN 1.0 Date run: 5-Nov-116 Time: 17:59:00 Sequence gi568815596f:174619908_174820315 : 200408 bp : 43.25% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.06 PlyA - 206 201 6 1.05 1.05 Term - 5270 5142 129 2 0 1 48 156 0.526 1.08 1.04 Intr - 6053 5914 140 0 2 77 66 75 0.354 4.38 1.03 Intr - 13697 13609 89 1 2 78 71 27 0.044 -0.39 1.02 Intr - 46889 46721 169 0 1 32 96 160 0.029 10.20 1.01 Init - 63065 62867 199 2 1 55 91 151 0.865 9.17 1.00 Prom - 63680 63641 40 -5.06 2.00 Prom + 79079 79118 40 -3.66 2.01 Init + 89778 89912 135 2 0 91 48 135 0.324 7.99 2.02 Term + 95218 95292 75 0 0 129 49 37 0.537 1.74 2.03 PlyA + 96304 96309 6 1.05 3.00 Prom + 96365 96404 40 -2.96 3.01 Init + 100001 100403 403 1 1 79 -4 397 0.011 26.39 3.02 Intr + 106146 106229 84 1 0 123 62 30 0.417 3.69 3.03 Intr + 109814 109917 104 0 2 115 79 71 0.363 8.79 3.04 Intr + 110209 110398 190 0 1 28 -9 139 0.172 -2.64 3.05 Term + 113325 113449 125 1 2 86 49 98 0.846 4.25 3.06 PlyA + 115695 115700 6 1.05 4.03 PlyA - 116991 116986 6 1.05 4.02 Term - 118407 118397 11 1 2 114 43 1 0.212 -3.44 4.01 Init - 124871 124703 169 2 1 64 100 101 0.511 8.60 4.00 Prom - 126829 126790 40 -2.16 5.13 PlyA - 127714 127709 6 1.05 5.12 Term - 128348 128217 132 2 0 99 53 112 0.997 6.89 5.11 Intr - 128912 128673 240 2 0 70 75 252 0.997 19.85 5.10 Intr - 130262 130039 224 0 2 101 113 309 0.984 32.55 5.09 Intr - 133833 133596 238 0 1 89 94 431 0.992 40.99 5.08 Intr - 134507 134312 196 1 1 58 94 330 0.984 30.02 5.07 Intr - 137768 137659 110 2 2 95 63 48 0.974 2.08 5.06 Intr - 139468 139424 45 1 0 121 93 50 0.989 7.41 5.05 Intr - 139726 139581 146 2 2 62 69 232 0.531 18.70 5.04 Intr - 153072 152896 177 1 0 84 14 132 0.027 5.29 5.03 Intr - 158353 158239 115 1 1 53 55 49 0.309 -1.88 5.02 Intr - 159777 159571 207 0 0 11 57 154 0.180 3.87 5.01 Init - 170699 170517 183 2 0 61 56 97 0.118 1.97 5.00 Prom - 173280 173241 40 -4.56 6.07 PlyA - 176950 176945 6 1.05 6.06 Term - 180380 180209 172 1 1 62 32 174 0.758 6.50 6.05 Intr - 180801 180738 64 1 1 96 59 48 0.758 0.48 6.04 Intr - 189135 188998 138 1 0 42 100 104 0.982 7.44 6.03 Intr - 191681 191604 78 0 0 50 87 115 0.930 7.12 6.02 Intr - 192575 192402 174 0 0 92 83 127 0.458 12.51 6.01 Intr - 196530 196473 58 0 1 90 55 61 0.190 1.46 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init - 46896 46721 176 0 2 52 96 164 0.941 12.42 S.002 Sngl + 100001 100411 411 1 0 79 48 405 0.971 31.89 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815596f:174619908_174820315|GENSCAN_predicted_peptide_1|241_aa MWRGGGGAAAPEGRAHISAKFPDGRSLPALSAFPARPAQPCGPREKFPRKNAQRGAGLRS RPEPLRVLEAHPTTEHSVSASASFLQSQAPGHVQAIALNLDDAESGFQLSFLVVKECGGA CGRPVCGKRKRLKSKLREGSSYIIGLLCSSRTEKEIPLALHIHKLCICGFSQLWIEICNF KLSSFRKENPEFFWVGSKHETSTQPSVAQKPIDASRQQLSLAAPRKCLVTGRLPASRPKE T >gi568815596f:174619908_174820315|GENSCAN_predicted_CDS_1|726_bp atgtggcggggcggcgggggggcggcggcgccggaggggagggcccacatctcggcgaag ttcccggatggtcgcagcctcccggcgctgagcgcttttcctgcccgcccggctcagccc tgcggaccccgggagaagtttcccagaaaaaatgcccagcgcggcgcggggctgcggagt cgtccggagccgctgcgcgttttagaggcacatcccaccaccgagcattctgtgtcagca tcagccagcttcctgcaaagccaggcccctgggcatgtgcaggctattgccttaaacctg gatgatgcagaaagtggcttccagctctccttcttagtggtcaaggaatgtggaggggcc tgtggcagacctgtatgtggaaaaagaaaacgtctcaaatccaagttaagagaaggaagt tcctacattattggactgctgtgcagctctaggacagaaaaggagataccgctggccctc catattcacaagctctgcatctgtgggttcagccaactatggatcgaaatatgcaacttc aaattatcttcattcaggaaggagaaccctgagttcttctgggttggatctaagcatgaa acttccactcaaccctcagttgcacagaagcccattgatgccagccggcagcagctgtcc ctggctgctcccagaaagtgcctggtaactgggagactgccagccagccgccccaaggag acctga >gi568815596f:174619908_174820315|GENSCAN_predicted_peptide_2|69_aa MAGCRSRALPCREAAKALREIEIERSAGGPGLLGDPGHPPQGLAQLELVPDLVSMTLGAP GEESTWTES >gi568815596f:174619908_174820315|GENSCAN_predicted_CDS_2|210_bp atggcgggctgcaggtcccgagccctgccctgcagggaggcagctaaggccctgcgagaa atcgaaatcgagcgcagcgctggtgggccagggctgctgggggacccagggcaccctccg caggggctggcccagctagaactggtgcctgacttggtctccatgaccctgggggctcca ggggaagagtcaacctggactgagtcttag >gi568815596f:174619908_174820315|GENSCAN_predicted_peptide_3|301_aa MARTKQTARKSTGGKAPRKQLATKAARKSAPSTGGVKKPHRYRPGTVALREIRRYQKSTE LLIRKLPFQRLVREIAQDFKTDLRFQSAAIGALQEASEAYLVGLFEDTNLCAIHAKRVTI MPKDIQLARRIRGEPGKTLGNTFLSLIVLGSEAKRETLDGVTGGVFGGCSPFHSKHAWEA TLPKAFGQSSVNTASQVWNRWALEGPDAGDGMQRFQVVTPAGRAWVRTATRAGASLLRRC LRKPSQPGAESDEDFLRVQATSRENVYYSPDRTQHKHEPSRLFPQVLCLLVEAFLSLSQT V >gi568815596f:174619908_174820315|GENSCAN_predicted_CDS_3|906_bp atggctcgtacaaagcagactgcccgcaaatcgaccggtggtaaagcacccaggaagcaa ctggctacaaaagccgctcgcaagagtgcgccctctactggaggggtgaagaaacctcat cgttacaggcctggtactgtggcgctccgtgaaattagacgttatcagaagtccactgaa cttctgattcgcaaacttcccttccagcgtctggtgcgagaaattgctcaggactttaaa acagatctgcgcttccagagcgcagctatcggtgctttgcaggaggcaagtgaggcctat ctggttggcctttttgaagacaccaacctgtgtgctatccatgccaaacgtgtaacaatt atgccaaaagacatccagctagcacgccgcatacgtggagaacctgggaaaacccttgga aatacattcttgagccttattgtgttgggctcagaggctaagagggagaccctggatgga gtgacaggtggtgtctttggtggctgttcacccttccacagtaaacatgcttgggaagca actttgccaaaagccttcggacagtccagcgtcaacaccgcaagccaagtgtggaatcgg tgggcactggaaggacccgacgcaggtgatgggatgcagcgcttccaggtggtgaccccg gctggacgtgcgtgggtgcgaactgccacccgggctggggccagcttgctgcgccgctgc ctccgcaagccctcccagcccggagctgagtctgacgaggatttcttacgcgttcaagcg acctctcgtgaaaatgtctattactccccagaccgcactcagcacaagcacgaaccttcc cgtttattcccacaagtgctgtgcctgctggtggaagcatttctctccctttctcagaca gtttag >gi568815596f:174619908_174820315|GENSCAN_predicted_peptide_4|59_aa MSPWPHVFGYEVSSLVRVNAVWNIMMMDKASISPQMVVLAEALHAGKANPYTYVYSSTN >gi568815596f:174619908_174820315|GENSCAN_predicted_CDS_4|180_bp atgagcccatggccacatgtctttggctacgaagtgagttccctggtcagagtcaatgct gtatggaatatcatgatgatggataaggcatctataagtccacagatggtagttttggca gaagcattgcatgcaggaaaggcaaatccatatacttatgtctattccagcacaaactag >gi568815596f:174619908_174820315|GENSCAN_predicted_peptide_5|670_aa MRKLASLGLGPPKYQNRGLALISTSPAKLSLGSAGEGDYKDDGLPATPTGKTSDGIPVFS MEGVVLSALRCLYLNELPQTKAQKDIINPEYLSLVASVEQETELDLGTCELALLHSQIAY FPGNTSECDQWDSGEKPGFDKANSESINADHEHSKHNFWVMTLIFKKSKNMRSWPWKNSA IKAFPALIKWLDPRAAVAKAWLRFVLTLRGPAATVHAYFCQGSTGSETGLVLGSEHETRL VAKLFKDYSSVVRPVEDHRQVVEVTVGLQLIQLINVDEVNQIVTTNVRLKQQWVDYNLKW NPDDYGGVKKIHIPSEKIWRPDLVLYNNADGDFAIVKFTKVLLQYTGHITWTPPAIFKSY CEIIVTHFPFDEQNCSMKLGTWTYDGSVVAINPESDQPDLSNFMESGEWVIKESRGWKHS VTYSCCPDTPYLDITYHFVMQRLPLYFIVNVIIPCLLFSFLTGLVFYLPTDSGEKMTLSI SVLLSLTVFLLVIVELIPSTSSAVPLIGKYMLFTMVFVIASIIITVIVINTHHRSPSTHV MPNWVRKVFIDTIPNIMFFSTMKRPSREKQDKKIFTEDIDISDISGKPGPPPMGFHSPLI KHPEVKSAIEGIKYIAETMKSDQESNNAAAEWKYVAMVMDHILLGVFMLVCIIGTLAVFA GRLIELNQQG >gi568815596f:174619908_174820315|GENSCAN_predicted_CDS_5|2013_bp atgagaaagctggccagtctggggctggggcccccgaaatatcaaaatcgagggcttgct ttgatatctacaagccctgcaaagctttccttggggtcagcaggggagggtgactacaag gatgatggccttccggccactcccacaggaaagacctcagatgggatccccgtgttcagc atggaaggggttgtcctgtcagcgctgaggtgtctctaccttaatgagctgccccagaca aaggcacaaaaggacattatcaaccctgagtacctgtctttagtggcttctgttgaacag gaaactgagctggatctgggcacatgcgagctggctttactgcactctcaaatcgcctac ttccctgggaacacatctgagtgtgatcagtgggattctggagaaaagccgggctttgac aaggcaaattcagagagcataaatgcagaccatgaacacagcaagcataacttttgggtg atgacactgatctttaaaaaatcaaagaatatgagaagctggccatggaagaattctgcc atcaaagcatttcctgcattgatcaagtggttggaccccagagctgccgtggccaaggcc tggcttcggttcgtgttgactctgcgtggtcctgcagccacagtccacgcttacttctgc caggggtccactgggtcagaaactggcctcgtcctgggctccgaacatgagacccgtctg gtggcaaagctatttaaagactacagcagcgtggtgcggccagtggaagaccaccgccag gtcgtggaggtcaccgtgggcctgcagctgatacagctcatcaatgtggatgaagtaaat cagatcgtgacaaccaatgtgcgtctgaaacagcaatgggtggattacaacctaaaatgg aatccagatgactatggcggtgtgaaaaaaattcacattccttcagaaaagatctggcgc ccagaccttgttctctataacaatgcagatggtgactttgctattgtcaagttcaccaaa gtgctcctgcagtacactggccacatcacgtggacacctccagccatctttaaaagctac tgtgagatcatcgtcacccactttccctttgatgaacagaactgcagcatgaagctgggc acctggacctacgacggctctgtcgtggccatcaacccggaaagcgaccagccagacctg agcaacttcatggagagcggggagtgggtgatcaaggagtcccggggctggaagcactcc gtgacctattcctgctgccccgacaccccctacctggacatcacctaccacttcgtcatg cagcgcctgcccctctacttcatcgtcaacgtcatcatcccctgcctgctcttctccttc ttaactggcctggtattctacctgcccacagactcaggggagaagatgactctgagcatc tctgtcttactgtctttgactgtgttccttctggtcatcgtggagctgatcccctccacg tccagtgctgtgcccttgattggaaaatacatgctgttcaccatggtgttcgtcattgcc tccatcatcatcactgtcatcgtcatcaacacacaccaccgctcacccagcacccatgtc atgcccaactgggtgcggaaggtttttatcgacactatcccaaatatcatgtttttctcc acaatgaaaagaccatccagagaaaagcaagacaaaaagatttttacagaagacattgat atctctgacatttctggaaagccagggcctccacccatgggcttccactctcccctgatc aaacaccccgaggtgaaaagtgccatcgagggcatcaagtacatcgcagagaccatgaag tcagaccaggagtctaacaatgcggcggcagagtggaagtacgttgcaatggtgatggac cacatactcctcggagtcttcatgcttgtttgcatcatcggaaccctagccgtgtttgca ggtcgactcattgaattaaatcagcaaggatga >gi568815596f:174619908_174820315|GENSCAN_predicted_peptide_6|227_aa VNRDPSATPSQWMLQQQSLDCGLNVHKQCSKMVPNDCKPDLKHVKKVYSCDLTTLVKAHT TKRPMVVDMCIREIESRGLNSEGLYRVSGFSDLIEDVKMAFDRDGEKADISVNMYEDINI ITGALKLYFRDLPIPLITYDAYPKFIESANTYSTVYLQNYSIAKTARVKGGVTLHEKENL MNAENLGIVFGPTLMRSPELDAMAALNDIRYQRLVVELLIKNEDILF >gi568815596f:174619908_174820315|GENSCAN_predicted_CDS_6|684_bp gtaaacagagacccttctgctacaccttctcagtggatgctgcagcagcagagtctagat tgtggtttgaatgttcataagcagtgttccaagatggtcccaaatgactgtaagccagac ttgaagcatgtcaaaaaggtgtacagctgtgaccttacgacgctcgtgaaagcacatacc actaagcggccaatggtggtagacatgtgcatcagggagattgagtctagaggtcttaat tctgaaggactataccgagtatcaggatttagtgacctaattgaagatgtcaagatggct ttcgacagagatggtgagaaggcagatatttctgtgaacatgtatgaagatatcaacatt atcactggtgcacttaaactgtacttcagggatttgccaattccactcattacatatgat gcctaccctaagtttatagaatctgccaacacttactcaactgtctacttgcaaaactac tccatagcaaagacagccagagtgaagggaggagtgaccctccacgaaaaggagaatctt atgaatgcagagaaccttggaatcgtctttggacccacccttatgagatctccagaacta gacgccatggctgcattgaatgatatacggtatcagagactggtggtggagctgcttatc aaaaacgaagacattttattttaa