GENSCAN 1.0 Date run: 8-Nov-116 Time: 13:29:48 Sequence gi568815581f:56878160_57106232 : 228073 bp : 44.83% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.11 PlyA - 3303 3298 6 1.05 1.10 Term - 6682 6481 202 0 1 75 49 203 0.622 11.96 1.09 Intr - 14070 13545 526 1 1 62 32 548 0.066 38.60 1.08 Intr - 17282 17184 99 0 0 100 65 95 0.984 8.48 1.07 Intr - 17445 17362 84 1 0 102 110 19 0.968 5.29 1.06 Intr - 17793 17767 27 1 0 68 99 29 0.512 0.09 1.05 Intr - 21021 20956 66 1 0 70 100 33 0.732 1.58 1.04 Intr - 23419 23260 160 1 1 61 60 198 0.998 13.86 1.03 Intr - 26329 26096 234 1 0 89 92 349 0.999 33.29 1.02 Intr - 30404 30309 96 2 0 116 100 97 0.998 13.91 1.01 Init - 35829 35233 597 0 0 95 75 920 0.987 86.68 1.00 Prom - 43100 43061 40 -2.46 2.08 PlyA - 43242 43237 6 1.05 2.07 Term - 60995 60912 84 2 0 104 48 42 0.773 -0.55 2.06 Intr - 63964 63876 89 2 2 111 100 80 0.970 11.19 2.05 Intr - 68352 68283 70 0 1 64 105 7 0.484 -1.25 2.04 Intr - 71275 71228 48 1 0 88 116 40 0.924 5.68 2.03 Intr - 71608 71522 87 1 0 79 71 36 0.630 1.17 2.02 Intr - 72837 72054 784 2 1 89 22 327 0.389 17.68 2.01 Init - 82860 82616 245 0 2 84 85 263 0.999 22.81 2.00 Prom - 86902 86863 40 -3.26 3.00 Prom + 89798 89837 40 -5.76 3.01 Init + 100001 100076 76 1 1 67 96 238 0.999 21.75 3.02 Intr + 102923 103071 149 0 2 25 103 226 0.832 17.75 3.03 Intr + 107219 107308 90 1 0 131 27 61 0.923 4.49 3.04 Intr + 109536 109691 156 2 0 41 115 162 0.900 14.41 3.05 Intr + 110057 110131 75 1 0 69 90 60 0.941 4.01 3.06 Intr + 112940 113012 73 1 1 123 111 50 0.997 9.78 3.07 Intr + 116822 116859 38 0 2 63 115 25 0.965 0.68 3.08 Intr + 117348 117476 129 2 0 49 98 101 0.919 8.09 3.09 Intr + 120240 120339 100 2 1 30 65 75 0.651 -0.92 3.10 Intr + 122696 122833 138 0 0 99 49 306 0.989 28.24 3.11 Intr + 123859 124022 164 2 2 115 95 156 0.993 18.69 3.12 Term + 128014 128076 63 0 0 95 45 48 0.737 -0.91 3.13 PlyA + 128310 128315 6 1.05 4.00 Prom + 128479 128518 40 -3.56 4.01 Init + 130556 130620 65 1 2 37 46 128 0.324 4.12 4.02 Intr + 133524 133590 67 0 1 66 109 25 0.145 1.31 4.03 Intr + 162552 162650 99 2 0 85 60 34 0.020 0.61 4.04 Intr + 167538 167718 181 2 1 19 -7 295 0.098 12.54 4.05 Term + 167901 168286 386 1 2 -49 52 658 0.922 43.55 4.06 PlyA + 168615 168620 6 1.05 5.10 PlyA - 170115 170110 6 1.05 5.09 Term - 185722 185574 149 2 2 61 40 97 0.208 0.26 5.08 Intr - 201251 201128 124 2 1 86 62 -7 0.043 -3.34 5.07 Intr - 206859 206681 179 1 2 -13 93 140 0.549 4.04 5.06 Intr - 207313 207112 202 1 1 49 11 166 0.589 3.86 5.05 Intr - 207961 207773 189 1 0 61 76 134 0.949 9.28 5.04 Intr - 216327 216033 295 2 1 11 84 141 0.507 3.01 5.03 Intr - 218242 218097 146 1 2 47 82 123 0.638 6.68 5.02 Intr - 219943 219817 127 0 1 89 47 100 0.898 6.68 5.01 Init - 223790 223756 35 2 2 101 87 -4 0.799 0.14 5.00 Prom - 225094 225055 40 -8.36 6.00 Prom + 226199 226238 40 -8.76 6.01 Init + 227306 228061 756 1 0 74 41 454 0.469 32.01 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Term - 14070 13541 530 1 2 62 48 560 0.920 43.82 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815581f:56878160_57106232|GENSCAN_predicted_peptide_1|696_aa MAELCPLAEELSCSICLEPFKEPVTTPCGHNFCGSCLNETWAVQGSPYLCPQCRAVYQAR PQLHKNTVLCNVVEQFLQADLAREPPADVWTPPARASAPSPNAQVACDHCLKEAAVKTCL VCMASFCQEHLQPHFDSPAFQDHPLQPPVRDLLRRKCSQHNRLREFFCPEHSECICHICL VEHKTCSPASLSQASADLEATLRHKLTVMYSQINGASRALDDVRNRQQDVRMTANRKVEQ LQQEYTEMKALLDASETTSTRKIKEEEKRVNSKFDTIYQILLKKKSEIQTLKEEIEQSLT KRDEFEFLEKASKLRGISTKPVYIPEVELNHKLIKGIHQSTIDLKNELKQCIGRLQEPTP SSGDPGEHDPASTHKSTRPVKKVSKEEKKSKKPPPVPALPSKLPTFGAPEQLVDLKQAGL EAAAKATSSHPNSTSLKAKVLETFLAKSRPELLEYYIKVILDYNTAHNKVALSECYTVAS VAEMPQNYRPHPQRFTYCSQVLGLHCYKKGIHYWEVELQKNNFCGVGICYGSMNRQGPES RLGRNSASWCVEWFNTKISAWHNNVEKTLPSTKATRVGVLLNCDHGFVIFFAVADKVHLM YKFRVDFTEALYPAFWVFSAGATLSICSPKNVKGHELIQDLLSSLHLDSSYPPDAGLSDD DEPPNASLPPDPPLLTVPQMHSVCDQWLQDAFHISL >gi568815581f:56878160_57106232|GENSCAN_predicted_CDS_1|2091_bp atggcagagctgtgccccctggccgaggagctgtcgtgctccatctgcctggagcccttc aaggagccggtcaccactccgtgcggccacaacttctgcgggtcgtgcctgaatgagacg tgggcagtccagggctcgccatacctgtgcccgcagtgccgcgccgtctaccaggcgcga ccgcagctgcacaagaacacggtgctgtgcaacgtggtggagcagttcctgcaggccgac ctggcccgggagccacccgccgacgtctggacgccgcccgcccgcgcctctgcacccagc ccgaatgcccaggtggcctgcgaccactgcctgaaggaggccgccgtgaagacgtgcttg gtgtgcatggcctccttctgtcaggagcacctgcagccgcacttcgacagccccgccttc caggaccacccgctgcagccgcccgttcgcgacctgttgcgccgcaaatgttcccagcac aatcggctgcgggaatttttctgccccgagcacagcgagtgcatctgccacatctgcctg gtggagcataagacctgctctcccgcgtccctgagccaggccagcgccgacctggaggcc accctgaggcacaaactaactgtcatgtacagtcagatcaacggggcgtcgagagcactg gatgatgtgagaaacaggcagcaggatgtgcggatgactgcaaacagaaaggtggagcag ctacaacaagaatacacggaaatgaaggctctcttggacgcctcagagaccacctcgaca aggaagataaaggaagaggagaagagggtcaacagcaagtttgacaccatttatcagatt ctcctcaagaagaagagtgagatccagaccttgaaggaggagattgaacagagcctgacc aagagggatgagttcgagtttctggagaaagcatcaaaactgcgaggaatctcaacaaag ccagtctacatccccgaggtggaactgaaccacaagctgataaaaggcatccaccagagc accatagacctcaaaaacgagctgaagcagtgcatcgggcggctccaggagcccaccccc agttcaggtgaccctggagagcatgacccagcgtccacacacaaatccacacgccctgtg aagaaggtctccaaagaggaaaagaaatccaagaaacctccccctgtccctgccttaccc agcaagcttcccacgtttggagccccggaacagttagtggatttaaaacaagctggcttg gaggctgcagccaaagccaccagctcacatccgaactcaacatctctcaaggccaaggtg ctggagaccttcctggccaagtccagacctgagctcctggagtattacattaaagtcatc ctggactacaacaccgcccacaacaaagtggctctgtcagagtgctatacagtagcttct gtggctgagatgcctcagaactaccggccgcatccccagaggttcacatactgctctcag gtgctgggcctgcactgctacaagaaggggatccactactgggaggtggagctgcagaag aacaacttctgtggggtaggcatctgctacggaagcatgaaccggcagggcccagaaagc aggctcggccgcaacagcgcctcctggtgcgtggagtggttcaacaccaagatctctgcc tggcacaataacgtggagaaaaccctgccctccaccaaggccacgcgggtgggcgtgctt ctcaactgtgaccacggctttgtcatcttcttcgctgttgccgacaaggtccacctgatg tataagttcagggtggactttactgaggctttgtacccggctttctgggtattttctgct ggtgccacactctccatctgctcccccaaaaatgtgaaaggtcatgagctcatccaggac ttgctatcctccctgcatttagacagttcctacccacctgatgctggcctgtctgatgat gatgagcctcccaatgccagcctgccccccgacccgccactcctcactgtgccccagatg cacagtgtttgtgaccagtggctgcaggatgccttccacatcagcctctga >gi568815581f:56878160_57106232|GENSCAN_predicted_peptide_2|468_aa MAASETVRLRLQFDYPPPATPHCTAFWLLVDLNRCRVVTDLISLIRQRFGFSSGAFLGLY LEGGLLPPAESARLVRDNDCLRVKLEERGVAENSVVISNGDINLSLRKAKKRAFQLEEGE ETEPDCKYSKKHWKSRENNNNNEKVLDLEPKAVTDQTVSKKNKRKNKATCGTVGDDNEEA KRKSPKKKEKCEYKKKAKNPKSPKVQAVKDWANQRCSSPKGSARNSLVKAKRKGSVSVCS KESPSSSSESESCDESISDGPSKVTLEARNSSEKLPTELSKEEPSTKNTTADKLAIKLGF SLTPSKGKTSGTTSSSSDSSAESDDQCLMSSSTPECAAGFLKTNPVETPKKDYSLLPLLA AAPQVGEKIAFKLLELTSSYSPDVSDYKEGRILSHNPETQQVDIEILSSLPALREPGKFD LVYHNENGAEVVEYAVTQESKITVFWKELIDPRLIIESPSNTSSTEPA >gi568815581f:56878160_57106232|GENSCAN_predicted_CDS_2|1407_bp atggcagcttccgagacggttaggctacggcttcaatttgattacccgccgccagctacc ccgcactgtacggccttctggcttctggtcgacttgaacagatgccgagtcgtcacagat ctcattagtctcatccgccagcgcttcggcttcagttctggggccttcctaggcctctac ctggagggggggctcttgccccccgccgagagcgcgcgccttgtgagagacaacgactgc ctcagagttaaattagaagagagaggagttgctgagaattctgtagtcatcagtaatggt gacattaatttatctcttagaaaagcaaagaagcgggcatttcagttagaggagggtgaa gaaactgaaccagattgcaaatattcaaagaagcattggaagagtcgagagaacaataac aataatgagaaggtcttggatctggaaccaaaagctgtcacagatcagactgtcagcaaa aaaaacaagagaaaaaataaagcaacctgtggcacagtgggtgatgataacgaagaggcc aaaagaaaatcaccaaagaaaaaggagaaatgtgaatataaaaaaaaggctaagaatccc aagtctccgaaagtacaggcagtgaaagactgggccaatcagagatgtagttctccaaaa ggttctgctagaaacagccttgttaaagccaaaaggaaaggtagtgtaagcgtttgctca aaagagagtcccagttcctcctcggagtctgaatcttgtgatgaatctatcagtgatggt cccagcaaagtcactttggaggccagaaattcctcagagaaattaccaactgagttatca aaggaagaaccctctaccaaaaatacaactgcagacaaactggctataaaacttggcttt agccttacccccagcaagggcaagacctctggaacaacatcttccagttcagactctagt gcagagtcagacgaccaatgcttgatgtcatcgagcaccccggagtgtgctgcgggtttc ttaaagacaaatccagtagagacacccaagaaggactatagtctgttaccactgttagca gctgcccctcaagttggagaaaagattgcatttaagcttttggagctaacatccagttac tctcctgatgtctctgactacaaggaaggaagaatattaagccacaatccagagacccag caagtagatatagaaattctttcatccttacctgccttgagagaacctgggaaatttgat ttagtttatcacaatgaaaatggagccgaggtagtggagtacgctgtgacacaggagagc aagatcactgtattttggaaagagttgattgacccaagactgattattgaatctccaagt aacacatcaagtacagaacctgcctga >gi568815581f:56878160_57106232|GENSCAN_predicted_peptide_3|416_aa MELALRRSPVPRWLLLLPLLLGLNAGAVIDWPTEEGKEVWDYVTVRKDAYMFWWLYYATN SCKNFSELPLVMWLQGGPGGSSTGFGNFEEIGPLDSDLKPRKTTWLQAASLLFVDNPVGT GFSYVNGSGAYAKDLAMVASDMMVLLKTFFSCHKEFQTVPFYIFSESYGGKMAAGIGLEL YKAIQRGTIKCNFAGVALGDSWISPVDSVLSWGPYLYSMSLLEDKGLAEVSKVAEQVLNA VNKGLYREATELWGKAEMIIEQRHVRHLQRDALSQLMNGPIRKKLKIIPEDQSWGGQATN VFVNMEEDFMKPVISIVDELLEAGINVTVYNGQLDLIVDTMGQEAWVRKLKWPELPKFSQ LKWKALYSDPKSLETSAFVKSYKNLAFYWILKAGHMVPSDQGDMALKMMRLVTQQE >gi568815581f:56878160_57106232|GENSCAN_predicted_CDS_3|1251_bp atggagctggcactgcggcgctctcccgtcccgcggtggttgctgctgctgccgctgctg ctgggcctgaacgcaggagctgtcattgactggcccacagaggagggcaaggaagtatgg gattatgtgacggtccgcaaggatgcctacatgttctggtggctctattatgccaccaac tcctgcaagaacttctcagaactgcccctggtcatgtggcttcagggcggtccaggcggt tctagcactggatttggaaactttgaggaaattgggccccttgacagtgatctcaaacca cggaaaaccacctggctccaggctgccagtctcctatttgtggataatcccgtgggcact gggttcagttatgtgaatggtagtggtgcctatgccaaggacctggctatggtggcttca gacatgatggttctcctgaagaccttcttcagttgccacaaagaattccagacagttcca ttctacattttctcagagtcctatggaggaaaaatggcagctggcattggtctagagctt tataaggccattcagcgagggaccatcaagtgcaactttgcgggggttgccttgggtgat tcctggatctcccctgttgattcggtgctctcctggggaccttacctgtacagcatgtct cttctcgaagacaaaggtctggcagaggtgtctaaggttgcagagcaagtactgaatgcc gtaaataaggggctctacagagaggccacagagctgtgggggaaagcagaaatgatcatt gaacagcgccacgtgagacacctacaacgagatgccttaagccagctcatgaatggcccc atcagaaagaagctcaaaattattcctgaggatcaatcctggggaggccaggctaccaac gtctttgtgaacatggaggaggacttcatgaagccagtcattagcattgtggacgagttg ctggaggcagggatcaacgtgacggtgtataatggacagctggatctcatcgtagatacc atgggtcaggaggcctgggtgcggaaactgaagtggccagaactgcctaaattcagtcag ctgaagtggaaggccctgtacagtgaccctaaatctttggaaacatctgcttttgtcaag tcctacaagaaccttgctttctactggattctgaaagctggtcatatggttccttctgac caaggggacatggctctgaagatgatgagactggtgactcagcaagaatag >gi568815581f:56878160_57106232|GENSCAN_predicted_peptide_4|265_aa MHHLNVDRYGTTATLQDSPERCSAELSCSPFPRILLEHVCFPGQHSQPSMVFVLFSRNSL IAGLSLVHRCGPGIWHVSRPPLENADQHLFTLPQGYGQFAFGIFDDSFEIPTFSPGAQAD GSKDPERPWETEHQSRPLANGLDAFAQLLNQFENTGPPPADEEKIQYLPTVPVTEEHVGS GLECPVCKDDYALGEQLPRNHLFHDGCIVHRLEQHDSCPVCRKSLPGHNTATNTPAPGPT GMNCSSSSSSPSSSSPSKENATSNS >gi568815581f:56878160_57106232|GENSCAN_predicted_CDS_4|798_bp atgcatcacctgaatgtggaccgctatggcaccacggccacactccaggacagccctgaa agatgctcagctgagctcagctgctcccccttcccccggatcctcctagagcatgtctgc ttcccgggccagcacagccagccaagtatggtgtttgtcctgttttctagaaactccttg atagcaggcttaagtctggttcatcgatgtggccctggcatctggcacgtaagccggccg ccgttggagaacgcggaccagcacctgttcacgctgccgcagggctacggacagtttgct ttcggcatctttgacgacagcttcgagatccccacgttctctcctggggcgcaggctgac ggcagcaaggaccctgagagaccgtgggagacagagcatcagtcccggcccctggccaac ggcctggacgccttcgcacagctcctcaatcagtttgaaaacacgggccccccaccggca gatgaagagaaaatccagtacctccccaccgtccccgtcaccgaggagcacgtaggctcc gggctcgagtgccccgtgtgcaaggacgactacgcgctgggcgagcagctgccccgcaac cacctgttccacgatggctgcatagtgcaccggctggagcagcacgacagctgccccgtc tgccgaaaaagcctcccgggacacaacacggccacgaacacccccgccccgggcccgact gggatgaactgctcctcctcgtcgtcctccccctcctccagctcgcccagtaaagagaac gccacaagtaactcctga >gi568815581f:56878160_57106232|GENSCAN_predicted_peptide_5|481_aa MEKITWIRKTKREVPAASQNSPQKSNGLTLNSADKPGNCRLQPSAKQENKSLAMSSCQEL GTLSQPNSQKEGQKLNPVVSEYLLGKAFYGQNENPESTTRCSRPSCLLTSATQLKAQSMP SKVPVPAWIGWLLVGHKPRTPSQQATERWAPPRQGNQPRKEGHWIQKRDKRRGEGIPRLK GKQSQGKGMRPGETWNRCEYLGLRMQIPPETPLSLRPPDPNSVRTVPQPETRLDHNRPMS RPIFPVPLARRTELPGAAPAGEPRAPHRPDPAAFGSPAGGPRRSCPRPTSSSATIGVRDA ESARRRRPPAVPRGPARSAALPQLPHVLAQAESRHLSGARERGRAGRYSNRRNLDSLPRK PSTGFAALSRRVNQAISCLGTESALEKAEGPTLVTQERWQDFHPSHLTWNPWINQLPLEA NPETKGAVYTETVAEVTLKPYWIVPDLLLPAHMPLVPVKYSRTLPEENPALNYPEPVWFG L >gi568815581f:56878160_57106232|GENSCAN_predicted_CDS_5|1446_bp atggaaaaaatcacctggatacgtaaaacaaaaagggaggtacctgcagcttctcagaac agccctcagaaaagtaatggcctcactctcaactcagctgacaaacctggtaactgcagg ctacagccttctgctaaacaagaaaacaaatccctggcaatgagcagctgtcaggaactc gggacactttcccagcccaattcacagaaggaaggccagaagcttaacccagtagtgtct gagtacctgcttggcaaagccttctacggccaaaatgaaaaccctgaatccactaccagg tgctctaggcctagttgtctgctgacttcagccacccaactgaaggcccaatccatgcct tcaaaggtgcctgtccctgcatggatcggctggcttcttgtgggccacaaaccacgcaca ccttctcagcaagctactgaacgatgggctcccccaagacaaggaaatcaaccaagaaag gaggggcactggatccagaaacgagacaagagaaggggagagggaattccaaggctgaag gggaagcagagccagggaaaagggatgcgcccaggagaaacgtggaaccgttgcgagtac ctggggctacgaatgcaaattccccccgagacgcccctgagcctccgacctccagacccc aattcggttcgaacggtccctcagcccgaaacacggttggatcacaaccggccgatgtcc cggcccatcttcccagttcccctggctcgccgcacggaactcccaggggctgcgccggcg ggcgagccgagggctccccatcgccccgaccccgccgccttcgggtctccggcagggggt ccccggcgcagctgcccccgacctaccagctcctcggctaccatcggggtgcgggatgcg gagtccgcgcgccgccgccggcccccggctgtgccccgcggcccggcccgcagcgccgcg cttccgcagctcccccacgtgctagcccaggcggagagccgtcatctctccggtgcccgg gagaggggccgggcgggacggtacagcaaccggaggaaccttgattccctgccccgcaag cccagcaccggttttgccgccttgtctcgaagggtcaaccaggccatctcctgcctcggg acggagagcgccctggaaaaggcggaggggccgaccttagtcacacaagagcgatggcaa gattttcacccaagccatctgacttggaacccatggattaaccaactgccactggaggca aatccagagaccaagggagcagtttatacagaaacagttgcagaagtcactctaaaaccc tactggattgtcccagaccttctgctcccagctcatatgcctttggttcctgtcaagtat tccaggacactgccagaagagaaccctgccctgaactaccctgaacctgtctggtttggg ctctga >gi568815581f:56878160_57106232|GENSCAN_predicted_peptide_6|252_aa MAIQFRSLFPLALPGMLALLGWWWFFSRKKGHVSSHDEQQVEAGAVQLRADPAIKEPLPV EDVCPKVVSTPPSVTEPPEKELSTVSKLPAEPPALLQTHPPCRRSESSGILPNTTDMRLR PGTRRDDSTKLELALTGGEAKSIPLECPLSSPKGVLFSSKSAEVCKQDSPFSRVPRKVQP GYPVVPAEKRSSGERARETGGAEGTGDAVLGEKVLEEALLSREHVLELENSKGPSLASLE GEEDKGKSSSSQ >gi568815581f:56878160_57106232|GENSCAN_predicted_CDS_6|756_bp atggcaatccagttccgttcgctcttccccttggcattgcctgggatgctggcgctcctc ggctggtggtggtttttctctcgtaaaaaaggccatgtcagcagccatgatgagcagcag gtggaggctggtgctgtgcagctgagggctgaccctgccatcaaggaacctctccccgtg gaagacgtctgtcccaaagtagtgtccacaccccccagtgtcacagagcctccagaaaag gaactgtccaccgtgagcaagctgcctgcagagcccccagcattgctccagacacaccca ccttgccgaagatcagagtcctcgggcattcttcctaacaccacagacatgagattgcga ccaggaacacgcagagatgacagtacaaagctggagctagccctgacaggtggtgaagcc aaatcgattcctctagagtgccccctttcatccccaaagggtgtactattctccagcaaa tcagctgaggtgtgtaagcaagattcccccttcagcagggtgccaaggaaggtccagcca ggctaccccgtagtccccgcagagaagcgtagctctggggagagggcaagagagacaggt ggggccgaagggactggtgatgccgtgttgggggaaaaggtgcttgaagaagctctgttg tctcgggagcatgtcttggaattggagaacagcaagggccccagcctggcctctttagag ggggaagaagataaggggaagagcagctcatcccag