GENSCAN 1.0 Date run: 6-Nov-116 Time: 00:52:05 Sequence gi568815580f:49462309_49686850 : 224542 bp : 45.16% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 3097 3231 135 0 0 56 116 -1 0.156 0.26 1.02 Term + 7629 7814 186 2 0 97 43 107 0.740 4.59 1.03 PlyA + 8198 8203 6 1.05 2.07 PlyA - 8514 8509 6 1.05 2.06 Term - 20102 20037 66 2 0 96 43 66 0.800 0.84 2.05 Intr - 27242 27051 192 2 0 131 94 226 0.973 27.19 2.04 Intr - 28244 28146 99 2 0 102 75 12 0.739 1.61 2.03 Intr - 28619 28485 135 2 0 74 94 90 0.981 8.96 2.02 Intr - 30158 29967 192 2 0 63 44 79 0.046 0.69 2.01 Init - 33618 32995 624 0 0 82 48 379 0.024 28.65 2.00 Prom - 44792 44753 40 -3.46 3.05 PlyA - 45891 45886 6 1.05 3.04 Term - 53354 53284 71 0 2 -30 44 156 0.709 -2.50 3.03 Intr - 54750 54635 116 2 2 92 92 57 0.871 6.59 3.02 Intr - 55272 55224 49 1 1 74 68 35 0.585 -2.16 3.01 Init - 56706 56643 64 0 1 76 85 32 0.561 3.02 3.00 Prom - 58879 58840 40 -3.16 4.00 Prom + 59845 59884 40 -3.26 4.01 Init + 65776 65858 83 0 2 77 62 76 0.544 4.34 4.02 Intr + 99025 99152 128 1 2 11 84 137 0.438 5.92 4.03 Intr + 103009 103190 182 2 2 102 101 116 0.975 13.89 4.04 Intr + 105134 105313 180 1 0 83 84 203 0.995 19.46 4.05 Intr + 107129 107240 112 1 1 75 99 138 0.775 13.55 4.06 Intr + 113061 113282 222 1 0 98 81 364 0.831 34.80 4.07 Intr + 119107 119349 243 2 0 79 76 223 0.994 17.67 4.08 Intr + 120054 120174 121 1 1 44 101 113 0.995 7.75 4.09 Intr + 121248 121466 219 0 0 42 100 275 0.990 21.52 4.10 Intr + 124438 124542 105 1 0 96 91 55 0.847 5.93 4.11 Intr + 137332 137354 23 1 2 86 47 0 0.023 -7.11 4.12 Intr + 138744 138764 21 1 0 122 100 -3 0.413 1.82 4.13 Intr + 139080 139251 172 1 1 45 115 101 0.599 7.50 4.14 Term + 142835 142976 142 2 1 79 38 105 0.688 2.00 4.15 PlyA + 143593 143598 6 1.05 5.05 PlyA - 146920 146915 6 1.05 5.04 Term - 154838 154759 80 0 2 84 34 94 0.570 1.53 5.03 Intr - 157604 157447 158 1 2 47 103 57 0.604 2.75 5.02 Intr - 158831 158748 84 1 0 22 74 144 0.092 5.34 5.01 Init - 168822 168701 122 0 2 68 113 45 0.099 3.87 5.00 Prom - 176405 176366 40 -5.26 6.08 PlyA - 177252 177247 6 1.05 6.07 Term - 177741 177683 59 1 2 95 54 75 0.300 2.65 6.06 Intr - 178733 178579 155 1 2 87 13 97 0.237 1.82 6.05 Intr - 189089 188984 106 0 1 77 75 56 0.078 2.47 6.04 Intr - 197255 197153 103 2 1 35 15 105 0.004 -2.35 6.03 Intr - 211427 211323 105 2 0 63 66 56 0.125 1.31 6.02 Intr - 218731 218603 129 1 0 141 52 78 0.934 10.39 6.01 Init - 220855 220850 6 1 0 93 93 0 0.684 1.85 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl - 33618 32842 777 0 0 82 37 417 0.800 32.02 S.002 Init - 80339 80283 57 2 0 99 92 36 0.931 6.41 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815580f:49462309_49686850|GENSCAN_predicted_peptide_1|106_aa ASCGAMVMNSLQGQLLAARLRKCCREFECSKSSDSSIPGYCLPDKELPRAIQLSPQVDAL ILCHSQGVQPSGPYLPPYPPPAPGMPPVNPLAPGLVGPGVIVDKKM >gi568815580f:49462309_49686850|GENSCAN_predicted_CDS_1|321_bp gcctcctgtggtgctatggtaatgaattccttgcaaggccagctgttagcagcaagactc agaaagtgctgcagagagtttgaatgctccaaaagcagtgactccagcattcctgggtat tgcctgcctgataaggagctccccagggcaatccagctttccccccaggtggacgccctc atcctgtgccacagccagggtgtccaaccttcgggtccctaccttcctccatacccacca cctgcccctggaatgcctcctgtgaatcccttggcccctggcttggttggaccaggagtg atagtggacaagaagatgtag >gi568815580f:49462309_49686850|GENSCAN_predicted_peptide_2|435_aa MVSAFVIMYSHEEDIDSAIEVFTHTIQWCQNHWPKSPAHLSLIREAANFKLKYRQKKEAI SDLEQQWKQNPKDFHTLTQLISAYSLVDPEKDKALCEHLSLSDSMSLKVDVKALENSPGA TYIWKKGGKVTGDSQQKEQGWGNLKKKKKMGKLPENYDPKVTPDPERWLPMQEHSYYWGR KKGKKDQVGKGIQGATAGASSELDARKTQPEVSVSPALLRGPNPPAIAAILASGAPASRP PGGELLGLLNPRWRRWLLMREDVAEGLTLGAGNTRETAQAIKGMHIRKATKYLKDVTLQK QCVPFRRYNGGVGRCAQAKQWGWTQGRWPKKSAEFLLHMLKNAESNAELKGLDVDSLVIE HIQVNKAPKMRRRTYRAHGRINPYMSSPCHIEMILTEKEQIVPKPEEEVAQKKKGADMNG LPTKGPTEICDKKKD >gi568815580f:49462309_49686850|GENSCAN_predicted_CDS_2|1308_bp atggtgtctgcattcgtgatcatgtatagccatgaagaagatattgatagtgccattgag gtcttcacacatactatccagtggtgtcaaaaccattggcccaaatctcctgctcatttg tccttgataagagaagctgcgaacttcaaactcaaatatagacagaagaaggaggcaatt agtgacctagaacaacaatggaaacaaaatccaaaagattttcacaccctgacacagctt atttctgcttactcacttgtagatccagagaaagacaaagctctttgtgaacacttgtca ttatcagatagcatgtctctaaaagtagatgtcaaggctcttgaaaattctcctggtgct acgtacatttggaagaagggtggcaaagttactggagatagtcaacaaaaggagcaaggg tggggaaatttgaaaaagaagaaaaagatgggaaaattgcctgagaattatgacccaaaa gttaccccagatccagaaagatggctaccaatgcaagaacattcttattactggggaaga aagaaaggtaaaaaggatcaggttggaaaagggatccagggagcaactgcaggagcttca tcagaactggatgccagaaaaactcagcctgaggtgagtgtttctcctgcgttgctccga gggcccaatcctcctgccatcgccgccatcctggcttcgggggcgccggcctccaggccc ccgggaggagaactcctagggctactaaatcctcgctggaggcggtggcttcttatgcgg gaggacgtggcggagggcctgactttgggagccgggaacactcgtgaaactgctcaggcc atcaagggtatgcatatacgaaaagccacgaagtatctgaaagatgtcactttacagaaa cagtgtgtaccattccgacgttacaatggtggagttggcaggtgtgcgcaggccaagcaa tggggctggacacaaggtcggtggcccaaaaagagtgctgaatttttgctgcacatgctt aaaaacgcagagagtaatgctgaacttaagggtttagatgtagattctctggtcattgag catatccaagtgaacaaagcacctaagatgcgccgccggacctacagagctcatggtcgg attaacccatacatgagctctccctgccacattgagatgatccttacggaaaaggaacag attgttcctaaaccagaagaggaggttgcccagaagaaaaagggtgcagacatgaatgga ttaccaacaaaaggaccaacagaaatctgtgataaaaagaaagactaa >gi568815580f:49462309_49686850|GENSCAN_predicted_peptide_3|99_aa MGVSKKRAASKTDVQAQLLQRGCRPTLQILDLPASIITYEPIRLSSVTPHGFQGQQVLLS RKDSRSAGYFVSKDPNKYEQAKPYARINDTRIQEDSFDL >gi568815580f:49462309_49686850|GENSCAN_predicted_CDS_3|300_bp atgggtgtaagcaagaaaagggcagcttcgaaaacagatgtgcaagcacagttgttgcag cgaggctgcaggcctaccctgcagattttggacttgccagcctccatcatcacctatgag cctattcgcctcagttcagttactcctcatggcttccagggccagcaagttctgctgagc agaaaagacagtcgatcagctggttactttgtctcaaaagacccaaataagtatgaacaa gcaaagccttatgcacgcatcaacgacaccaggattcaagaggacagctttgacctctag >gi568815580f:49462309_49686850|GENSCAN_predicted_peptide_4|650_aa MPKEKIKLKAESCKKLPFVPKQIATDKSFPNIPKPPIHYHPRIADERTASEKLAGSLGVL QLVAERAFRADKLHKPKATQTEVKPSVRFNLRTSKDPEHEGCYLSVGHSQPLEDCSFNMT AKTFFIIHGWTMSGIFENWLHKLVSALHTREKDANVVVVDWLPLAHQLYTDAVNNTRVVG HSIARMLDWLQEKDDFSLGNVHLIGYSLGAHVAGYAGNFVKGTVGRITGLDPAGPMFEGA DIHKRLSPDDADFVDVLHTYTRSFGLSIGIQMPVGHIDIYPNGGDFQPGCGLNDVLGSIA YGTITEVVKCEHERAVHLFVDSLVNQDKPSFAFQCTDSNRFKKGICLSCRKNRCNSIGYN AKKMRNKRNSKMYLKTRAGMPFRVYHYQMKIHVFSYKNMGEIEPTFYVTLYGTNADSQTL PLEIVERIEQNATNTFLVYTEEDLGDLLKIQLTWEGASQSWYNLWKEFRSYLSQPRNPGR ELNIRRIRVKSGETQRKLTFCTEDPENTSISPGRELWFRKCRDGWRMKNETRDFAFSIIC RNMIRQGTTSITHIHEEWPGDVLPAEFPPVMGRDATWKEEQLREQEVQGEGIYELSPGHA PAISKAAVLKTAACIPSVDLGPWFWKEKSKSYSQRIVFVCSNLSVGYLKD >gi568815580f:49462309_49686850|GENSCAN_predicted_CDS_4|1953_bp atgccaaaggaaaaaatcaagctgaaagctgagtcatgcaagaagctgccttttgttcct aagcagatagctacagataaaagttttcccaacatccctaagccgccgatacattatcac ccacgtattgcggacgagagaaccgcctcggagaagctggctggctcgcttggagttttg cagctagtggcggagcgagcattccgagcagataagctccacaaacccaaagctacacag actgaggtcaaaccatctgtgaggtttaacctccgcacctccaaggacccagagcatgaa ggatgctacctctccgtcggccacagccagcccttagaagactgcagtttcaacatgaca gctaaaacctttttcatcattcacggatggacgatgagcggtatctttgaaaactggctg cacaaactcgtgtcagccctgcacacaagagagaaagacgccaatgtagttgtggttgac tggctccccctggcccaccagctttacacggatgcggtcaataataccagggtggtggga cacagcattgccaggatgctcgactggctgcaggagaaggacgatttttctctcgggaat gtccacttgatcggctacagcctcggagcgcacgtggccgggtatgcaggcaacttcgtg aaaggaacggtgggccgaatcacaggtttggatcctgccgggcccatgtttgaaggggcc gacatccacaagaggctctctccggacgatgcagattttgtggatgtcctccacacctac acgcgttccttcggcttgagcattggtattcagatgcctgtgggccacattgacatctac cccaatgggggtgacttccagccaggctgtggactcaacgatgtcttgggatcaattgca tatggaacaatcacagaggtggtaaaatgtgagcatgagcgagccgtccacctctttgtt gactctctggtgaatcaggacaagccgagttttgccttccagtgcactgactccaatcgc ttcaaaaaggggatctgtctgagctgccgcaagaaccgttgtaatagcattggctacaat gccaagaaaatgaggaacaagaggaacagcaaaatgtacctaaaaacccgggcaggcatg cctttcagagtttaccattatcagatgaaaatccatgtcttcagttacaagaacatggga gaaattgagcccaccttttacgtcaccctttatggcactaatgcagattcccagactctg ccactggaaatagtggagcggatcgagcagaatgccaccaacaccttcctggtctacacc gaggaggacttgggagacctcttgaagatccagctcacctgggagggggcctctcagtct tggtacaacctgtggaaggagtttcgcagctacctgtctcaaccccgcaaccccggacgg gagctgaatatcaggcgcatccgggtgaagtctggggaaacccagcggaaactgacattt tgtacagaagaccctgagaacaccagcatatccccaggccgggagctctggtttcgcaag tgtcgggatggctggaggatgaaaaacgaaaccagagattttgcttttagtatcatctgt agaaatatgattagacaagggacaaccagtatcacacacattcatgaagaatggcctgga gatgttttaccagcagaatttcctcctgtgatgggaagggacgccacctggaaagaggag cagctacgggaacaggaggtgcaaggagaaggaatctatgaactgagtcctggacacgct ccagctatcagcaaggcagcagtcctgaagactgcagcctgcattccaagtgtggatctt ggtccctggttctggaaggagaagagcaagtcttattctcaaagaattgtgtttgtctgt tctaacctgtctgtgggctacctgaaggactaa >gi568815580f:49462309_49686850|GENSCAN_predicted_peptide_5|147_aa MSWMAPEGLWALRCGPLELLRISDLQWGLNVPPQKLLTLPSTIRNGKDMKSTQMPINDRL DKEKAKPGGCPRLVQAPTTDFWDSPNNTLNPCHTRWYLPSTSYRIPPQITAIQVALFIVL YRPSDENPLSRTDLKAESEPPKAGMTP >gi568815580f:49462309_49686850|GENSCAN_predicted_CDS_5|444_bp atgtcctggatggcacccgaaggcctctgggctttacgctgtgggcccctggagctactg aggatttctgatctgcagtggggcctaaatgtgcccccacaaaagctcctaacacttcca agcactattcgcaacggcaaagacatgaagtcaacccaaatgcccatcaatgatagactg gataaagaaaaggcgaaacctgggggatgccctcggctggtgcaggcacccacaacagat ttctgggactctcccaacaataccctaaacccctgtcacacgcgttggtacctccctagt actagctatcgcattccaccacagatcacagccatccaagtggctctgttcattgttctt taccgcccttctgatgaaaacccactctcaagaacggacctgaaagcagaatcagagcca ccgaaagctggaatgacaccttga >gi568815580f:49462309_49686850|GENSCAN_predicted_peptide_6|220_aa MTVLQLLLCPMDFVREDWPLVDLVNTTQDPSALGFYLLSNPIPDWKVDLQPTPALRAEGF ASGQTKPVWCALVEEVGGKQNGKVEKPSNFTVEKPDKFHLRQVIKVNINSDKLCQRAMLR EGEESRTQHGKQEWGKTATAGWKTVQQIEGTRRPGGAPATPTSPGMPAPLTKQRIGEEEN CLSLNGPPYNLPLLEEEEDKPENSTVTKNVMIQTTLAAQM >gi568815580f:49462309_49686850|GENSCAN_predicted_CDS_6|663_bp atgactgtcctgcagcttctcctttgccccatggattttgtaagagaagactggcctttg gtagatctcgtcaacacaacccaagaccccagtgccctgggcttctatctgctgagcaac cccatcccagactggaaagtagacctgcaacctacaccagctttgagagcagaagggttt gccagtggtcagaccaagcctgtttggtgtgctttggtggaggaggttggtggaaaacag aatggaaaagtggaaaaacccagtaactttacagtggagaaacctgacaaattccacctc cgccaggtgatcaaggttaacatcaacagtgataaattgtgtcaaagggcaatgctcagg gaaggtgaggaaagcaggactcagcacggcaaacaggagtggggcaaaactgccacagca ggttggaagacagttcaacagatcgaaggcacgaggagacccggaggagccccagccaca cccacctccccaggaatgccggccccactgaccaagcagaggataggagaggaagagaac tgtctgtctctcaatggacccccctataacttgcccctcctggaggaagaggaagacaag cctgagaacagcaccgtcactaaaaacgtgatgattcaaaccacgctggcagctcagatg tga