GENSCAN 1.0 Date run: 7-Nov-116 Time: 18:52:17 Sequence gi568815597f:111655925_111866896 : 210972 bp : 44.24% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Init + 25764 25983 220 2 1 43 9 146 0.135 0.99 1.02 Term + 27577 28007 431 2 2 19 41 188 0.141 2.66 1.03 PlyA + 28094 28099 6 1.05 2.00 Prom + 29875 29914 40 -2.26 2.01 Init + 35437 35493 57 0 0 66 119 40 0.991 6.24 2.02 Intr + 39417 39485 69 2 0 132 93 55 0.996 9.78 2.03 Intr + 41517 41573 57 2 0 123 70 35 0.947 4.28 2.04 Intr + 45091 45130 40 0 1 110 101 -6 0.407 0.70 2.05 Intr + 47485 47552 68 2 2 34 95 100 0.403 3.92 2.06 Intr + 48419 48562 144 1 0 133 76 85 0.993 12.28 2.07 Term + 53225 53311 87 1 0 57 47 69 0.544 -2.64 2.08 PlyA + 54318 54323 6 1.05 3.04 PlyA - 54557 54552 6 1.05 3.03 Term - 71880 71044 837 0 0 126 41 501 0.909 42.16 3.02 Intr - 83504 83262 243 2 0 97 99 50 0.327 4.69 3.01 Init - 83696 83610 87 2 0 94 17 110 0.342 3.16 3.00 Prom - 87175 87136 40 -5.46 4.00 Prom + 88957 88996 40 -5.86 4.01 Init + 100001 100301 301 1 1 108 89 447 0.998 42.21 4.02 Intr + 100722 100816 95 1 2 112 109 47 0.994 8.88 4.03 Intr + 103476 103644 169 2 1 71 99 2 0.855 -0.88 4.04 Intr + 104550 104664 115 1 1 73 69 68 0.790 2.91 4.05 Intr + 104782 104924 143 1 2 82 75 18 0.557 -0.10 4.06 Intr + 106982 107083 102 0 0 114 88 53 0.994 8.05 4.07 Term + 109813 110975 1163 2 2 77 38 446 0.963 30.53 4.08 PlyA + 111313 111318 6 1.05 5.08 PlyA - 112297 112292 6 1.05 5.07 Term - 120354 120153 202 2 1 66 41 172 0.720 7.16 5.06 Intr - 121349 121102 248 2 2 131 94 182 0.997 19.36 5.05 Intr - 122568 122512 57 0 0 69 70 56 0.663 0.98 5.04 Intr - 124390 124301 90 1 0 113 74 110 0.999 12.29 5.03 Intr - 124867 124766 102 1 0 64 90 178 0.992 15.97 5.02 Intr - 131182 131020 163 0 1 130 121 169 0.814 24.38 5.01 Init - 131842 131835 8 1 2 46 95 0 0.293 -3.19 5.00 Prom - 135306 135267 40 -0.66 6.00 Prom + 143919 143958 40 -2.96 6.01 Init + 143992 144036 45 0 0 47 78 38 0.190 -1.89 6.02 Intr + 149241 149312 72 2 0 121 116 0 0.632 5.70 6.03 Intr + 152752 152957 206 0 2 85 36 94 0.345 1.90 6.04 Intr + 157528 157712 185 1 2 92 80 28 0.163 1.83 6.05 Intr + 174356 174474 119 0 2 97 91 28 0.618 4.18 6.06 Intr + 175214 175234 21 1 0 114 116 -6 0.558 2.54 6.07 Intr + 176982 177042 61 2 1 83 75 34 0.213 -0.09 6.08 Intr + 184072 184171 100 2 1 65 -1 144 0.443 2.37 6.09 Intr + 188515 188628 114 1 0 85 32 95 0.667 3.16 6.10 Intr + 190856 190994 139 2 1 78 19 74 0.093 -0.03 6.11 Intr + 200101 200246 146 0 2 120 75 43 0.205 5.28 6.12 Intr + 200638 200827 190 1 1 91 75 35 0.087 2.09 6.13 Intr + 201296 201452 157 1 1 71 74 15 0.041 -2.02 6.14 Term + 202090 202205 116 2 2 30 54 108 0.082 0.33 6.15 PlyA + 204396 204401 6 1.05 7.00 Prom + 207841 207880 40 -4.26 7.01 Init + 209086 209188 103 0 1 65 90 95 0.275 7.80 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815597f:111655925_111866896|GENSCAN_predicted_peptide_1|216_aa MWDYVKRPNLRLIGIPESDGENGTKLENAPGYYPELPQPSKTDQHSNSGNTENITKILLK KSNPKTHNRQIHQVIQTTTREYKHLYANKLEKLEEMDKFLDTYTLPRLNHKEVNSLNRPI TSSEIEAVNNSLPTKKSPGPDRFTAEFYQRYKEELVPFLLKLFQTTEEERLLPNSFYEAS VILIPKPGRDTTKKENFRPISLMTIDAKILNKILAN >gi568815597f:111655925_111866896|GENSCAN_predicted_CDS_1|651_bp atgtgggactatgtgaaaagaccaaacctacgattgattggtatacctgaaagtgacggg gagaatggaaccaagttggaaaatgctccaggatattatccagaacttccccaacctagc aagacagaccaacattcaaattcaggaaatacagagaacatcactaagatactcctcaag aagagcaaccctaaaacacataatcgtcagattcaccaagtaatacaaactaccaccaga gaatataaacacctctatgcaaataaactagaaaagctagaagaaatggataaattcctg gatacatataccctcccaagactaaaccacaaagaagtcaactccttgaatagaccaata acaagttctgaaattgaggcagtaaataatagcctaccaaccaaaaaaagtccaggacca gacagattcacagccgaattctaccagaggtacaaagaggagctggtaccattccttctg aaactattccaaacaacagaagaagagcgactgctccctaactcattttatgaggccagt gtcatcctgatacccaaacctggcagagacacaacaaaaaaggaaaatttcaggccaata tccctgatgaccatcgatgcaaaaatcctcaataaaatactggcaaactga >gi568815597f:111655925_111866896|GENSCAN_predicted_peptide_2|173_aa MREYKLVVLGSGGVGKSALTVQFVQGIFVEKYDPTIEDSYRKQVEVDCQQCMLEILDTAG TIVLHTVTIIIPFKSQSTFNDLQDLREQILRVKDTEDVPMILVGNKCDLEDERVVGKEQG QNLARQWCNCAFLESSAKSKINVNEIFYDLVRQINRKTPVEKKKPKKKSCLLL >gi568815597f:111655925_111866896|GENSCAN_predicted_CDS_2|522_bp atgcgtgagtacaagctagtggtccttggttcaggaggcgttgggaagtctgctctgaca gttcagtttgttcagggaatttttgttgaaaaatatgacccaacgatagaagattcctac agaaagcaagttgaagtcgattgccaacagtgtatgctcgaaatcctggatactgcaggg acaattgttctccacacagtaaccatcataattccatttaaatctcagtccacgtttaac gacttacaggacctgagggaacagattttacgggttaaggacacggaagatgttccaatg attttggttggcaataaatgtgacctggaagatgagcgagtagttggcaaagagcagggc cagaatttagcaagacagtggtgtaactgtgcctttttagaatcttctgcaaagtcaaag atcaatgttaatgagatattttatgacctggtcagacagataaataggaaaacaccagtg gaaaagaagaagcctaaaaagaaatcatgtctgctgctctag >gi568815597f:111655925_111866896|GENSCAN_predicted_peptide_3|388_aa MPSAALAGTRCVLASPSGCARVLCKHCGRHRPRAASRWGSDSESRARSAGSAPRCERYGR RGGEPARSAVSSGAGVSPGPSLFEPNPTNGRMTMESREMDCYLRRLKQELMSMKEVGDGL QDQMNCMMGALQELKLLQVQTALEQLEISGGGPVPGSPEGPRTQCEHPCWEGGRGPARPT VCSPSSQPSLGSSTKFPSHRSVCGRDLAPLPRTQPHQSCAQQGPERVEPDDWTSTLMSRG RNRQPLVLGDNVFADLVGNWLDLPELEKGGEKGETGGAREPKGEKGQPQELGRRFALTAN IFKKFLRSVRPDRDRLLKEKPGWVTPMVPESRTGRSQKVKKRSLSKGSGHFPFPGTGEHR RGENPPTSCPKALEHSPSGFDINTAVWV >gi568815597f:111655925_111866896|GENSCAN_predicted_CDS_3|1167_bp atgccctccgccgcgctggccgggaccaggtgtgtcctcgccagccccagtggctgcgcc cgggtcctgtgcaaacactgcggtaggcaccgaccgcgcgcggctagccgctggggaagc gactctgagtcccgggctcggagcgcaggctcagctccgcgctgcgagcgctacgggcgc aggggcggggagccggcccggagcgcagtttccagtggggccggggtttcacccgggccc tctctgtttgaaccgaacccgacaaatgggcgcatgacgatggagagcagggaaatggac tgctatctccgtcgcctcaaacaggagctgatgtccatgaaggaggtgggtgatggctta caggatcagatgaactgcatgatgggtgcactgcaagaactgaagctcctccaggtgcag acagcactggaacagctggagatctctggagggggtcctgtgccaggcagccctgaaggt cccaggacccagtgcgagcacccttgttgggagggtggcagaggtcctgccaggcccaca gtctgttccccctccagtcaaccttctcttggcagcagcaccaagtttccatcccatagg agtgtctgtggaagggatttagcccccttgcccaggacacagccacatcaaagctgtgct cagcaggggccagagcgagtggaaccggatgactggacctccacgttgatgtcccggggc cggaatcgacagcctctggtgttaggggacaacgtttttgcagacctggtgggcaattgg ctagacttgccagaactggagaagggtggggagaagggtgagactgggggggcacgtgaa cccaaaggagagaaaggccagccccaggagctgggccgcaggttcgccctgacagcaaac atctttaagaagttcttgcgtagtgtgcggcctgaccgtgaccggctgctgaaggagaag ccaggctgggtgacacccatggtccctgagtcccgaaccggccgctcacagaaggtcaag aagcggagcctttccaagggctctggacatttccccttcccaggcaccggggagcacagg cgaggggagaatccccccacaagctgccccaaggccctggagcactcaccctcaggattt gatattaacacagctgtttgggtctga >gi568815597f:111655925_111866896|GENSCAN_predicted_peptide_4|695_aa MAAAFEASGALAAVATAMPAEHVAVQVPAPEPTPGPVRILRTAQDLSSPRTRTGDVLLAE PADFESLLLSRPVLEGLRAAGFERPSPVQLKAIPLGRCGLDLIVQAKSGTGKTCVFSTIA LDSLVLENLSTQILILAPTREIAVQIHSVITAIGIKMEGLECHVFIGGTPLSQDKTRLKK CHIAVGSPGRIKQLIELDYLNPGSIRLFILDEADKLLEEGSFQEQINWIYSSLPASKQML AVSATYPEFLANALTKYMRDPTFVRLNSSDPSLIGTLGLTVTYCCRGEEENMMMRIAQKC NINLLPLPDPIPSGLMEECVDWDVEVKAAVHTYGIASVPNQPLKKQIQKIERTLQIQKAH GDHMASSRNNSVSGLSVKSKNNTKQKLPVKSHSECGIIEKATSPKELGCDRQSEEQMKNS VQTPVENSTNSQHQVKEALPVSLPQIPCLSSFKIHQPYTLTFAELVEDYEHYIKEGLEKP VEIIRHYTGPGDQTVNPQNGFVRNKVIEQRVPVLASSSQSGDSESDSDSYSSRTSSQSKG NKSYLEGSSDNQLKDSESTPVDDRISLEQPPNGSDTPNPEKYQESPGIQMKTRLKEGASQ RAKQSRRNLPRRSSFRLQTEAQEDDWYDCHREIRLSFSDTYQDYEEYWRAYYRAWQEYYA AASHSYYWNAQRHPSWMAAYHMNTIYLQEMMHSNQ >gi568815597f:111655925_111866896|GENSCAN_predicted_CDS_4|2088_bp atggcggcggcatttgaagcctcgggagccttagcagcagtggcgactgctatgccggct gagcatgtggccgtgcaggtcccggccccagagccaacacccgggcctgtgaggatcctg cggaccgctcaggatctcagcagcccgcggacccgcacgggggatgtgctgttggcggag ccggccgacttcgagtcactgctgctttcgcggccggtgctggaggggctgcgggcggcc ggcttcgagaggccctcgccggtgcagctcaaggccatcccgttggggcgctgcgggctc gatttaattgttcaagctaaatctggcaccgggaaaacctgtgtgttctccaccatagct ttggactctcttgttcttgaaaacttaagtacccagattttgatcttggctcctacaaga gaaattgctgtacagatacattctgttattacagccattggaataaaaatggaaggctta gagtgtcatgtctttattggagggaccccattatcacaagacaaaaccagacttaaaaag tgtcatattgctgttggatctcctggcagaattaagcaactcatagaacttgactacttg aacccaggcagtatacgcctctttattcttgatgaagcagataagcttttagaagaaggc agcttccaggagcaaataaattggatttattcttccttgcctgccagtaaacagatgctg gcagtatcagctacttatcccgaatttttggctaatgctttgacaaagtacatgagagat cccacttttgtaagactgaattccagtgatccaagtctcataggtacattggggctgaca gtgacctactgttgccggggagaggaagaaaatatgatgatgagaattgcccagaaatgt aatatcaaccttctccctttaccagatcccattccttctggtctgatggaagaatgtgtg gattgggatgtggaagttaaagctgctgtgcatacatatggtatagcaagtgtacctaac caacccttaaaaaagcaaattcagaaaatagagagaacccttcaaattcagaaagctcat ggtgaccacatggcttcctctagaaataattctgtatctggactatcagtcaaatcaaaa aataataccaaacaaaagcttcctgtgaaaagccactcagaatgtggaatcatagaaaaa gcaacgtcaccaaaagaactgggctgtgacaggcaatccgaagagcaaatgaagaattct gttcagactcccgttgaaaactccaccaacagtcagcaccaggtcaaagaagctttacct gtgtcactcccccagattccttgtctgtcttcctttaaaatccatcagccatacacgttg acttttgctgaattggtagaggattatgaacattatattaaagaggggttagagaaacct gtggaaatcatcaggcactacacaggccctggggatcagactgtgaatcctcaaaatggt tttgtgagaaataaagttattgaacagagagtccctgtgttggcaagtagtagccaatct ggagactctgagagtgacagtgattcttacagctcaagaacctcttcccagagcaaagga aataagtcatacttggaaggctcttctgataatcagctgaaagactctgaatctacgcct gtggatgatcgtatttctttggaacaaccaccaaatggaagtgacacccccaatccagag aaatatcaagaatcacctggaatccagatgaagacaagacttaaagagggggctagccag agagctaagcagagccggagaaacctacccaggcggtcttccttcagattgcagactgaa gcccaggaagatgattggtatgactgtcatagggaaatacgtctgagtttttctgatacc tatcaggattatgaggagtactggagagcttactacagggcatggcaagaatattatgct gccgcttctcattcatattattggaatgctcagagacatccaagttggatggcagcttat cacatgaataccatttatctacaagaaatgatgcatagtaaccagtga >gi568815597f:111655925_111866896|GENSCAN_predicted_peptide_5|289_aa MNRYGDMVPKTIAGKIFGSICSLSGVLVIALPVPVIVSNFSRIYHQNQRADKRRAQKKAR LARIRVAKTGSSNAYLHSKRNGLLNEALELTGTPEEEHMGKTTSLIESQHHHLLHCLEKT TGLSYLVDDPLLSVRTSTIKNHEFIDEQMFEQNCMESSMQNYPSTRSPSLSSHPGLTTTC CSRRSKKTTHLPNSNLPATRLRSMQELSTIHIQGSEQPSLTTSRSSLNLKADDGLRPNCK TSQITTAIISIPTPPALTPEGESRPPPASPGPNTNIPSIASNVVKVSAL >gi568815597f:111655925_111866896|GENSCAN_predicted_CDS_5|870_bp atgaatagatacggagacatggtgcctaagacgattgcagggaagatcttcggctccatc tgctccttgagtggcgtcctggtcattgccctgccagtccctgtgattgtttccaacttt agccggatttaccaccagaatcagagagctgataaacgcagggcacaaaagaaggcccgc cttgccaggatccgtgtggccaaaacaggcagttcgaatgcatacctgcacagcaagcgc aacgggctcctcaacgaggcgctggagctgacgggcaccccagaagaggagcacatgggc aagaccacctcactcatcgagagccagcatcatcacctgctgcactgcctggaaaaaacc actgggttgtcctatcttgtggatgatcccctgttatctgtacgaacctccaccatcaag aaccacgagtttattgatgagcagatgtttgagcagaactgcatggagagttcaatgcag aactacccatccacaagaagtccctcactgtccagccacccaggcctcactaccacctgc tgctcccgtcgtagtaagaagaccacacacctgcccaattctaacctgccagctactcgc ctgcgcagcatgcaagagctcagcacgatccacatccagggcagtgagcagccctccctc acaaccagtcgctccagccttaatttgaaagcagacgacggactgagaccaaactgcaaa acatcccagatcaccacagccatcatcagcatccccactcccccagcgctaaccccagag ggggaaagtcggccaccccctgccagcccaggccccaacacgaacattccttccatagcc agcaatgttgtcaaggtctccgccttgtaa >gi568815597f:111655925_111866896|GENSCAN_predicted_peptide_6|556_aa MKIGPPRLASRPVREGPFKMTQQTISLNGISTPTGGKKKSFSSQLILEILMHKVGEGKWL EMDINRTNKIPSSKRRLPTLQKALNTCALQRLFKDLENQDVPCQRGLLSEPPDPLHIIAH VEDDNIRPGHWSQQRLPSVMRGKQLKALAVLGSPSHSRAKGSVPHTPVPAFQIKACFFMQ HKRLAQIWAQCGSSDLALIHDFVSWPIFQGKRKLMKLTTSKQTSYWVQEVIKVSGAVLQP GKSKFKVPADVVSSEGLIVRDGRPCCILTCFLVAENLGICCHPTLSPAIIPAGPRIHGDH SNNSCLPSFVKQGLSSSFDNGGKLSLREAIALGNEAALGSSGFRFERRPLTHSTALARTP TGQTQPEARMEKDGEQTWTETLPDEYPRDCLRTLHLPCRGEKRPDPPLHMVDPEPLPDLG SAPGSLLPAPPAKGDCWLALACPSIWQSLLMSTGSKPQVTALLGVKDHHRGVTDALGSTV PERRCSPGRGVTARAEEDRDVCTILLLCRGHGAGFADPGQETDTQKSDVTYWRLSAAGST SQRKAPSNMWLPKLVN >gi568815597f:111655925_111866896|GENSCAN_predicted_CDS_6|1671_bp atgaaaattgggcccccccgcctggccagccgccccgtccgggagggcccctttaaaatg actcaacagactattagtctcaatggaatatcaaccccaacagggggaaaaaaaaagagc ttctcatctcagctcattctggaaattctgatgcacaaagttggtgaggggaagtggcta gagatggacatcaataggaccaacaaaatccccagctccaagaggagactgccaactctg caaaaagcattaaatacttgtgctctgcagcgcctgtttaaggacttggagaaccaagat gtgccttgccaaagaggcctcttgtcagagcctcctgaccctcttcacatcatagcacat gtggaagatgataatattcgcccaggacactggagtcaacagaggctgccatctgtaatg agaggcaagcagctgaaggctctagctgtcttaggctcgcctagccacagccgggccaag gggtccgtgccccacacacctgtaccagccttccagataaaggcctgtttcttcatgcag cataaaaggcttgctcagatctgggcccagtgcggttcctcagaccttgcactcattcat gactttgtatcttggcctatattccagggaaagagaaaattgatgaagctcaccaccagc aaacagacatcatactgggtccaggaggtaattaaagtttcaggggcagttctgcagcct ggaaagtccaagtttaaggtgccagcagatgtggtgtccagcgagggcctgatcgtcaga gatggccgtccttgctgcatcctcacatgctttctggtcgctgagaaccttggcatctgc tgtcatcctactctgagccctgccatcatcccagctggccccaggatccacggggaccac tctaacaacagctgtctgcccagttttgtgaagcaggggttgtcatcttcatttgacaat ggggggaaactgagtcttagagaagctattgccctaggtaatgaagcagccctgggcagt tctggattcaggtttgaacgcaggcctctaactcacagcacggctcttgccaggaccccc actggccaaacccagccagaagccaggatggagaaggatggcgagcagacctggacggag accctccctgatgaatacccaagggactgcctccgcactctgcacctgccttgcagggga gagaagaggccagacccccccttgcacatggtggatcctgagcctctgcccgacttgggg tcagcgccaggaagcctgctgccagccccacctgccaaaggtgactgctggctagcactg gcctgtcccagcatctggcagagcttacttatgagcactggttccaagccccaagtgact gcattgctaggtgtcaaggatcatcatcgcggtgttacagatgctctgggctccacagtc cctgagagaaggtgcagccctgggcgtggggtcacagcgcgtgctgaggaggacagagat gtctgcaccattctgcttctgtgcagagggcacggggctggttttgctgacccaggacag gaaactgatactcagaagagtgacgtgacttactggaggttgtcggctgccggttctaca tctcagcgaaaggcaccttcaaacatgtggctgcccaagttagtgaactga >gi568815597f:111655925_111866896|GENSCAN_predicted_peptide_7|35_aa MFEEQPDIRQPNLHLGLKVSLAMVLNAKAPLSGRX >gi568815597f:111655925_111866896|GENSCAN_predicted_CDS_7|105_bp atgttcgaggaacagccagacattcggcaacccaatcttcatcttggtctgaaagtgtcc ttggccatggtgctgaatgctaaggccccactatcaggtagagnn