GENSCAN 1.0 Date run: 5-Nov-116 Time: 10:24:53 Sequence gi568815582f:19073428_19367073 : 293646 bp : 45.40% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 494 608 115 1 1 86 79 78 0.331 6.72 1.02 Intr + 2294 2493 200 0 2 21 86 232 0.331 15.37 1.03 Intr + 3879 3947 69 2 0 60 50 99 0.798 2.68 1.04 Term + 5560 5646 87 0 0 83 42 61 0.593 -1.34 1.05 PlyA + 5810 5815 6 1.05 2.00 Prom + 20633 20672 40 -1.46 2.01 Init + 41035 42631 1597 0 1 79 61 1747 0.121 163.70 2.02 Term + 51513 51634 122 1 2 130 38 50 0.364 2.84 2.03 PlyA + 52878 52883 6 1.05 3.00 Prom + 89735 89774 40 -3.16 3.01 Init + 95220 95234 15 2 0 100 91 30 0.526 4.82 3.02 Intr + 100017 100096 80 2 2 64 61 92 0.375 2.45 3.03 Intr + 106964 107112 149 2 2 105 91 157 0.985 17.58 3.04 Intr + 110101 110720 620 2 2 70 110 1023 0.853 94.74 3.05 Intr + 120899 121095 197 1 2 78 42 107 0.449 3.31 3.06 Intr + 131517 131592 76 0 1 41 42 85 0.001 -1.28 3.07 Intr + 141170 141284 115 1 1 71 16 105 0.039 1.62 3.08 Intr + 144275 144408 134 0 2 71 6 105 0.019 0.96 3.09 Intr + 149618 149738 121 1 1 92 86 54 0.949 5.67 3.10 Intr + 151256 151411 156 0 0 72 96 183 0.991 17.48 3.11 Intr + 154831 154917 87 2 0 65 95 25 0.250 0.84 3.12 Intr + 162170 162282 113 0 2 -9 70 112 0.022 -0.20 3.13 Term + 162710 162781 72 1 0 127 42 16 0.462 -1.19 3.14 PlyA + 164282 164287 6 1.05 4.03 PlyA - 165190 165185 6 1.05 4.02 Term - 167419 167045 375 1 0 108 38 172 0.825 8.94 4.01 Init - 168546 168448 99 0 0 76 32 91 0.427 2.57 4.00 Prom - 175536 175497 40 -3.26 5.00 Prom + 185969 186008 40 -4.16 5.01 Init + 193117 193201 85 0 1 74 94 -3 0.525 0.00 5.02 Term + 193453 193649 197 2 2 86 55 254 0.969 19.47 5.03 PlyA + 193681 193686 6 -0.45 6.03 PlyA - 194876 194871 6 1.05 6.02 Term - 201764 201302 463 1 1 35 39 257 0.460 10.03 6.01 Init - 206936 206803 134 2 2 39 92 47 0.103 -0.06 6.00 Prom - 209940 209901 40 -6.06 7.00 Prom + 210357 210396 40 -3.06 7.01 Init + 212425 212512 88 0 1 53 111 60 0.511 3.80 7.02 Intr + 225246 225411 166 1 1 102 111 51 0.933 7.82 7.03 Intr + 230635 230728 94 1 1 64 94 46 0.911 2.77 7.04 Intr + 234118 234233 116 0 2 33 47 210 0.230 10.65 7.05 Intr + 255568 255615 48 1 0 89 96 40 0.027 2.70 7.06 Intr + 271064 271169 106 2 1 102 98 36 0.509 6.22 7.07 Term + 280635 280691 57 2 0 100 48 36 0.228 -1.51 7.08 PlyA + 282004 282009 6 1.05 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 41035 42642 1608 0 0 79 40 1756 0.870 165.26 S.002 Term - 138842 138658 185 0 2 29 55 150 0.820 3.51 S.003 Init - 140608 140527 82 1 1 61 100 38 0.825 3.44 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815582f:19073428_19367073|GENSCAN_predicted_peptide_1|156_aa KMWDQEKDHLKKFNELMVTFRVRPTVLMPLWNVLGFALGAGTALLGKEGAMACTVAVEES IAHHYNNQIRTLMEEDPEKYEELLQVFIRALERGCSRRKMADSNWLIKKFRDEELEHHDI GLDHDAELNSYDEIFTLKLIEGGALGKCEVMRVEPS >gi568815582f:19073428_19367073|GENSCAN_predicted_CDS_1|471_bp aaaatgtgggatcaagaaaaggaccatttgaaaaagttcaatgagttgatggttacgttc agggtccggccaacagttctgatgcccttgtggaacgtgctggggtttgcactgggggcg gggaccgccttgctcgggaaggaaggtgccatggcctgcaccgtggcggtggaagagagc atagcacatcactacaacaaccagatcaggacgctgatggaggaggaccctgaaaaatac gaggaacttcttcaggtatttatccgtgctctagaacggggctgctcaaggaggaaaatg gcagatagcaattggctgataaagaaatttcgggatgaagagcttgagcaccatgacata ggcctcgaccatgatgcagaattgaattcatacgatgaaatcttcactctcaagttgata gaaggtggggcccttgggaagtgtgaggtcatgagagtggagccctcatga >gi568815582f:19073428_19367073|GENSCAN_predicted_peptide_2|572_aa MSVHYTLNLRVFWPLVTGLCTALVCLYHVLRGSGGARAEPADGVDGGFPLLKVAVLLLLS YVLLRCRHAVRQRFLPGSPRLEGHAAFSSRHFREPGLSILLESYYEHEVRLSPHVLGHSK AHVSRIVGELVRAGRARGSPGLIPGGALALAFRGDFIQVGSAYEQHKIRRPDSFDVLVPL RLPPLVALEPRSLGEEPALAPAFRGCFLCALKAPPSPSGASGGHWLRDCKPFADAFCVDV RGRRHLSATLVLRWFQSHLQRSLATVRYSLEGRCRVTLTPGGLEQPPTLHILPCRTDYGC CRLSMAVRLIPAVHLGDGVFLVAPPPPPLPSAPLLELPEGLRAEALWGVNTARQEQKLLS WLQERAAPGACYLKCLQLLKALRDLGARGLDSAAATQWGRILSSYVLKTVLLAVLLRKGA PGQGWDEEHLGRCLEELVQFLRDCLLRRHTLFHCVLGPGGAAAEVGPLPKALREAAPVDL LAAFDGHARELAAARLLSTWQRLPQLLRAYGGPRYLARCPPPRSQRTQGFLEGSVSLSST PDHLSATQFLFISFLDVQGIFHYEKVLFGILW >gi568815582f:19073428_19367073|GENSCAN_predicted_CDS_2|1719_bp atgtcggtgcactacaccctcaatctacgcgtcttctggcccctggtgaccggcctgtgc accgccctggtgtgcctctaccatgtcctgcggggaagcgggggcgcccgggccgagccc gccgacggcgtggatggcggcttcccgttgcttaaggtggccgtcctgctcctcctcagc tatgtcctcctgcgctgtcgccacgctgtccggcagcgcttcctgcccgggtctccccgt ctggagggtcacgccgccttctcctcgagacacttccgagagccgggcctcagcatcctg ctggagagttactacgagcatgaggtgcgcctgtctccgcacgtgttgggccacagcaag gcgcacgtgagccggatcgtgggcgagctggtgcgggctggccgcgcccgggggtccccc ggtctcattcctgggggagcgctggccttggccttccgcggagacttcatccaggtgggc agcgcctacgagcaacataaaatccgccggcccgacagcttcgacgtgctggtgccactg cgcctcccgccgcttgtggcgctggagccacggagcctgggcgaggagccagcgctggcc ccggccttccgcggctgcttcttgtgcgccctcaaggcaccaccctcaccatcgggggcc tcggggggccactggcttcgggactgcaaaccctttgctgatgccttctgcgtggatgtg cgcgggcggcgtcacctctctgctactctggtgctgcgctggttccagtcgcatctgcag cgctccttggccactgtgcgttacagcctggaggggcgctgtcgggtcaccttgacccca ggtggcctggaacagccccccaccttacacatcttgccctgccgcactgactacggctgc tgccgcctttctatggctgtgcgtctcatccccgctgtccatctgggagatggggtcttc cttgtggcgccaccaccgccacccttgcccagcgcgcccctgttggagctccctgagggc ctgcgtgcggaggcactgtggggtgtgaacacagcacgccaggagcagaagctgctgagt tggctgcaggaacgggcagctccaggtgcctgctacctcaagtgcctgcagttgcttaag gctctgcgcgatctgggggcccgtgggctggactcagcggccgccacccagtggggacgc atcctatcctcatatgtgctcaagacagtgctgctggcagtgctgctgcgcaagggggcc cctgggcaaggctgggacgaggagcacctgggaaggtgtttggaggagttggtgcagttc cttagggactgcctgctgcgacgccatacgctcttccactgcgtcctgggccctggtggg gcggctgccgaggtgggtcccctgcccaaggcactgagggaagccgccccagttgacctc ctggccgctttcgacgggcacgcccgggaacttgcagcagcgcggttgctgtccacgtgg caaaggctgccccagcttctccgggcctacgggggtccccgctaccttgccaggtgcccc ccaccccggagtcagcgcacccagggcttccttgaagggtctgtttccctgtcgtccacc cccgatcatctctctgctacccagtttttgttcatctccttccttgatgtccagggcatt ttccactatgaaaaggttttattcgggatactatggtag >gi568815582f:19073428_19367073|GENSCAN_predicted_peptide_3|644_aa MAYIQNLWSAAVQMDLPALLSEVLRVQLLPVKMASRSSDKDGDSVHTASEVPLTPRTNSP DGRRSSSDTSKSTYSLTRRISSLESRRPSSPLIDIKPIEFGVLSAKKEPIQPSVLRRTYN PDDYFRKFEPHLYSLDSNSDDVDSLTDEEILSKYQLGMLHFSTQYDLLHNHLTVRVIEAR DLPPPISHDGSRQDMAHSNPYVKICLLPDQKNSKQTGVKRKTQKPVFEERYTFEIPFLEA QRRTLLLTVVDFDKFSRHCVIGKVSVPLCEVDLVKGGHWWKALIPSSQILGVGRIVNAAE PPLRLSQITERDVVSQMLLAPGYFSLVFTLGQGSVFWGQQETPSLSILLMYRKRSHSLHS SANQASPINPFKPFYSYNQIVRADIYDAQVNDVIHTYNIFWGRDPECDGNSPFPPGLGIH LGMELLGHTPTCLPKQLHQFSLPPAVYSVLVYHILANTAYDFNEVELGELLLSLNYLPSA GRLNVDVIRAKQLLQTDVSQGSDPFVKIQLVHGLKLVKTKKTSFLRGTIDPFYNESFSFK VPQEELENASLVFTGGNCQLGQWSADAVREVLYSPFTEDPLLSYANHKMAQMLELSDEDF EATIITMLHEVRVNPLEKNDKKTLAIHEEAVWHHGQSQPELASV >gi568815582f:19073428_19367073|GENSCAN_predicted_CDS_3|1935_bp atggcgtacatccagaatctctggtctgctgctgtgcagatggacctgccggcactgctg tcagaagtgctacgagtccagctgttgccagtcaagatggccagccggagcagtgacaag gatggtgactctgtccacacggccagcgaagtcccgctgaccccacggaccaattccccg gatggaagacgctcgtcctcagacacatccaagtctacatacagcctgacgcggaggatt tcgagtcttgagtcaagacgtcccagctctccactcatcgatattaaacccatcgagttt ggcgttctcagcgccaagaaggagcccatccaaccttcggtgctcagacggacctataac cccgacgactatttcaggaagttcgaaccccacctgtactccctcgactccaacagcgac gatgtggactctctgacagacgaggagatcctgtccaagtaccagctgggcatgctgcac ttcagcactcagtacgacctgctgcacaaccacctcaccgtgcgcgtgatcgaggccagg gacctgccacctcccatctcccacgatggctcgcgccaggacatggcgcactccaacccc tacgtcaagatctgtctcctgccagaccagaagaactcaaagcagaccggggtcaaacgc aagacccagaagcccgtgtttgaggagcgctacaccttcgagatccccttcctggaggcc cagaggaggaccctgctcctgaccgtggtggattttgataagttctcccgccactgtgtc attgggaaagtttctgtgcctttgtgtgaagttgacctggtcaagggcgggcactggtgg aaggcgctgattcccagttctcagatactgggagtagggaggattgtgaatgctgcagaa ccacctctccgcctctcacaaatcacagagagagatgttgtgtcccagatgttgctggcc cctgggtacttctccttggtcttcactttgggccagggcagcgtgttctggggtcaacag gaaacaccctcactcagcatcctgctgatgtacaggaaaaggagccatagcctacactca tcagcaaatcaggccagccccatcaaccccttcaaacccttctactcctacaaccagatt gttagagccgacatttatgatgcccaagtgaacgatgttattcatacgtacaacatattc tggggcagagaccctgagtgtgatggcaactctccttttcctccagggctgggtatacac ctaggaatggaactgctgggtcatacaccaacctgtttaccaaagcagctgcaccagttt tcattaccaccagcagtatacagcgttcttgtttaccacatcctggccaacactgcttat gattttaatgaagtggagctgggggagctgcttctgtcactgaattatctcccaagtgct ggcagactgaatgttgatgtcattcgagccaagcaacttcttcagacagatgtgagccaa ggttcagacccctttgtgaaaatccagctggtgcatggactcaaacttgtgaaaaccaag aagacgtccttcttaaggggcacaattgatcctttctacaatgaatccttcagcttcaaa gttccccaagaagaactggaaaatgccagcctagtgtttacaggtggcaactgtcagctg gggcagtggtcagctgatgctgtcagggaagtcctgtactctccattcacagaagacccg cttctctcctatgccaaccacaagatggcccagatgttggaattatcagatgaagacttt gaagcaactattataaccatgctccacgaagtaagggtgaaccctcttgaaaagaatgat aagaaaacattagcaattcatgaggaagctgtgtggcaccatggtcaaagtcagccagag ctggcttctgtgtga >gi568815582f:19073428_19367073|GENSCAN_predicted_peptide_4|157_aa MTPIDDSEGPSGAATAIALAVVVGGSCGERAVVTLGASKYDGEAEEGPRTAYHWPAGASR HEQPERHEQPGMSSRRQTGFWAEEGWSLVKPRLQTREGLKPGDQAASPVDQSGNLWCFSL GQPMAAHGPISIHFLPSEAHKNPGLSQTQGREQRDDG >gi568815582f:19073428_19367073|GENSCAN_predicted_CDS_4|474_bp atgactcctattgatgatagtgaagggccatccggagcagccactgccattgctctggct gtggtggtggggggcagttgcggggaacgggcagtggtgactttgggtgccagcaagtat gacggggaggctgaggaggggccaaggacagcctaccactggcctgcaggtgcctctcgg catgaacagcctgagcgccatgagcaacctggcatgagcagcaggagacagacaggcttc tgggcagaagagggctggtccctggtgaagccccgccttcaaaccagggagggcctgaag cctggggaccaggccgccagtcccgtggaccagagtggaaacctctggtgcttttccctg ggccagcccatggctgcccatggaccaatcagcatacacttcctcccctctgaggcccat aaaaaccctggactcagccagactcagggtagagaacagagagacgatgggtag >gi568815582f:19073428_19367073|GENSCAN_predicted_peptide_5|93_aa MPWDIRLRLHIAGGAAVWQKEQPMPRLRVFGHNMKSSNDFIGRIVIGQYSSGPSETNHWR RMLNTHRTAVEQWHSLRSRAECDRVSPASLEVT >gi568815582f:19073428_19367073|GENSCAN_predicted_CDS_5|282_bp atgccgtgggacatacgtttaaggctgcatattgcaggcggtgcagctgtctggcagaag gaacagccaatgccaaggctgagagttttcggccacaacatgaagagcagcaatgacttc atcgggaggatcgtcattggccagtactcttcaggcccctctgagaccaaccactggagg cgcatgctcaacacgcaccgcacagccgtggagcagtggcatagcctgaggtcccgagct gagtgtgaccgcgtgtctcctgcctccctggaggtgacctga >gi568815582f:19073428_19367073|GENSCAN_predicted_peptide_6|198_aa MHTISLEASKCALKQNLLSCMVIGLKVLKIKHKPSSSNWLTYNESHPIVAQWAHEQSGHG GRDGGYTWIQQHGLPLTKADLATVATAESSISQQQNPALSTQYGIIPWGDQLTTLWQFDY IGPLPSSKRQHFVLIGIDTDSEYGFAFLPRNASDKTTMHGLTECFVHHHGIPHNIASDQG THFTPKEVWQWAHAHGIH >gi568815582f:19073428_19367073|GENSCAN_predicted_CDS_6|597_bp atgcacacaattagcctggaagcttctaagtgtgccctgaagcagaatcttctctcctgt atggtcatagggttgaaagtgctgaaaatcaaacacaagccctcatcctccaattggctg acttacaatgaaagccaccccatcgttgcccaatgggctcatgaacaaagtggccatggt ggcagggatggaggttatacatggattcagcaacatggacttccactcaccaaggctgac ctggctacagtggccactgctgagtcctcaatcagccagcagcagaacccagcactgagc acccaatatggcatcattccttggggtgatcagctaactaccttgtggcagtttgattat attggaccacttccatcatcgaagaggcagcattttgtccttattggaatagacactgac tctgaatatggatttgccttccttccacgcaatgcttctgacaaaactaccatgcatgga cttacagaatgctttgtccaccatcatggtattccacacaacattgcttctgaccaagga actcactttacacccaaagaagtgtggcagtgggctcatgctcatggaattcactga >gi568815582f:19073428_19367073|GENSCAN_predicted_peptide_7|224_aa MQRWTLWAAAFLTLHSAQAFPQTDISISPALPELPLPSLCPLFWMEFKGHCYRFFPLNKT WAEADLYCSEFSVGRKSAKLASIHSWEENVFVYDLVNSCVPGIPADVWTGLHDHRQEGQF EWTDGSSYDYSYWDGSQPDDGVHADPEEEDCVQIWPKRLSGLSPPSNRDHRASSGLFMHN DNLLNTQQVHTSGILICYPEENGMKGVKSCCVAFGTEDTKASGV >gi568815582f:19073428_19367073|GENSCAN_predicted_CDS_7|675_bp atgcaaaggtggacactgtgggctgcagccttcctgaccctccactctgcacaggccttt ccacaaacagacatcagtatcagtccagccctgccagagctgcccctgccttccctgtgc cccctgttctggatggagttcaaaggccactgctatcgattcttccctctcaataagacc tgggctgaggccgacctctactgttctgagttctctgtgggcaggaagtccgccaagctg gcctccatccacagctgggaggagaatgtctttgtatatgacctcgtgaacagctgtgtt cccggcatcccagctgacgtctggacaggccttcatgatcacagacaggaagggcagttt gaatggactgatggctcatcctatgactacagctactgggatggcagccagccagatgat ggcgtccacgcggacccagaagaagaggactgcgtgcagatatggcccaagcgactctct ggcctcagccccccaagcaaccgggaccacagggcctcaagtggtctctttatgcacaac gacaatctcctgaatacacaacaggtgcacacttctggaattttaatctgttatcctgag gagaatggaatgaagggggttaagagctgctgtgtggcttttggaactgaggacaccaaa gcctcaggtgtctga