GENSCAN 1.0 Date run: 5-Nov-116 Time: 11:04:15 Sequence gi568815586f:52849174_53052839 : 203666 bp : 48.69% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 8451 8561 111 1 0 75 38 63 0.032 0.35 1.02 Intr + 19631 19755 125 0 2 99 56 34 0.166 1.60 1.03 Intr + 23664 23823 160 2 1 48 49 128 0.979 4.46 1.04 Intr + 24645 24825 181 1 1 96 78 132 0.981 11.93 1.05 Term + 25757 25829 73 2 1 101 44 56 0.608 -0.12 1.06 PlyA + 29071 29076 6 1.05 2.00 Prom + 36058 36097 40 -3.06 2.01 Init + 36690 36933 244 2 1 50 84 131 0.373 6.80 2.02 Term + 47640 47662 23 1 2 77 50 36 0.002 -2.83 2.03 PlyA + 47858 47863 6 1.05 3.11 PlyA - 48043 48038 6 1.05 3.10 Term - 48445 48255 191 2 2 110 47 194 0.943 15.11 3.09 Intr - 49346 49288 59 1 2 58 94 78 0.999 3.93 3.08 Intr - 49726 49506 221 1 2 102 80 535 0.993 51.20 3.07 Intr - 50892 50602 291 0 0 149 80 554 0.998 58.03 3.06 Intr - 51510 51415 96 0 0 105 87 117 0.999 13.51 3.05 Intr - 52046 51986 61 1 1 109 100 110 0.992 13.04 3.04 Intr - 52899 52691 209 0 2 108 63 393 0.999 36.68 3.03 Intr - 55797 55485 313 2 1 75 100 605 0.845 56.59 3.02 Intr - 57574 57475 100 2 1 83 91 61 0.818 5.07 3.01 Init - 63572 63518 55 2 1 99 75 2 0.341 1.24 3.00 Prom - 64336 64297 40 -3.36 4.00 Prom + 64340 64379 40 -7.26 4.01 Init + 67545 67636 92 2 2 86 51 72 0.661 3.26 4.02 Term + 68692 69121 430 1 1 -38 55 819 0.984 60.77 4.03 PlyA + 69880 69885 6 1.05 5.00 Prom + 70771 70810 40 -10.35 5.01 Init + 71439 71572 134 2 2 65 115 80 0.402 7.34 5.02 Intr + 76720 76746 27 1 0 109 91 13 0.203 0.93 5.03 Intr + 100000 100417 418 1 1 84 110 591 0.032 54.93 5.04 Intr + 101155 101237 83 0 2 67 108 175 0.999 15.84 5.05 Intr + 101577 101818 242 0 2 90 7 333 0.681 22.59 5.06 Intr + 102318 102472 155 1 2 -39 96 306 0.390 18.49 5.07 Intr + 102558 102683 126 2 0 78 105 131 0.999 14.68 5.08 Intr + 102946 103169 224 0 2 119 78 438 0.997 42.83 5.09 Term + 103549 103669 121 1 1 81 46 129 0.982 5.95 5.10 PlyA + 103709 103714 6 1.05 6.00 Prom + 122861 122900 40 -5.26 6.01 Init + 126604 126703 100 0 1 109 75 43 0.776 5.68 6.02 Intr + 131158 131232 75 2 0 71 86 54 0.620 2.99 6.03 Intr + 134121 134152 32 1 2 81 97 1 0.066 -1.85 6.04 Intr + 153819 153900 82 2 1 82 106 49 0.105 5.31 6.05 Intr + 167300 167437 138 0 0 52 48 136 0.976 6.44 6.06 Intr + 169625 169833 209 0 2 78 101 136 0.999 12.70 6.07 Intr + 170737 170853 117 0 0 92 88 98 0.985 10.86 6.08 Intr + 172633 172687 55 0 1 21 116 81 0.964 2.75 6.09 Intr + 173320 173454 135 2 0 42 98 155 0.996 12.44 6.10 Intr + 178609 178746 138 2 0 67 76 253 0.999 22.44 6.11 Intr + 178842 179015 174 1 0 53 75 173 0.807 12.41 6.12 Intr + 184633 184861 229 2 1 22 85 191 0.528 9.13 6.13 Intr + 185439 185536 98 0 2 62 100 26 0.715 0.85 6.14 Intr + 188236 188449 214 2 1 101 113 99 0.963 11.47 6.15 Intr + 190457 190529 73 2 1 90 94 50 0.978 5.21 6.16 Term + 190970 191050 81 1 0 92 41 107 0.964 4.09 6.17 PlyA + 191137 191142 6 1.05 7.00 Prom + 194136 194175 40 -4.56 7.01 Init + 198126 198200 75 2 0 85 109 77 0.567 8.59 7.02 Intr + 200914 201087 174 0 0 59 53 106 0.283 4.34 7.03 Intr + 202682 202790 109 1 1 92 78 85 0.924 7.76 7.04 Intr + 203282 203319 38 0 2 117 98 17 0.867 3.58 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Init + 100001 100417 417 1 0 80 110 586 0.966 56.43 S.002 Init + 157311 157323 13 2 1 115 99 15 0.859 5.20 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815586f:52849174_53052839|GENSCAN_predicted_peptide_1|216_aa XEKYMLNSNPGYPDYPEHESFAFNLYHQATSTFHRYEAGIATPPPEGMEESKERMHSGQM GQLAEPWAGQLVKAPVKGSAKQPNPQNRNPPTQRTTAPADRQTVPGEPDHCALPPTTTTL GAPTLELMLLGEEQFLKQMTACGVADCGQRTSAVKTVAGNAPVPLSLANKAACLLQGGAE CPAETFREELRGGARAPEGLGPDATLQARLNPYSSA >gi568815586f:52849174_53052839|GENSCAN_predicted_CDS_1|651_bp nctgagaagtacatgctgaattctaacccaggatatcctgactatcctgaacatgagtca ttcgcattcaacctttaccatcaagccacatcgacattccataggtatgaagctggaata gccacccctccaccagaggggatggaggaatccaaagagagaatgcactcaggccagatg gggcaactagcagagccatgggctgggcagctggtgaaggctccagtgaaggggagtgcc aagcagccaaatccccaaaaccggaatccaccaacccagaggaccaccgcccctgctgac cgccaaaccgtccctggagaaccagaccattgtgccctcccaccaaccacaaccaccctg ggggcccccaccctggagctcatgctgttgggagaggaacaatttctgaagcagatgaca gcgtgtggtgttgctgactgtggacagagaacttctgccgtcaagaccgtagcaggaaat gcccctgtgcccctgagcctggcaaacaaggcagcctgtctccttcaaggaggagctgag tgccccgcagagacattccgggaagagctcagagggggtgccagggcccctgaaggtctg gggccagatgctaccctgcaggccaggctcaatccctacagcagtgcctga >gi568815586f:52849174_53052839|GENSCAN_predicted_peptide_2|88_aa MASKAGNLYVPAEPNQVGGHQNQRYQCVSPQDQKVLQLFHLCQIFNGSFVKLNKASTNML RIVEPYIAWGYPNMKSVNELIYSPDAKV >gi568815586f:52849174_53052839|GENSCAN_predicted_CDS_2|267_bp atggcaagcaaagctggcaacctctatgtacctgcagaacccaaccaagttggcggtcat cagaatcagaggtatcaatgtgtgagcccacaggaccaaaaggtattgcaactttttcac ctttgtcaaatcttcaatggaagctttgtgaagctcaacaaggcttccactaacatgctg aggattgtagaaccatatattgcatgggggtacccaaatatgaagtcagtaaatgaacta atctatagtcctgatgccaaggtgtag >gi568815586f:52849174_53052839|GENSCAN_predicted_peptide_3|531_aa MAMEKAGNSLNCGCLSVSGDLAVICWSSEFSFIQSPHHTLQYLPENPAPAIRVTQKSYKV STSGPRAFSSRSYTSGPGSRISSSSFSRVGSSNFRGGLGGGYGGASGMGGITAVTVNQSL LSPLVLEVDPNIQAVRTQEKEQIKTLNNKFASFIDKVRFLEQQNKMLETKWSLLQQQKTA RSNMDNMFESYINNLRRQLETLGQEKLKLEAELGNMQGLVEDFKNKYEDEINKRTEMENE FVLIKKDVDEAYMNKVELESRLEGLTDEINFLRQLYEEEIRELQSQISDTSVVLSMDNSR SLDMDSIIAEVKAQYEDIANRSRAEAESMYQIKYEELQSLAGKHGDDLRRTKTEISEMNR NISRLQAEIEGLKGQRASLEAAIADAEQRGELAIKDANAKLSELEAALQRAKQDMARQLR EYQELMNVKLALDIEIATYRKLLEGEESRLESGMQNMSIHTKTTSGYAGGLSSAYGGLTS PGLSYSLGSSFGSGAGSSSFSRTSSSRAVVVKKIETRDGKLVSESSDVLPK >gi568815586f:52849174_53052839|GENSCAN_predicted_CDS_3|1596_bp atggccatggaaaaggctgggaatagcctaaactgtggctgcctctctgtgtcaggggat ctggctgtcatctgttggtcctctgagttcagtttcatccagtccccacatcacaccctg cagtatcttccagagaacccagccccagccatcagggtgacccagaagtcctacaaggtg tccacctctggcccccgggccttcagcagccgctcctacacgagtgggcccggttcccgc atcagctcctcgagcttctcccgagtgggcagcagcaactttcgcggtggcctgggcggc ggctatggtggggccagcggcatgggaggcatcaccgcagttacggtcaaccagagcctg ctgagcccccttgtcctggaggtggaccccaacatccaggccgtgcgcacccaggagaag gagcagatcaagaccctcaacaacaagtttgcctccttcatagacaaggtacggttcctg gagcagcagaacaagatgctggagaccaagtggagcctcctgcagcagcagaagacggct cgaagcaacatggacaacatgttcgagagctacatcaacaaccttaggcggcagctggag actctgggccaggagaagctgaagctggaggcggagcttggcaacatgcaggggctggtg gaggacttcaagaacaagtatgaggatgagatcaataagcgtacagagatggagaacgaa tttgtcctcatcaagaaggatgtggatgaagcttacatgaacaaggtagagctggagtct cgcctggaagggctgaccgacgagatcaacttcctcaggcagctatatgaagaggagatc cgggagctgcagtcccagatctcggacacatctgtggtgctgtccatggacaacagccgc tccctggacatggacagcatcattgctgaggtcaaggcacagtacgaggatattgccaac cgcagccgggctgaggctgagagcatgtaccagatcaagtatgaggagctgcagagcctg gctgggaagcacggggatgacctgcggcgcacaaagactgagatctctgagatgaaccgg aacatcagccggctccaggctgagattgagggcctcaaaggccagagggcttccctggag gccgccattgcagatgccgagcagcgtggagagctggccattaaggatgccaacgccaag ttgtccgagctggaggccgccctgcagcgggccaagcaggacatggcgcggcagctgcgt gagtaccaggagctgatgaacgtcaagctggccctggacatcgagatcgccacctacagg aagctgctggagggcgaggagagccggctggagtctgggatgcagaacatgagtattcat acgaagaccaccagcggctatgcaggtggtctgagctcggcctatgggggcctcacaagc cccggcctcagctacagcctgggctccagctttggctctggcgcgggctccagctccttc agccgcaccagctcctccagggccgtggttgtgaagaagatcgagacacgtgatgggaag ctggtgtctgagtcctctgacgtcctgcccaagtga >gi568815586f:52849174_53052839|GENSCAN_predicted_peptide_4|173_aa MRLSQNPPQSSAAEASVDKPAFSHADHAGAGKTSSQKKKKKEKEEKEKKKKEKEKRKKRK RKKKKRKRKKKKKERKERKKKEKKKRRRRRRRRGRRRRRRRGGGGEEEEEEEEEEEEEEG GGEGEGEGEEEEEEEEEEEEEEEEQEEEEEEEEEEEEEEEEETKQSLRQLGSS >gi568815586f:52849174_53052839|GENSCAN_predicted_CDS_4|522_bp atgaggctgtctcaaaacccaccacagtcctcagcagcagaagcaagtgtggacaagccc gccttctcgcatgctgaccatgctggagcaggcaagacttcatctcaaaagaagaagaag aaggagaaggaggagaaggagaaaaagaagaaggagaaggagaagaggaagaagaggaag aggaagaagaagaagaggaagaggaagaagaagaagaaggagaggaaggagaggaagaag aaggagaagaagaagaggaggaggagaagaagaagaagaggaagaagaaggagaagaaga agaggaggaggaggagaagaagaagaggaagaagaagaagaagaagaggaagaagaagga gggggagagggagagggagaaggggaagaagaagaagaagaggaagaggaagaagaagaa gaagaagaagaacaagaagaagaagaagaagaagaagaagaagaagaagaagaagaagaa gaagaaacaaaacagagtctgcgtcaactgggaagttcatga >gi568815586f:52849174_53052839|GENSCAN_predicted_peptide_5|509_aa MDSPMRAETLTMLLTVTSTALERHLTGTPPVYDERIHKWLNQPARAHYSELGPSMSFTTR STFSTNYRSLGSVQAPSYGARPVSSAASVYAGAGGSGSRISVSRSTSFRGGMGSGGLATG IAGGLAGMGGIQNEKETMQSLNDRLASYLDRVRSLETENRRLESKIREHLEKKGPQVRDW SHYFKIIEDLRAQIFANTVDNARIVLQIDNARLAADDFRVKYETELAMRQSVENDIHGLR KVIDDTNITRLQLETEIEALKEELLFMKKNHEEASRGHWPGQGLRGQEKSGSENRQDKPT ASLQAQIASSGLTVEVDAPKSQDLAKIMADIRAQYDELARKNREELDKYWSQQIEESTTV VTTQSAEVGAAETTLTELRRTVQSLEIDLDSMRNLKASLENSLREVEARYALQMEQLNGI LLHLESELAQTRAEGQRQAQEYEALLNIKVKLEAEIATYRRLLEDGEDFNLGDALDSSNS MQTIQKTTTRRIVDGKVVSETNDTKVLRH >gi568815586f:52849174_53052839|GENSCAN_predicted_CDS_5|1530_bp atggacagccccatgagggcagagaccttgactatgttgttgaccgtcacatccacagca ttagaaagacatctaacaggcactccaccagtatatgatgaacgaattcacaaatggctg aaccagcctgccagggcccattactctgagctgggccccagcatgagcttcaccactcgc tccaccttctccaccaactaccggtccctgggctctgtccaggcgcccagctacggcgcc cggccggtcagcagcgcggccagcgtctatgcaggcgctgggggctctggttcccggatc tccgtgtcccgctccaccagcttcaggggcggcatggggtccgggggcctggccaccggg atagccgggggtctggcaggaatgggaggcatccagaacgagaaggagaccatgcaaagc ctgaacgaccgcctggcctcttacctggacagagtgaggagcctggagaccgagaaccgg aggctggagagcaaaatccgggagcacttggagaagaagggaccccaggtcagagactgg agccattacttcaagatcatcgaggacctgagggctcagatcttcgcaaatactgtggac aatgcccgcatcgttctgcagattgacaatgcccgtcttgctgctgatgactttagagtc aagtatgagacagagctggccatgcgccagtctgtggagaacgacatccatgggctccgc aaggtcattgatgacaccaatatcacacgactgcagctggagacagagatcgaggctctc aaggaggagctgctcttcatgaagaagaaccacgaagaggcaagcaggggccactggcca ggccagggattgaggggccaagagaagtctgggtcggagaatagacaagacaaaccaact gcaagcctacaagcccagattgccagctctgggttgaccgtggaggtagatgcccccaaa tctcaggacctcgccaagatcatggcagacatccgggcccaatatgacgagctggctcgg aagaaccgagaggagctagacaagtactggtctcagcagattgaggagagcaccacagtg gtcaccacacagtctgctgaggttggagctgctgagacgacgctcacagagctgagacgt acagtccagtccttggagatcgacctggactccatgagaaatctgaaggccagcttggag aacagcctgagggaggtggaggcccgctacgccctacagatggagcagctcaacgggatc ctgctgcaccttgagtcagagctggcacagacccgggcagagggacagcgccaggcccag gagtatgaggccctgctgaacatcaaggtcaagctggaggctgagatcgccacctaccgc cgcctgctggaagatggcgaggactttaatcttggtgatgccttggacagcagcaactcc atgcaaaccatccaaaagaccaccacccgccggatagtggatggcaaagtggtgtctgag accaatgacaccaaagttctgaggcattaa >gi568815586f:52849174_53052839|GENSCAN_predicted_peptide_6|649_aa MEKPHGMQCTESFPGLRQALWPRKVHPVLCYLGAISPSPSEALAKCSGIQKELDEYGEGV CKEGSSITKKGVQIQALREGSWTSCKKEFWASPKSETKKKNKKGKTISLTDFLAEDGGTG GGSTYVSKPVSWADETDDLEGDVSTTWHSNDDDVYRAPPIDRSILPTAPRAAREPNIDRS RLPKSPPYTAFLGNLPYDVTEESIKEFFRGLNISAVRLPREPSNPERLKGFGYAEFEDLD SLLSALSLNEESLGNRRIRVDVADQAQDKDRDDRSFGRDRNRDSDKTDTDWRARPATDSF DDYPPRRGDDSFGDKYRDRYDSDRYRDGYRDGYRDGPRRDMDRYGGRDRYDDRGSRDYDR GYDSRIGSGRRAFGSGYRRDDDYRGGGDRYEDRYDRRDDRSWSSRDDYSRDDYRRDDRGP PQRPKLNLKPRSTPKEDDSSASTSQSTRAASIFGGAKPVDTAAREREVEERLQKEQEKLQ RQLDEPKLERRPRERHPSWRSEETQERERSRTGSESSQTGTSTTSSRNARRRESEKSLEN ETLNKEEDCHSPTSKPPKPDQPLKVMPAPPPKENAWVKRSSNPPARSQSSDTEQQSPTRK DGKKDQDSRSAPEPKKPEENPASKFSSASKYAALSVDGEDENEGEDYAE >gi568815586f:52849174_53052839|GENSCAN_predicted_CDS_6|1950_bp atggagaagccacatggcatgcagtgtacggaatccttccctggactgcgccaggccctc tggcccagaaaggttcatcctgtcctgtgctacctgggagcgatctctccctcaccttcc gaggctctggctaaatgctctggtatacagaaggagcttgatgagtatggggaaggggtt tgtaaggaaggcagtagcataacaaagaaaggggtccagatccaggccctacgagagggt tcttggacctcatgcaagaaagaattctgggcaagtccaaaaagtgaaacaaaaaagaag aataagaaggggaagactatctccctaacagactttctggctgaggatgggggtactggt ggaggaagcacctatgtttccaaaccagtcagctgggctgatgaaacggatgacctggaa ggagatgtttcgaccacttggcacagtaacgatgacgatgtgtatagggcgcctccaatt gaccgttccatccttcccactgctccacgggctgctcgggaacccaatatcgaccggagc cgtcttcccaaatcgccaccctacactgcttttctaggaaacctaccctatgatgttaca gaagagtcaattaaggaattctttcgaggattaaatatcagtgcagtgcgtttaccacgt gaacccagcaatccagagaggttgaaaggttttggttatgctgaatttgaggacctggat tccctgctcagtgccctgagtctcaatgaagagtctctaggtaacaggagaattcgagtg gacgttgctgatcaagcacaggataaagacagggatgatcgttcttttggccgtgataga aatcgggattctgacaaaacagatacagactggagggctcgtcctgctacagacagcttt gatgactacccacctagaagaggtgatgatagctttggagacaagtatcgagatcgttat gattcagaccggtatcgggatgggtatcgggatgggtatcgggatggcccacgccgggat atggatcgatatggtggccgggatcgctatgatgaccgaggcagcagagactatgataga ggctatgattcccggataggcagtggcagaagagcatttggcagtgggtatcgcagggat gatgactacagaggaggcggggaccgctatgaagaccgatatgacagacgggatgatcgg tcgtggagctccagagatgattactctcgggatgattataggcgtgatgatagaggtccc ccccaaagacccaaactgaatctaaagcctcggagtactcctaaggaagatgattcctct gctagtacctcccagtccactcgagctgcttctatctttggaggggcaaagcctgttgac acagctgctagagaaagagaagtagaagaacggctacagaaggaacaagagaagttgcag cgtcagctggatgagccaaaactagaacgacggcctcgggagagacacccaagctggcga agtgaagaaactcaggaacgggaacggtcgaggacaggaagtgagtcatcacaaactggg acctccaccacatctagcagaaatgcacgaaggagagagagtgagaagtctctagaaaat gaaacactcaataaggaggaagattgccactctccaacttctaaacctcccaaacctgat cagcccctaaaggtaatgccagcccctccaccaaaggagaatgcttgggtgaagcgaagt tctaaccctcctgctcgatctcagagctcagacacagagcagcagtcccctacaaggaaa gatggcaaaaaggatcaagactccagatctgcacctgagccaaagaaacctgaggaaaat ccagcttccaagttcagttctgcaagcaagtatgctgctctctctgttgatggtgaagat gaaaatgagggagaagattatgccgaatag >gi568815586f:52849174_53052839|GENSCAN_predicted_peptide_7|132_aa MGWPGGSPCCCPAPPRPRPAGRPPQGKRLPPPGRFQEAPGQAPALFRPWGQHPSQPNTMK SSGPVERLLRALGRRDSSRAASRPRKAEPHSFREKVFRKKPPVCAVCKVTIDGTGVSCRV CKVATHRKCEAK >gi568815586f:52849174_53052839|GENSCAN_predicted_CDS_7|396_bp atgggctggcccggcggctccccctgctgctgccccgccccgccgcgcccccgcccggcc gggcgccccccgcaggggaagcggctgcctccgccaggccgcttccaggaagccccgggc caggccccagcattgttcaggccctggggccagcaccccagccagccgaacaccatgaag tccagcggccctgtggagaggctgctcagagccctggggaggagggacagcagccgggcc gcaagcaggcctaggaaagctgagcctcatagcttccgggagaaggttttccggaagaaa cctccagtctgtgcagtatgtaaggtgaccatcgatgggacaggcgtttcgtgcagagtc tgcaaggtggcgacgcacagaaaatgtgaagcaaag