GENSCAN 1.0 Date run: 5-Nov-116 Time: 05:42:45 Sequence gi568815587f:45705802_45911332 : 205531 bp : 48.59% C+G : Isochore 2 (43 - 51 C+G%) Parameter matrix: HumanIso.smat Predicted genes/exons: Gn.Ex Type S .Begin ...End .Len Fr Ph I/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ 1.01 Intr + 4364 4474 111 2 0 24 87 76 0.164 0.59 1.02 Intr + 10032 10126 95 0 2 92 68 68 0.868 4.81 1.03 Intr + 16263 16406 144 1 0 68 53 101 0.252 4.85 1.04 Intr + 21404 21426 23 0 2 75 121 -1 0.143 -0.84 1.05 Intr + 23475 23562 88 2 1 76 81 128 0.631 10.44 1.06 Intr + 35921 36061 141 0 0 31 76 101 0.168 3.52 1.07 Intr + 45151 45314 164 2 2 69 76 61 0.201 2.69 1.08 Term + 46291 46446 156 0 0 34 49 87 0.324 -2.67 1.09 PlyA + 52390 52395 6 1.05 2.00 Prom + 55023 55062 40 -4.36 2.01 Init + 65784 65913 130 2 1 74 106 85 0.940 9.21 2.02 Intr + 66192 66220 29 1 2 103 58 8 0.432 -2.87 2.03 Intr + 68231 68297 67 1 1 103 76 64 0.635 5.18 2.04 Intr + 83873 84050 178 0 1 54 63 64 0.011 -0.52 2.05 Intr + 100009 100535 527 1 2 49 109 959 0.687 86.69 2.06 Term + 104975 105534 560 0 2 121 53 1000 0.931 94.31 2.07 PlyA + 107198 107203 6 1.05 3.00 Prom + 125102 125141 40 -2.76 3.01 Init + 141627 141904 278 2 2 63 78 459 0.851 36.86 3.02 Intr + 150181 150289 109 1 1 61 121 45 0.969 5.39 3.03 Intr + 152930 153072 143 1 2 67 110 221 0.998 21.35 3.04 Intr + 153957 154067 111 0 0 53 42 148 0.929 6.19 3.05 Intr + 155047 155231 185 1 2 69 65 373 0.989 32.53 3.06 Intr + 156259 156347 89 2 2 119 86 24 0.914 4.99 3.07 Intr + 161811 161951 141 2 0 98 116 141 0.999 18.45 3.08 Intr + 163705 164016 312 0 0 86 100 240 0.969 21.38 3.09 Intr + 164252 164403 152 1 2 80 94 246 0.999 23.36 3.10 Intr + 164457 164731 275 0 2 93 56 353 0.789 29.78 3.11 Intr + 165041 165133 93 0 0 138 99 12 0.993 7.24 3.12 Term + 166291 166430 140 2 2 101 54 174 0.990 13.43 3.13 PlyA + 168835 168840 6 1.05 4.00 Prom + 174439 174478 40 -5.26 4.01 Init + 180020 180120 101 1 2 80 94 131 0.334 10.64 4.02 Intr + 185971 186059 89 1 2 53 37 84 0.160 -0.59 4.03 Intr + 191071 191149 79 2 1 65 58 122 0.422 5.51 4.04 Intr + 192284 192389 106 2 1 99 90 174 0.980 18.92 4.05 Intr + 194337 194651 315 2 0 106 94 423 0.898 40.96 4.06 Intr + 195000 195098 99 2 0 69 59 138 0.995 9.31 4.07 Intr + 196029 196260 232 2 1 83 111 92 0.966 8.35 4.08 Intr + 196571 197383 813 0 0 94 81 1066 0.964 97.91 4.09 Intr + 197564 197639 76 0 1 79 94 114 0.922 9.67 4.10 Intr + 198188 198360 173 2 2 58 72 356 0.992 30.59 4.11 Intr + 198654 198763 110 1 2 91 77 127 0.994 11.90 4.12 Intr + 198917 199033 117 1 0 88 57 136 0.996 11.16 4.13 Intr + 199170 199240 71 2 2 112 108 -1 0.994 2.28 4.14 Intr + 199350 199448 99 0 0 59 71 171 0.670 11.73 4.15 Term + 199848 199920 73 0 1 90 37 153 0.940 7.78 4.16 PlyA + 200640 200645 6 -1.95 5.05 PlyA - 200756 200751 6 -3.94 5.04 Term - 200889 200768 122 1 2 126 46 73 0.973 5.54 5.03 Intr - 201141 201008 134 2 2 104 34 46 0.810 1.09 5.02 Intr - 204257 204101 157 0 1 66 81 35 0.108 -0.33 5.01 Intr - 205161 205097 65 2 2 103 82 95 0.945 8.76 Suboptimal exons with probability > 0.800 Exnum Type S .Begin ...End .Len Fr Ph B/Ac Do/T CodRg P.... Tscr.. ----- ---- - ------ ------ ---- -- -- ---- ---- ----- ----- ------ S.001 Sngl + 132909 133160 252 2 0 52 47 157 0.897 3.09 Predicted peptide sequence(s): Predicted coding sequence(s): >gi568815587f:45705802_45911332|GENSCAN_predicted_peptide_1|307_aa XHFGCVNQKDKATSHQGPGKRRWTEEMRIKGAEKKRGRHALLFSPNAGSPEKVELTLGIQ KHFTNPLASEPQTHSFFGNFFTPFTEKTLDFECEKWLNLLRKRIKPLPQDLAKRELTECP YCLFKCPTSITANENRKSHVKGYTSPSITKRALKAANLMQTQLEKEEDSQVIPAKATLDQ PASCQAASDHRVQLTTASPGPAVRFQVSPESSTSSFDLATEYSIVLINMPVSPSYPHILG ILKRTVMDTKLFSVGPKYLNLKVIFAALFTAVDNGKQPKCPVMEASYINFGTCIHVENAK AYGRKNT >gi568815587f:45705802_45911332|GENSCAN_predicted_CDS_1|924_bp nnccattttggatgtgttaaccagaaggacaaagccacaagccatcagggaccagggaag aggagatggacagaggaaatgcgtatcaagggtgcagagaagaagaggggaagacatgct ctcctcttctcccccaatgcaggctctccagagaaagtggaactgacattagggattcag aagcattttaccaaccccttagcctcagagccccaaacgcattccttctttggaaatttc tttacaccctttactgagaagaccctggattttgagtgtgaaaaatggctgaatctactt cggaaaagaataaaaccgttaccccaggatctcgcaaagcgagaattgactgaatgtccc tactgcctattcaagtgtccaactagtatcaccgccaatgaaaaccggaagtcccatgtc aaaggatataccagccccagcatcacaaagagagcattaaaagcagcaaacttgatgcaa acacaacttgaaaaagaagaggacagccaagtcatcccagccaaggccaccctggaccag ccagcctcttgccaagctgccagtgaccacagagtccaattaacaacagccagtcctggc ccagcagtacgtttccaggtgtcaccagaatcctcaacgtcttcctttgatttggccaca gaatattccattgtgttaataaacatgccggtttcaccttcttatccccatattctgggc attctgaagaggactgtgatggacaccaagttgttctcagtagggcctaaatatttgaat ctgaaggtgatctttgcagcattatttacagcagtagataatggaaaacaacctaaatgt cccgtaatggaggccagttacatcaattttggtacatgcatacacgttgaaaatgcaaag gcgtacgggcggaagaacacataa >gi568815587f:45705802_45911332|GENSCAN_predicted_peptide_2|496_aa MGSGAETVTVTVLFREEGRRNQCRGRLWGVDRRCPQLTGSMTLEEWKLRQSRDLEKRLDE AYIYWYPGMYKGQCPVHQLVTGKHTVVHDKRNQRLIHATTWMNLKSTMLSERSQTQKAIY YMIPFVRHYGKAKTRAPLKRSRILHMALTGASDPSAEAEANGEKPFLLRALQIALVVSLY WVTSISMVFLNKYLLDSPSLRLDTPIFVTFYQCLVTTLLCKGLSALAACCPGAVDFPSLR LDLRVARSVLPLSVVFIGMITFNNLCLKYVGVAFYNVGRSLTTVFNVLLSYLLLKQTTSF YALLTCGIIIGGFWLGVDQEGAEGTLSWLGTVFGVLASLCVSLNAIYTTKVLPAVDGSIW RLTFYNNVNACILFLPLLLLLGELQALRDFAQLGSAHFWGMMTLGGLFGFAIGYVTGLQI KFTSPLTHNVSGTAKACAQTVLAVLYYEETKSFLWWTSNMMVLGGSSAYTWVRGWEMKKT PEEPSPKDSEKSAMGV >gi568815587f:45705802_45911332|GENSCAN_predicted_CDS_2|1491_bp atggggagtggggcagaaactgtcactgtcacggtcctgtttagagaggaaggtcggaga aaccagtgccgtggaaggctctggggcgttgacaggcgctgtccccagctcaccggctct atgaccctggaagaatggaaacttcgtcagagcagggacttagaaaagcgactggatgag gcgtacatttactggtaccctgggatgtataagggccagtgtccggtccatcaactggtg actggaaaacatactgtggtacatgataaaaggaaccaacgactgatacatgcaacaaca tggatgaaccttaaaagcaccatgttaagtgaaagaagtcagacacaaaaggctatatac tatatgattccatttgtaagacattatggaaaagccaaaactagggcccctctgaagcgg tccaggatcctgcacatggcgctgaccggggcctcagacccctctgcagaggcagaggcc aacggggagaagccctttctgctgcgggcattgcagatcgcgctggtggtctccctctac tgggtcacctccatctccatggtgttccttaataagtacctgctggacagcccctccctg cggctggacacccccatcttcgtcaccttctaccagtgcctggtgaccacgctgctgtgc aaaggcctcagcgctctggccgcctgctgccctggtgccgtggacttccccagcttgcgc ctggacctcagggtggcccgcagcgtcctgcccctgtcggtggtcttcatcggcatgatc accttcaataacctctgcctcaagtacgtcggtgtggccttctacaatgtgggccgctca ctcaccaccgtcttcaacgtgctgctctcctacctgctgctcaagcagaccacctccttc tatgccctgctcacctgcggtatcatcatcgggggcttctggcttggtgtggaccaggag ggggcagaaggcaccctgtcgtggctgggcaccgtcttcggcgtgctggctagcctctgt gtctcgctcaacgccatctacaccacgaaggtgctcccggcggtggacggcagcatctgg cgcctgactttctacaacaacgtcaacgcctgcatcctcttcctgcccctgctcctgctg ctcggggagcttcaggccctgcgtgactttgcccagctgggcagtgcccacttctggggg atgatgacgctgggcggcctgtttggctttgccatcggctacgtgacaggactgcagatc aagttcaccagtccgctgacccacaatgtgtcgggcacggccaaggcctgtgcccagaca gtgctggccgtgctctactacgaggagaccaagagcttcctctggtggacgagcaacatg atggtgctgggcggctcctccgcctacacctgggtcaggggctgggagatgaagaagact ccggaggagcccagccccaaagacagcgagaagagcgccatgggggtgtga >gi568815587f:45705802_45911332|GENSCAN_predicted_peptide_3|675_aa MGGVHVAYRGGAGVAGAVWTVMAATVATAAAVAPAPAPGTDSASSVHWFRKGLRLHDNPA LLAAVRGARCVRCVYILDPWFAASSSVGINRWRFLLQSLEDLDTSLRKLNSRLFVVRGQP ADVFPRLFKEWGVTRLTFEYDSEPFGKERDAAIMKMAKEAGVEVVTENSHTLYDLDRAVT LLLLLLLLLLLPSQQPPPLPTPTSQPSAGCQEARIIELNGQKPPLTYKRFQAIISRMELP KKPVGLVTSQQMESCRAEIQENHDETYGVPSLEELGFPTEGLGPAVWQGGETEALARLDK HLERKAWVANYERPRMNANSLLASPTGLSPYLRFGCLSCRLFYYRLWDLYKKVKRNSTPP LSLFGQLLWREFFYTAATNNPRFDRMEGNPICIQIPWDRNPEALAKWAEGKTGFPWIDAI MTQLRQEGWIHHLARHAVACFLTRGDLWVSWESGVRVFDELLLDADFSVNAGSWMWLSCS AFFQQFFHCYCPVGFGRRTDPSGDYIRMGYPGPFEGGSGYADGSSGVSYFRRYLPKLKAF PSRYIYEPWNAPESIQKAAKCIIGVDYPRPIVNHAETSRLNIERMKQIYQQLSRYRGLCL LASVPSCVEDLSHPVAEPSSSQAGSMSSAGPRPLPSGPASPKRKLEAAEEPPGEELSKRA RVAELPTPELPSKDA >gi568815587f:45705802_45911332|GENSCAN_predicted_CDS_3|2028_bp atgggcggggtccacgtcgcctaccggggcggagcgggggtggctggagcagtctggaca gtcatggcggcgactgtggcgacggcggcagctgtggccccggcgccagcgcccggcacg gacagcgcctcttcggtgcactggttccgcaaagggctgcgactccacgacaacccggcg ttgctggcggccgtgcgcggggcgcgctgcgtgcgctgcgtttacattctcgacccgtgg ttcgcggcctcctcctcagtcgggatcaaccgatggaggttcctacttcagtctctggaa gatttggacacaagtttaaggaaactgaactcccgcctgtttgtagtccggggacagcca gccgacgtgttcccaaggctgttcaaggaatggggagtgacccgcttgacctttgaatat gactctgaaccctttgggaaagaacgggatgcagccatcatgaagatggccaaggaggct ggtgtggaagtagtgacggagaattctcataccctctatgacctggacagagctgtcacc ctcctgctgctgctgctgctgttgctgctgctgccaagccagcagccgccgccgctcccc acccccacttcccaacccagtgctggctgccaagaagccaggatcattgagctgaatggg cagaagccaccccttacatacaagcgctttcaggccatcatcagccgcatggagctgccc aagaagccagtgggcttggtgaccagccagcagatggagagctgcagggccgagatccag gagaaccacgacgagacctacggcgtgccctccctggaggagctggggttccccactgaa ggacttggtccagctgtctggcagggaggagagacagaagctctggcccgcctggataag cacttggaacggaaggcctgggttgccaactatgagagaccccgaatgaacgccaactcc ctcctggccagccccacaggcctcagcccctacctgcgctttggttgtctctcctgccgc ctcttctactaccgcctgtgggacctgtataaaaaggtgaagcggaacagcacacctccc ctctccctatttgggcaactcctatggcgagagttcttctacacggcagctaccaacaac cccaggtttgaccgcatggaggggaaccccatctgcatccagatcccctgggaccgcaat cctgaggccctggccaagtgggctgagggcaagacaggcttcccttggattgatgccatc atgacccaactgaggcaggagggctggatccaccacctggcccggcatgccgtggcctgc ttcctgacccgcggggacctctgggtcagctgggagagcggggtccgggtatttgatgag ctgctcctggatgcagatttcagcgtgaacgcaggcagctggatgtggctgtcctgcagt gctttcttccagcagttcttccactgctactgccctgtgggctttggccgtcgcacggac cccagtggggactacatcaggatgggataccctgggccttttgaaggagggtctgggtat gctgatgggtcatctggtgtatcttatttcaggcgatacctgcccaaattgaaagcgttc ccctctcgatacatctatgagccctggaatgccccagagtcaattcagaaggcagccaag tgcatcattggtgtggactacccacggcccatcgtcaaccatgccgagaccagccggctt aacattgaacgaatgaagcagatttaccagcagctttcgcgctaccggggactctgtcta ctggcatctgtcccttcctgtgtggaagacctcagtcaccctgtggcagagcccagctcg agccaggctggcagcatgagcagtgcaggcccaagaccactacccagtggcccagcatcc cccaaacgcaagctggaagcagccgaggaaccacctggtgaagaactcagcaaacgggcc cgggtggcagagttgccaaccccagagctgccgagcaaggatgcctga >gi568815587f:45705802_45911332|GENSCAN_predicted_peptide_4|850_aa MAERESGGLGGGAASPPAASPFLGLHIASPPNFRLYNVDSKRLHHISSADGHDGTSPLVL GDAGLSMQLVLKMDSSPDNDSWLEDQWERWLTHDISLEEFEDEDLSEITDECGISLQCKD TLSLRPPRAGLLSAGGGGAGSRLQAEMLQMDLIDATGDTPGAEDDEEDDDEERAARRPGA GPPKAESGQEPASRGQGQSQGQSQGPGSGDTYRPKRPTTLNLFPQVPRSQDMGLLVKVEQ EQGLEKRASEMVIVGEAVGDRGERRVTSEAFDSGGGVDTANGPRDGLRGQGRGLPVPGTV DHFHHKSLLFPAQDTLNNNSLGKKHSWQDRVSRSSSPLKTGEQTPPHEHICLSDELPPQS GPAPTTDRGTSTDSPCRRSTATQMAPPGGPPAAPPGGRGHSHRDRIHYQADVRLEATEEI YLTPVQRPPDAAEPTSAFLPPTESRMSVSSDPDPAAYPSTAGRPHPSISEEEEGFDCLSS PERAEPPGGGWRGSLGEPPPPPRASLSSDTSALSYDSVKYTLVVDEHAQLELVSLRPCFG DYSDESDSATVYDNCASVSSPYESAIGEEYEEAPRPQPPACLSEDSTPDEPDVHFSKKFL NVFMSGRSRSSSAESFGLFSCIINGEEQEQTHRAIFRFVPRHEDELELEVDDPLLVELQA EDYWYEAYNMRTGARGVFPAYYAIEVTKEPEHMAALAKNSDWVDQFRVKFLGSVQVPYHK GNDVLCAAMQKIATTRRLTVHFNPPSSCVLEISVRGVKIGVKADDSQEAKGNKCSHFFQL KNISFCGYHPKNNKYFGFITKHPADHRFACHVFVSEDSTKALAESVGRAFQQFYKQFVEY TCPTEDIYLE >gi568815587f:45705802_45911332|GENSCAN_predicted_CDS_4|2553_bp atggcggagcgagaaagcggcggcctgggagggggggccgcgtccccgcccgccgcctcc ccgttcctggggctgcacatcgcttcgcctcccaatttcagactttataatgttgattcc aaaagactgcatcacatttcatctgctgatggccacgatggaaccagccccctggtgctg ggtgacgcagggctctccatgcagctggtgctgaagatggattcgagcccagacaatgac agctggttggaggatcaatgggagcgctggctcacccatgacatcagcctggaggagttt gaggatgaagacctctcggagatcactgatgagtgtggcatcagcttacagtgcaaagac accctgtccttacggcccccgcgcgccgggctgctctctgcgggcggcggcggcgcgggg agccggttgcaggccgagatgctgcagatggacctgatcgacgcgacgggggacactccc ggggccgaggacgacgaggaggacgacgacgaggagcgcgcggcccggcggccgggagcg gggccgcccaaggccgagtccggccaggagccggcgtcccgcggccagggccagagccaa ggccagagccagggcccgggcagcggggacacgtaccggcccaagcggcccaccacgctc aacctctttccgcaggtgccgcggtctcaggatatggggctcctggtgaaagtggagcag gagcagggcctggagaaaagggcatctgaaatggtcatcgtgggggaggccgtgggagat cgtggcgagaggagagttacgagtgaggcctttgacagtggaggcggggtggacacagca aatggccctagggatggcctgagggggcagggccggggcctccctgtgccgggcactgtt gaccacttccatcacaagagccttttgttccctgcacaggacacactgaataataattct ctgggcaaaaagcacagttggcaggatcgggtgtctcgatcatcctcacccctgaagaca ggggagcagacaccaccgcatgaacacatctgcctgagcgatgagctgcccccccagagc ggccccgcccccaccacagatcgaggcacctccaccgacagcccttgccgccgcagcaca gccacccagatggcacctccgggtggtccccctgctgccccgcctgggggtcggggccac tcgcatcgagaccgaatccactaccaggccgatgtgcgactagaggccactgaggagatc tacctgaccccagtgcagaggcccccagacgctgcagagcccacctccgccttcctgccg cccactgagagccggatgtcagtcagctccgatccagaccctgccgcctacccctccacg gcagggcggccgcacccctccatcagtgaagaggaagagggcttcgactgcctgtcgtcc ccagagcgggctgagcccccaggcggagggtggcgggggagcctgggggagccgccgcca cctccacgggcctctctgagctcggacaccagcgccctgtcctatgactctgtcaagtac acgctggtggtagatgagcatgcacagctggagctggtgagcctgcggccgtgcttcgga gactacagtgacgagagtgactctgccaccgtctatgacaactgtgcctccgtctcctcg ccctatgagtcggccatcggagaggaatatgaggaggccccgcggccccagccccctgcc tgcctctccgaggactccacgcctgatgaacccgacgtccatttctccaagaaattcctg aacgtcttcatgagtggccgctcccgctcctccagtgctgagtccttcgggctgttctcc tgcatcatcaacggggaggagcaggagcagacccaccgggccatattcaggtttgtgcct cgacacgaagacgaacttgagctggaagtggatgaccctctgctagtggagctccaggct gaagactactggtacgaggcctacaacatgcgcactggtgcccggggtgtctttcctgcc tattacgccatcgaggtcaccaaggagcccgagcacatggcagccctggccaaaaacagt gactgggtggaccagttccgggtgaagttcctgggctcagtccaggttccctatcacaag ggcaatgacgtcctctgtgctgctatgcaaaagattgccaccacccgccggctcaccgtg cactttaacccgccctccagctgtgtcctggagatcagcgtgcggggtgtgaagataggc gtcaaggccgatgactcccaggaggccaaggggaataaatgtagccactttttccagtta aaaaacatctctttctgcggatatcatccaaagaacaacaagtactttgggttcatcacc aagcaccccgccgaccaccggtttgcctgccacgtctttgtgtctgaagactccaccaaa gccctggcagagtccgtggggagagcattccagcagttctacaagcagtttgtggagtac acctgccccacagaagatatctacctggagtag >gi568815587f:45705802_45911332|GENSCAN_predicted_peptide_5|159_aa XARILFLLQLLADHVPGVGLVTTRTLSLLGQEAFTRSGIQAPLPQKAVTSPHGAREAARL GSHRSEHCCSLAGPWSQRSVPEAFSAPLELSQPLSGLVDGSISLVPFPYPSLTVETPLRD YGILPKHPRPRGPRPLLSRAQQRKRDGPDLAEYYYDAHL >gi568815587f:45705802_45911332|GENSCAN_predicted_CDS_5|480_bp nnggccaggatcctcttcctgctccagttgctggccgaccacgtccctggcgttggcctg gtcacaacccgcaccttgtcactgctgggccaagaagccttcactaggagtgggatccag gctcctctcccacagaaagcggtgacttcacctcatggagcccgggaagctgctcgcctc ggcagccataggagcgaacactgctgctctctcgctggcccctggtctcagcggtctgtt cctgaggcattttccgcccccctggaactctcgcagccactttccggcctggtggatggt agtatctctcttgtccccttcccctacccatccctgacagtggagacgcccctaagagac tatggcatcctccccaagcacccaaggccgcgagggcctcgacccctcctgtctagggcc cagcagcgcaagcgggacgggcccgaccttgccgagtattactatgatgcacacctatga